Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Allow cudnn prefill kernels to be called natively
#1317 opened Jul 24, 2025 by Anerudhan Draft
5 tasks done
minor: add trtllm_gen_mla benchmark
#1316 opened Jul 24, 2025 by yyihuang Loading…
5 tasks done
test qkvo quantization not equal to 1.
#1314 opened Jul 24, 2025 by weireweire Loading…
5 tasks
Wrap cudnn backend to unified interface
#1312 opened Jul 23, 2025 by cyx-6 Loading…
5 tasks
Refactor Fused Moe Module
#1309 opened Jul 23, 2025 by wenscarl Loading…
5 tasks
Api regression test for trtllmgen fp8 moe
#1308 opened Jul 23, 2025 by aleozlx Loading…
5 tasks done
fix: a workaround to make fp8 kv-cache work for prefill
#1304 opened Jul 22, 2025 by chenyang78 Loading…
2 tasks
3rparty: upgrade cutlass dependency to v4.1.0
#1299 opened Jul 22, 2025 by yzh119 Loading…
5 tasks
Add weight layout
#1297 opened Jul 21, 2025 by aleozlx Loading…
5 tasks done
add mm_fp4 use cutlass backend for large bs
#1296 opened Jul 21, 2025 by ttyio Loading…
5 tasks done
Add native cudnn_decode for improved cudnn decode performance
#1283 opened Jul 18, 2025 by Anerudhan Loading…
5 tasks done
ci: add github actions to upload sdist to pypi
#1270 opened Jul 16, 2025 by yzh119 Loading…
5 tasks
Bug fix: fix duplicate launch in POD
#1267 opened Jul 16, 2025 by Edenzzzz Loading…
5 tasks
feat(aot): add nvshmem module for aot compilation
#1261 opened Jul 15, 2025 by EmilienM Loading…
3 of 5 tasks
refactor: separate SM100 and legacy TRT-LLM comm modules
#1259 opened Jul 15, 2025 by EmilienM Loading…
3 of 5 tasks
Mnnvl memory with custom communicator
#1245 opened Jul 14, 2025 by wenscarl Draft
5 tasks
feat: Restore convenience FLASHINFER_ENABLE_AOT option
#1235 opened Jul 8, 2025 by mgorny Loading…
3 of 5 tasks
ProTip! Add no:assignee to see everything that’s not assigned.