Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fp8 attention are now part of cuDNN 9.17.1
#2241 opened Dec 18, 2025 by Anerudhan Draft
5 tasks done
agent: add CLAUDE.md and claude skills
#2240 opened Dec 18, 2025 by yzh119 Loading…
5 tasks done
Allreduce auto backend improvements
#2239 opened Dec 18, 2025 by nvmbreughe Loading…
3 of 5 tasks
fix: Handle zeros in Mistral Large 3 MoE inference
#2238 opened Dec 18, 2025 by dbari Draft
8 of 9 tasks
feat: Fused RMSNorm + FP4 Quantization Kernels in CuTe-DSL
#2233 opened Dec 17, 2025 by bkryu Loading…
3 of 5 tasks
cicd / testing: Add xfails tracker script
#2227 opened Dec 16, 2025 by kahyunnam Loading…
5 tasks done
misc: support checks unit test tracking
#2224 opened Dec 16, 2025 by jimmyzho Loading…
5 tasks
refactor: update fa3 codebase [part 2]
#2192 opened Dec 9, 2025 by yzh119 Loading…
4 of 5 tasks
Add CUDA graph buffers for persistent attention
#2185 opened Dec 7, 2025 by Edenzzzz Loading…
5 tasks
Fix/moe_sm110 (to be tested)
#2183 opened Dec 6, 2025 by aleozlx Draft
5 tasks
Enable Hopper FA3 FP8 attention in decode.py
#2148 opened Nov 28, 2025 by nvpohanh Loading…
5 tasks done
feat: add sink to flashinfer decode
#2087 opened Nov 13, 2025 by djmmoss Loading…
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
Blockwise GEMM with all reduce overlapping
#2007 opened Oct 30, 2025 by Amir-19 Draft
5 tasks
chore: agentic workflow for automatic version bump
#1947 opened Oct 19, 2025 by yzh119 Loading…
5 tasks
add blockwise gemm cute dsl
#1922 opened Oct 13, 2025 by Amir-19 Loading…
5 tasks
Sampling non contiguous
#1916 opened Oct 12, 2025 by zcin Loading…
5 tasks done
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.