Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix filtered topk unified kernel
#2319 opened Jan 9, 2026 by murphymatt Loading…
5 tasks done
tiny support glm routing
#2313 opened Jan 8, 2026 by b8zhong Loading…
Fix: FilteredTopKUnifiedKernel read value out of length
#2308 opened Jan 8, 2026 by HarryWu99 Loading…
4 tasks done
Fused MoE non gated Relu2 FP8
#2304 opened Jan 7, 2026 by amitz-nv Draft
5 tasks
Selective State Update kernel (mamba)
#2301 opened Jan 6, 2026 by ishovkun Loading…
5 tasks done
bugfix: skip CUTLASS kernel generation when AOT cache exists
#2248 opened Dec 19, 2025 by yongwww Loading…
3 of 5 tasks
fix: Handle zeros in Mistral Large 3 MoE inference
#2238 opened Dec 18, 2025 by dbari Draft
8 of 9 tasks
misc: support checks unit test tracking
#2224 opened Dec 16, 2025 by jimmyzho Loading…
5 tasks
feat: Input/output Dump + Replay Mode for API Logging Level 10
#2206 opened Dec 11, 2025 by bkryu Loading…
3 of 5 tasks
refactor: update fa3 codebase [part 2]
#2192 opened Dec 9, 2025 by yzh119 Loading…
4 of 5 tasks
Add CUDA graph buffers for persistent attention
#2185 opened Dec 7, 2025 by Edenzzzz Loading…
5 tasks
feat: add sink to flashinfer decode
#2087 opened Nov 13, 2025 by djmmoss Loading…
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
ProTip! Filter pull requests by the default branch with base:main.