Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

CuTe DSL: add SM120 NVFP4 warp MMA unpack support
#3185 opened Apr 24, 2026 by alecco Loading…
Add Snake activation functor for EVT
#3184 opened Apr 24, 2026 by emre570 Loading…
Fix Thor MLA decode arch dispatch
#3173 opened Apr 17, 2026 by iloveai8086 Loading…
WIP: OSS CI Testing for v4.5
#3171 opened Apr 16, 2026 by zekunf-nv Collaborator Loading…
Fix incorrect example paths in CuTeDSL docstrings
#3151 opened Apr 6, 2026 by Weili-0234 Loading…
Fix Hopper FMHA performance regression on CUDA < 13.1
#3137 opened Mar 31, 2026 by arvin-chou Loading…
5 of 6 tasks
feat(CuTeDSL): print benchmark time from Blackwell dense_gemm CLI
#3136 opened Mar 30, 2026 by aidando73 Contributor Loading…
Fix elementwise_apply.py inactive-30d
#3129 opened Mar 25, 2026 by HydraQYH Contributor Loading…
Enable strict C++ compiler warnings with -Werror inactive-30d
#3123 opened Mar 22, 2026 by maxwbuckley Loading…
3 of 4 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.