-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Pull requests: NVIDIA/cutlass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fixes for #3340, stale comments on thread layout shape
#3341
opened Jun 22, 2026 by
SpookOrSpooky
Loading…
[CuTeDSL] Fix make_ptr ZeroDivisionError on sub-byte dtypes
#3334
opened Jun 19, 2026 by
waynehacking8
Loading…
[CuTeDSL] Fix cute.is_major crash on List[int] mode
#3333
opened Jun 19, 2026 by
waynehacking8
Loading…
[CuTeDSL] Fix unsigned integer constant lowering above INT64_MAX (#3312)
#3332
opened Jun 19, 2026 by
waynehacking8
Loading…
[CuteDSL] Fix source location info for cute.arch.elect_one
#3317
opened Jun 14, 2026 by
pchen7e2
Loading…
[Tutorial] Fix race condition in 2SM TMEM alloc for Cute Blackwell Tutorial 04/05
#3316
opened Jun 14, 2026 by
pchen7e2
Loading…
[CuTeDSL] Fix _ScalarData internal methods triggering its own struct.scalar deprecation
#3311
opened Jun 10, 2026 by
Johnsonms
Contributor
Loading…
Fix CUTLASS Blackwell FMHA register spills on DRIVE Thor (sm_110a)
#3308
opened Jun 8, 2026 by
pzhao-eng
Loading…
Fixed integer overflow in make_cute_packed_stride batch stride computation
#3307
opened Jun 7, 2026 by
a123pal
Loading…
Enable
smem_merge_branch_allocs option on branched mega-kernel examples
#3304
opened Jun 5, 2026 by
LongshengDu
Contributor
Loading…
adding DSL INT8 MMA support on SM80+, with CuTe C++ example
#3302
opened Jun 5, 2026 by
ayghri
Loading…
Fix CUDA 12.8 __nv_atomic_load_n call signature in SubbyteReference
#3301
opened Jun 5, 2026 by
wanghemeng
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.