-
Notifications
You must be signed in to change notification settings - Fork 54
Issues: NVIDIA/Fuser
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Swizzle tiles in matmul without introducing larger grid due to nondivisible splits
Matmuls
#3942
opened Feb 21, 2025 by
jacobhinkle
Allow separate sub-DAG for load and compute warp groups with warp-specialized circular buffering.
Matmuls
TMA
#3941
opened Feb 21, 2025 by
rdspring1
MarkAliasesPrepare to recognize meta ops with DID loop split.
allocation domain
issues related to allocation domain support
Multi-GPU
#3902
opened Feb 15, 2025 by
wujingyue
Fix ReorderShardedAxis and MakeReshardingContiguous for DID loop split.
Multi-GPU
#3900
opened Feb 15, 2025 by
wujingyue
Feature request: Consider privatization instead of forwarding in fusion segmentation
Segmentation
Issues related to nvFuser Segmentation
#3832
opened Feb 5, 2025 by
naoyam
Feature request: Extend the privatization to improve segmentation
Segmentation
Issues related to nvFuser Segmentation
#3830
opened Feb 5, 2025 by
naoyam
Feature request: Fusing sibling exprs in segmentation
Segmentation
Issues related to nvFuser Segmentation
#3829
opened Feb 5, 2025 by
naoyam
Automate performant Hopper matmul
H100 Perf
improve performance on H100
Matmuls
#3819
opened Feb 4, 2025 by
jacobhinkle
3 of 22 tasks
Reduction scheduler fails to recognize iter domains not captured by reference
bug
Something isn't working
#3811
opened Feb 3, 2025 by
naoyam
Make FusionProfile object not a singleton and allow copying
#3771
opened Jan 28, 2025 by
kshitij12345
pytest benchmark reporting incorrect benchmark time
bug
Something isn't working
Python Benchmarks
#3753
opened Jan 23, 2025 by
jjsjann123
Run the IdModel exact-ness validation as part of the launch-time validation
idmodel
#3752
opened Jan 23, 2025 by
naoyam
TensorDomain::flatten should squeeze broadcast IDs as done by the usual reshape transform
#3691
opened Jan 9, 2025 by
naoyam
Optimize matrix multiplication performance using tma multicast
Matmuls
#3689
opened Jan 9, 2025 by
rdspring1
1 of 7 tasks
Inlining error in Hopper matmul with AxisMapping and grid swizzling
Matmuls
#3671
opened Jan 6, 2025 by
jacobhinkle
Previous Next
ProTip!
Adding no:label will show everything without a label.