-
Notifications
You must be signed in to change notification settings - Fork 428
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Manage dependencies and add missing
einops
req
#1859
opened Jun 7, 2025 by
ksivaman
Loading…
7 of 13 tasks
[PyTorch] Add support for FP8 current scaling in operation-based API
enhancement
New feature or request
testing
Improvements to tests or testing infrastructure
#1858
opened Jun 6, 2025 by
timmoon10
Loading…
6 of 14 tasks
Use public API instead of removed private function in
te_llama.py
#1856
opened Jun 6, 2025 by
janekb04
Loading…
2 of 13 tasks
Add support for Fused Attn MLA head_dim_qk != head_dim_v
#1851
opened Jun 4, 2025 by
KshitijLakhani
•
Draft
13 tasks
Draft: Add support for overlapping wgrad NCCL AG with dgrad GEMM
#1849
opened Jun 4, 2025 by
djns99
Loading…
4 of 13 tasks
[PyTorch] Inference mode disables initializing quantized weights with column-wise usage
2.5.0
bug
Something isn't working
enhancement
New feature or request
#1847
opened Jun 4, 2025 by
timmoon10
Loading…
6 of 13 tasks
[JAX] Collective GEMM custom op + primitive + minimal supporting functions
jax
#1846
opened Jun 3, 2025 by
denera
Loading…
5 of 13 tasks
[JAX] TensorUsage + FP8 GEMM with all layouts handling on BW
2.5.0
#1844
opened Jun 3, 2025 by
phu0ngng
Loading…
8 of 13 tasks
[PyTorch Debug] Fixed the empty tensor bug in statistics computation
#1843
opened Jun 3, 2025 by
pggPL
Loading…
8 of 13 tasks
Make quantize_ respect the usages of the quantizer
#1836
opened May 31, 2025 by
ptrendx
Loading…
13 tasks
[PyTorch] Use FP16 tols for distributed tests with TF32 compute
#1831
opened May 28, 2025 by
timmoon10
Loading…
6 of 13 tasks
Add cuBLASMp-backed GEMM-like API to TE common
#1824
opened May 27, 2025 by
mk-61
Loading…
4 of 13 tasks
[PyTorch][MoE] Reduce CPU Overhead By Fuse Torch Empty Calls
performance
Performance issues
#1793
opened May 16, 2025 by
zhongbozhu
Loading…
1 of 13 tasks
[common] Added support of FP4 data type
#1779
opened May 13, 2025 by
Oleg-Goncharov
Loading…
6 of 13 tasks
[PyTorch] Update PyTorch FSDP2 test to cover all TE layer types
testing
Improvements to tests or testing infrastructure
#1777
opened May 12, 2025 by
denera
Loading…
8 of 13 tasks
linear: clear row-wise weight at the end of forward
#1770
opened May 12, 2025 by
kshitij12345
•
Draft
Previous Next
ProTip!
Adding no:label will show everything without a label.