Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add MXFP8 attention
#2719 opened Mar 1, 2026 by cyanguwa Draft
13 tasks
pass params_dtype to qk_norm creation
#2718 opened Feb 28, 2026 by pstjohn Loading…
[JAX] CGEMM with Shardy
#2714 opened Feb 27, 2026 by phu0ngng Loading…
8 of 13 tasks
Enable dequantization from MXFP8 tensor with only columnwise data
#2712 opened Feb 26, 2026 by ptrendx Loading…
13 tasks
[JAX] Support calling MOE router kernels from JAX side
#2711 opened Feb 26, 2026 by tdophung Loading…
1 of 13 tasks
[Draft] Newton-Schulz via cuSOLVERMp
#2706 opened Feb 25, 2026 by vcherepanov-nv Loading…
6 of 13 tasks
[All] Added better error messages
#2705 opened Feb 25, 2026 by ptrendx Loading…
[Draft][PyTorch] torch.compile support for TE Linear
#2701 opened Feb 24, 2026 by pggPL Draft
13 tasks
[PyTorch] Zero-initialize learnable softmax_offset in DotProductAttention
#2694 opened Feb 20, 2026 by fjosw Loading…
7 of 13 tasks
NVFP4 primary weight support
#2691 opened Feb 19, 2026 by WanZzzzzz Loading…
10 of 13 tasks
[PyTorch] Error out if constructing LayerNormLinear with row tensor parallelism bug Something isn't working
#2688 opened Feb 17, 2026 by timmoon10 Loading…
6 of 13 tasks
[PyTorch] torch.compile support for permutation functions
#2686 opened Feb 17, 2026 by pggPL Loading…
9 of 13 tasks
[PyTorch] Add dtype information to QuantizedTensorStorage class
#2676 opened Feb 12, 2026 by ptrendx Loading…
1 of 13 tasks
[Common] MOE Split dBias cpu_overhead enhancement New feature or request MoE
#2674 opened Feb 11, 2026 by Oleg-Goncharov Loading…
8 of 13 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.