-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[#10243][feat] Draft - DONT REVIEW - Add TRT-LLM attention backend with hybrid model support to AutoDeploy
#11430
opened Feb 10, 2026 by
MrGeva
Loading…
1 task
[None][feat] AutoDeploy: Add nemotron v2 acc test
#11429
opened Feb 10, 2026 by
nvchenghaoz
Loading…
1 task
[https://nvbugs/5832481][test] Add gpt-oss-120b-Eagle3-throughput case on DGX-Spark
#11419
opened Feb 10, 2026 by
JennyLiu-nv
Loading…
1 task done
[https://nvbugs/5809169][unwaive] Unwaive TestGPTOSS test
#11417
opened Feb 10, 2026 by
peaceh-nv
Loading…
[https://nvbugs/5809169][unwaive] Unwaive TestGPTOSS test
#11416
opened Feb 10, 2026 by
peaceh-nv
Loading…
[https://nvbugs/5624818][fix] Add unittest for GPT-OSS non-paged_context_fmha
#11415
opened Feb 10, 2026 by
pengbowang-nv
•
Draft
1 task done
[#11109][feat] AutoDeploy: GLM 4.7 Flash Improvements
#11414
opened Feb 10, 2026 by
bmarimuthu-nv
Loading…
1 task done
[https://nvbugs/5880261][fix] fix cacheTransceiver
#11409
opened Feb 10, 2026 by
chuangz0
Loading…
1 task done
[None][chore] Merge residual+hidden into layer norm at the end of each MTP, and remove a % operation
#11406
opened Feb 10, 2026 by
hnover-nv
Loading…
[TRTLLM-10329][feat] Fix weight loading for Nemotron 3 models on DGX Spark
#11405
opened Feb 10, 2026 by
pamelap-nvidia
Loading…
1 task done
[None][chore] Fix gpu memory requirement in stress test
#11404
opened Feb 10, 2026 by
dominicshanshan
Loading…
1 task done
[None][fix] Fix silent MPI failures on models with custom tokenizers
#11399
opened Feb 10, 2026 by
jthomson04
Loading…
1 task done
[TRTLLM-10948][feat] Add GPU energy monitoring to trtllm-bench
Community want to contribute
PRs initiated from Community
#11397
opened Feb 10, 2026 by
inciaf
Loading…
1 task
[https://nvbugs/5868038][fix] Gracefully terminate disagg serving servers to prevent leftover subprocess warnings
#11395
opened Feb 9, 2026 by
peihu-nv
Loading…
1 task done
[None][feat] Integrate all reduce from flashinfer
#11391
opened Feb 9, 2026 by
NVShreyas
Loading…
1 task done
[https://nvbugs/5823783][fix] Fix multi-node trust_remote_code hang i…
#11383
opened Feb 9, 2026 by
JunyiXu-nv
Loading…
1 task done
[None][feat] Remove non flash attetnion style fmha_v2 kernel for hopper
#11381
opened Feb 9, 2026 by
pengbowang-nv
•
Draft
1 task done
[TRTLLM-9644][infra] Implement isolation for slurm
#11380
opened Feb 9, 2026 by
EmmaQiaoCh
•
Draft
1 task done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.