Content-Length: 352668 | pFad | http://github.com/NVIDIA/TensorRT-LLM/issues/#start-of-content

32 Issues · NVIDIA/TensorRT-LLM · GitHub
Skip to content

Issues: NVIDIA/TensorRT-LLM

Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

Cannot find 'setup.py' nor 'pyproject.toml' in TensorRT-LLM/3rdparty/cutlass/python bug Something isn't working
#4995 opened Jun 6, 2025 by hoangledoan
2 of 4 tasks
Scaffolding tests failing on main branch with thread leaks and RuntimeError bug Something isn't working triaged Issue has been triaged by maintainers
#4974 opened Jun 6, 2025 by ccs96307
Feature Request: Enable chunked prefill by default in trtllm-serve or provide CLI flag feature request New feature or request. This includes new model, dtype, functionality support
#4947 opened Jun 5, 2025 by Nekofish-L
Feature Request: Add Llama_Nemotron_Nano_VL Support feature request New feature or request. This includes new model, dtype, functionality support
#4937 opened Jun 5, 2025 by guruprasad-atx
Feature Request: Add Prometheus Metrics Endpoint to trtllm-serve feature request New feature or request. This includes new model, dtype, functionality support
#4926 opened Jun 5, 2025 by Nekofish-L
[AutoDeploy] Investigate DemoLLM Token Generation AutoDeploy bug Something isn't working
#4841 opened Jun 2, 2025 by lucaslie
Title: KeyError: 'gemma3' error in GemmaConfig.from_hugging_face when converting Gemma 3 model bug Something isn't working triaged Issue has been triaged by maintainers
#4825 opened Jun 2, 2025 by bebilli
2 of 4 tasks
Driver crash during warmup of DeepSeek-R1-FP4 bug Something isn't working
#4816 opened May 31, 2025 by pathorn
1 of 4 tasks
The output of Gemma 3 4B for TensorRT and Transformers is not the same, even when using float32 bug Something isn't working triaged Issue has been triaged by maintainers
#4815 opened May 31, 2025 by Alireza3242
1 of 4 tasks
Feature support: eagle multimodal inputs feature request New feature or request. This includes new model, dtype, functionality support
#4787 opened May 30, 2025 by liyi-xia
How is the performance of the model with pytorch as the backend Investigating Performance TRTLLM model inference speed, throughput, efficiency. Latency, benchmarks, regressions, opts. triaged Issue has been triaged by maintainers
#4745 opened May 29, 2025 by oppolll
ProTip! Find all open issues with in progress development work with linked:pr.








ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/NVIDIA/TensorRT-LLM/issues/#start-of-content

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy