Content-Length: 278991 | pFad | https://github.com/vllm-project/#start-of-content

55 vLLM · GitHub
Skip to content

Pinned Loading

  1. vllm vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 47.9k 7.6k

  2. llm-compressor llm-compressor Public

    Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

    Python 1.4k 133

Repositories

Showing 10 of 16 repositories

Sponsors

  • @Stack-ML
  • @imkero
  • @comet-ml
  • @HiddenPeak
  • @terrytangyuan
  • @mhupfauer
  • @dvlpjrs
  • @vincentkoc
  • @robertgshaw2-redhat
  • Private Sponsor

Top languages

Loading…

Most used topics

Loading…









ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: https://github.com/vllm-project/#start-of-content

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy