Skip to content

Pinned Loading

  1. vllm vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 59.3k 10.5k

  2. llm-compressor llm-compressor Public

    Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

    Python 2k 246

  3. recipes recipes Public

    Common recipes to run vLLM

    146 48

Repositories

Showing 10 of 24 repositories
  • vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    vllm-project/vllm’s past year of commit activity
    Python 59,320 Apache-2.0 10,477 1,865 (30 issues need help) 1,175 Updated Oct 2, 2025
  • vllm-gaudi Public

    Community maintained hardware plugin for vLLM on Intel Gaudi

    vllm-project/vllm-gaudi’s past year of commit activity
    Python 11 46 1 42 Updated Oct 2, 2025
  • ci-infra Public

    This repo hosts code for vLLM CI & Performance Benchmark infrastructure.

    vllm-project/ci-infra’s past year of commit activity
    HCL 22 39 0 18 Updated Oct 2, 2025
  • guidellm Public

    Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

    vllm-project/guidellm’s past year of commit activity
    Python 603 Apache-2.0 85 85 (5 issues need help) 31 Updated Oct 1, 2025
  • aibrix Public

    Cost-efficient and pluggable Infrastructure components for GenAI inference

    vllm-project/aibrix’s past year of commit activity
    Go 4,279 Apache-2.0 466 218 (19 issues need help) 23 Updated Oct 1, 2025
  • llm-compressor Public

    Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

    vllm-project/llm-compressor’s past year of commit activity
    Python 2,031 Apache-2.0 246 59 (13 issues need help) 35 Updated Oct 1, 2025
  • semantic-router Public

    Intelligent Mixture-of-Models Router for Efficient LLM Inference

    vllm-project/semantic-router’s past year of commit activity
    Go 1,582 Apache-2.0 174 69 (15 issues need help) 15 Updated Oct 1, 2025
  • vllm-spyre Public

    Community maintained hardware plugin for vLLM on Spyre

    vllm-project/vllm-spyre’s past year of commit activity
    Python 35 Apache-2.0 24 6 18 Updated Oct 1, 2025
  • vllm-ascend Public

    Community maintained hardware plugin for vLLM on Ascend

    vllm-project/vllm-ascend’s past year of commit activity
    Python 1,178 Apache-2.0 466 528 (6 issues need help) 152 Updated Oct 1, 2025
  • recipes Public

    Common recipes to run vLLM

    vllm-project/recipes’s past year of commit activity
    146 Apache-2.0 48 4 4 Updated Oct 1, 2025