EmbeddedLLM
Pinned Loading
Repositories
- LMCache Public Forked from LMCache/LMCache
ROCm support of Ultra-Fast and Cheaper Long-Context LLM Inference
EmbeddedLLM/LMCache’s past year of commit activity - vllm Public Forked from vllm-project/vllm
vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
EmbeddedLLM/vllm’s past year of commit activity - JamAIBase Public
The collaborative spreadsheet for AI. Chain cells into powerful pipelines, experiment with prompts and models, and evaluate LLM responses in real-time. Work together seamlessly to build and iterate on AI applications.
EmbeddedLLM/JamAIBase’s past year of commit activity - vllmtests Public
This is a repository containing the tools for testing vLLM correctness and perf regression
EmbeddedLLM/vllmtests’s past year of commit activity - aiter-api-watcher Public
This is a repository to monitor the fast changing ROCm/aiter repository to alert user that AITER function of interests e.g. in vLLM, in SGLang has been updated at certain commit.
EmbeddedLLM/aiter-api-watcher’s past year of commit activity - vllm-rocmfork Public Forked from ROCm/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
EmbeddedLLM/vllm-rocmfork’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…