Skip to content

Issues: vectorch-ai/ScaleLLM

Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

kv cache: cache aware router
#456 opened May 1, 2025 by guocuimi
kv cache: fp8/int8 kv cache support
#455 opened May 1, 2025 by guocuimi
model: add qwen new models
#454 opened May 1, 2025 by guocuimi
model: add deepseek new models
#453 opened May 1, 2025 by guocuimi
kernel: tile scheduling
#452 opened May 1, 2025 by guocuimi
kernel: attention kernel for sm_75/70
#451 opened May 1, 2025 by guocuimi
kernel: deepep integration
#449 opened May 1, 2025 by guocuimi
kernel: attention kernel for sm_120
#447 opened May 1, 2025 by guocuimi
kernel: attention kernel for sm_90
#445 opened May 1, 2025 by guocuimi
feat: EP support for Deepseek models
#444 opened May 1, 2025 by guocuimi
kernel: Grouped GEMM for MOE
#443 opened May 1, 2025 by guocuimi
RuntimeError: Timed out
#310 opened Aug 16, 2024 by spongxin
Introducing the Mamba model
#165 opened Apr 28, 2024 by guocuimi
ProTip! What’s not been updated in a month: updated:<2025-04-25.