forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 5
Issues: EmbeddedLLM/vllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug]: Fix the triton chunked prefill decode V1 bug
bug
Something isn't working
#63
opened May 14, 2025 by
tjtanaa
2 tasks done
[Feature]: Integrate AITER MLA V1 from Upstream
enhancement
New feature or request
#62
opened May 14, 2025 by
tjtanaa
1 task done
[Feature]: MTP on ROCm
enhancement
New feature or request
#61
opened May 13, 2025 by
tjtanaa
1 task done
[Feature]: Move to use AITER v0.1.1
enhancement
New feature or request
#58
opened May 7, 2025 by
tjtanaa
1 task done
[Feature]: [Feature]: Optimize DeepSeek V3 for V1 Engine
enhancement
New feature or request
#57
opened May 7, 2025 by
tjtanaa
1 task done
[Usage]: Check if lmcache works on ROCm
question
Further information is requested
#56
opened May 7, 2025 by
tjtanaa
1 task done
[Usage]: Document how to use AITER on vLLM
documentation
Improvements or additions to documentation
#55
opened May 7, 2025 by
tjtanaa
1 task done
[Usage]: PD-disaggregated Inferencing
question
Further information is requested
#54
opened May 7, 2025 by
tjtanaa
1 task done
[Bug]: Something isn't working
direct_register_custom_ops
incur overhead
bug
#52
opened May 6, 2025 by
tjtanaa
1 task done
[Bug]: Qwen 2.5 72B TPOT regression
bug
Something isn't working
#51
opened Apr 30, 2025 by
tjtanaa
1 task done
[Bug]: Regression in DeepSeekV3 by 20%
bug
Something isn't working
#50
opened Apr 30, 2025 by
tjtanaa
1 task done
[Feature]: Check if the moe_2stage is compatible with no-EP and EP of Qwen3 MoE FP8 model
enhancement
New feature or request
#49
opened Apr 30, 2025 by
tjtanaa
1 task done
[Bug]: Benchmark MoE tuning script bug on ROCM
bug
Something isn't working
#48
opened Apr 30, 2025 by
tjtanaa
1 task done
[Feature]: CK MoE 2 stage
enhancement
New feature or request
#47
opened Apr 30, 2025 by
tjtanaa
1 task done
[Feature]: Enable MLA for V1 on AMD [Triton MLA]
enhancement
New feature or request
#46
opened Apr 23, 2025 by
tjtanaa
1 task done
[RFC]: All Ops should be determined during init and wrapped in a Layer Module to avoid envs.ENVIRON overhead
enhancement
New feature or request
help wanted
Extra attention is needed
#45
opened Apr 22, 2025 by
tjtanaa
1 task done
[Bug]: Bugfix LoRA for ROCm due to incompatible triton arguments are passed.
bug
Something isn't working
#42
opened Apr 17, 2025 by
tjtanaa
1 task done
[Bug]: Bugfix AITER RMSNORM
bug
Something isn't working
#39
opened Apr 16, 2025 by
tjtanaa
1 task done
AITER Block Scaled A8W8 GEMM() for V1 Engine
enhancement
New feature or request
#33
opened Apr 16, 2025 by
vllmellm
Enable AITER Linear() for V1 Engine
enhancement
New feature or request
#30
opened Apr 16, 2025 by
vllmellm
[Feature]: Roadmap for 2nd Quarter 2025
enhancement
New feature or request
#29
opened Apr 15, 2025 by
tjtanaa
1 of 5 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-04-25.