Skip to content

Pull requests: deepseek-ai/DeepEP

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix racing condition in large batch size
#440 opened Oct 2, 2025 by fzyzcjy Loading…
Add permute extension to hybrid-ep
#439 opened Sep 30, 2025 by Autumn1998 Loading…
Add cpp and python linter.
#435 opened Sep 26, 2025 by sphish Loading…
opt ll dispatch layered algo
#425 opened Sep 24, 2025 by alpha-baby Loading…
Support per tensor transfer
#416 opened Sep 18, 2025 by ayrnb Loading…
Add imbalance factor in test_low_latency
#393 opened Sep 4, 2025 by JianboDong Loading…
Feature/sm free normal kernel
#347 opened Jul 31, 2025 by ZhiyiHu1999 Loading…
4 tasks
Support nvfp4 low latency mode dispatch
#341 opened Jul 30, 2025 by shifangx Loading…
Support nvfp4 intranode dispatch
#339 opened Jul 30, 2025 by jershi425 Loading…
Support prefill with 2 GPUs
#331 opened Jul 27, 2025 by fzyzcjy Loading…
enhance warp copy efficiency in cached_notify()
#315 opened Jul 18, 2025 by ZhiyiHu1999 Loading…
support low latency dispatch tma
#293 opened Jul 10, 2025 by ayrnb Loading…
Tiny support custom nvcc flags
#280 opened Jul 5, 2025 by fzyzcjy Loading…
Allow using few SMs for low-latency mode
#277 opened Jul 3, 2025 by fzyzcjy Loading…
Computation communication overlap
#249 opened Jun 24, 2025 by fzyzcjy Draft
Support other NVLink scenarios
#218 opened Jun 17, 2025 by fzyzcjy Loading…
ProTip! What’s not been updated in a month: updated:<2025-09-03.