-
Notifications
You must be signed in to change notification settings - Fork 944
Pull requests: deepseek-ai/DeepEP
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Support CUDA Graph for internode dispatch normal kernel
#438
opened Sep 30, 2025 by
yifeizhang-c
Loading…
Fix get_nvl_buffer_size_hint and get_rdma_buffer_size_hint
#434
opened Sep 26, 2025 by
yuantailing
Loading…
atomic_clean_flag in low latency combine, seems useless ?
#409
opened Sep 16, 2025 by
zhoutianzi666
Loading…
[Feat] Single Batch Overlap (SBO): Overlaping of Down GEMM with Combine Send
#390
opened Sep 2, 2025 by
Zqy11
Loading…
delete some redundant logic using cg::this_grid().sync();
#357
opened Aug 9, 2025 by
zhoutianzi666
Loading…
feat(buffer): implement dynamic buffer resizing for simplicity
#340
opened Jul 30, 2025 by
MengAiDev
Loading…
Optimize low latency combine recv kernel (about 3.0x speedup)
#248
opened Jun 23, 2025 by
fzyzcjy
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-09-03.