[Feat] Implement kv cache broadcast in MLA #367

harrisonyhq · 2025-11-18T01:22:34Z

Purpose

Implement kv cache broadcast in MLA scenario, support only load on rank 0 and broadcast to other gpus in tp group.

Modifications

Skip load and wait in tp > 0.

Test

ucm/integration/vllm/ucm_connector.py

harrisonyhq requested review from hek14, mag1c-h, qyh111 and ygwpz as code owners November 18, 2025 01:22

harrisonyhq requested review from flesher0813 and removed request for hek14 and mag1c-h November 18, 2025 01:22

harrisonyhq force-pushed the dev-ucm-v1 branch from 56497f1 to dc9aada Compare November 18, 2025 03:44

harrisonyhq changed the title ~~[Feat] Implement kv cache broadcast in MLA~~ [WIP] Implement kv cache broadcast in MLA Nov 18, 2025

ygwpz reviewed Nov 18, 2025

View reviewed changes

ucm/integration/vllm/ucm_connector.py Outdated Show resolved Hide resolved

[Feat] Implement kv cache broadcast in MLA in ucm_connector

16cae41

harrisonyhq force-pushed the dev-ucm-v1 branch from dc9aada to 16cae41 Compare November 19, 2025 03:17

harrisonyhq changed the title ~~[WIP] Implement kv cache broadcast in MLA~~ [Feat] Implement kv cache broadcast in MLA Nov 19, 2025

harrisonyhq requested a review from ygwpz November 19, 2025 03:45

harrisonyhq self-assigned this Nov 19, 2025

[Style] Change wait for broadcast into single task method

cc68938

ygwpz approved these changes Nov 19, 2025

View reviewed changes

ygwpz merged commit a2fdffc into ModelEngine-Group:dev-ucm-v1 Nov 19, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feat] Implement kv cache broadcast in MLA #367

[Feat] Implement kv cache broadcast in MLA #367

Uh oh!

harrisonyhq commented Nov 18, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Feat] Implement kv cache broadcast in MLA #367

[Feat] Implement kv cache broadcast in MLA #367

Uh oh!

Conversation

harrisonyhq commented Nov 18, 2025

Purpose

Modifications

Test

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants