fix: `DocVLMPredictor` device use error #4348

gouzil · 2025-07-09T12:46:34Z

fix

在使用 PaddleOCR 时遇到的 PaddleX DocVLMPredictor 显存分配异常情况

复现代码

from paddleocr import PPStructureV3

pipeline = PPStructureV3(device="gpu:1") # device 指定为任意，非默认 device 即可

表现形式

当使用 PPStructureV3(device="cpu"), 同时有 gpu 时，DocVLMPredictor 下的几个模型会被加载到 gpu:0 上，并占用大约 2034MiB 显存

修复方案

修改 device 设置策略，优先加载 kwargs 传入，当 device 为 None 时加载 pp_option 中的 device。（paddlex 框架内居然有两种 device 加载选项挺奇怪的）

TODO

现有的 paddlex/inference/models/common/vlm/flash_attn_utils.py 下的 is_flash_attn_available 函数也有类似问题，需要换一种形式确认是否支持 flash_attention

paddle-bot · 2025-07-09T12:46:41Z

Thanks for your contribution!

gouzil · 2025-07-10T03:00:38Z

@luotao1 能帮忙看看 cla 为啥过不了嘛，还有应该找谁 review

BluebirdStory

哈喽，感谢PR。

先来说一下为什么会有两套的device设置：因为目前doc_vlm之下的vlm模型只支持动态图的推理，和PaddleX下其他使用静态图的模型不太一样，所以在设计最初避开了对pp_option的使用，做了隔离，直接通过传入的device来在指定设备上推理。
再来说一下为什么pp structure会有相关问题：因为对于PPStructureV3(device="cpu")这里传入的device参数，PaddleOCR内部会直接封装出pp_option，并不会再单独传入device参数给doc_vlm的predictor，所以这导致在structure产线下，doc_vlm模块收到的device参数永远是None，从而导致在有gpu的情况下，会加载部分模型参数到0号线卡上。
这个PR中，先对device检查，之后再检查pp_option，确实是个解决这个问题的很好的方法，但是依然存在一些问题：
- self.pp_option.device_type != "cpu"如果设置device为cpu，那self.device依然是None，依然会加载模型到gpu:0上。

方便的话，辛苦更正上述问题重新PR。

gouzil · 2025-07-16T13:44:56Z

self.pp_option.device_type != "cpu"如果设置device为cpu，那self.device依然是None，依然会加载模型到gpu:0上。

done，我这没有权限能查看 ci 日志，能帮忙看看 ci 挂了是啥原因嘛

BluebirdStory · 2025-08-19T13:59:36Z

哈喽，感谢PR。

依然存在一些小问题哈，在你的逻辑里：

                self.device = constr_device(
                    self.pp_option.device_type,
                    (
                        str(self.pp_option.device_id)
                        if self.pp_option.device_type != "cpu"
                        else None
                    ),
                )

如果设置self.pp_option.device_type为gpu，但是self.pp_option.device_id如果为默认参数None，那么会导致str(self.pp_option.device_id)为"None"，进而输入给constr_device导致错误的device_id形式。

方便的话，辛苦更正上述问题重新PR。

感谢您的共享精神，besides，代码的修改所涉及的所有变量，辛苦考虑到每一种可能的情况。

…edictor_gpu_memory_usage

gouzil · 2025-09-06T03:17:25Z

方便的话，辛苦更正上述问题重新PR。

Done

fix: DocVLMPredictor device use error

ff63b80

paddle-bot bot added the contributor External developers label Jul 9, 2025

gouzil changed the title ~~fix: DocVLMPredictor device use error~~ fix: DocVLMPredictor device use error Jul 9, 2025

luotao1 closed this Jul 11, 2025

luotao1 reopened this Jul 11, 2025

luotao1 added the HappyOpenSource 快乐开源活动issue与PR label Jul 11, 2025

luotao1 assigned luotao1 and BluebirdStory Jul 11, 2025

BluebirdStory requested changes Jul 16, 2025

View reviewed changes

fix

b239eab

gouzil closed this Jul 16, 2025

gouzil reopened this Jul 16, 2025

gouzil added 3 commits September 5, 2025 23:14

Merge branch 'develop' of github.com:gouzil/PaddleX into fix/DocVLMPr…

248e356

…edictor_gpu_memory_usage

Merge branch 'develop' of github.com:gouzil/PaddleX into fix/DocVLMPr…

afb7eb5

…edictor_gpu_memory_usage

fix: pp_option device id

7cae6ce

gouzil requested a review from BluebirdStory September 22, 2025 14:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: `DocVLMPredictor` device use error #4348

fix: `DocVLMPredictor` device use error #4348

gouzil commented Jul 9, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Jul 9, 2025

Uh oh!

gouzil commented Jul 10, 2025

Uh oh!

BluebirdStory left a comment

Uh oh!

gouzil commented Jul 16, 2025 •

edited

Loading

Uh oh!

BluebirdStory commented Aug 19, 2025

Uh oh!

gouzil commented Sep 6, 2025

Uh oh!

Uh oh!

fix: DocVLMPredictor device use error #4348

Are you sure you want to change the base?

fix: DocVLMPredictor device use error #4348

Conversation

gouzil commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

fix

复现代码

表现形式

修复方案

TODO

Uh oh!

paddle-bot bot commented Jul 9, 2025

Uh oh!

gouzil commented Jul 10, 2025

Uh oh!

BluebirdStory left a comment

Choose a reason for hiding this comment

Uh oh!

gouzil commented Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BluebirdStory commented Aug 19, 2025

Uh oh!

gouzil commented Sep 6, 2025

Uh oh!

Uh oh!

fix: `DocVLMPredictor` device use error #4348

fix: `DocVLMPredictor` device use error #4348

gouzil commented Jul 9, 2025 •

edited

Loading

gouzil commented Jul 16, 2025 •

edited

Loading