Dynamic Merging of Word Bounding Boxes #12283

satvik-27199 · 2024-05-24T23:34:58Z

satvik-27199
May 24, 2024

Currently, the algorithm for merging word bounding boxes combines words into a single bounding box if they are close enough. While this works in many cases, it can cause issues in others where words should remain separate despite their proximity. This leads to inaccurate text extraction and poor representation of the original content.

Example:

Consider the following text snippet:

Word1 Word2 Word3 Word4

In the current implementation, if Word1 and Word2 are close enough, they might be combined into a single bounding box as Word1Word2. Similarly, Word3 and Word4 might also be merged if they are close. This results in incorrect extraction as shown below:

Word1Word2 Word3Word4

I wonder if there any sophisticated merging strategy that takes into account:

The average character width and spacing.
The context of the text, ensuring words that are naturally separate remain so.
A threshold that adjusts dynamically based on font size and text density.

GreatV · 2024-05-25T00:32:26Z

GreatV
May 25, 2024
Maintainer

Could you please tell me which algorithm you are describing and if a corresponding example is provided?

0 replies

JoshvirN · 2025-03-28T07:50:09Z

JoshvirN
Mar 28, 2025

I would like some insight into this as well. I trained paddle detection model on word level but the inference still shows line boxes. Is there a way to get unmerged boxes?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic Merging of Word Bounding Boxes #12283

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Dynamic Merging of Word Bounding Boxes #12283

satvik-27199 May 24, 2024

Replies: 2 comments

GreatV May 25, 2024 Maintainer

JoshvirN Mar 28, 2025

satvik-27199
May 24, 2024

GreatV
May 25, 2024
Maintainer

JoshvirN
Mar 28, 2025