Skip to content

Conversation

Fridayxiao
Copy link

process

  1. Add HTML-first conversion strategy: Prioritize HTML-based conversion for arXiv papers, falling back to PDF method when HTML is unavailable
  2. PDF conversion improvements: Modify PDF conversion function parameters to ignore images, addressing poor OCR performance and excessive processing time
  3. Performance enhancements: Significantly improve conversion speed for papers with HTML versions while reducing resource consumption
    modified: pyproject.toml
    modified: src/arxiv_mcp_server/tools/download.py

xch added 4 commits June 27, 2025 18:08
process

1. Add HTML-first conversion strategy: Prioritize HTML-based conversion for arXiv papers, falling back to PDF method when HTML is unavailable
2. PDF conversion improvements: Modify PDF conversion function parameters to ignore images, addressing poor OCR performance and excessive processing time
3. Performance enhancements: Significantly improve conversion speed for papers with HTML versions while reducing resource consumption
	modified:   pyproject.toml
	modified:   src/arxiv_mcp_server/tools/download.py
modified:   src/arxiv_mcp_server/tools/download.py
process

1. Add HTML-first conversion strategy: Prioritize HTML-based conversion for arXiv papers, falling back to PDF method when HTML is unavailable
2. PDF conversion improvements: Modify PDF conversion function parameters to ignore images, addressing poor OCR performance and excessive processing time
3. Performance enhancements: Significantly improve conversion speed for papers with HTML versions while reducing resource consumption
	modified:   pyproject.toml
	modified:   src/arxiv_mcp_server/tools/download.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant