GitHub - kimtth/rag-multimodal-semantic-chunking: 🖼️📄E2E Multi-modal Document Preprocessing for Search Indexing with Azure Document Intelligence

📄 Multi-modal Document Preprocessing with Azure Document Intelligence

📝 Generate a document parsed results using Document Intelligence, and output it in Markdown format. > output
🖼️ Extract figures from documents and save them as PNG images. > output
🤖 Generate figure descriptions using Azure OpenAI Multimodal.
📝 Update markdown outputs with generated descriptions. > output
📊 Extract tables and convert them into Excel files. > output
📖 Text Chunking to markdown ouputs using MarkdownHeaderTextSplitter, RecursiveContentChunker, and SemanticContentChunker (TBD) > markdown chuck output | recursive chunk output

python doc_intelli_workflow.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
actor		actor
data		data
output		output
.env.sample		.env.sample
.gitignore		.gitignore
README.md		README.md
doc_intelli_workflow.py		doc_intelli_workflow.py
pyproject.toml		pyproject.toml