Skip to content

πŸ–ΌοΈπŸ“„E2E Multi-modal Document Preprocessing for Search Indexing with Azure Document Intelligence

Notifications You must be signed in to change notification settings

kimtth/rag-multimodal-semantic-chunking

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

9 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“„ Multi-modal Document Preprocessing with Azure Document Intelligence

✨ Features

  1. πŸ“ Generate a document parsed results using Document Intelligence, and output it in Markdown format. > output
  2. πŸ–ΌοΈ Extract figures from documents and save them as PNG images. > output
  3. πŸ€– Generate figure descriptions using Azure OpenAI Multimodal.
  4. πŸ“ Update markdown outputs with generated descriptions. > output
  5. πŸ“Š Extract tables and convert them into Excel files. > output
  6. πŸ“– Text Chunking to markdown ouputs using MarkdownHeaderTextSplitter, RecursiveContentChunker, and SemanticContentChunker (TBD) > markdown chuck output | recursive chunk output

πŸš€ Usage

python doc_intelli_workflow.py

πŸ“š Learn More

About

πŸ–ΌοΈπŸ“„E2E Multi-modal Document Preprocessing for Search Indexing with Azure Document Intelligence

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages