Advanced PDF analysis solution that processes multiple document collections and extracts relevant content based on specific personas and use cases.
Challenge_1b/
├── Collection 1/ # Travel Planning
│ ├── PDFs/ # South of France guides
│ ├── challenge1b_input.json # Input configuration
│ └── challenge1b_output.json # Analysis results
├── Collection 2/ # Adobe Acrobat Learning
│ ├── PDFs/ # Acrobat tutorials
│ ├── challenge1b_input.json # Input configuration
│ └── challenge1b_output.json # Analysis results
├── Collection 3/ # Recipe Collection
│ ├── PDFs/ # Cooking guides
│ ├── challenge1b_input.json # Input configuration
│ └── challenge1b_output.json # Analysis results
└── README.md
- Challenge ID: round_1b_002
- Persona: Travel Planner
- Task: Plan a 4-day trip for 10 college friends to South of France
- Documents: 7 travel guides
- Challenge ID: round_1b_003
- Persona: HR Professional
- Task: Create and manage fillable forms for onboarding and compliance
- Documents: 15 Acrobat guides
- Challenge ID: round_1b_001
- Persona: Food Contractor
- Task: Prepare vegetarian buffet-style dinner menu for corporate gathering
- Documents: 9 cooking guides
{
"challenge_info": {
"challenge_id": "round_1b_XXX",
"test_case_name": "specific_test_case"
},
"documents": [{"filename": "doc.pdf", "title": "Title"}],
"persona": {"role": "User Persona"},
"job_to_be_done": {"task": "Use case description"}
}{
"metadata": {
"input_documents": ["list"],
"persona": "User Persona",
"job_to_be_done": "Task description"
},
"extracted_sections": [
{
"document": "source.pdf",
"section_title": "Title",
"importance_rank": 1,
"page_number": 1
}
],
"subsection_analysis": [
{
"document": "source.pdf",
"refined_text": "Content",
"page_number": 1
}
]
}- Persona-based content analysis
- Importance ranking of extracted sections
- Multi-collection document processing
- Structured JSON output with metadata
Note: This README provides a brief overview of the Challenge 1b solution structure based on available sample data.