| | --- |
| | title: GAIA Agent System |
| | emoji: 🤖 |
| | colorFrom: indigo |
| | colorTo: indigo |
| | sdk: gradio |
| | sdk_version: 5.25.2 |
| | app_file: ./src/app.py |
| | pinned: false |
| | hf_oauth: true |
| | |
| | hf_oauth_expiration_minutes: 480 |
| | short_description: Multi-Agent AI System for GAIA Benchmark Questions |
| | suggested_hardware: cpu-upgrade |
| | models: |
| | - Qwen/Qwen2.5-7B-Instruct |
| | - Qwen/Qwen2.5-32B-Instruct |
| | - Qwen/Qwen2.5-72B-Instruct |
| | tags: |
| | - GAIA |
| | - multi-agent |
| | - LangGraph |
| | - benchmark |
| | - reasoning |
| | - web-search |
| | - file-processing |
| | - question-answering |
| | --- |
| | |
| | # 🤖 GAIA Agent System |
| |
|
| | Advanced Multi-Agent AI System for GAIA Benchmark Questions using LangGraph orchestration. |
| |
|
| | ## Features |
| |
|
| | - **Multi-Agent Architecture**: Router, Web Research, File Processing, Reasoning, and Synthesizer agents |
| | - **LangGraph Orchestration**: Intelligent workflow management with state tracking |
| | - **Unit 4 API Integration**: Official GAIA benchmark submission and scoring |
| | - **Smart Model Selection**: Tiered Qwen 2.5 models (7B/32B/72B) for optimal cost/performance |
| | - **Comprehensive Tools**: Wikipedia search, web scraping, mathematical calculations, file analysis |
| |
|
| | ## Usage |
| |
|
| | 1. **Official GAIA Evaluation**: Login with HuggingFace and run complete benchmark |
| | 2. **Manual Testing**: Test individual questions with detailed reasoning analysis |
| | 3. **File Processing**: Upload and analyze CSV, images, code, and audio files |
| |
|
| | Check out the configuration reference at <https://huggingface.co/docs/hub/spaces-config-reference> |
| |
|
| |
|