Spaces:
Sleeping
Sleeping
File size: 1,656 Bytes
e266fe2 225a75e 5a03810 e266fe2 225a75e e266fe2 da22b37 49d2654 3779577 e266fe2 225a75e 3779577 e266fe2 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 |
---
title: GAIA Agent System
emoji: 🤖
colorFrom: yellow
colorTo: pink
sdk: gradio
sdk_version: 5.25.2
app_file: ./src/app.py
pinned: false
hf_oauth: true
# optional, default duration is 8 hours/480 minutes. Max duration is 30 days/43200 minutes.
hf_oauth_expiration_minutes: 480
# Required scopes for Qwen model access via Inference API
hf_oauth_scopes:
- inference-api
short_description: Multi-Agent AI System for GAIA Benchmark Questions
suggested_hardware: cpu-upgrade
models:
- Qwen/Qwen2.5-7B-Instruct
- Qwen/Qwen2.5-32B-Instruct
- Qwen/Qwen2.5-72B-Instruct
tags:
- GAIA
- multi-agent
- LangGraph
- benchmark
- reasoning
- web-search
- file-processing
- question-answering
---
# 🤖 GAIA Agent System
Advanced Multi-Agent AI System for GAIA Benchmark Questions using LangGraph orchestration.
## Features
- **Multi-Agent Architecture**: Router, Web Research, File Processing, Reasoning, and Synthesizer agents
- **LangGraph Orchestration**: Intelligent workflow management with state tracking
- **Unit 4 API Integration**: Official GAIA benchmark submission and scoring
- **Smart Model Selection**: Tiered Qwen 2.5 models (7B/32B/72B) for optimal cost/performance
- **Comprehensive Tools**: Wikipedia search, web scraping, mathematical calculations, file analysis
## Usage
1. **Official GAIA Evaluation**: Login with HuggingFace and run complete benchmark
2. **Manual Testing**: Test individual questions with detailed reasoning analysis
3. **File Processing**: Upload and analyze CSV, images, code, and audio files
Check out the configuration reference at <https://huggingface.co/docs/hub/spaces-config-reference>
|