Chris commited on
Commit
3779577
·
1 Parent(s): d8abd6c

Final 4.2.1

Browse files
Files changed (1) hide show
  1. README.md +30 -1
README.md CHANGED
@@ -10,11 +10,40 @@ pinned: false
10
  hf_oauth: true
11
  # optional, default duration is 8 hours/480 minutes. Max duration is 30 days/43200 minutes.
12
  hf_oauth_expiration_minutes: 480
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  ---
14
 
15
  # 🤖 GAIA Agent System
16
 
17
  Advanced Multi-Agent AI System for GAIA Benchmark Questions using LangGraph orchestration.
18
 
19
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
 
 
10
  hf_oauth: true
11
  # optional, default duration is 8 hours/480 minutes. Max duration is 30 days/43200 minutes.
12
  hf_oauth_expiration_minutes: 480
13
+ short_description: Advanced Multi-Agent AI System for GAIA Benchmark Questions using LangGraph orchestration with specialized agents for web research, file processing, and mathematical reasoning.
14
+ suggested_hardware: cpu-upgrade
15
+ models:
16
+ - Qwen/Qwen2.5-7B-Instruct
17
+ - Qwen/Qwen2.5-32B-Instruct
18
+ - Qwen/Qwen2.5-72B-Instruct
19
+ tags:
20
+ - GAIA
21
+ - multi-agent
22
+ - LangGraph
23
+ - benchmark
24
+ - reasoning
25
+ - web-search
26
+ - file-processing
27
+ - question-answering
28
  ---
29
 
30
  # 🤖 GAIA Agent System
31
 
32
  Advanced Multi-Agent AI System for GAIA Benchmark Questions using LangGraph orchestration.
33
 
34
+ ## Features
35
+
36
+ - **Multi-Agent Architecture**: Router, Web Research, File Processing, Reasoning, and Synthesizer agents
37
+ - **LangGraph Orchestration**: Intelligent workflow management with state tracking
38
+ - **Unit 4 API Integration**: Official GAIA benchmark submission and scoring
39
+ - **Smart Model Selection**: Tiered Qwen 2.5 models (7B/32B/72B) for optimal cost/performance
40
+ - **Comprehensive Tools**: Wikipedia search, web scraping, mathematical calculations, file analysis
41
+
42
+ ## Usage
43
+
44
+ 1. **Official GAIA Evaluation**: Login with HuggingFace and run complete benchmark
45
+ 2. **Manual Testing**: Test individual questions with detailed reasoning analysis
46
+ 3. **File Processing**: Upload and analyze CSV, images, code, and audio files
47
+
48
+ Check out the configuration reference at <https://huggingface.co/docs/hub/spaces-config-reference>
49