gaia / README.md
bstraehle's picture
Update README.md
9d9b4d9 verified
---
title: General AI Assistant - MCP Client & Server
emoji: 🤖
colorFrom: gray
colorTo: gray
sdk: gradio
sdk_version: 6.0.1
app_file: app.py
pinned: true
license: apache-2.0
short_description: Solves GAIA Benchmark & Humanity's Last Exam problems
tags:
- building-mcp-track-consumer
- mcp-in-action-track-consumer
---
# General AI Assistant
Prototype **multi-agent AI platform** with high autonomy, including code generation & execution, browser automation, and multi-modal reasoning. The system can solve multiple <a href="https://arxiv.org/pdf/2311.12983">GAIA Benchmark</a> Level 1, 2, 3 and even <a href="https://arxiv.org/pdf/2501.14249">Humanity's Last Exam</a> problems.
<video controls src="https://bstraehle.s3.us-west-2.amazonaws.com/gaia.mp4"></video>
## Architecture
<a href="https://bstraehle.s3.us-west-2.amazonaws.com/gaia.png">Agents & tools</a>
## Stack
- <a href="https://huggingface.co/">Hugging Face</a> (Spaces & Datasets)
- <a href="https://www.gradio.app/">Gradio</a> (UI, API & MCP Server)
- <a href="https://www.crewai.com/">CrewAI</a> (Agent Framework)
- <a href="https://phoenix.arize.com/">Arize AI</a> (Observability)
- <a href="https://www.postman.com/ai-engineer/generative-ai-apis/overview">Postman</a> (API & MCP Client - filter by GAIA)</p>
## Models
- <a href="https://ai.google/get-started/our-models/">Gemini</a> (gemini-3-pro-preview & gemini-2.5-pro - fallback on daily rate limit)
- <a href="https://platform.openai.com/docs/models">OpenAI</a> (gpt-4.1)
- <a href="https://docs.anthropic.com/en/docs/about-claude/models">Anthropic</a> (claude-sonnet-4-5-latest)
## Social
[LinkedIn](https://www.linkedin.com/posts/activity-7400926611958706176-HuPV)
## Author
[Bernd Straehle](https://huggingface.co/bstraehle)