Spaces:
Running
Running
| title: General AI Assistant - MCP Client & Server | |
| emoji: 🤖 | |
| colorFrom: gray | |
| colorTo: gray | |
| sdk: gradio | |
| sdk_version: 6.0.1 | |
| app_file: app.py | |
| pinned: true | |
| license: apache-2.0 | |
| short_description: Solves GAIA Benchmark & Humanity's Last Exam problems | |
| tags: | |
| - building-mcp-track-consumer | |
| - mcp-in-action-track-consumer | |
| # General AI Assistant | |
| Prototype **multi-agent AI platform** with high autonomy, including code generation & execution, browser automation, and multi-modal reasoning. The system can solve multiple <a href="https://arxiv.org/pdf/2311.12983">GAIA Benchmark</a> Level 1, 2, 3 and even <a href="https://arxiv.org/pdf/2501.14249">Humanity's Last Exam</a> problems. | |
| <video controls src="https://bstraehle.s3.us-west-2.amazonaws.com/gaia.mp4"></video> | |
| ## Architecture | |
| <a href="https://bstraehle.s3.us-west-2.amazonaws.com/gaia.png">Agents & tools</a> | |
| ## Stack | |
| - <a href="https://huggingface.co/">Hugging Face</a> (Spaces & Datasets) | |
| - <a href="https://www.gradio.app/">Gradio</a> (UI, API & MCP Server) | |
| - <a href="https://www.crewai.com/">CrewAI</a> (Agent Framework) | |
| - <a href="https://phoenix.arize.com/">Arize AI</a> (Observability) | |
| - <a href="https://www.postman.com/ai-engineer/generative-ai-apis/overview">Postman</a> (API & MCP Client - filter by GAIA)</p> | |
| ## Models | |
| - <a href="https://ai.google/get-started/our-models/">Gemini</a> (gemini-3-pro-preview & gemini-2.5-pro - fallback on daily rate limit) | |
| - <a href="https://platform.openai.com/docs/models">OpenAI</a> (gpt-4.1) | |
| - <a href="https://docs.anthropic.com/en/docs/about-claude/models">Anthropic</a> (claude-sonnet-4-5-latest) | |
| ## Social | |
| [LinkedIn](https://www.linkedin.com/posts/activity-7400926611958706176-HuPV) | |
| ## Author | |
| [Bernd Straehle](https://huggingface.co/bstraehle) |