Spaces:
Sleeping
Sleeping
GAIA Benchmark Agent with LangGraph
This project implements an agent using the LangGraph framework to solve the GAIA benchmark tasks provided by the Hugging Face Agents Course (Unit 4).
✅ Uses LangChain + LangGraph
✅ Handles 20 benchmark questions
✅ Submits answers for scoring via Hugging Face leaderboard
How to run
- Clone this repo and push it as a Hugging Face Space (public).
- Install the dependencies.
- Launch the Space and log in to Hugging Face.
- Click "Run Evaluation & Submit All Answers" to evaluate your agent.
Project files
main.py— Gradio app runnergraph_builder.py— Builds the LangGraph pipelinellm_node.py— Defines LLM node logicformatter_node.py— Cleans up answers for exact-match submissionapi_client.py— Wraps GAIA API interactions
Good luck!