Spaces:
Sleeping
title: Template Final Assignment
emoji: π΅π»ββοΈ
colorFrom: indigo
colorTo: indigo
sdk: gradio
sdk_version: 5.25.2
app_file: app.py
pinned: false
hf_oauth: true
hf_oauth_expiration_minutes: 480
short_description: Gemini Agent for GAIA Evaluation
π§ Gemini Agent for GAIA Evaluation
This project contains a Gemini-powered CodeAgent built with smolagents for use in the GAIA Unit 4 Evaluation on Hugging Face Spaces.
π§βπ« This app is part of the Hugging Face Courses series and was developed as an educational project to showcase how machine learning can be used in real-world event scenarios. Learn more at Hugging Face Courses.
π Features
Uses the Gemini 2.0 Flash model via
LiteLLMModelEquipped with essential tools:
DuckDuckGoSearchToolfor quick lookupsRunPythonFileToolfor executing.pyscriptsReverseTextToolfor decoding reversed questionsdownload_serverfor fetching files from URLs- Base tools (math, string manipulation, etc.)
π Evaluation Strategy
The agent reads questions from the GAIA evaluation endpoint, applies reasoning using a system prompt with strict guidelines, and submits answers back for scoring.
π οΈ Setup
Clone this repository or Space
Set your environment variables:
GEMINI_API_KEY=your_api_key_here SPACE_ID=your_hf_space_idInstall dependencies:
pip install -r requirements.txtRun locally:
python app.py
Or launch directly via Hugging Face Spaces.
π§ͺ Evaluation Flow
- Log in to Hugging Face through the UI
- Click βRun Evaluation & Submit All Answersβ
- The agent will fetch tasks, solve them, and submit results
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference