Spaces:
Sleeping
Sleeping
metadata
title: Template Final Assignment
emoji: ๐ต๐ปโโ๏ธ
colorFrom: indigo
colorTo: indigo
sdk: gradio
sdk_version: 5.25.2
app_file: app.py
pinned: false
hf_oauth: true
hf_oauth_expiration_minutes: 480
short_description: Gemini Agent for GAIA Evaluation
๐ง Gemini Agent for GAIA Evaluation
This project contains a Gemini-powered CodeAgent built with smolagents for use in the GAIA Unit 4 Evaluation on Hugging Face Spaces.
๐ Features
Uses the Gemini 2.0 Flash model via
LiteLLMModelEquipped with essential tools:
DuckDuckGoSearchToolfor quick lookupsRunPythonFileToolfor executing.pyscriptsReverseTextToolfor decoding reversed questionsdownload_serverfor fetching files from URLs- Base tools (math, string manipulation, etc.)
๐ Evaluation Strategy
The agent reads questions from the GAIA evaluation endpoint, applies reasoning using a system prompt with strict guidelines, and submits answers back for scoring.
๐ ๏ธ Setup
Clone this repository or Space
Set your environment variables:
GEMINI_API_KEY=your_api_key_here SPACE_ID=your_hf_space_idInstall dependencies:
pip install -r requirements.txtRun locally:
python app.py
Or launch directly via Hugging Face Spaces.
๐งช Evaluation Flow
- Log in to Hugging Face through the UI
- Click โRun Evaluation & Submit All Answersโ
- The agent will fetch tasks, solve them, and submit results
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference