GAIA-Agent-Unit4 / README.md
VoicesColeby's picture
fix: bump gradio to 5.49 (5.0 imports removed HfFolder)
5e2f3c3 verified
|
Raw
History Blame Contribute Delete
1.81 kB

A newer version of the Gradio SDK is available: 6.19.0

Upgrade
metadata
title: GAIA Unit 4 Agent
emoji: πŸ¦‡
colorFrom: indigo
colorTo: red
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
python_version: '3.12'
hf_oauth: true
hf_oauth_expiration_minutes: 480
pinned: false
license: apache-2.0
tags:
  - agent
  - smolagents
  - agents-course
  - unit-4
  - gaia

Unit 4 β€” GAIA Level-1 Final Project

Gradio Space that fetches the 20 official GAIA-Level-1 questions from agents-course-unit4-scoring.hf.space, runs a smolagents CodeAgent on each, and submits the answers to the Students leaderboard.

Agent

CodeAgent (Qwen 2.5-Coder-32B via HF Inference Providers) with:

Tool Purpose
web_search (DDG) Fetch external information.
visit_webpage Read & strip a specific URL.
python_interpreter Run sandboxed Python for math / parsing.
download_task_file Pull files associated with a task_id.
final_answer Emit the short, exact-match answer.

additional_authorized_imports covers stdlib + pandas / numpy for Level-1 spreadsheet-y questions.

Output format reminder

The submit endpoint compares answers as EXACT MATCH. The agent's system hint enforces:

  • No "FINAL ANSWER:" prefix.
  • Numbers as digits, no units unless asked.
  • Lists comma-separated.
  • Dates as the question requests.

Deploy

# After cloning the Space repo:
huggingface-cli login
# upload via HfApi.upload_file or git push

Then click Sign in with Hugging Face, then πŸš€ Run + Submit. The score and per-question transcript appear inline.

Scoring tier

Score Status
< 30% Try again β€” debug failure cases in the transcript.
β‰₯ 30% πŸŽ“ Certificate of Excellence eligible β€” claim at https://huggingface.co/spaces/agents-course/Unit4-Final-Certificate