derkaal
/

FinalSubmission

Model card Files Files and versions

FinalSubmission / README.md

derkaal's picture

Upload folder using huggingface_hub

c7eca3d verified 8 months ago

|

history blame contribute delete

3.52 kB

	---
	title: GAIA Benchmark Agent
	emoji: 🧠
	colorFrom: blue
	colorTo: indigo
	sdk: gradio
	sdk_version: 5.25.2
	app_file: app.py
	pinned: false
	hf_oauth: true
	hf_oauth_expiration_minutes: 480
	---

	# GAIA Benchmark Agent

	This Hugging Face Space hosts a GAIA (General AI Assistant) benchmark agent designed to solve certification challenges across various domains of AI and machine learning.

	## Features

	- Processes questions from the GAIA benchmark
	- Uses LangChain and OpenAI's language models
	- Analyzes questions and identifies their types
	- Retrieves relevant context when needed
	- Generates accurate, well-reasoned answers
	- Integrates with external information sources:
	- SerpAPI for real-time web search capabilities
	- YouTube for video content search and transcript analysis
	- Tavily for AI-optimized search results
	- Audio processing for speech-to-text conversion and analysis

	## Usage

	1. Log in to your Hugging Face account using the button
	2. Click 'Run Evaluation & Submit All Answers' to:
	- Fetch questions from the GAIA benchmark
	- Run the agent on all questions
	- Submit answers and see your score

	## Implementation Details

	The agent uses a modular architecture with specialized handlers for different question types:
	- Factual knowledge questions
	- Technical implementation questions
	- Mathematical questions
	- Context-based analysis questions
	- Ethical/societal impact questions
	- Media content questions (videos, podcasts, audio recordings)
	- Current events questions
	- Categorization questions with enhanced botanical classification

	### Botanical Classification

	The agent has been enhanced with comprehensive botanical classification capabilities, allowing it to:
	- Accurately distinguish between botanical fruits and vegetables
	- Provide detailed explanations of botanical classifications
	- Correctly identify commonly misclassified items (tomatoes, bell peppers, cucumbers, etc.)
	- Explain the difference between botanical and culinary classifications

	### External Information Sources

	The agent can access external information to provide more accurate and up-to-date answers:

	- SerpAPI Integration: Enables real-time web search capabilities for current events and factual information
	- YouTube Integration:
	- Search for relevant videos on specific topics
	- Extract and analyze video transcripts for information
	- Tavily Search: AI-optimized search engine that provides relevant results for complex queries

	### Audio Processing Capabilities

	The agent has been enhanced with audio processing capabilities, allowing it to:
	- Transcribe audio files using OpenAI's Whisper API with Google Speech Recognition fallback
	- Extract ingredients from recipe audio recordings
	- Process and analyze spoken content from various audio formats
	- Format responses according to user requests for audio content

	### API Keys Configuration

	To use the external information sources, you need to set the following API keys in your environment:
	- `SERPAPI_API_KEY`: For web search capabilities
	- `YOUTUBE_API_KEY`: For YouTube video search and transcript analysis
	- `TAVILY_API_KEY`: For AI-optimized search results
	- `WHISPER_API_KEY`: For audio transcription (defaults to OPENAI_API_KEY if not set)

	## Repository

	The code for this agent is available at: https://huggingface.co/derkaal/GAIA-agent

	Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference