Spaces:

Humbleguava
/

GAIA_Agent_DeepResearch

Sleeping

App Files Files Community

GAIA_Agent_DeepResearch / USAGE.md

humblebanana

1st

176a845 3 months ago

preview code

raw

history blame contribute delete

7.05 kB

	# Usage Guide

	## Quick Start (5 minutes)

	### Step 1: Install Dependencies

	```bash
	pip install -r requirements.txt
	```

	### Step 2: Configure API Keys

	Copy `.env.example` to `.env` and fill in your API keys:

	```bash
	cp .env.example .env
	# Edit .env file with your keys
	```

	Minimum required:
	- `HUGGINGFACEHUB_API_TOKEN` (for LLM)
	- `TAVILY_API_KEY` (for web search)

	### Step 3: Setup Vector Database

	```bash
	python setup_chromadb.py
	```

	This will:
	- Download embedding model (~90MB)
	- Load metadata.jsonl
	- Create local ChromaDB database

	### Step 4: Run the Agent

	#### Option A: Gradio UI (Recommended)

	```bash
	python app.py
	```

	Then:
	1. Log in with your HuggingFace account
	2. Click "Run Evaluation & Submit All Answers"
	3. Wait for results

	#### Option B: Command Line

	```bash
	python agent.py
	```

	Edit the question in `agent.py` line 208.

	## Understanding the Output

	### Agent Messages

	The agent produces several message types:

	1. SystemMessage: Instructions and guidelines
	2. HumanMessage: Your question + similar example (from RAG)
	3. AIMessage: Agent's reasoning and tool calls
	4. ToolMessage: Tool execution results
	5. Final AIMessage: The answer

	Example:
	```
	SystemMessage: You are Alfred, an intelligent research assistant...
	HumanMessage: What is quantum computing?
	HumanMessage: Here is a similar question... (from vector DB)
	AIMessage: I will use deep_research tool
	tool_calls: [{"name": "deep_research", "args": {"query": "..."}}]
	ToolMessage: DEEP RESEARCH REPORT: ...
	AIMessage: FINAL ANSWER: Quantum computing is...
	```

	### Deep Research Report Structure

	```
	DEEP RESEARCH REPORT: [Query]
	=====================================

	📊 OVERVIEW
	- Total sources: X
	- Wikipedia: Y articles
	- Web: Z pages
	- Academic: W papers

	📚 WIKIPEDIA FINDINGS
	[Source 1] ...
	[Source 2] ...

	🌐 WEB FINDINGS
	[Source 1] ...
	[Source 2] ...

	🎓 ACADEMIC FINDINGS
	[Source 1] ...
	[Source 2] ...

	📋 ALL SOURCES
	[1] https://...
	[2] https://...
	```

	## Customization

	### Changing LLM Provider

	Edit `agent.py` line 217:

	```python
	# Option 1: HuggingFace (Free, slower)
	graph = build_graph(provider="huggingface")

	# Option 2: Groq (Fast, free tier available)
	graph = build_graph(provider="groq")

	# Option 3: Google Gemini (Balanced, requires payment)
	graph = build_graph(provider="google")
	```

	### Adjusting Deep Research Behavior

	Edit `deep_research_tool.py`:

	```python
	# Line 54: Wikipedia results
	WikipediaLoader(query=query, load_max_docs=2) # Change to 1-5

	# Line 68: Web results
	TavilySearchResults(max_results=10) # Change to 3-15

	# Line 85: Academic results
	ArxivLoader(query=query, load_max_docs=5) # Change to 1-10

	# Line 58, 75, 89: Content truncation
	"content": doc.page_content[:2000] # Change to 500-3000
	```

	### Modifying System Prompt

	Edit `system_prompt.txt` to:
	- Change tool selection strategy
	- Adjust reasoning guidelines
	- Modify output format

	## Testing Different Question Types

	### Mathematical Questions
	```python
	question = "What is 15 multiplied by 23?"
	# Expected: Uses multiply tool → Direct answer
	```

	### Simple Factual Questions
	```python
	question = "Who invented the telephone?"
	# Expected: Uses wiki_search → Quick answer
	```

	### Complex Conceptual Questions
	```python
	question = "Explain quantum entanglement and its applications"
	# Expected: Uses deep_research → Comprehensive answer
	```

	### Recent Events
	```python
	question = "What are the latest AI developments in 2025?"
	# Expected: Uses web_search or deep_research
	```

	## Troubleshooting

	### Issue: "No module named 'sentence_transformers'"

	```bash
	pip install sentence-transformers
	```

	### Issue: "TAVILY_API_KEY not found"

	Make sure `.env` file exists and contains:
	```
	TAVILY_API_KEY=tvly-xxxxx
	```

	### Issue: ChromaDB not working

	Delete and recreate:
	```bash
	rm -rf chroma_db/
	python setup_chromadb.py
	```

	### Issue: Agent not using deep_research

	Check `system_prompt.txt` - make sure it mentions when to use deep_research.

	Or be explicit in your question:
	```python
	question = "Use deep research to analyze quantum computing"
	```

	## Performance Optimization

	### Speed vs Quality Trade-offs

	Faster (for testing):
	```python
	# deep_research_tool.py
	WikipediaLoader(load_max_docs=1)
	TavilySearchResults(max_results=3)
	ArxivLoader(load_max_docs=1)
	"content": doc.page_content[:500]
	```

	Balanced (recommended):
	```python
	WikipediaLoader(load_max_docs=2)
	TavilySearchResults(max_results=10)
	ArxivLoader(load_max_docs=5)
	"content": doc.page_content[:2000]
	```

	Comprehensive (slower but thorough):
	```python
	WikipediaLoader(load_max_docs=5)
	TavilySearchResults(max_results=15)
	ArxivLoader(load_max_docs=10)
	"content": doc.page_content[:5000]
	```

	## Advanced Usage

	### Adding Custom Tools

	1. Define tool in `agent.py`:

	```python
	@tool
	def my_custom_tool(query: str) -> str:
	"""Description of what this tool does."""
	# Your implementation
	return result
	```

	2. Add to tools list:

	```python
	tools = [
	multiply, add, subtract,
	wiki_search, web_search,
	deep_research,
	my_custom_tool, # Add here
	]
	```

	3. Update `system_prompt.txt` to mention when to use it.

	### Batch Processing

	```python
	from agent import build_graph
	from langchain_core.messages import HumanMessage

	graph = build_graph(provider="huggingface")

	questions = [
	"Question 1",
	"Question 2",
	"Question 3",
	]

	for q in questions:
	result = graph.invoke({"messages": [HumanMessage(content=q)]})
	answer = result["messages"][-1].content
	print(f"Q: {q}\nA: {answer}\n")
	```

	### Logging and Debugging

	Add logging to track agent behavior:

	```python
	# In agent.py, modify assistant function:
	def assistant(state: MessagesState):
	"""Assistant node"""
	print(f"\n{'='*60}")
	print("Assistant Input:")
	for msg in state["messages"]:
	print(f" - {type(msg).__name__}: {msg.content[:100]}...")

	result = llm_with_tools.invoke(state["messages"])

	print("\nAssistant Output:")
	if hasattr(result, "tool_calls") and result.tool_calls:
	print(f" Tool calls: {[tc['name'] for tc in result.tool_calls]}")
	print(f"{'='*60}\n")

	return {"messages": [result]}
	```

	## FAQ

	Q: How do I know which tool was used?

	A: Check the AIMessage for `tool_calls` field, or add logging as shown above.

	Q: Can I use without Tavily API?

	A: Yes, but web_search and deep_research will partially fail. Consider removing them from the tools list.

	Q: How long does setup take?

	A: ~5 minutes (mostly downloading the embedding model).

	Q: Can I run this offline?

	A: No, it requires API calls to LLM and search services.

	Q: How much does it cost?

	A: Using HuggingFace Inference API is free (with rate limits). Tavily has a free tier (1000 queries/month).

	## Next Steps

	1. Test with different question types
	2. Optimize performance for your use case
	3. Customize system prompt
	4. Add domain-specific tools
	5. Integrate with your application

	For more details, see the full documentation in the main repository.