Commit History

debug: log full transcript content for debugging
4496081

mangubee Claude commited on

feat: add transcript caching and upgrade synthesis model
b68b317

mangubee Claude commited on

debug: log full transcript content for debugging
a1d5de5

mangubee Claude commited on

fix: vision tool cannot process videos, clarify image-only capability
3ee1e2c

mangubee Claude commited on

fix: clarify tool descriptions to prevent LLM selecting vision for YouTube
61ecfdb

mangubee Claude commited on

fix: typo WISPER_MODEL -> WHISPER_MODEL in audio.py
7bb1d7c

mangubee Claude commited on

fix: YouTubeTranscriptApi.fetch is instance method not class method
92346a5

mangubee Claude commited on

fix: youtube transcript API and missing datasets dependency
a170083

mangubee Claude commited on

fix: remove duplicate huggingface-hub, update tenacity version
c9e941b

mangubee Claude commited on

fix: correct google-generativeai package name
fdd1cbb

mangubee Claude commited on

deps: add audio/video processing to pyproject.toml
7a850db

mangubee Claude commited on

ui: move Test & Debug tab after Full Evaluation
c8247b8

mangubee Claude commited on

fix: add youtube_transcript and transcribe_audio to TOOL_FUNCTIONS
40db96d

mangubee Claude commited on

docs: update CHANGELOG for Phase 1 completion
7b6de93

mangubee Claude commited on

feat: Phase 1 - YouTube transcript + Whisper audio transcription
38cc8e4

mangubee Claude commited on

feat: phase1 planning and video processing research
0d77f39

mangubee Claude commited on

Update README.md
2577d6f
verified

mangubee commited on

feat: system error field, calculator fix, target task IDs, course vs GAIA docs
41ac444

mangubee Claude commited on

Change UI default LLM provider from Groq to HuggingFace
7d0cc73

mangubee Claude commited on

Add LLM_PROVIDER to .env.example
7e5d63e

mangubee Claude commited on

Phase 0-2: HF Vision Integration Complete - Google Gemma 3 Selected
2bb2d3d

mangubee Claude commited on

Remove binary test images and update .gitignore
c86df49

mangubee commited on

Add initial implementation for Phase 0 validation of HF Inference API with vision models
630f609

mangubee commited on

Phase 0 Prep: Cache restructure, Vision investigation, HF integration plan
9fb579f

mangubee Claude commited on

Update Dev
d93842c

mangubee commited on

Update Groq model to openai/gpt-oss-120b
44d1862

mangubee commited on

Add Groundtruth Answer Column
dc583a7

mangubee commited on

UI Control for Question Limit
65a5dc6

mangubee commited on

Integrate benchmark dataset with results from HF as groundtruth
9fb23b8

mangubee commited on

Fix Correct-Column in Report
0292109

mangubee commited on

Update JSON export with execution time and correct flags
ff5bca5

mangubee commited on

Async Implementation
94965d6

mangubee commited on

Update CHANGELOG: Document runtime config bug fix
38116d3

mangubee commited on

Fix: Read LLM provider config at runtime for UI updates
91f84b2

mangubee commited on

Add UI-based LLM provider selection for cloud testing
ac73681

mangubee Claude Sonnet 4.5 commited on

Groq Model Update
6e73cae

mangubee commited on

Stage 5: Performance optimization - retry logic, Groq integration, improved prompts
5890f66

mangubee Claude Sonnet 4.5 commited on

Docs: Update CHANGELOG with correct session date and dev records
a98db29

mangubee Claude Sonnet 4.5 commited on

Docs: Recover Stage 4 complete documentation structure
06fc271

mangubee Claude Sonnet 4.5 commited on

Plan: Stage 5 performance optimization strategy
456c236

mangubee Claude Sonnet 4.5 commited on

Docs: Complete Stage 4 wrap-up in dev log
8b043d1

mangubee Claude Sonnet 4.5 commited on

Docs: Update dev log to reflect JSON export implementation
ac31506

mangubee Claude Sonnet 4.5 commited on