Update default Scientist model to claude-haiku-4-5-20251001 3c84c29 Running Ayush Ojha commited on 5 days ago
Fix health endpoint to report env=real (built-in env is fully functional) 0508316 Ayush Ojha commited on 5 days ago
Fix TypeScript build error: remove undefined suggestScientistAction call f24a42e Ayush Ojha commited on 6 days ago
Merge pull request #14 from Ayush10/feature/scoring-improvements cd197ac unverified Ayush Ojha commited on 6 days ago
Add 12 scoring & environment improvements with full test coverage 2b98daa ayushozha Claude Opus 4.6 commited on 6 days ago
Fix gpt-5.4 incompatibility: use max_completion_tokens instead of max_tokens b1cce18 maxxie114 Claude Sonnet 4.6 commited on 7 days ago
Add fixed GRPO training notebook (grpo_training2.ipynb) 21e1fa1 maxxie114 Claude Sonnet 4.6 commited on 7 days ago
Switch Oracle judge to OpenAI (gpt-5.4), support both backends 9ee3fcd maxxie114 Claude Sonnet 4.6 commited on 7 days ago
Fix scientist inference and wire Oracle LLM judge af6803d maxxie114 Claude Sonnet 4.6 commited on 7 days ago
Wire fine-tuned Qwen3.5 LoRA checkpoint into scientist suggest endpoint 2d56484 maxxie114 Claude Sonnet 4.6 commited on 7 days ago
Add clean training summary cell to GRPO notebook cb80a59 maxxie114 Claude Sonnet 4.6 commited on 7 days ago
Add GRPO fine-tuning notebook for Qwen3.5-0.8B a85a0ef maxxie114 Claude Sonnet 4.6 commited on 7 days ago
Use generic regex for CORS instead of hardcoded deployment URLs d3db98a maxxie114 Claude Sonnet 4.6 commited on 7 days ago
Update HF Space: final architecture diagram, frontend, and README 65e91b3 ayushozha Claude Opus 4.6 commited on 7 days ago
Add Northflank CORS origins for production deployment 39ecbed maxxie114 Claude Sonnet 4.6 commited on 7 days ago
Close all Person D tasks: README enhancements, docs, UI close-out (152/152 = 100%) f0d1d76 ayushozha Claude Opus 4.6 commited on 7 days ago
Close DOC 08: repo hygiene verified, Max 100% complete (41/41) 7a753de ayushozha commited on 7 days ago
Add GPU training infrastructure and 50 research paper corpus 93b73ad ayushozha Claude Opus 4.6 commited on 7 days ago
Add API 19 /web fallback route, merge Kush frontend, close UI 07 b001a03 ayushozha commited on 7 days ago
Add ENV 09 disk persistence, OBS 07/09, TST 11 audit tests, close 10 Max tasks 11faa95 ayushozha commited on 7 days ago
Add MOD 08 schema tests, V2 training stack, and close MOD 08/JDG 07/API 01/OBS 02 685783a ayushozha commited on 7 days ago
Add JDG 07: reward breakdown logging to CSV and JSONL per episode 82805bf ayushozha commited on 7 days ago
Add Person D docs, README results, episode ID, demo script, and checklists 32737e6 Kush commited on 7 days ago
Add deterministic judge scoring engine (JDG 01-03) e50dca9 ayushozha Claude Opus 4.6 commited on 8 days ago
Add living project map documenting all modules and relationships 0ed9084 ayushozha Claude Opus 4.6 commited on 8 days ago
Fix baseline scientist false revision on accepted protocols 20510e3 ayushozha Claude Opus 4.6 commited on 8 days ago
Add AGT 04/05/07 implementations, server integration, and doc updates 7c2246c ayushozha Claude Opus 4.6 commited on 8 days ago