Update default Scientist model to claude-haiku-4-5-20251001 3c84c29 Running Ayush Ojha commited on Mar 10
Fix health endpoint to report env=real (built-in env is fully functional) 0508316 Ayush Ojha commited on Mar 10
Fix TypeScript build error: remove undefined suggestScientistAction call f24a42e Ayush Ojha commited on Mar 10
Merge pull request #14 from Ayush10/feature/scoring-improvements cd197ac unverified Ayush Ojha commited on Mar 9
Add 12 scoring & environment improvements with full test coverage 2b98daa ayushozha Claude Opus 4.6 commited on Mar 9
Fix gpt-5.4 incompatibility: use max_completion_tokens instead of max_tokens b1cce18 maxxie114 Claude Sonnet 4.6 commited on Mar 8
Add fixed GRPO training notebook (grpo_training2.ipynb) 21e1fa1 maxxie114 Claude Sonnet 4.6 commited on Mar 8
Switch Oracle judge to OpenAI (gpt-5.4), support both backends 9ee3fcd maxxie114 Claude Sonnet 4.6 commited on Mar 8
Fix scientist inference and wire Oracle LLM judge af6803d maxxie114 Claude Sonnet 4.6 commited on Mar 8
Wire fine-tuned Qwen3.5 LoRA checkpoint into scientist suggest endpoint 2d56484 maxxie114 Claude Sonnet 4.6 commited on Mar 8
Add clean training summary cell to GRPO notebook cb80a59 maxxie114 Claude Sonnet 4.6 commited on Mar 8
Use generic regex for CORS instead of hardcoded deployment URLs d3db98a maxxie114 Claude Sonnet 4.6 commited on Mar 8
Update HF Space: final architecture diagram, frontend, and README 65e91b3 ayushozha Claude Opus 4.6 commited on Mar 8
Add Northflank CORS origins for production deployment 39ecbed maxxie114 Claude Sonnet 4.6 commited on Mar 8
Close all Person D tasks: README enhancements, docs, UI close-out (152/152 = 100%) f0d1d76 ayushozha Claude Opus 4.6 commited on Mar 8
Add GPU training infrastructure and 50 research paper corpus 93b73ad ayushozha Claude Opus 4.6 commited on Mar 8
Add ENV 09 disk persistence, OBS 07/09, TST 11 audit tests, close 10 Max tasks 11faa95 ayushozha commited on Mar 8
Add MOD 08 schema tests, V2 training stack, and close MOD 08/JDG 07/API 01/OBS 02 685783a ayushozha commited on Mar 8
Add JDG 07: reward breakdown logging to CSV and JSONL per episode 82805bf ayushozha commited on Mar 8
Add Person D docs, README results, episode ID, demo script, and checklists 32737e6 Kush commited on Mar 8
Add deterministic judge scoring engine (JDG 01-03) e50dca9 ayushozha Claude Opus 4.6 commited on Mar 8
Add living project map documenting all modules and relationships 0ed9084 ayushozha Claude Opus 4.6 commited on Mar 8
Fix baseline scientist false revision on accepted protocols 20510e3 ayushozha Claude Opus 4.6 commited on Mar 8
Add AGT 04/05/07 implementations, server integration, and doc updates 7c2246c ayushozha Claude Opus 4.6 commited on Mar 8