update: FAL results report with final eval numbers + conclusions 99c09f5 verified rtferraz commited on May 12
add: FAL demo results report (preliminary β eval crash pending fix) f46c90b verified rtferraz commited on May 11
add: Modal deployment lessons learned β TRL dependency hell postmortem 81195a7 verified rtferraz commited on May 11
add: ADR-003 Future-as-Label demo β detailed implementation plan with research validation d75cbbf verified rtferraz commited on May 11
add: V4.2 Final Report β complete project retrospective with evidence-based analysis 22cca8b verified rtferraz commited on May 3
docs: add V4.1 run report β detailed evaluation with per-task analysis and V4.2 roadmap 482efc4 verified rtferraz commited on Apr 28
docs: add V4 run assessment with lessons learned and improvement roadmap cfaf49c verified rtferraz commited on Apr 27
ADR-002: V4 Instruct-Only GRPO β revises dual-model plan based on model repo audit 50e0e4d verified rtferraz commited on Apr 25
Add comprehensive investigation report β performance audit, unexplored alternatives, literature-backed recommendations 4312bfd verified rtferraz commited on Apr 25
Add session checkpoint: v3 launch decision with full context bead5cb verified rtferraz commited on Apr 24
Add v3 thinking control patch - task-aware system prompts + think efficiency reward 0f39df7 verified rtferraz commited on Apr 23
docs: add ADR-001 next steps with detailed execution plans b47b36b verified rtferraz commited on Apr 23