blog: reword env reward and HOME wording; clarify worker roles (no simulator/synthetic phrasing). 6025ea6 hiitsesh commited on Apr 26
Update blog.md (rubric crosswalk); add outputs/agent_project_knowledge.json for reviewers. 0bae8be hiitsesh commited on Apr 26
README: GitHub raw image URLs (Hub blocks plain PNG git); gitignore images/*.png 60c96ed hiitsesh commited on Apr 26
Set TORCHINDUCTOR cache dirs before torch import (HF Space no passwd uids) 20b13c9 hiitsesh commited on Apr 26
Work around torch mega-cache double registration on Space imports 6f0a93d hiitsesh commited on Apr 26
fix: durable outputs under /data, unbuffered training logs, relaxed deps 1348e4a hiitsesh commited on Apr 26
fix: Update eval API to support subfolder and fix default model ID 3bfb797 hiitsesh commited on Apr 26
Enhance TorchGenerator to support optional subfolder for model loading 8c33b70 hiitsesh commited on Apr 26
Post-training push to Hugging Face Hub: --hub-model-repo and pilot query params ebcefc4 hiitsesh commited on Apr 25
write_test: explain repo vs container; add download_url and list_files 6787f1b hiitsesh commited on Apr 25
Add /outputs/write_test to create simpllll.csv and report hf_token env presence ff9e20d hiitsesh commited on Apr 25
Document HF token via Space secrets, ENV.example, and operational curl for pilot and push_to_hub 1737549 hiitsesh commited on Apr 25
Add 8-bit Adam + gradient checkpointing for bf16 training to fit 1.7B on L4 a5ea00e hiitsesh commited on Apr 25
Allow model_name override on /train/pilot and raise num_generations cap b05b4e3 hiitsesh commited on Apr 25
Force Qwen3 chat template to disable thinking mode via tokenizer patch 95fada9 hiitsesh commited on Apr 25
Nuclear reward shaping: silent rollouts get flat -3, active rollouts floor at +1 31334fa hiitsesh commited on Apr 25
Fix reward collapse: floor exploration loss, bonus per-call and per-terminal 7078c21 hiitsesh commited on Apr 25
Live training visualization + aggressive reward shaping to prevent 'do nothing' collapse 4dde8b9 hiitsesh commited on Apr 25