Commit History

sync training files
ea9c69b
verified

Pathikreet commited on

fix: OOM — NUM_GENERATIONS 32→16, max_completion_length 300→200, expandable_segments
c0d3d54
verified

Pathikreet commited on

fix: add 3 missing hard tasks to _TASK_DIFFICULTY (322 prompts)
a6c22c8
verified

Pathikreet commited on

Fix: kl_coeff -> beta (correct TRL GRPOConfig param name)
a47b370
verified

Pathikreet commited on

Auto-detect username from token for adapter + run folder upload
0f17c96
verified

Pathikreet commited on

Graceful stop: save weights on /app/stop_requested flag
65ac9f8
verified

Pathikreet commited on

Fix hard_currency_conversion task ID in TRAIN_TASKS and EVAL_TASKS
e27253e
verified

Pathikreet commited on

Add oversight eval + bump seeds medium×8 hard/long×20
4c61d1e
verified

Pathikreet commited on

Bump seeds: medium×8, hard/long×20 (322 prompts total)
abf8676
verified

Pathikreet commited on

Full metrics in live JSON: format/diff/ep_len history
3869f27
verified

Pathikreet commited on

Add loss tracking callback + reward/loss PNG savers
8877328
verified

Pathikreet commited on

Run 3: temp=0.7, kl=0.1, format±0.15, no curriculum, 20 tasks, G=32
0bfa536
verified

Pathikreet commited on

Fix bf16 crash + 17 tasks / 160 prompts dataset
6912151
verified

Pathikreet commited on

Upload train.py with huggingface_hub
a9752a6
verified

Pathikreet commited on

Upload train.py with huggingface_hub
d826205
verified

Pathikreet commited on

Upload train.py with huggingface_hub
2865f24
verified

Pathikreet commited on

Upload train.py with huggingface_hub
16366ba
verified

Pathikreet commited on

Upload train.py with huggingface_hub
da5a42f
verified

Pathikreet commited on

Upload train.py with huggingface_hub
6913549
verified

Pathikreet commited on

Upload train.py with huggingface_hub
3ea12ec
verified

Pathikreet commited on