Commit History

Add Training Evidence: real plots, logs, updated README
29675aa
verified

Aswini-Kumar commited on

Grounded description text
9d5010f
verified

Aswini-Kumar commited on

Upgrade landing page: inference-locked production model
ea1d0e2
verified

Aswini-Kumar commited on

Fix num_generations=2
5d27dfe
verified

Aswini-Kumar commited on

SFT 200->350 steps for better format compliance
def99af
verified

Aswini-Kumar commited on

Fix curriculum window=50 and threshold=0.80
c27aa07
verified

Aswini-Kumar commited on

GRPO speed fix: 50 steps, 1 generation
4c55113
verified

Aswini-Kumar commited on

Add training logs and fix gitignore
f1fbb08
verified

Aswini-Kumar commited on

Add training logs and fix gitignore
dfc933b
verified

Aswini-Kumar commited on

Add training logs and fix gitignore
1f9ba78
verified

Aswini-Kumar commited on

Remove emojis from notebook
ac0332f
verified

Aswini-Kumar commited on

Add WHY comments + GRPO/SFT speed caps
8843277
verified

Aswini-Kumar commited on

Speed fix: cap SFT at 200 steps
49d8e89
verified

Aswini-Kumar commited on

Fix URL typos
fd6ed18
verified

Aswini-Kumar commited on

Fix URL typos
96e3282
verified

Aswini-Kumar commited on

Fix HF Space URL typo in README and BLOG
fb69011
verified

Aswini-Kumar commited on

Update plots\training_dashboard.png
c528568
verified

Aswini-Kumar commited on

Update plots\reward_curve.png
73d47f8
verified

Aswini-Kumar commited on

Update plots\baseline_comparison.png
cace6f2
verified

Aswini-Kumar commited on

Update .gitattributes
cbb57d4
verified

Aswini-Kumar commited on

Update submit_job.py
320f23d
verified

Aswini-Kumar commited on

Update hf_job_train.py
2c07073
verified

Aswini-Kumar commited on

Update eval_data_centric.py
7af30cc
verified

Aswini-Kumar commited on

Update train_colab.ipynb
e15f4a8
verified

Aswini-Kumar commited on

Update train_data_centric.py
b572d19
verified

Aswini-Kumar commited on

Update BLOG.md
b8a9982
verified

Aswini-Kumar commited on

Update README.md
222f593
verified

Aswini-Kumar commited on

Audit fixes: remove duplicate torch import, add metadata field, fix stale strings, fix test assertions, update reward docs
36f4bdf

Aswini-Kumar commited on

Clean final training notebook: no demo, no Drive, all steps including plots + eval + download
f096486

Aswini-Kumar commited on

Redesign reward for discrimination: efficiency multiplier, strict penalties, stretch bonus, start at level 1
46f0850

Aswini-Kumar commited on

Fix demo mode crash: use max_steps param instead of unpicklable local class
3f7380e

Aswini-Kumar commited on

Optimize for fast iteration: 1.5B model, LoRA r=8, GRPO batch=2/gen=2, seq=512
3807e67

Aswini-Kumar commited on

Switch experiment tracking from W&B to TensorBoard (no API key required)
b80a8b2

Aswini-Kumar commited on

Enable W&B experiment tracking in SFT+GRPO phases (required by hackathon)
ffbb7d8

Aswini-Kumar commited on

Pin torchao version to 0.6.1 for stability
35c049e

Aswini-Kumar commited on

refactor: extract agent_utils.py (shared prompt/commands/server utils), simplify reward to env+format, add audit.py
51a79ee

Aswini-Kumar commited on

feat: rewrite hf_job_train.py + add submit_job.py for HF Jobs training
b6d5e7d

Aswini-Kumar commited on

Data-Centric AI RL Environment — OpenEnv Hackathon Submission
71dc210

Aswini-Kumar commited on