Commit History

Add loss curve plot (required for automated validation)
de1f57d

Roopalgn commited on

Update blog + Run 2 real data
0c8ca1b

Roopalgn commited on

Submission ready: narrative README, real reward plot, clean notebook, remove internal files
89811cf

Roopalgn commited on

docs: add competitive analysis of ~250 OpenEnv hackathon repos
5a2448e

Roopalgn commited on

Submission: clean public repo - remove internal docs, add notebook and results
079b390

Roopalgn commited on

Submission: notebook, README, blog post, reward plot
f1a5429

Roopalgn commited on

V4: steep reward slope - full-episode eval + stronger milestones + progress bonus
f885d09

Roopalgn commited on

fixes : rewards and training
d68729f

Coding Ninja commited on

Fix GRPO parse collapse for notebook training
2f3aa1e

Coding Ninja commited on

Fix reward-stationary reset controls and notebook training setup
2a5fb9e

Coding Ninja commited on

fix file structure
d233ac8

Roopalgn commited on

V4: fix 5 critical + 8 major + 2 moderate issues
d148dd5

Roopalgn commited on

Update prompt.md
c280445

Roopalgn commited on

Add evaluation results and warnings to prompt.md
77371be

Roopalgn commited on

Add GRPO V3 training guide: explains bug, fix, and testing steps
1eb8c81

Roopalgn commited on

Add detailed training issues and fixes documentation
e1e5bc0
unverified

Roopalgn commited on

fix: rebalance rewards for GRPO slope - remove efficiency baseline, add milestone bonuses
d8168e4

Roopalgn commited on

docs: capture colab validation run and hf plan
54c5378

Roopalgn commited on

docs: prep training intake and handoff workflow
cb22297

Roopalgn commited on

Add comprehensive RL book for beginners (16 chapters, docs/internal/rl_book)
c7bd258

Roopalgn commited on

fix: add root route so HF Space doesn't show 404
6b6624d

Roopalgn commited on

merge PR#15 (S1-S7), integrate extra_info into hack_info+ROADMAP, reset reward CSV
08f6e71

Roopalgn commited on

Merge branch 'Roopalgn:main' into main
2da6021
unverified

Suyash Kumar commited on

[Push 8 Pre-25] S1-S7: action space docs, onsite checklist patch, eval key rename, 7b lora_r fix, dry-run outputs
5c57225

Coding Ninja commited on

Create extra_info.md
a65ebde
unverified

Roopalgn commited on

pre-25th R1-R8: add innovation argument, training-failure fallback, notebook validation outputs
9a019f2

Roopalgn commited on

hack_info: restructure with proper markdown; roadmap: add pre-25th task list
57d30f2

Roopalgn commited on

roadmap: add competitor analysis, validate 6 code gaps, add P0 fix plan for onsite
257348b

Roopalgn commited on

Update hack_info.md
0ccb5ed
unverified

Roopalgn commited on

docs: consolidate 28->16 md files, refine README, add eval report
2646e27

Roopalgn commited on

fix: train_colab.ipynb - correct API, DRY_RUN, MODEL_PRESETS, reward handling; mark Suyash tasks done
13ccd3d

Roopalgn commited on

prep: onsite artifacts - kaggle notebook, checklist, templates with [FILL ONSITE] placeholders
20fbcfb

Roopalgn commited on

docs: add pre-onsite checklist + update KnowledgeBase
0ab5277

Roopalgn commited on

fix: remove all ROADMAP contradictions - training is onsite only
ed8ee76

Roopalgn commited on

Update ROADMAP: mark branch merge complete, update Phase B checklist
112396c

Roopalgn commited on

Update ROADMAP: re-enable pre-training on Kaggle, Push 8 becomes H100 refinement
242e13c

Roopalgn commited on

Add internal resources document
90def51

Roopalgn commited on

[Push 7] Roopal: fix HF Space card metadata, align port to 7860, add training_log.md
38974f7

Roopalgn commited on

Push 7 Phase A: grounding + kaggle notebook + docs
62441e1

Roopalgn commited on

Update ROADMAP: pre-training on Kaggle before onsite, Push 8 becomes H100 refinement
2f22004

Roopalgn commited on

Polish repo for judges: move internal docs, clean winner refs, improve dashboard
95c4fd1

Roopalgn commited on

[Push 7-8] Add G16-G22 post-merge gaps, Push 7 (pre-onsite) and Push 8 (onsite training), updated checklists and Definition of Done
d5992c2

Roopalgn commited on

post-merge: add .gitignore, update project status with integration test results
6470e2f

Roopalgn commited on

[Push 6] Roopal: storytelling assets, pitch notes, mini-blog finalized, ARCHITECTURE.md finalized, docs proofread
418d871

Roopalgn commited on

[Push 5] Roopal: reward tuning, adaptive difficulty spec, dashboard UI, improved prompts
e96bfea

Roopalgn commited on

[Push 4] Roopal: training runbook, Colab notebook, evaluation template, mini-blog draft
6daa3e9

Roopalgn commited on

[Push 3] Roopal: curriculum policy, verification spec, benchmark protocol, dashboard metrics, phase scoring
40c42e9

Roopalgn commited on

[Push 2] Roopal: reward spec, scenario cards, milestone map, shaping function
d2176de

Roopalgn commited on

[Push 1] Roopal: enhance all docs with explicit winner-inspired patterns (KubeSRE, Bio, VRAM)
a0d682e

Roopalgn commited on

[Push 1] Roopal: rewrite KnowledgeBase as progressive textbook, add change log to project status
9bb4c46

Roopalgn commited on