Spaces:
Sleeping
Sleeping
Commit History
Update blog + Run 2 real data 0c8ca1b
Submission ready: narrative README, real reward plot, clean notebook, remove internal files 89811cf
docs: add competitive analysis of ~250 OpenEnv hackathon repos 5a2448e
Roopalgn commited on
Submission: clean public repo - remove internal docs, add notebook and results 079b390
Submission: notebook, README, blog post, reward plot f1a5429
V4: steep reward slope - full-episode eval + stronger milestones + progress bonus f885d09
fixes : rewards and training d68729f
Coding Ninja commited on
Fix GRPO parse collapse for notebook training 2f3aa1e
Coding Ninja commited on
Fix reward-stationary reset controls and notebook training setup 2a5fb9e
Coding Ninja commited on
fix file structure d233ac8
V4: fix 5 critical + 8 major + 2 moderate issues d148dd5
Roopalgn commited on
Update prompt.md c280445
Add evaluation results and warnings to prompt.md 77371be
Add GRPO V3 training guide: explains bug, fix, and testing steps 1eb8c81
Roopalgn commited on
Add detailed training issues and fixes documentation e1e5bc0 unverified
fix: rebalance rewards for GRPO slope - remove efficiency baseline, add milestone bonuses d8168e4
docs: capture colab validation run and hf plan 54c5378
docs: prep training intake and handoff workflow cb22297
Add comprehensive RL book for beginners (16 chapters, docs/internal/rl_book) c7bd258
Roopalgn commited on
fix: add root route so HF Space doesn't show 404 6b6624d
Roopalgn commited on
merge PR#15 (S1-S7), integrate extra_info into hack_info+ROADMAP, reset reward CSV 08f6e71
Roopalgn commited on
Merge branch 'Roopalgn:main' into main 2da6021 unverified
Suyash Kumar commited on
[Push 8 Pre-25] S1-S7: action space docs, onsite checklist patch, eval key rename, 7b lora_r fix, dry-run outputs 5c57225
Coding Ninja commited on
Create extra_info.md a65ebde unverified
pre-25th R1-R8: add innovation argument, training-failure fallback, notebook validation outputs 9a019f2
Roopalgn commited on
hack_info: restructure with proper markdown; roadmap: add pre-25th task list 57d30f2
Roopalgn commited on
roadmap: add competitor analysis, validate 6 code gaps, add P0 fix plan for onsite 257348b
Roopalgn commited on
Update hack_info.md 0ccb5ed unverified
docs: consolidate 28->16 md files, refine README, add eval report 2646e27
Roopalgn commited on
fix: train_colab.ipynb - correct API, DRY_RUN, MODEL_PRESETS, reward handling; mark Suyash tasks done 13ccd3d
Roopalgn commited on
prep: onsite artifacts - kaggle notebook, checklist, templates with [FILL ONSITE] placeholders 20fbcfb
Roopalgn commited on
docs: add pre-onsite checklist + update KnowledgeBase 0ab5277
Roopalgn commited on
fix: remove all ROADMAP contradictions - training is onsite only ed8ee76
Roopalgn commited on
Update ROADMAP: mark branch merge complete, update Phase B checklist 112396c
Roopalgn commited on
Update ROADMAP: re-enable pre-training on Kaggle, Push 8 becomes H100 refinement 242e13c
Roopalgn commited on
Add internal resources document 90def51
[Push 7] Roopal: fix HF Space card metadata, align port to 7860, add training_log.md 38974f7
Push 7 Phase A: grounding + kaggle notebook + docs 62441e1
Update ROADMAP: pre-training on Kaggle before onsite, Push 8 becomes H100 refinement 2f22004
Roopalgn commited on
Polish repo for judges: move internal docs, clean winner refs, improve dashboard 95c4fd1
Roopalgn commited on
[Push 7-8] Add G16-G22 post-merge gaps, Push 7 (pre-onsite) and Push 8 (onsite training), updated checklists and Definition of Done d5992c2
Roopalgn commited on
post-merge: add .gitignore, update project status with integration test results 6470e2f
Roopalgn commited on
[Push 6] Roopal: storytelling assets, pitch notes, mini-blog finalized, ARCHITECTURE.md finalized, docs proofread 418d871
Roopalgn commited on
[Push 5] Roopal: reward tuning, adaptive difficulty spec, dashboard UI, improved prompts e96bfea
Roopalgn commited on
[Push 4] Roopal: training runbook, Colab notebook, evaluation template, mini-blog draft 6daa3e9
Roopalgn commited on
[Push 3] Roopal: curriculum policy, verification spec, benchmark protocol, dashboard metrics, phase scoring 40c42e9
Roopalgn commited on
[Push 2] Roopal: reward spec, scenario cards, milestone map, shaping function d2176de
Roopalgn commited on
[Push 1] Roopal: enhance all docs with explicit winner-inspired patterns (KubeSRE, Bio, VRAM) a0d682e
Roopalgn commited on
[Push 1] Roopal: rewrite KnowledgeBase as progressive textbook, add change log to project status 9bb4c46
Roopalgn commited on