Spaces:

Roopalgn
/

openenv-clinical-trial

Sleeping

App Files Files Community

openenv-clinical-trial / docs

Commit History

Add loss curve plot (required for automated validation)

de1f57d

Roopalgn commited on Apr 26

Update blog + Run 2 real data

0c8ca1b

Roopalgn commited on Apr 26

Submission ready: narrative README, real reward plot, clean notebook, remove internal files

89811cf

Roopalgn commited on Apr 26

docs: add competitive analysis of ~250 OpenEnv hackathon repos

5a2448e

Roopalgn commited on Apr 26

Submission: clean public repo - remove internal docs, add notebook and results

079b390

Roopalgn commited on Apr 26

Submission: notebook, README, blog post, reward plot

f1a5429

Roopalgn commited on Apr 26

V4: steep reward slope - full-episode eval + stronger milestones + progress bonus

f885d09

Roopalgn commited on Apr 26

fixes : rewards and training

d68729f

Coding Ninja commited on Apr 26

Fix GRPO parse collapse for notebook training

2f3aa1e

Coding Ninja commited on Apr 26

Fix reward-stationary reset controls and notebook training setup

2a5fb9e

Coding Ninja commited on Apr 26

fix file structure

d233ac8

Roopalgn commited on Apr 26

V4: fix 5 critical + 8 major + 2 moderate issues

d148dd5

Roopalgn commited on Apr 25

Update prompt.md

c280445

Roopalgn commited on Apr 25

Add evaluation results and warnings to prompt.md

77371be

Roopalgn commited on Apr 25

Add GRPO V3 training guide: explains bug, fix, and testing steps

1eb8c81

Roopalgn commited on Apr 25

Add detailed training issues and fixes documentation

e1e5bc0
unverified

Roopalgn commited on Apr 25

fix: rebalance rewards for GRPO slope - remove efficiency baseline, add milestone bonuses

d8168e4

Roopalgn commited on Apr 25

docs: capture colab validation run and hf plan

54c5378

Roopalgn commited on Apr 25

docs: prep training intake and handoff workflow

cb22297

Roopalgn commited on Apr 25

Add comprehensive RL book for beginners (16 chapters, docs/internal/rl_book)

c7bd258

Roopalgn commited on Apr 24

fix: add root route so HF Space doesn't show 404

6b6624d

Roopalgn commited on Apr 24

merge PR#15 (S1-S7), integrate extra_info into hack_info+ROADMAP, reset reward CSV

08f6e71

Roopalgn commited on Apr 24

Merge branch 'Roopalgn:main' into main

2da6021
unverified

Suyash Kumar commited on Apr 24

[Push 8 Pre-25] S1-S7: action space docs, onsite checklist patch, eval key rename, 7b lora_r fix, dry-run outputs

5c57225

Coding Ninja commited on Apr 24

Create extra_info.md

a65ebde
unverified

Roopalgn commited on Apr 24

pre-25th R1-R8: add innovation argument, training-failure fallback, notebook validation outputs

9a019f2

Roopalgn commited on Apr 23

hack_info: restructure with proper markdown; roadmap: add pre-25th task list

57d30f2

Roopalgn commited on Apr 23

roadmap: add competitor analysis, validate 6 code gaps, add P0 fix plan for onsite

257348b

Roopalgn commited on Apr 23

Update hack_info.md

0ccb5ed
unverified

Roopalgn commited on Apr 23

docs: consolidate 28->16 md files, refine README, add eval report

2646e27

Roopalgn commited on Apr 22

fix: train_colab.ipynb - correct API, DRY_RUN, MODEL_PRESETS, reward handling; mark Suyash tasks done

13ccd3d

Roopalgn commited on Apr 22

prep: onsite artifacts - kaggle notebook, checklist, templates with [FILL ONSITE] placeholders

20fbcfb

Roopalgn commited on Apr 22

docs: add pre-onsite checklist + update KnowledgeBase

0ab5277

Roopalgn commited on Apr 22

fix: remove all ROADMAP contradictions - training is onsite only

ed8ee76

Roopalgn commited on Apr 22

Update ROADMAP: mark branch merge complete, update Phase B checklist

112396c

Roopalgn commited on Apr 22

Update ROADMAP: re-enable pre-training on Kaggle, Push 8 becomes H100 refinement

242e13c

Roopalgn commited on Apr 22

Add internal resources document

90def51

Roopalgn commited on Apr 22

[Push 7] Roopal: fix HF Space card metadata, align port to 7860, add training_log.md

38974f7

Roopalgn commited on Apr 22

Push 7 Phase A: grounding + kaggle notebook + docs

62441e1

Roopalgn commited on Apr 22

Update ROADMAP: pre-training on Kaggle before onsite, Push 8 becomes H100 refinement

2f22004

Roopalgn commited on Apr 22

Polish repo for judges: move internal docs, clean winner refs, improve dashboard

95c4fd1

Roopalgn commited on Apr 21

[Push 7-8] Add G16-G22 post-merge gaps, Push 7 (pre-onsite) and Push 8 (onsite training), updated checklists and Definition of Done

d5992c2

Roopalgn commited on Apr 21

post-merge: add .gitignore, update project status with integration test results

6470e2f

Roopalgn commited on Apr 21

[Push 6] Roopal: storytelling assets, pitch notes, mini-blog finalized, ARCHITECTURE.md finalized, docs proofread

418d871

Roopalgn commited on Apr 20

[Push 5] Roopal: reward tuning, adaptive difficulty spec, dashboard UI, improved prompts

e96bfea

Roopalgn commited on Apr 20

[Push 4] Roopal: training runbook, Colab notebook, evaluation template, mini-blog draft

6daa3e9

Roopalgn commited on Apr 20

[Push 3] Roopal: curriculum policy, verification spec, benchmark protocol, dashboard metrics, phase scoring

40c42e9

Roopalgn commited on Apr 20

[Push 2] Roopal: reward spec, scenario cards, milestone map, shaping function

d2176de

Roopalgn commited on Apr 20

[Push 1] Roopal: enhance all docs with explicit winner-inspired patterns (KubeSRE, Bio, VRAM)

a0d682e

Roopalgn commited on Apr 20

[Push 1] Roopal: rewrite KnowledgeBase as progressive textbook, add change log to project status

9bb4c46

Roopalgn commited on Apr 20

Commit History

Add loss curve plot (required for automated validation) de1f57d

Update blog + Run 2 real data 0c8ca1b

Submission ready: narrative README, real reward plot, clean notebook, remove internal files 89811cf

docs: add competitive analysis of ~250 OpenEnv hackathon repos 5a2448e

Submission: clean public repo - remove internal docs, add notebook and results 079b390

Submission: notebook, README, blog post, reward plot f1a5429

V4: steep reward slope - full-episode eval + stronger milestones + progress bonus f885d09

fixes : rewards and training d68729f

Fix GRPO parse collapse for notebook training 2f3aa1e

Fix reward-stationary reset controls and notebook training setup 2a5fb9e

fix file structure d233ac8

V4: fix 5 critical + 8 major + 2 moderate issues d148dd5

Update prompt.md c280445

Add evaluation results and warnings to prompt.md 77371be

Add GRPO V3 training guide: explains bug, fix, and testing steps 1eb8c81

Add detailed training issues and fixes documentation e1e5bc0 unverified

fix: rebalance rewards for GRPO slope - remove efficiency baseline, add milestone bonuses d8168e4

docs: capture colab validation run and hf plan 54c5378

docs: prep training intake and handoff workflow cb22297

Add comprehensive RL book for beginners (16 chapters, docs/internal/rl_book) c7bd258

fix: add root route so HF Space doesn't show 404 6b6624d

merge PR#15 (S1-S7), integrate extra_info into hack_info+ROADMAP, reset reward CSV 08f6e71

Merge branch 'Roopalgn:main' into main 2da6021 unverified

[Push 8 Pre-25] S1-S7: action space docs, onsite checklist patch, eval key rename, 7b lora_r fix, dry-run outputs 5c57225

Create extra_info.md a65ebde unverified

pre-25th R1-R8: add innovation argument, training-failure fallback, notebook validation outputs 9a019f2

hack_info: restructure with proper markdown; roadmap: add pre-25th task list 57d30f2

roadmap: add competitor analysis, validate 6 code gaps, add P0 fix plan for onsite 257348b

Update hack_info.md 0ccb5ed unverified

docs: consolidate 28->16 md files, refine README, add eval report 2646e27

fix: train_colab.ipynb - correct API, DRY_RUN, MODEL_PRESETS, reward handling; mark Suyash tasks done 13ccd3d

prep: onsite artifacts - kaggle notebook, checklist, templates with [FILL ONSITE] placeholders 20fbcfb

docs: add pre-onsite checklist + update KnowledgeBase 0ab5277

fix: remove all ROADMAP contradictions - training is onsite only ed8ee76

Update ROADMAP: mark branch merge complete, update Phase B checklist 112396c

Update ROADMAP: re-enable pre-training on Kaggle, Push 8 becomes H100 refinement 242e13c

Add internal resources document 90def51

[Push 7] Roopal: fix HF Space card metadata, align port to 7860, add training_log.md 38974f7

Push 7 Phase A: grounding + kaggle notebook + docs 62441e1

Update ROADMAP: pre-training on Kaggle before onsite, Push 8 becomes H100 refinement 2f22004

Polish repo for judges: move internal docs, clean winner refs, improve dashboard 95c4fd1

[Push 7-8] Add G16-G22 post-merge gaps, Push 7 (pre-onsite) and Push 8 (onsite training), updated checklists and Definition of Done d5992c2

post-merge: add .gitignore, update project status with integration test results 6470e2f

[Push 6] Roopal: storytelling assets, pitch notes, mini-blog finalized, ARCHITECTURE.md finalized, docs proofread 418d871

[Push 5] Roopal: reward tuning, adaptive difficulty spec, dashboard UI, improved prompts e96bfea

[Push 4] Roopal: training runbook, Colab notebook, evaluation template, mini-blog draft 6daa3e9

[Push 3] Roopal: curriculum policy, verification spec, benchmark protocol, dashboard metrics, phase scoring 40c42e9

[Push 2] Roopal: reward spec, scenario cards, milestone map, shaping function d2176de

[Push 1] Roopal: enhance all docs with explicit winner-inspired patterns (KubeSRE, Bio, VRAM) a0d682e

[Push 1] Roopal: rewrite KnowledgeBase as progressive textbook, add change log to project status 9bb4c46

Add loss curve plot (required for automated validation)

de1f57d

Update blog + Run 2 real data

0c8ca1b

Submission ready: narrative README, real reward plot, clean notebook, remove internal files

89811cf

docs: add competitive analysis of ~250 OpenEnv hackathon repos

5a2448e

Submission: clean public repo - remove internal docs, add notebook and results

079b390

Submission: notebook, README, blog post, reward plot

f1a5429

V4: steep reward slope - full-episode eval + stronger milestones + progress bonus

f885d09

fixes : rewards and training

d68729f

Fix GRPO parse collapse for notebook training

2f3aa1e

Fix reward-stationary reset controls and notebook training setup

2a5fb9e

fix file structure

d233ac8

V4: fix 5 critical + 8 major + 2 moderate issues

d148dd5

Update prompt.md

c280445

Add evaluation results and warnings to prompt.md

77371be

Add GRPO V3 training guide: explains bug, fix, and testing steps

1eb8c81

Add detailed training issues and fixes documentation

e1e5bc0
unverified

fix: rebalance rewards for GRPO slope - remove efficiency baseline, add milestone bonuses

d8168e4

docs: capture colab validation run and hf plan

54c5378

docs: prep training intake and handoff workflow

cb22297

Add comprehensive RL book for beginners (16 chapters, docs/internal/rl_book)

c7bd258

fix: add root route so HF Space doesn't show 404

6b6624d

merge PR#15 (S1-S7), integrate extra_info into hack_info+ROADMAP, reset reward CSV

08f6e71

Merge branch 'Roopalgn:main' into main

2da6021
unverified

[Push 8 Pre-25] S1-S7: action space docs, onsite checklist patch, eval key rename, 7b lora_r fix, dry-run outputs

5c57225

Create extra_info.md

a65ebde
unverified

pre-25th R1-R8: add innovation argument, training-failure fallback, notebook validation outputs

9a019f2

hack_info: restructure with proper markdown; roadmap: add pre-25th task list

57d30f2

roadmap: add competitor analysis, validate 6 code gaps, add P0 fix plan for onsite

257348b

Update hack_info.md

0ccb5ed
unverified

docs: consolidate 28->16 md files, refine README, add eval report

2646e27

fix: train_colab.ipynb - correct API, DRY_RUN, MODEL_PRESETS, reward handling; mark Suyash tasks done

13ccd3d

prep: onsite artifacts - kaggle notebook, checklist, templates with [FILL ONSITE] placeholders

20fbcfb

docs: add pre-onsite checklist + update KnowledgeBase

0ab5277

fix: remove all ROADMAP contradictions - training is onsite only

ed8ee76

Update ROADMAP: mark branch merge complete, update Phase B checklist

112396c

Update ROADMAP: re-enable pre-training on Kaggle, Push 8 becomes H100 refinement

242e13c

Add internal resources document

90def51

[Push 7] Roopal: fix HF Space card metadata, align port to 7860, add training_log.md

38974f7

Push 7 Phase A: grounding + kaggle notebook + docs

62441e1

Update ROADMAP: pre-training on Kaggle before onsite, Push 8 becomes H100 refinement

2f22004

Polish repo for judges: move internal docs, clean winner refs, improve dashboard

95c4fd1

[Push 7-8] Add G16-G22 post-merge gaps, Push 7 (pre-onsite) and Push 8 (onsite training), updated checklists and Definition of Done

d5992c2

post-merge: add .gitignore, update project status with integration test results

6470e2f

[Push 6] Roopal: storytelling assets, pitch notes, mini-blog finalized, ARCHITECTURE.md finalized, docs proofread

418d871

[Push 5] Roopal: reward tuning, adaptive difficulty spec, dashboard UI, improved prompts

e96bfea

[Push 4] Roopal: training runbook, Colab notebook, evaluation template, mini-blog draft

6daa3e9

[Push 3] Roopal: curriculum policy, verification spec, benchmark protocol, dashboard metrics, phase scoring

40c42e9

[Push 2] Roopal: reward spec, scenario cards, milestone map, shaping function

d2176de

[Push 1] Roopal: enhance all docs with explicit winner-inspired patterns (KubeSRE, Bio, VRAM)

a0d682e

[Push 1] Roopal: rewrite KnowledgeBase as progressive textbook, add change log to project status

9bb4c46