Openenv / app.py

Commit History

Fix Gradio runtime error by moving theme to gr.Blocks
a593df9

saravanatanjiro commited on

Update with existing environment
d81b76a

saravanatanjiro commited on

Fix GRPO group-loss training and align UI defaults.
5a0c6af

saravanatanjiro commited on

Migrate LLM pipeline to custom GRPO with robust rewards
dfc5996

saravanatanjiro commited on

Multi-model benchmark pipeline: VRAM cleanup + EMA graph + detailed output
af6bbef

kavin57447 commited on

Max GPU utilization: flash-attn2 + grad accumulation + 15 steps/ep + 1024 seq len
93d0171

kavin57447 commited on

Cap LLM iterations at 50 to prevent timeout on 8B models
f20bc34

kavin57447 commited on

Switch to Llama 3.1 8B + fix low-timestep crash (min 5000)
8d95050

kavin57447 commited on

Add LLM RL training with Gemma 7B + LoRA
ee3dfa7

kavin57447 commited on

Fix Gradio 6.0 theme deprecation
1c86d42

kavin57447 commited on

Add Cloud Arena Mathematical Model RL environment
12263fa

kavin57447 commited on