Fix dashboard logging URL to use proxy path, force Docker rebuild a2c523c Running sissississi Claude Opus 4.6 commited on 2 days ago
Fix MAX_SEQ_LENGTH: 1024 was too small for prompt+completion, bump to 2048 9b2abc6 sissississi Claude Opus 4.6 commited on 3 days ago
Fix Qwen3 thinking mode: add /no_think, increase max_completion_length 9a9721a sissississi Claude Opus 4.6 commited on 3 days ago
Fix API response structure: done/reward are top-level, not in observation 444b086 sissississi Claude Opus 4.6 commited on 3 days ago
Hardcode task definitions in notebook to avoid /tasks API dependency e19247c sissississi Claude Opus 4.6 commited on 3 days ago
Redesign frontend as training dashboard + add live activity feed d662461 sissississi Claude Opus 4.6 commited on 3 days ago
Route rewards through OpenEnv API instead of local computation c0cedb4 sissississi Claude Opus 4.6 commited on 3 days ago
Fix GRPO: remove SFT, multi-task dataset, instruct model only 490094b sissississi Claude Opus 4.6 commited on 3 days ago
Add RL tag to show badge on HF Spaces card 8dc0478 sissississi Claude Opus 4.6 commited on 3 days ago
Fix training: add SFT warmup + switch to instruct model 4edf79e sissississi Claude Opus 4.6 commited on 3 days ago
Fix: copy Node.js 20 from build stage instead of apt install 0f2e319 sissississi Claude Opus 4.6 commited on 3 days ago
Update training notebook: vLLM fast inference, Qwen3-4B, max_steps=300 4859185 sissississi Claude Opus 4.6 commited on 3 days ago
Restore OpenEnv with optimized Docker multi-stage build 1f89afe sissississi Claude Opus 4.6 commited on 3 days ago
Remove openenv-core dep to fix HF build timeout 9ae534f sissississi Claude Opus 4.6 commited on 3 days ago
Fix Dockerfile: use node:20-slim with python3 venv e1e78bb sissississi Claude Opus 4.6 commited on 3 days ago
Add RL training environment with OpenEnv backend bc52096 sissississi Claude Opus 4.6 commited on 3 days ago
Deploy full Next.js origami simulator to HF Space b40f1ec sissississi Claude Opus 4.6 commited on 3 days ago
Initial deploy: OpenEnv RL environment for origami crease patterns 236e665 sissississi Claude Opus 4.6 commited on 3 days ago