Full-screen UI + OpenEnv API tab (reset/step/state/stop) e019ca1 verified Rayugacodes commited on 11 days ago
Redesigned UI: dark theme, Plotly charts, 4 tabs, professional layout 644149b verified Rayugacodes commited on 11 days ago
Fix: simulate action effects on next state so AI wins on latency reduction fb4bf5a verified Rayugacodes commited on 11 days ago
Deploy interactive simulation demo (Gradio, free CPU) 1489940 verified Rayugacodes commited on 11 days ago
Fix merge: fall back to warm-start adapter from HF when GRPO skipped 03140d1 verified Rayugacodes commited on 12 days ago
Fix: batch_size=4 so num_generations=4 divides evenly 278a0ec verified Rayugacodes commited on 12 days ago
Fix: max_length -> max_seq_length for trl 0.15.2 (verified all configs locally) beef760 verified Rayugacodes commited on 12 days ago
Fix: pin trl==0.12.2, verify imports during build 9ff82e1 verified Rayugacodes commited on 12 days ago
Fix: pin trl<0.17 for FSDP compat, skip world model (already done) f4c4a2c verified Rayugacodes commited on 12 days ago
Revert to python:3.10-slim (was working) + health server prevents timeout 2e20db1 verified Rayugacodes commited on 12 days ago
Fix: add health server on port 7860 to prevent timeout cfd9219 verified Rayugacodes commited on 12 days ago
Fix: batch_size=16, 10K samples, unbuffered output, 2 epochs 1572306 verified Rayugacodes commited on 12 days ago
Fix all: writable /tmp cache, no login(), proper permissions 8b8863d verified Rayugacodes commited on 12 days ago
Fix Dockerfile: read HF_TOKEN from env correctly 3b0341d verified Rayugacodes commited on 12 days ago