Commit History

Add simulation disclaimer banner
d056af3
Running
verified

Rayugacodes commited on

Full-screen UI + OpenEnv API tab (reset/step/state/stop)
e019ca1
verified

Rayugacodes commited on

Redesigned UI: dark theme, Plotly charts, 4 tabs, professional layout
644149b
verified

Rayugacodes commited on

Fix: simulate action effects on next state so AI wins on latency reduction
fb4bf5a
verified

Rayugacodes commited on

Deploy interactive simulation demo (Gradio, free CPU)
1489940
verified

Rayugacodes commited on

Fix merge: fall back to warm-start adapter from HF when GRPO skipped
03140d1
verified

Rayugacodes commited on

Skip GRPO: merge warm-start and push to HF
7f52093
verified

Rayugacodes commited on

Fix: batch_size=4 so num_generations=4 divides evenly
278a0ec
verified

Rayugacodes commited on

Fix: max_length -> max_seq_length for trl 0.15.2 (verified all configs locally)
beef760
verified

Rayugacodes commited on

Fix: trl==0.15.2 (has GRPO, no vllm/FSDP dep)
e191204
verified

Rayugacodes commited on

Fix: pin trl==0.12.2, verify imports during build
9ff82e1
verified

Rayugacodes commited on

Fix: pin trl<0.17 for FSDP compat, skip world model (already done)
f4c4a2c
verified

Rayugacodes commited on

Install CUDA PyTorch in slim image for A100 GPU
b3f08a7
verified

Rayugacodes commited on

Revert to python:3.10-slim (was working) + health server prevents timeout
2e20db1
verified

Rayugacodes commited on

Fix: install PyTorch with CUDA 12.1 support
36344b2
verified

Rayugacodes commited on

Fix: use CUDA base image for GPU support
9894974
verified

Rayugacodes commited on

Fix: add health server on port 7860 to prevent timeout
cfd9219
verified

Rayugacodes commited on

Fix: batch_size=16, 10K samples, unbuffered output, 2 epochs
1572306
verified

Rayugacodes commited on

Fix: set HOME/USER/TORCH env vars for uid 1000
0d9e780
verified

Rayugacodes commited on

Fix all: writable /tmp cache, no login(), proper permissions
8b8863d
verified

Rayugacodes commited on

Fix: set HF_HOME to writable directory
74820e1
verified

Rayugacodes commited on

Fix Dockerfile: read HF_TOKEN from env correctly
3b0341d
verified

Rayugacodes commited on

Use HF_TOKEN secret in Dockerfile
6205146
verified

Rayugacodes commited on

Add Dockerfile and training script
97cae42
verified

Rayugacodes commited on

initial commit
92b614c
verified

Rayugacodes commited on