Fix merge: fall back to warm-start adapter from HF when GRPO skipped 03140d1 verified Rayugacodes commited on 17 days ago
Fix: batch_size=4 so num_generations=4 divides evenly 278a0ec verified Rayugacodes commited on 17 days ago
Fix: max_length -> max_seq_length for trl 0.15.2 (verified all configs locally) beef760 verified Rayugacodes commited on 17 days ago
Fix: add health server on port 7860 to prevent timeout cfd9219 verified Rayugacodes commited on 17 days ago
Fix: batch_size=16, 10K samples, unbuffered output, 2 epochs 1572306 verified Rayugacodes commited on 17 days ago
Fix all: writable /tmp cache, no login(), proper permissions 8b8863d verified Rayugacodes commited on 18 days ago