KernelX / train_on_hf.py

Commit History

Fix merge: fall back to warm-start adapter from HF when GRPO skipped
03140d1
verified

Rayugacodes commited on

Fix: batch_size=4 so num_generations=4 divides evenly
278a0ec
verified

Rayugacodes commited on

Fix: max_length -> max_seq_length for trl 0.15.2 (verified all configs locally)
beef760
verified

Rayugacodes commited on

Fix: add health server on port 7860 to prevent timeout
cfd9219
verified

Rayugacodes commited on

Fix: batch_size=16, 10K samples, unbuffered output, 2 epochs
1572306
verified

Rayugacodes commited on

Fix all: writable /tmp cache, no login(), proper permissions
8b8863d
verified

Rayugacodes commited on

Add Dockerfile and training script
97cae42
verified

Rayugacodes commited on