Fix Gradio runtime error by moving theme to gr.Blocks a593df9 saravanatanjiro commited on 17 days ago
Migrate LLM pipeline to custom GRPO with robust rewards dfc5996 saravanatanjiro commited on 17 days ago
Multi-model benchmark pipeline: VRAM cleanup + EMA graph + detailed output af6bbef kavin57447 commited on 18 days ago
Max GPU utilization: flash-attn2 + grad accumulation + 15 steps/ep + 1024 seq len 93d0171 kavin57447 commited on 18 days ago
Switch to Llama 3.1 8B + fix low-timestep crash (min 5000) 8d95050 kavin57447 commited on 18 days ago