Fix resume adapter training (no peft_config with PeftModel) 7aa68cd CreativeEngineer commited on 15 days ago
Fix code block extraction and increase completion length 1ee2461 CreativeEngineer commited on 15 days ago
Switch to correctness-gated GRPO LoRA with persistence 648e193 CreativeEngineer commited on 15 days ago