CodeRM-SFT-Warmup-Selection-8B / tokenizer_config.json

Commit History

SFT warmup LoRA for 8B judge (9367 samples, 1 epoch)
d6f5cbd
verified

t2ance commited on