Commit History

Update README with detailed data pipeline and reproduction steps
2d3e79d
verified

NotoriousH2 commited on

Add model card README
d471665
verified

NotoriousH2 commited on

Add eval.py
48d87be
verified

NotoriousH2 commited on

Add train_rs_sft.py
10c8d20
verified

NotoriousH2 commited on

Add rs_sample.py
1dcce72
verified

NotoriousH2 commited on

Add train_sft.py
1c4102a
verified

NotoriousH2 commited on

SFT + Rejection Sampling SFT (5x teacher replay). GSM8K avg ~46.6%, best 48.9%
8e96ac1
verified

NotoriousH2 commited on

initial commit
4f6c780
verified

NotoriousH2 commited on