Commit History

feat: Add SFT training data (filtered, 2.4M samples from 24 sources)
6ebad26
verified

pathcosmos commited on

feat: Add heegyu_orca-math-korean-preference-cleaned.jsonl for ORPO/SFT reproducibility
449ea1c
verified

pathcosmos commited on

feat: Add nayohan_preference-collection-ko-full.jsonl for ORPO/SFT reproducibility
b412bf5
verified

pathcosmos commited on

feat: Add combined_preference.jsonl for ORPO/SFT reproducibility
60d9aab
verified

pathcosmos commited on

feat: Add SFT val + preference data (ORPO training, 630K pairs)
e9af455
verified

pathcosmos commited on

feat: Add data pipeline scripts + phase reports (Tier 3 - reproducibility)
b3d361d
verified

pathcosmos commited on