Phase 2: LR=1e-5, epochs=2, label_smoothing=0.1, warmup=0.15 ad57149 verified fmnxl commited on Jan 12
Phase 1: max_length=768, batch=8, epochs=3, standard Trainer (no class weights) 2f95a9a verified fmnxl commited on Jan 12
Fix: Use fixthemusic namespace for model upload (token has access) 39d86c2 verified fmnxl commited on Jan 12
Fix: Use WeightedTrainer, calculate class weights, add early stopping, use bf16 043bc32 verified fmnxl commited on Jan 12
Fix: Remove data/ copy since loading from HuggingFace Dataset 4ef8db4 verified fmnxl commited on Jan 11