remove ema, remove redundant compute for data loading, memory opt for float32, and set float32 by default for finetuning aa7853c klldmofashi commited on Sep 5, 2025
Fix normalize tests and add multi-batch dimension test 6eb4446 kvablack Claude commited on Aug 28, 2025
move option to remove last n in filter ranges to json generator de46c76 verityw commited on Aug 27, 2025
[fix] optimise `_merge_params` to prevent CPU overload (#616) 0fffc72 unverified Kevin Black commited on Aug 27, 2025
Merge branch 'main' into optimise-merge-params 237d886 unverified Varun Edachali commited on Aug 23, 2025
gemma_300m_lora variant is missing in Variant (#555) 51fb06b unverified Kevin Black commited on Aug 9, 2025