ModernBERT workhorses. Collection A collection of powerful - but light - models to annotate data. • 4 items • Updated Sep 23 • 1
Running Featured 1.24k FineWeb: decanting the web for the finest text data at scale 🍷 1.24k Generate high-quality text data for LLMs using FineWeb
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28 • 123
ModernBERT workhorses. Collection A collection of powerful - but light - models to annotate data. • 4 items • Updated Sep 23 • 1
ModernBERT workhorses. Collection A collection of powerful - but light - models to annotate data. • 4 items • Updated Sep 23 • 1