Running Featured 1.31k FineWeb: decanting the web for the finest text data at scale 🍷 1.31k Read a detailed overview of the FineWeb web‑scale text dataset
MT5 release Collection The MT5 release follows the T5 family, but is pretrained on multilingual data. The update UMT5 models are pretrained on an updated corpus. • 10 items • Updated 9 days ago • 23