Running Featured 1.37k FineWeb: decanting the web for the finest text data at scale 🍷 1.37k Explore and download the FineWeb web‑scale text dataset
oliverguhr/fullstop-punctuation-multilang-large Token Classification • Updated Nov 16, 2023 • 658k • • 177