Running Featured 1.35k FineWeb: decanting the web for the finest text data at scale 🍷 1.35k Explore FineWeb: a web‑scale text dataset for LLM training