-
mike-ravkine/rosettacode-parsed
Viewer • Updated • 4.26k • 63 • 12 -
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer • Updated • 3.91M • 4.03k • 645 -
HuggingFaceFW/fineweb
Viewer • Updated • 52.5B • 197k • 2.71k -
FineWeb: decanting the web for the finest text data at scale
🍷1.32kRead a detailed overview of the FineWeb web‑scale text dataset
Gokul Ganesan
Xeiroh
AI & ML interests
None yet
Organizations
Datasets
-
mike-ravkine/rosettacode-parsed
Viewer • Updated • 4.26k • 63 • 12 -
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer • Updated • 3.91M • 4.03k • 645 -
HuggingFaceFW/fineweb
Viewer • Updated • 52.5B • 197k • 2.71k - RunningFeatured1.32k
FineWeb: decanting the web for the finest text data at scale
🍷1.32kRead a detailed overview of the FineWeb web‑scale text dataset