-
mike-ravkine/rosettacode-parsed
Viewer • Updated • 4.26k • 66 • 12 -
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer • Updated • 3.91M • 4.79k • 662 -
HuggingFaceFW/fineweb
Viewer • Updated • 52.5B • 923k • 2.79k -
FineWeb: decanting the web for the finest text data at scale
🍷1.34kExplore and download the FineWeb web‑text dataset
Gokul Ganesan
Xeiroh
AI & ML interests
None yet
Organizations
Datasets
-
mike-ravkine/rosettacode-parsed
Viewer • Updated • 4.26k • 66 • 12 -
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer • Updated • 3.91M • 4.79k • 662 -
HuggingFaceFW/fineweb
Viewer • Updated • 52.5B • 923k • 2.79k - RunningFeatured1.34k
FineWeb: decanting the web for the finest text data at scale
🍷1.34kExplore and download the FineWeb web‑text dataset