Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
nekomeowww
's Collections
Training Datasets (Voice Models)
Training Datasets (Language Models)
Language Models (Diffusion approach)
Evaluations
3D
Robotics
Vision
generative-ui
generative-curves
Training Datasets (Language Models)
updated
May 24, 2025
Upvote
-
allenai/dolma
Updated
Apr 17, 2024
•
4.32k
•
1.04k
OpenCoder-LLM/opc-sft-stage1
Viewer
•
Updated
Nov 24, 2024
•
4.22M
•
1.17k
•
74
OpenCoder-LLM/opc-fineweb-math-corpus
Viewer
•
Updated
Nov 24, 2024
•
5.24M
•
1.13k
•
30
OpenCoder-LLM/opc-fineweb-code-corpus
Viewer
•
Updated
Nov 24, 2024
•
101M
•
4.09k
•
51
mlfoundations/dclm-baseline-1.0
Preview
•
Updated
Jul 22, 2024
•
580k
•
277
Upvote
-
Share collection
View history
Collection guide
Browse collections