Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
EliMC
's Collections
Signature Detection Datasets
Text & Image Datasets
TTS Models
NLP & Embbeding Models
OCR Models
Text & Image Datasets
updated
about 19 hours ago
Upvote
-
airtrain-ai/fineweb-edu-fortified
Viewer
•
Updated
Aug 8, 2024
•
322M
•
218k
•
54
openbmb/Ultra-FineWeb
Viewer
•
Updated
Dec 10, 2025
•
1.29B
•
36.4k
•
340
gair-prox/FineWeb-pro
Viewer
•
Updated
Sep 26, 2024
•
63.1M
•
1.03k
•
26
LLM360/TxT360-Midas
Viewer
•
Updated
Dec 9, 2025
•
2.87B
•
490
•
13
MiXaiLL76/TextOCR_OCR
Viewer
•
Updated
Jan 18, 2025
•
113k
•
147
•
1
chestnutlzj/LaTeX_OCR_384x384
Viewer
•
Updated
Nov 27, 2025
•
76.3k
•
217
ducto489/ocr_datasets
Viewer
•
Updated
May 25, 2025
•
5.65M
•
118
•
2
laicsiifes/flickr30k-pt-br
Viewer
•
Updated
Mar 31, 2025
•
31k
•
390
•
4
Rapidata/Flux-2-pro_t2i_human_preference
Viewer
•
Updated
Dec 2, 2025
•
44.9k
•
868
•
12
Open-Bee/Honey-Data-15M
Viewer
•
Updated
Mar 10
•
14.8M
•
33.2k
•
117
HuggingFaceM4/FineVision
Viewer
•
Updated
Oct 21, 2025
•
24.2M
•
105k
•
484
detection-datasets/coco
Viewer
•
Updated
Mar 15, 2023
•
122k
•
14k
•
75
KBlueLeaf/coyo11m-256px-ccrop-latent
Viewer
•
Updated
Dec 5, 2024
•
9.16M
•
295
•
4
HuggingFaceM4/the_cauldron
Viewer
•
Updated
May 6, 2024
•
1.88M
•
422k
•
530
lmms-lab/VQAv2
Viewer
•
Updated
Jan 26, 2024
•
770k
•
25.2k
•
34
HSDSLab/TwitterMemes
Viewer
•
Updated
Jul 10, 2024
•
174k
•
13
•
3
BLIP3o/BLIP3o-Pretrain-Long-Caption
Viewer
•
Updated
Jun 26, 2025
•
27.2M
•
8.05k
•
62
Pclanglais/Nanochat
Viewer
•
Updated
Nov 20, 2025
•
97.2M
•
5.2k
•
7
abisee/cnn_dailymail
Viewer
•
Updated
Jan 18, 2024
•
936k
•
131k
•
343
PleIAs/SYNTH
Viewer
•
Updated
9 days ago
•
68M
•
12.1k
•
262
nvidia/Nemotron-PII
Viewer
•
Updated
Dec 17, 2025
•
200k
•
4.51k
•
97
lightonai/LightOnOCR-mix-0126
Viewer
•
Updated
Jan 26
•
16.4M
•
386
•
112
OptimalScale/ClimbMix
Viewer
•
Updated
May 4, 2025
•
395M
•
12.8k
•
30
karpathy/tinystories-gpt4-clean
Viewer
•
Updated
Feb 8
•
2.73M
•
1.66k
•
72
pszemraj/cnn_dailymail-cleaned
Viewer
•
Updated
Dec 29, 2025
•
350k
•
103
mlabonne/smoltalk-flat
Viewer
•
Updated
Nov 21, 2024
•
1.1M
•
126
•
4
olmer/wiki_paragraphs
Viewer
•
Updated
May 20, 2023
•
44.4M
•
16
•
1
omarkamali/wikipedia-monthly
Viewer
•
Updated
Mar 14
•
195M
•
11.3k
•
64
Felladrin/ChatML-SlimOrca-Dedup
Viewer
•
Updated
Feb 23, 2024
•
363k
•
35
•
1
BEE-spoke-data/cosmopedia-v2-mincols
Viewer
•
Updated
Dec 29, 2025
•
39.1M
•
3.63k
•
3
Upvote
-
Share collection
View history
Collection guide
Browse collections