Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Bk9x
's Collections
Data_Pretrain_NLP
Dataset_NLP
Small LM
Dataset_voice
Embedding
Automatic Speech Recognition
SDXL
TTS
LLM
model_NLP
VLM + OCR
Data_Pretrain_NLP
updated
Jan 10
Upvote
-
aisingapore/SEA-PILE-v2
Viewer
•
Updated
Apr 14, 2025
•
187M
•
1.1k
•
5
BlossomsAI/vietnamese-corpus
Viewer
•
Updated
Dec 17, 2024
•
29M
•
383
•
8
uonlp/CulturaX
Viewer
•
Updated
Dec 16, 2024
•
7.18B
•
16k
•
602
bkai-foundation-models/BKAINewsCorpus
Viewer
•
Updated
Mar 5, 2024
•
16.8M
•
472
•
13
Upvote
-
Share collection
View history
Collection guide
Browse collections