Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Bk9x
's Collections
Data_Pretrain_NLP
Dataset_NLP
Small LM
Dataset_voice
Embedding
Automatic Speech Recognition
SDXL
TTS
LLM
model_NLP
VLM + OCR
Data_Pretrain_NLP
updated
Jan 10
Upvote
-
aisingapore/SEA-PILE-v2
Viewer
•
Updated
Apr 14, 2025
•
187M
•
2.72k
•
6
BlossomsAI/vietnamese-corpus
Viewer
•
Updated
Dec 17, 2024
•
29M
•
333
•
8
uonlp/CulturaX
Viewer
•
Updated
Dec 16, 2024
•
7.18B
•
38.8k
•
622
bkai-foundation-models/BKAINewsCorpus
Viewer
•
Updated
Mar 5, 2024
•
16.8M
•
1.45k
•
14
Upvote
-
Share collection
View history
Collection guide
Browse collections