Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
whr94621
's Collections
multilingual_benchmark
multilingual_domain_datasets
LLM_LongContext
LLM_Eval
LLM_Alignment
LLM_Pretrain
LLM_Multilingual
llm_datasets_japanese
llm_datasets_multi
llm_datasets_arabic
llm_synthesis_data
llm_datasets_id
llm_datasets_translation
llm_models_pretrain
llm_datasets_korean
llm_datasets_vi
llm_datasets_ru
llm_datasets_th
curated_sft_data
multilingual_domain_datasets
updated
Feb 17, 2025
Multilingual datasets. Excluding those which are just a cleaned version of CC.
Upvote
-
nyuuzyou/edutexts
Viewer
•
Updated
Jan 14
•
1.38M
•
132
•
5
llm-jp/AnswerCarefully
Viewer
•
Updated
Feb 16
•
4.62k
•
221
•
51
sander-wood/m4-rag
Viewer
•
Updated
Oct 12, 2025
•
1.04M
•
20
•
13
Upvote
-
Share collection
View history
Collection guide
Browse collections