Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
proxectonos
's Collections
MrBERT-nos-gl
Domain Specific Corpora
CorpusNÓS: A massive Galician corpus for training LLM
Text Datasets for Fine-tuning and Instruction tuning
Text Datasets for Evaluation
MT
Text Models
TTS Models
ASR Models
Instruction Pretrained Experiments
MT Models (former)
ASR Datasets
TTS Datasets
Domain Specific Corpora
updated
18 days ago
Collection of corpora prepared from specific domains mainly in Galician language.
Upvote
-
proxectonos/corpus_dominio_legal_administrativo
Preview
•
Updated
18 days ago
•
197
proxectonos/corpus_dominio_periodistico
Viewer
•
Updated
17 days ago
•
280k
•
59
proxectonos/corpus_dominio_cientifico
Preview
•
Updated
17 days ago
•
62
proxectonos/corpus_dominio_museistico_patrimonio
Viewer
•
Updated
15 days ago
•
14.5k
•
306
Upvote
-
Share collection
View history
Collection guide
Browse collections