MrBERT BSC-LT/MrBERT Fill-Mask • Updated Mar 26 • 637 • 8 BSC-LT/MrBERT-es Fill-Mask • 0.2B • Updated 30 days ago • 6.35k • • 7 BSC-LT/MrBERT-ca Fill-Mask • Updated 18 days ago • 56 • 2 BSC-LT/MrBERT-legal Fill-Mask • 0.3B • Updated 30 days ago • 262 • 1
Salamandra 🦎 BSC-LT/salamandra-7b-instruct Text Generation • 8B • Updated Oct 22, 2025 • 175k • 79 BSC-LT/salamandra-7b Text Generation • 8B • Updated Oct 22, 2025 • 288 • 29 BSC-LT/salamandra-2b-instruct Text Generation • 2B • Updated Oct 22, 2025 • 3.3k • 28 BSC-LT/salamandra-2b Text Generation • 2B • Updated Oct 22, 2025 • 964 • 25
Speech models Models developed by the speech team of the Language Technologies unit BSC-LT/wavenext-encodec Updated Sep 12, 2024 • 3 BSC-LT/wavenext-mel Updated Sep 10, 2024 • 7 • 3 BSC-LT/vocos-mel-22khz Updated Aug 27, 2024 • 358 • 7 BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 72 • 1
BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 72 • 1
ALIA BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated Feb 19 • 4.1k • 12 BSC-LT/ALIA-40b-instruct-2601-GGUF Text Generation • 40B • Updated Feb 20 • 411 • 4
MT Datasets BSC-LT/NTEU_Multilingual_Evaluation_Dataset Updated Nov 4, 2025 • 34 • 1 BSC-LT/ALIA_mixed_authentic_synthetic_MT Viewer • Updated Dec 17, 2025 • 454M • 138 • 1 BSC-LT/Catalan-Aranese_Parallel_Corpus Viewer • Updated Feb 6 • 539k • 21 • 1 BSC-LT/Spanish-Valencian_Catalan_Parallel_Corpus Viewer • Updated Mar 4 • 2.16M • 60 • 1
Speech datasets Datasets curated by the speech team of the Language Technologies unit BSC-LT/CAESAR-TINY Viewer • Updated Apr 7, 2025 • 667 • 25 BSC-LT/CAESAR-TV3 Viewer • Updated Apr 7, 2025 • 2.96k • 51 • 1 BSC-LT/BSCs_Code_Switching_CA-ES_ASR_Test Updated Nov 22, 2025 • 25 BSC-LT/distilled-yodas-spanish Updated Dec 15, 2025 • 208 • 3
MrBERT BSC-LT/MrBERT Fill-Mask • Updated Mar 26 • 637 • 8 BSC-LT/MrBERT-es Fill-Mask • 0.2B • Updated 30 days ago • 6.35k • • 7 BSC-LT/MrBERT-ca Fill-Mask • Updated 18 days ago • 56 • 2 BSC-LT/MrBERT-legal Fill-Mask • 0.3B • Updated 30 days ago • 262 • 1
ALIA BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated Feb 19 • 4.1k • 12 BSC-LT/ALIA-40b-instruct-2601-GGUF Text Generation • 40B • Updated Feb 20 • 411 • 4
Salamandra 🦎 BSC-LT/salamandra-7b-instruct Text Generation • 8B • Updated Oct 22, 2025 • 175k • 79 BSC-LT/salamandra-7b Text Generation • 8B • Updated Oct 22, 2025 • 288 • 29 BSC-LT/salamandra-2b-instruct Text Generation • 2B • Updated Oct 22, 2025 • 3.3k • 28 BSC-LT/salamandra-2b Text Generation • 2B • Updated Oct 22, 2025 • 964 • 25
MT Datasets BSC-LT/NTEU_Multilingual_Evaluation_Dataset Updated Nov 4, 2025 • 34 • 1 BSC-LT/ALIA_mixed_authentic_synthetic_MT Viewer • Updated Dec 17, 2025 • 454M • 138 • 1 BSC-LT/Catalan-Aranese_Parallel_Corpus Viewer • Updated Feb 6 • 539k • 21 • 1 BSC-LT/Spanish-Valencian_Catalan_Parallel_Corpus Viewer • Updated Mar 4 • 2.16M • 60 • 1
Speech models Models developed by the speech team of the Language Technologies unit BSC-LT/wavenext-encodec Updated Sep 12, 2024 • 3 BSC-LT/wavenext-mel Updated Sep 10, 2024 • 7 • 3 BSC-LT/vocos-mel-22khz Updated Aug 27, 2024 • 358 • 7 BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 72 • 1
BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 72 • 1
Speech datasets Datasets curated by the speech team of the Language Technologies unit BSC-LT/CAESAR-TINY Viewer • Updated Apr 7, 2025 • 667 • 25 BSC-LT/CAESAR-TV3 Viewer • Updated Apr 7, 2025 • 2.96k • 51 • 1 BSC-LT/BSCs_Code_Switching_CA-ES_ASR_Test Updated Nov 22, 2025 • 25 BSC-LT/distilled-yodas-spanish Updated Dec 15, 2025 • 208 • 3