MrBERT BSC-LT/MrBERT Fill-Mask • 0.3B • Updated Mar 26 • 179 • 8 BSC-LT/MrBERT-es Fill-Mask • 0.2B • Updated Apr 9 • 1.99k • • 9 BSC-LT/MrBERT-ca Fill-Mask • 0.1B • Updated Apr 21 • 50 • 2 BSC-LT/MrBERT-legal Fill-Mask • 0.3B • Updated Apr 9 • 73 • 1
Salamandra 🦎 BSC-LT/salamandra-7b-instruct Text Generation • 8B • Updated Oct 22, 2025 • 16.9k • 80 BSC-LT/salamandra-7b Text Generation • 8B • Updated Oct 22, 2025 • 951 • 30 BSC-LT/salamandra-2b-instruct Text Generation • 2B • Updated Oct 22, 2025 • 5.48k • 28 BSC-LT/salamandra-2b Text Generation • 2B • Updated Oct 22, 2025 • 536 • 27
MT Datasets Machine Translation Datasets developed by the MT team of the AI Institute, BSC BSC-LT/BSC_ParaMT_8 Viewer • Updated 13 days ago • 733M • 146 BSC-LT/Legal_Catalan_Spanish_Parallel_Corpus Updated May 21 • 28 BSC-LT/MULTI_corpus Viewer • Updated May 21 • 468k • 42 BSC-LT/geneval_catalan Viewer • Updated Apr 9 • 5.25k • 62
Speech datasets Datasets curated by the speech team of the Language Technologies unit BSC-LT/CAESAR-TINY Viewer • Updated Apr 7, 2025 • 667 • 20 BSC-LT/CAESAR-TV3 Viewer • Updated Apr 7, 2025 • 2.96k • 28 • 1 BSC-LT/BSCs_Code_Switching_CA-ES_ASR_Test Updated Nov 22, 2025 • 25 BSC-LT/distilled-yodas-spanish Updated Dec 15, 2025 • 60 • 4
ALIA BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated 28 days ago • 683 • 12 BSC-LT/ALIA-40b-instruct-2601-GGUF Text Generation • 40B • Updated Feb 20 • 98 • 5
MT Models Machine Translation Models developed by the MT team of the AI Institute, BSC BSC-LT/salamandraTA-7b-instruct Translation • 8B • Updated 2 days ago • 1.65k • 25 BSC-LT/salamandraTA-7B-instruct-GGUF Translation • 8B • Updated Oct 31, 2025 • 270 • 1 BSC-LT/salamandraTA-2b-instruct Translation • 2B • Updated May 11 • 137 • 2 BSC-LT/salamandraTA-2B-instruct-GGUF Translation • 2B • Updated Aug 19, 2025 • 97 • 2
Speech models Models developed by the speech team of the Language Technologies unit BSC-LT/wavenext-encodec Updated Sep 12, 2024 • 4 BSC-LT/wavenext-mel Updated Sep 10, 2024 • 39 • 3 BSC-LT/vocos-mel-22khz Updated Aug 27, 2024 • 519 • 7 BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 68 • 1
BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 68 • 1
MrBERT BSC-LT/MrBERT Fill-Mask • 0.3B • Updated Mar 26 • 179 • 8 BSC-LT/MrBERT-es Fill-Mask • 0.2B • Updated Apr 9 • 1.99k • • 9 BSC-LT/MrBERT-ca Fill-Mask • 0.1B • Updated Apr 21 • 50 • 2 BSC-LT/MrBERT-legal Fill-Mask • 0.3B • Updated Apr 9 • 73 • 1
ALIA BSC-LT/ALIA-40b-instruct-2601 Text Generation • 40B • Updated 28 days ago • 683 • 12 BSC-LT/ALIA-40b-instruct-2601-GGUF Text Generation • 40B • Updated Feb 20 • 98 • 5
Salamandra 🦎 BSC-LT/salamandra-7b-instruct Text Generation • 8B • Updated Oct 22, 2025 • 16.9k • 80 BSC-LT/salamandra-7b Text Generation • 8B • Updated Oct 22, 2025 • 951 • 30 BSC-LT/salamandra-2b-instruct Text Generation • 2B • Updated Oct 22, 2025 • 5.48k • 28 BSC-LT/salamandra-2b Text Generation • 2B • Updated Oct 22, 2025 • 536 • 27
MT Models Machine Translation Models developed by the MT team of the AI Institute, BSC BSC-LT/salamandraTA-7b-instruct Translation • 8B • Updated 2 days ago • 1.65k • 25 BSC-LT/salamandraTA-7B-instruct-GGUF Translation • 8B • Updated Oct 31, 2025 • 270 • 1 BSC-LT/salamandraTA-2b-instruct Translation • 2B • Updated May 11 • 137 • 2 BSC-LT/salamandraTA-2B-instruct-GGUF Translation • 2B • Updated Aug 19, 2025 • 97 • 2
MT Datasets Machine Translation Datasets developed by the MT team of the AI Institute, BSC BSC-LT/BSC_ParaMT_8 Viewer • Updated 13 days ago • 733M • 146 BSC-LT/Legal_Catalan_Spanish_Parallel_Corpus Updated May 21 • 28 BSC-LT/MULTI_corpus Viewer • Updated May 21 • 468k • 42 BSC-LT/geneval_catalan Viewer • Updated Apr 9 • 5.25k • 62
Speech models Models developed by the speech team of the Language Technologies unit BSC-LT/wavenext-encodec Updated Sep 12, 2024 • 4 BSC-LT/wavenext-mel Updated Sep 10, 2024 • 39 • 3 BSC-LT/vocos-mel-22khz Updated Aug 27, 2024 • 519 • 7 BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 68 • 1
BSC-LT/whisper-large-v3-ca-punctuated-3370h Automatic Speech Recognition • Updated Oct 28, 2025 • 68 • 1
Speech datasets Datasets curated by the speech team of the Language Technologies unit BSC-LT/CAESAR-TINY Viewer • Updated Apr 7, 2025 • 667 • 20 BSC-LT/CAESAR-TV3 Viewer • Updated Apr 7, 2025 • 2.96k • 28 • 1 BSC-LT/BSCs_Code_Switching_CA-ES_ASR_Test Updated Nov 22, 2025 • 25 BSC-LT/distilled-yodas-spanish Updated Dec 15, 2025 • 60 • 4