Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
BSC-LT 's Collections
MrBERT
ALIA
Salamandra 🦎
MT Models
MT Datasets
Speech models
Speech datasets

MT Datasets

updated 6 days ago

Machine Translation Datasets developed by the MT team of the AI Institute, BSC

Upvote
-

  • BSC-LT/BSC_ParaMT_8

    Viewer • Updated 17 days ago • 733M • 156

  • BSC-LT/Legal_Catalan_Spanish_Parallel_Corpus

    Updated May 21 • 28

  • BSC-LT/MULTI_corpus

    Viewer • Updated May 21 • 468k • 49

  • BSC-LT/geneval_catalan

    Viewer • Updated Apr 9 • 5.25k • 59

  • BSC-LT/NTEU_Multilingual_Evaluation_Dataset

    Updated Nov 4, 2025 • 89 • 1

  • BSC-LT/Catalan-Aranese_Parallel_Corpus

    Viewer • Updated Feb 6 • 539k • 26 • 1

  • BSC-LT/ALIA_mixed_authentic_synthetic_MT

    Viewer • Updated Dec 17, 2025 • 454M • 198 • 1

  • BSC-LT/Spanish-Valencian_Catalan_Parallel_Corpus

    Viewer • Updated Mar 4 • 2.16M • 32 • 2
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs