MMS-LID 126 (Core ML, 4-bit)
Core ML conversion of MMS-LID for 126 languages, 4-bit palettized for smaller size and ANE-friendly inference on iOS/macOS.
- Base model: facebook/mms-lid-126
- Format: Core ML (.mlpackage), 4-bit palettized
- Languages: 126 (ISO 639-3)
Contents
- 4-bit palettized Core ML model (.mlpackage)
- Label mapping file(s) for index to language code
Input / Output
- Input: 16 kHz mono float32 audio, 10 seconds (160,000 samples)
- Output: Logits over 126 language classes;
argmaxthen look up ISO 639-3 in labels file.
Related
| Variant | Repo |
|---|---|
| Float16 | mms-lid-126-coreml |
| 4-bit | this repo |
| INT8 | mms-lid-126-coreml-int8 |
Citation
@article{pratap2023mms,
title={Scaling Speech Technology to 1,000+ Languages},
author={Vineel Pratap and Andros Tjandra and Bowen Shi and Paden Tomasello and Arun Babu and Sayani Kundu and Ali Elkahky and Zhaoheng Ni and Apoorv Vyas and Maryam Fazel-Zarandi and Alexei Baevski and Yossi Adi and Xiaohui Zhang and Wei-Ning Hsu and Alexis Conneau and Michael Auli},
journal={arXiv},
year={2023}
}
License
CC-BY-NC-4.0 (inherited from MMS-LID).
- Downloads last month
- 8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support