mms-lid
Collection
12 items โข Updated
Core ML conversion of MMS-LID (Massively Multilingual Speech - Language Identification) for 256 languages. Float16 model for on-device inference on iOS 17+ and macOS.
labels.json or mms_lid_id2label.json โ Index to language code mappingargmax gives the predicted language index. Map to ISO 639-3 using the labels file..mlpackage with Core ML; feed 10 seconds of 16 kHz mono audio.argmax of the logits output and look up the language code in the labels file.| Repo | Description |
|---|---|
| this repo | Float16 Core ML |
| mms-lid-256-coreml-4bit | 4-bit palettized (smaller, ANE-friendly) |
| Languages | ONNX | Core ML |
|---|---|---|
| 256 | mms-lid-256-onnx | this repo |
| 126 | โ | mms-lid-126-coreml |
| 512 | โ | mms-lid-512-coreml |
@article{pratap2023mms,
title={Scaling Speech Technology to 1,000+ Languages},
author={Vineel Pratap and Andros Tjandra and Bowen Shi and Paden Tomasello and Arun Babu and Sayani Kundu and Ali Elkahky and Zhaoheng Ni and Apoorv Vyas and Maryam Fazel-Zarandi and Alexei Baevski and Yossi Adi and Xiaohui Zhang and Wei-Ning Hsu and Alexis Conneau and Michael Auli},
journal={arXiv},
year={2023}
}
CC-BY-NC-4.0 (inherited from MMS-LID).