aoiandroid
/

mms-lid-256-onnx

language-identification

Model card Files Files and versions

aoiandroid commited on Mar 6

Commit

7734207

·

verified ·

1 Parent(s): 4bf3a53

Add model card (README.md)

Files changed (1) hide show

README.md +61 -0

README.md ADDED Viewed

	@@ -0,0 +1,61 @@

+---
+language:
+  - multilingual
+license: cc-by-nc-4.0
+tags:
+  - language-identification
+  - onnx
+  - audio
+  - wav2vec2
+  - mms-lid
+datasets:
+  - mms-lid
+---
+# MMS-LID 256 (ONNX)
+ONNX export of **MMS-LID** (Massively Multilingual Speech - Language Identification) for **256 languages**. For on-device or server inference without PyTorch.
+- **Base model:** [facebook/mms-lid-256](https://huggingface.co/facebook/mms-lid-256)
+- **Format:** ONNX
+- **Languages:** 256 (ISO 639-3)
+## Contents
+- ONNX model file(s) for the Wav2Vec2-based LID classifier
+- Label mapping (e.g. `labels.json` or `mms_lid_id2label.json`) for index to language code
+## Input / Output
+- **Input:** Raw waveform, 16 kHz mono, 10 seconds (160,000 samples)
+- **Output:** Logits over 256 language classes; `argmax` gives the predicted language index. Map index to ISO 639-3 code using the included labels file.
+## Usage
+1. Load the ONNX model with your runtime (e.g. ONNX Runtime, or convert further to Core ML for iOS).
+2. Feed 10 seconds of 16 kHz mono float32 audio.
+3. Take `argmax` of the logits output and look up the language code in the labels file.
+## Related repos
+| Languages | Format | Repo |
+|-----------|--------|------|
+| 256 | ONNX | **this repo** |
+| 126 | Core ML | [mms-lid-126-coreml](https://huggingface.co/aoiandroid/mms-lid-126-coreml) |
+| 256 | Core ML | [mms-lid-256-coreml](https://huggingface.co/aoiandroid/mms-lid-256-coreml) |
+| 512 | Core ML | [mms-lid-512-coreml](https://huggingface.co/aoiandroid/mms-lid-512-coreml) |
+## Citation
+```bibtex
+@article{pratap2023mms,
+  title={Scaling Speech Technology to 1,000+ Languages},
+  author={Vineel Pratap and Andros Tjandra and Bowen Shi and Paden Tomasello and Arun Babu and Sayani Kundu and Ali Elkahky and Zhaoheng Ni and Apoorv Vyas and Maryam Fazel-Zarandi and Alexei Baevski and Yossi Adi and Xiaohui Zhang and Wei-Ning Hsu and Alexis Conneau and Michael Auli},
+  journal={arXiv},
+  year={2023}
+}
+```
+## License
+CC-BY-NC-4.0 (inherited from MMS-LID).