| --- |
| tags: |
| - mms |
| language: |
| - ab |
| - af |
| - ak |
| - am |
| - ar |
| - as |
| - av |
| - ay |
| - az |
| - ba |
| - bm |
| - be |
| - bn |
| - bi |
| - bo |
| - sh |
| - br |
| - bg |
| - ca |
| - cs |
| - ce |
| - cv |
| - ku |
| - cy |
| - da |
| - de |
| - dv |
| - dz |
| - el |
| - en |
| - eo |
| - et |
| - eu |
| - ee |
| - fo |
| - fa |
| - fj |
| - fi |
| - fr |
| - fy |
| - ff |
| - ga |
| - gl |
| - gn |
| - gu |
| - zh |
| - ht |
| - ha |
| - he |
| - hi |
| - sh |
| - hu |
| - hy |
| - ig |
| - ia |
| - ms |
| - is |
| - it |
| - jv |
| - ja |
| - kn |
| - ka |
| - kk |
| - kr |
| - km |
| - ki |
| - rw |
| - ky |
| - ko |
| - kv |
| - lo |
| - la |
| - lv |
| - ln |
| - lt |
| - lb |
| - lg |
| - mh |
| - ml |
| - mr |
| - ms |
| - mk |
| - mg |
| - mt |
| - mn |
| - mi |
| - my |
| - zh |
| - nl |
| - 'no' |
| - 'no' |
| - ne |
| - ny |
| - oc |
| - om |
| - or |
| - os |
| - pa |
| - pl |
| - pt |
| - ms |
| - ps |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - qu |
| - ro |
| - rn |
| - ru |
| - sg |
| - sk |
| - sl |
| - sm |
| - sn |
| - sd |
| - so |
| - es |
| - sq |
| - su |
| - sv |
| - sw |
| - ta |
| - tt |
| - te |
| - tg |
| - tl |
| - th |
| - ti |
| - ts |
| - tr |
| - uk |
| - ms |
| - vi |
| - wo |
| - xh |
| - ms |
| - yo |
| - ms |
| - zu |
| - za |
| license: cc-by-nc-4.0 |
| datasets: |
| - google/fleurs |
| metrics: |
| - wer |
| --- |
| |
| # Massively Multilingual Speech (MMS) - Finetuned ASR - ALL |
|
|
| This checkpoint is a model fine-tuned for multi-lingual ASR and part of Facebook's [Massive Multilingual Speech project](https://research.facebook.com/publications/scaling-speech-technology-to-1000-languages/). |
| This checkpoint is based on the [Wav2Vec2 architecture](https://huggingface.co/docs/transformers/model_doc/wav2vec2) and makes use of adapter models to transcribe 1000+ languages. |
| The checkpoint consists of **1 billion parameters** and has been fine-tuned from [facebook/mms-1b](https://huggingface.co/facebook/mms-1b) on 1162 languages. |
|
|
| ## Table Of Content |
|
|
| - [Example](#example) |
| - [Supported Languages](#supported-languages) |
| - [Model details](#model-details) |
| - [Additional links](#additional-links) |
|
|
| ## Example |
|
|
| This MMS checkpoint can be used with [Transformers](https://github.com/huggingface/transformers) to transcribe audio of 1107 different |
| languages. Let's look at a simple example. |
|
|
| First, we install transformers and some other libraries |
| ``` |
| pip install torch accelerate torchaudio datasets |
| pip install --upgrade transformers |
| ```` |
|
|
| **Note**: In order to use MMS you need to have at least `transformers >= 4.30` installed. If the `4.30` version |
| is not yet available [on PyPI](https://pypi.org/project/transformers/) make sure to install `transformers` from |
| source: |
| ``` |
| pip install git+https://github.com/huggingface/transformers.git |
| ``` |
|
|
| Next, we load a couple of audio samples via `datasets`. Make sure that the audio data is sampled to 16000 kHz. |
|
|
| ```py |
| from datasets import load_dataset, Audio |
| |
| # English |
| stream_data = load_dataset("mozilla-foundation/common_voice_13_0", "en", split="test", streaming=True) |
| stream_data = stream_data.cast_column("audio", Audio(sampling_rate=16000)) |
| en_sample = next(iter(stream_data))["audio"]["array"] |
| |
| # French |
| stream_data = load_dataset("mozilla-foundation/common_voice_13_0", "fr", split="test", streaming=True) |
| stream_data = stream_data.cast_column("audio", Audio(sampling_rate=16000)) |
| fr_sample = next(iter(stream_data))["audio"]["array"] |
| ``` |
|
|
| Next, we load the model and processor |
|
|
| ```py |
| from transformers import Wav2Vec2ForCTC, AutoProcessor |
| import torch |
| |
| model_id = "facebook/mms-1b-all" |
| |
| processor = AutoProcessor.from_pretrained(model_id) |
| model = Wav2Vec2ForCTC.from_pretrained(model_id) |
| ``` |
|
|
| Now we process the audio data, pass the processed audio data to the model and transcribe the model output, just like we usually do for Wav2Vec2 models such as [facebook/wav2vec2-base-960h](https://huggingface.co/facebook/wav2vec2-base-960h) |
|
|
| ```py |
| inputs = processor(en_sample, sampling_rate=16_000, return_tensors="pt") |
| |
| with torch.no_grad(): |
| outputs = model(**inputs).logits |
| |
| ids = torch.argmax(outputs, dim=-1)[0] |
| transcription = processor.decode(ids) |
| # 'joe keton disapproved of films and buster also had reservations about the media' |
| ``` |
|
|
| We can now keep the same model in memory and simply switch out the language adapters by calling the convenient [`load_adapter()`]() function for the model and [`set_target_lang()`]() for the tokenizer. We pass the target language as an input - "fra" for French. |
|
|
| ```py |
| processor.tokenizer.set_target_lang("fra") |
| model.load_adapter("fra") |
| |
| inputs = processor(fr_sample, sampling_rate=16_000, return_tensors="pt") |
| |
| with torch.no_grad(): |
| outputs = model(**inputs).logits |
| |
| ids = torch.argmax(outputs, dim=-1)[0] |
| transcription = processor.decode(ids) |
| # "ce dernier est volé tout au long de l'histoire romaine" |
| ``` |
|
|
| In the same way the language can be switched out for all other supported languages. Please have a look at: |
| ```py |
| processor.tokenizer.vocab.keys() |
| ``` |
|
|
| For more details, please have a look at [the official docs](https://huggingface.co/docs/transformers/main/en/model_doc/mms). |
|
|
| ## Supported Languages |
|
|
| This model supports 1162 languages. Unclick the following to toogle all supported languages of this checkpoint in [ISO 639-3 code](https://en.wikipedia.org/wiki/ISO_639-3). |
| You can find more details about the languages and their ISO 649-3 codes in the [MMS Language Coverage Overview](https://dl.fbaipublicfiles.com/mms/misc/language_coverage_mms.html). |
| <details> |
| <summary>Click to toggle</summary> |
|
|
| - abi |
| - abk |
| - abp |
| - aca |
| - acd |
| - ace |
| - acf |
| - ach |
| - acn |
| - acr |
| - acu |
| - ade |
| - adh |
| - adj |
| - adx |
| - aeu |
| - afr |
| - agd |
| - agg |
| - agn |
| - agr |
| - agu |
| - agx |
| - aha |
| - ahk |
| - aia |
| - aka |
| - akb |
| - ake |
| - akp |
| - alj |
| - alp |
| - alt |
| - alz |
| - ame |
| - amf |
| - amh |
| - ami |
| - amk |
| - ann |
| - any |
| - aoz |
| - apb |
| - apr |
| - ara |
| - arl |
| - asa |
| - asg |
| - asm |
| - ast |
| - ata |
| - atb |
| - atg |
| - ati |
| - atq |
| - ava |
| - avn |
| - avu |
| - awa |
| - awb |
| - ayo |
| - ayr |
| - ayz |
| - azb |
| - azg |
| - azj-script_cyrillic |
| - azj-script_latin |
| - azz |
| - bak |
| - bam |
| - ban |
| - bao |
| - bas |
| - bav |
| - bba |
| - bbb |
| - bbc |
| - bbo |
| - bcc-script_arabic |
| - bcc-script_latin |
| - bcl |
| - bcw |
| - bdg |
| - bdh |
| - bdq |
| - bdu |
| - bdv |
| - beh |
| - bel |
| - bem |
| - ben |
| - bep |
| - bex |
| - bfa |
| - bfo |
| - bfy |
| - bfz |
| - bgc |
| - bgq |
| - bgr |
| - bgt |
| - bgw |
| - bha |
| - bht |
| - bhz |
| - bib |
| - bim |
| - bis |
| - biv |
| - bjr |
| - bjv |
| - bjw |
| - bjz |
| - bkd |
| - bkv |
| - blh |
| - blt |
| - blx |
| - blz |
| - bmq |
| - bmr |
| - bmu |
| - bmv |
| - bng |
| - bno |
| - bnp |
| - boa |
| - bod |
| - boj |
| - bom |
| - bor |
| - bos |
| - bov |
| - box |
| - bpr |
| - bps |
| - bqc |
| - bqi |
| - bqj |
| - bqp |
| - bre |
| - bru |
| - bsc |
| - bsq |
| - bss |
| - btd |
| - bts |
| - btt |
| - btx |
| - bud |
| - bul |
| - bus |
| - bvc |
| - bvz |
| - bwq |
| - bwu |
| - byr |
| - bzh |
| - bzi |
| - bzj |
| - caa |
| - cab |
| - cac-dialect_sanmateoixtatan |
| - cac-dialect_sansebastiancoatan |
| - cak-dialect_central |
| - cak-dialect_santamariadejesus |
| - cak-dialect_santodomingoxenacoj |
| - cak-dialect_southcentral |
| - cak-dialect_western |
| - cak-dialect_yepocapa |
| - cap |
| - car |
| - cas |
| - cat |
| - cax |
| - cbc |
| - cbi |
| - cbr |
| - cbs |
| - cbt |
| - cbu |
| - cbv |
| - cce |
| - cco |
| - cdj |
| - ceb |
| - ceg |
| - cek |
| - ces |
| - cfm |
| - cgc |
| - che |
| - chf |
| - chv |
| - chz |
| - cjo |
| - cjp |
| - cjs |
| - ckb |
| - cko |
| - ckt |
| - cla |
| - cle |
| - cly |
| - cme |
| - cmn-script_simplified |
| - cmo-script_khmer |
| - cmo-script_latin |
| - cmr |
| - cnh |
| - cni |
| - cnl |
| - cnt |
| - coe |
| - cof |
| - cok |
| - con |
| - cot |
| - cou |
| - cpa |
| - cpb |
| - cpu |
| - crh |
| - crk-script_latin |
| - crk-script_syllabics |
| - crn |
| - crq |
| - crs |
| - crt |
| - csk |
| - cso |
| - ctd |
| - ctg |
| - cto |
| - ctu |
| - cuc |
| - cui |
| - cuk |
| - cul |
| - cwa |
| - cwe |
| - cwt |
| - cya |
| - cym |
| - daa |
| - dah |
| - dan |
| - dar |
| - dbj |
| - dbq |
| - ddn |
| - ded |
| - des |
| - deu |
| - dga |
| - dgi |
| - dgk |
| - dgo |
| - dgr |
| - dhi |
| - did |
| - dig |
| - dik |
| - dip |
| - div |
| - djk |
| - dnj-dialect_blowowest |
| - dnj-dialect_gweetaawueast |
| - dnt |
| - dnw |
| - dop |
| - dos |
| - dsh |
| - dso |
| - dtp |
| - dts |
| - dug |
| - dwr |
| - dyi |
| - dyo |
| - dyu |
| - dzo |
| - eip |
| - eka |
| - ell |
| - emp |
| - enb |
| - eng |
| - enx |
| - epo |
| - ese |
| - ess |
| - est |
| - eus |
| - evn |
| - ewe |
| - eza |
| - fal |
| - fao |
| - far |
| - fas |
| - fij |
| - fin |
| - flr |
| - fmu |
| - fon |
| - fra |
| - frd |
| - fry |
| - ful |
| - gag-script_cyrillic |
| - gag-script_latin |
| - gai |
| - gam |
| - gau |
| - gbi |
| - gbk |
| - gbm |
| - gbo |
| - gde |
| - geb |
| - gej |
| - gil |
| - gjn |
| - gkn |
| - gld |
| - gle |
| - glg |
| - glk |
| - gmv |
| - gna |
| - gnd |
| - gng |
| - gof-script_latin |
| - gog |
| - gor |
| - gqr |
| - grc |
| - gri |
| - grn |
| - grt |
| - gso |
| - gub |
| - guc |
| - gud |
| - guh |
| - guj |
| - guk |
| - gum |
| - guo |
| - guq |
| - guu |
| - gux |
| - gvc |
| - gvl |
| - gwi |
| - gwr |
| - gym |
| - gyr |
| - had |
| - hag |
| - hak |
| - hap |
| - hat |
| - hau |
| - hay |
| - heb |
| - heh |
| - hif |
| - hig |
| - hil |
| - hin |
| - hlb |
| - hlt |
| - hne |
| - hnn |
| - hns |
| - hoc |
| - hoy |
| - hrv |
| - hsb |
| - hto |
| - hub |
| - hui |
| - hun |
| - hus-dialect_centralveracruz |
| - hus-dialect_westernpotosino |
| - huu |
| - huv |
| - hvn |
| - hwc |
| - hye |
| - hyw |
| - iba |
| - ibo |
| - icr |
| - idd |
| - ifa |
| - ifb |
| - ife |
| - ifk |
| - ifu |
| - ify |
| - ign |
| - ikk |
| - ilb |
| - ilo |
| - imo |
| - ina |
| - inb |
| - ind |
| - iou |
| - ipi |
| - iqw |
| - iri |
| - irk |
| - isl |
| - ita |
| - itl |
| - itv |
| - ixl-dialect_sangasparchajul |
| - ixl-dialect_sanjuancotzal |
| - ixl-dialect_santamarianebaj |
| - izr |
| - izz |
| - jac |
| - jam |
| - jav |
| - jbu |
| - jen |
| - jic |
| - jiv |
| - jmc |
| - jmd |
| - jpn |
| - jun |
| - juy |
| - jvn |
| - kaa |
| - kab |
| - kac |
| - kak |
| - kam |
| - kan |
| - kao |
| - kaq |
| - kat |
| - kay |
| - kaz |
| - kbo |
| - kbp |
| - kbq |
| - kbr |
| - kby |
| - kca |
| - kcg |
| - kdc |
| - kde |
| - kdh |
| - kdi |
| - kdj |
| - kdl |
| - kdn |
| - kdt |
| - kea |
| - kek |
| - ken |
| - keo |
| - ker |
| - key |
| - kez |
| - kfb |
| - kff-script_telugu |
| - kfw |
| - kfx |
| - khg |
| - khm |
| - khq |
| - kia |
| - kij |
| - kik |
| - kin |
| - kir |
| - kjb |
| - kje |
| - kjg |
| - kjh |
| - kki |
| - kkj |
| - kle |
| - klu |
| - klv |
| - klw |
| - kma |
| - kmd |
| - kml |
| - kmr-script_arabic |
| - kmr-script_cyrillic |
| - kmr-script_latin |
| - kmu |
| - knb |
| - kne |
| - knf |
| - knj |
| - knk |
| - kno |
| - kog |
| - kor |
| - kpq |
| - kps |
| - kpv |
| - kpy |
| - kpz |
| - kqe |
| - kqp |
| - kqr |
| - kqy |
| - krc |
| - kri |
| - krj |
| - krl |
| - krr |
| - krs |
| - kru |
| - ksb |
| - ksr |
| - kss |
| - ktb |
| - ktj |
| - kub |
| - kue |
| - kum |
| - kus |
| - kvn |
| - kvw |
| - kwd |
| - kwf |
| - kwi |
| - kxc |
| - kxf |
| - kxm |
| - kxv |
| - kyb |
| - kyc |
| - kyf |
| - kyg |
| - kyo |
| - kyq |
| - kyu |
| - kyz |
| - kzf |
| - lac |
| - laj |
| - lam |
| - lao |
| - las |
| - lat |
| - lav |
| - law |
| - lbj |
| - lbw |
| - lcp |
| - lee |
| - lef |
| - lem |
| - lew |
| - lex |
| - lgg |
| - lgl |
| - lhu |
| - lia |
| - lid |
| - lif |
| - lin |
| - lip |
| - lis |
| - lit |
| - lje |
| - ljp |
| - llg |
| - lln |
| - lme |
| - lnd |
| - lns |
| - lob |
| - lok |
| - lom |
| - lon |
| - loq |
| - lsi |
| - lsm |
| - ltz |
| - luc |
| - lug |
| - luo |
| - lwo |
| - lww |
| - lzz |
| - maa-dialect_sanantonio |
| - maa-dialect_sanjeronimo |
| - mad |
| - mag |
| - mah |
| - mai |
| - maj |
| - mak |
| - mal |
| - mam-dialect_central |
| - mam-dialect_northern |
| - mam-dialect_southern |
| - mam-dialect_western |
| - maq |
| - mar |
| - maw |
| - maz |
| - mbb |
| - mbc |
| - mbh |
| - mbj |
| - mbt |
| - mbu |
| - mbz |
| - mca |
| - mcb |
| - mcd |
| - mco |
| - mcp |
| - mcq |
| - mcu |
| - mda |
| - mdf |
| - mdv |
| - mdy |
| - med |
| - mee |
| - mej |
| - men |
| - meq |
| - met |
| - mev |
| - mfe |
| - mfh |
| - mfi |
| - mfk |
| - mfq |
| - mfy |
| - mfz |
| - mgd |
| - mge |
| - mgh |
| - mgo |
| - mhi |
| - mhr |
| - mhu |
| - mhx |
| - mhy |
| - mib |
| - mie |
| - mif |
| - mih |
| - mil |
| - mim |
| - min |
| - mio |
| - mip |
| - miq |
| - mit |
| - miy |
| - miz |
| - mjl |
| - mjv |
| - mkd |
| - mkl |
| - mkn |
| - mlg |
| - mlt |
| - mmg |
| - mnb |
| - mnf |
| - mnk |
| - mnw |
| - mnx |
| - moa |
| - mog |
| - mon |
| - mop |
| - mor |
| - mos |
| - mox |
| - moz |
| - mpg |
| - mpm |
| - mpp |
| - mpx |
| - mqb |
| - mqf |
| - mqj |
| - mqn |
| - mri |
| - mrw |
| - msy |
| - mtd |
| - mtj |
| - mto |
| - muh |
| - mup |
| - mur |
| - muv |
| - muy |
| - mvp |
| - mwq |
| - mwv |
| - mxb |
| - mxq |
| - mxt |
| - mxv |
| - mya |
| - myb |
| - myk |
| - myl |
| - myv |
| - myx |
| - myy |
| - mza |
| - mzi |
| - mzj |
| - mzk |
| - mzm |
| - mzw |
| - nab |
| - nag |
| - nan |
| - nas |
| - naw |
| - nca |
| - nch |
| - ncj |
| - ncl |
| - ncu |
| - ndj |
| - ndp |
| - ndv |
| - ndy |
| - ndz |
| - neb |
| - new |
| - nfa |
| - nfr |
| - nga |
| - ngl |
| - ngp |
| - ngu |
| - nhe |
| - nhi |
| - nhu |
| - nhw |
| - nhx |
| - nhy |
| - nia |
| - nij |
| - nim |
| - nin |
| - nko |
| - nlc |
| - nld |
| - nlg |
| - nlk |
| - nmz |
| - nnb |
| - nno |
| - nnq |
| - nnw |
| - noa |
| - nob |
| - nod |
| - nog |
| - not |
| - npi |
| - npl |
| - npy |
| - nso |
| - nst |
| - nsu |
| - ntm |
| - ntr |
| - nuj |
| - nus |
| - nuz |
| - nwb |
| - nxq |
| - nya |
| - nyf |
| - nyn |
| - nyo |
| - nyy |
| - nzi |
| - obo |
| - oci |
| - ojb-script_latin |
| - ojb-script_syllabics |
| - oku |
| - old |
| - omw |
| - onb |
| - ood |
| - orm |
| - ory |
| - oss |
| - ote |
| - otq |
| - ozm |
| - pab |
| - pad |
| - pag |
| - pam |
| - pan |
| - pao |
| - pap |
| - pau |
| - pbb |
| - pbc |
| - pbi |
| - pce |
| - pcm |
| - peg |
| - pez |
| - pib |
| - pil |
| - pir |
| - pis |
| - pjt |
| - pkb |
| - pls |
| - plw |
| - pmf |
| - pny |
| - poh-dialect_eastern |
| - poh-dialect_western |
| - poi |
| - pol |
| - por |
| - poy |
| - ppk |
| - pps |
| - prf |
| - prk |
| - prt |
| - pse |
| - pss |
| - ptu |
| - pui |
| - pus |
| - pwg |
| - pww |
| - pxm |
| - qub |
| - quc-dialect_central |
| - quc-dialect_east |
| - quc-dialect_north |
| - quf |
| - quh |
| - qul |
| - quw |
| - quy |
| - quz |
| - qvc |
| - qve |
| - qvh |
| - qvm |
| - qvn |
| - qvo |
| - qvs |
| - qvw |
| - qvz |
| - qwh |
| - qxh |
| - qxl |
| - qxn |
| - qxo |
| - qxr |
| - rah |
| - rai |
| - rap |
| - rav |
| - raw |
| - rej |
| - rel |
| - rgu |
| - rhg |
| - rif-script_arabic |
| - rif-script_latin |
| - ril |
| - rim |
| - rjs |
| - rkt |
| - rmc-script_cyrillic |
| - rmc-script_latin |
| - rmo |
| - rmy-script_cyrillic |
| - rmy-script_latin |
| - rng |
| - rnl |
| - roh-dialect_sursilv |
| - roh-dialect_vallader |
| - rol |
| - ron |
| - rop |
| - rro |
| - rub |
| - ruf |
| - rug |
| - run |
| - rus |
| - sab |
| - sag |
| - sah |
| - saj |
| - saq |
| - sas |
| - sat |
| - sba |
| - sbd |
| - sbl |
| - sbp |
| - sch |
| - sck |
| - sda |
| - sea |
| - seh |
| - ses |
| - sey |
| - sgb |
| - sgj |
| - sgw |
| - shi |
| - shk |
| - shn |
| - sho |
| - shp |
| - sid |
| - sig |
| - sil |
| - sja |
| - sjm |
| - sld |
| - slk |
| - slu |
| - slv |
| - sml |
| - smo |
| - sna |
| - snd |
| - sne |
| - snn |
| - snp |
| - snw |
| - som |
| - soy |
| - spa |
| - spp |
| - spy |
| - sqi |
| - sri |
| - srm |
| - srn |
| - srp-script_cyrillic |
| - srp-script_latin |
| - srx |
| - stn |
| - stp |
| - suc |
| - suk |
| - sun |
| - sur |
| - sus |
| - suv |
| - suz |
| - swe |
| - swh |
| - sxb |
| - sxn |
| - sya |
| - syl |
| - sza |
| - tac |
| - taj |
| - tam |
| - tao |
| - tap |
| - taq |
| - tat |
| - tav |
| - tbc |
| - tbg |
| - tbk |
| - tbl |
| - tby |
| - tbz |
| - tca |
| - tcc |
| - tcs |
| - tcz |
| - tdj |
| - ted |
| - tee |
| - tel |
| - tem |
| - teo |
| - ter |
| - tes |
| - tew |
| - tex |
| - tfr |
| - tgj |
| - tgk |
| - tgl |
| - tgo |
| - tgp |
| - tha |
| - thk |
| - thl |
| - tih |
| - tik |
| - tir |
| - tkr |
| - tlb |
| - tlj |
| - tly |
| - tmc |
| - tmf |
| - tna |
| - tng |
| - tnk |
| - tnn |
| - tnp |
| - tnr |
| - tnt |
| - tob |
| - toc |
| - toh |
| - tom |
| - tos |
| - tpi |
| - tpm |
| - tpp |
| - tpt |
| - trc |
| - tri |
| - trn |
| - trs |
| - tso |
| - tsz |
| - ttc |
| - tte |
| - ttq-script_tifinagh |
| - tue |
| - tuf |
| - tuk-script_arabic |
| - tuk-script_latin |
| - tuo |
| - tur |
| - tvw |
| - twb |
| - twe |
| - twu |
| - txa |
| - txq |
| - txu |
| - tye |
| - tzh-dialect_bachajon |
| - tzh-dialect_tenejapa |
| - tzj-dialect_eastern |
| - tzj-dialect_western |
| - tzo-dialect_chamula |
| - tzo-dialect_chenalho |
| - ubl |
| - ubu |
| - udm |
| - udu |
| - uig-script_arabic |
| - uig-script_cyrillic |
| - ukr |
| - umb |
| - unr |
| - upv |
| - ura |
| - urb |
| - urd-script_arabic |
| - urd-script_devanagari |
| - urd-script_latin |
| - urk |
| - urt |
| - ury |
| - usp |
| - uzb-script_cyrillic |
| - uzb-script_latin |
| - vag |
| - vid |
| - vie |
| - vif |
| - vmw |
| - vmy |
| - vot |
| - vun |
| - vut |
| - wal-script_ethiopic |
| - wal-script_latin |
| - wap |
| - war |
| - waw |
| - way |
| - wba |
| - wlo |
| - wlx |
| - wmw |
| - wob |
| - wol |
| - wsg |
| - wwa |
| - xal |
| - xdy |
| - xed |
| - xer |
| - xho |
| - xmm |
| - xnj |
| - xnr |
| - xog |
| - xon |
| - xrb |
| - xsb |
| - xsm |
| - xsr |
| - xsu |
| - xta |
| - xtd |
| - xte |
| - xtm |
| - xtn |
| - xua |
| - xuo |
| - yaa |
| - yad |
| - yal |
| - yam |
| - yao |
| - yas |
| - yat |
| - yaz |
| - yba |
| - ybb |
| - ycl |
| - ycn |
| - yea |
| - yka |
| - yli |
| - yor |
| - yre |
| - yua |
| - yue-script_traditional |
| - yuz |
| - yva |
| - zaa |
| - zab |
| - zac |
| - zad |
| - zae |
| - zai |
| - zam |
| - zao |
| - zaq |
| - zar |
| - zas |
| - zav |
| - zaw |
| - zca |
| - zga |
| - zim |
| - ziw |
| - zlm |
| - zmz |
| - zne |
| - zos |
| - zpc |
| - zpg |
| - zpi |
| - zpl |
| - zpm |
| - zpo |
| - zpt |
| - zpu |
| - zpz |
| - ztq |
| - zty |
| - zul |
| - zyb |
| - zyp |
| - zza |
| |
| </details> |
| |
| ## Model details |
| |
| - **Developed by:** Vineel Pratap et al. |
| - **Model type:** Multi-Lingual Automatic Speech Recognition model |
| - **Language(s):** 1000+ languages, see [supported languages](#supported-languages) |
| - **License:** CC-BY-NC 4.0 license |
| - **Num parameters**: 1 billion |
| - **Audio sampling rate**: 16,000 kHz |
| - **Cite as:** |
| |
| @article{pratap2023mms, |
| title={Scaling Speech Technology to 1,000+ Languages}, |
| author={Vineel Pratap and Andros Tjandra and Bowen Shi and Paden Tomasello and Arun Babu and Sayani Kundu and Ali Elkahky and Zhaoheng Ni and Apoorv Vyas and Maryam Fazel-Zarandi and Alexei Baevski and Yossi Adi and Xiaohui Zhang and Wei-Ning Hsu and Alexis Conneau and Michael Auli}, |
| journal={arXiv}, |
| year={2023} |
| } |
| |
| ## Additional Links |
| |
| - [Blog post](https://ai.facebook.com/blog/multilingual-model-speech-recognition/) |
| - [Transformers documentation](https://huggingface.co/docs/transformers/main/en/model_doc/mms). |
| - [Paper](https://arxiv.org/abs/2305.13516) |
| - [GitHub Repository](https://github.com/facebookresearch/fairseq/tree/main/examples/mms#asr) |
| - [Other **MMS** checkpoints](https://huggingface.co/models?other=mms) |
| - MMS base checkpoints: |
| - [facebook/mms-1b](https://huggingface.co/facebook/mms-1b) |
| - [facebook/mms-300m](https://huggingface.co/facebook/mms-300m) |
| - [Official Space](https://huggingface.co/spaces/facebook/MMS) |
| |