XPF / README.md
niobures's picture
XPF
4a08ba7 verified
# XPF
Check out our interactive website: [The XPF Corpus](https://cohenpr-xpf.github.io/XPF/)
The preliminary manual of the corpus can be found [here](https://cohenpr-xpf.github.io/XPF/manual/xpf_manual.pdf).
## Repository
* [`./Code`](./Code) contains the various scripts needed to obtain phoneme translation statistics.
* [`./Data`](./Data) contains language specific information in terms of their profiles and phonemic grammars.
* [`./docs`](./docs) contains the files strictly needed for the website.
* [`./Guidelines`](./Guidelines) and [`./Manual`](./Manual) contain relevant documentation pertaining to the corpus and the curation of it.
## Available Languages
| Language Code | Language (click for info) | Comments |
|---------------|----------------------------------------------------------------------------------------------| -------------- |
| aak | [Ankave](https://cohenpr-xpf.github.io/XPF/conv_resources/info/aak.html) | |
| aau | [Abau](https://cohenpr-xpf.github.io/XPF/conv_resources/info/aau.html) | |
| ab | [Abkhaz](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ab.html) | |
| acf | [Saint Lucian Creole French](https://cohenpr-xpf.github.io/XPF/conv_resources/info/acf.html) | lacks lenition |
| aey | [Amele](https://cohenpr-xpf.github.io/XPF/conv_resources/info/aey.html) | |
| agg | [Angor](https://cohenpr-xpf.github.io/XPF/conv_resources/info/agg.html) | |
| aia | [Arosi](https://cohenpr-xpf.github.io/XPF/conv_resources/info/aia.html) | lacks lenition |
| amn | [Amanab](https://cohenpr-xpf.github.io/XPF/conv_resources/info/amn.html) | |
| an | [Aragonese](https://cohenpr-xpf.github.io/XPF/conv_resources/info/an.html) | |
| aom | [Aomie](https://cohenpr-xpf.github.io/XPF/conv_resources/info/aom.html) | lacks lenition |
| apu | [Apurinã](https://cohenpr-xpf.github.io/XPF/conv_resources/info/apu.html) | |
| apy | [Apalaí](https://cohenpr-xpf.github.io/XPF/conv_resources/info/apy.html) | lacks lenition |
| arl | [Arabela](https://cohenpr-xpf.github.io/XPF/conv_resources/info/arl.html) | |
| ast | [Asturian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ast.html) | |
| ata | [Pele-Ata](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ata.html) | |
| avt | [Au](https://cohenpr-xpf.github.io/XPF/conv_resources/info/avt.html) | |
| ay | [Aymara](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ay.html) | |
| az | [Azerbaijani](https://cohenpr-xpf.github.io/XPF/conv_resources/info/az.html) | |
| ba | [Bashkir](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ba.html) | |
| bdd | [Bunama](https://cohenpr-xpf.github.io/XPF/conv_resources/info/bdd.html) | |
| be | [Belarusan](https://cohenpr-xpf.github.io/XPF/conv_resources/info/be.html) | lacks lenition |
| bef | [Benabena](https://cohenpr-xpf.github.io/XPF/conv_resources/info/bef.html) | |
| bg | [Bulgarian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/bg.html) | |
| bi | [Bislama](https://cohenpr-xpf.github.io/XPF/conv_resources/info/bi.html) | |
| boa | [Bora](https://cohenpr-xpf.github.io/XPF/conv_resources/info/boa.html) | |
| boj | [Anjam](https://cohenpr-xpf.github.io/XPF/conv_resources/info/boj.html) | lacks lenition |
| bug | [Bugis](https://cohenpr-xpf.github.io/XPF/conv_resources/info/bug.html) | |
| bvr | [Burarra](https://cohenpr-xpf.github.io/XPF/conv_resources/info/bvr.html) | |
| bxr | [Russia Buryat](https://cohenpr-xpf.github.io/XPF/conv_resources/info/bxr.html) | |
| caa | [Ch'orti'](https://cohenpr-xpf.github.io/XPF/conv_resources/info/caa.html) | |
| car | [Carib](https://cohenpr-xpf.github.io/XPF/conv_resources/info/car.html) | |
| cbi | [Cha'palaa](https://cohenpr-xpf.github.io/XPF/conv_resources/info/cbi.html) | |
| cbk | [Chavacano](https://cohenpr-xpf.github.io/XPF/conv_resources/info/cbk.html) | |
| cbt | [Chayahuita](https://cohenpr-xpf.github.io/XPF/conv_resources/info/cbt.html) | |
| cbu | [Candoshi Shapra](https://cohenpr-xpf.github.io/XPF/conv_resources/info/cbu.html) | lacks lenition |
| cnm | [Ixtatán Chuj](https://cohenpr-xpf.github.io/XPF/conv_resources/info/cnm.html) | |
| crh | [Crimean Tatar](https://cohenpr-xpf.github.io/XPF/conv_resources/info/crh.html) | |
| cs | [Czech](https://cohenpr-xpf.github.io/XPF/conv_resources/info/cs.html) | |
| ctu | [Chol](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ctu.html) | |
| cv | [Chuvash](https://cohenpr-xpf.github.io/XPF/conv_resources/info/cv.html) | |
| ded | [Dedua](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ded.html) | |
| dgz | [Daga](https://cohenpr-xpf.github.io/XPF/conv_resources/info/dgz.html) | |
| djr | [Djambarrpuyngu](https://cohenpr-xpf.github.io/XPF/conv_resources/info/djr.html) | |
| dv | [Maldivian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/dv.html) | |
| el | [Greek](https://cohenpr-xpf.github.io/XPF/conv_resources/info/el.html) | |
| emi | [Mussau-Emira](https://cohenpr-xpf.github.io/XPF/conv_resources/info/emi.html) | lacks lenition |
| eu | [Basque](https://cohenpr-xpf.github.io/XPF/conv_resources/info/eu.html) | |
| gaw | [Nobonob](https://cohenpr-xpf.github.io/XPF/conv_resources/info/gaw.html) | |
| ghs | [Guhu-Samane](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ghs.html) | |
| gil | [Kiribati](https://cohenpr-xpf.github.io/XPF/conv_resources/info/gil.html) | lacks lenition |
| gn | [Guarani](https://cohenpr-xpf.github.io/XPF/conv_resources/info/gn.html) | |
| guc | [Wayuu](https://cohenpr-xpf.github.io/XPF/conv_resources/info/guc.html) | |
| guo | [Guayabero](https://cohenpr-xpf.github.io/XPF/conv_resources/info/guo.html) | |
| gvn | [Kuku-Yalanji](https://cohenpr-xpf.github.io/XPF/conv_resources/info/gvn.html) | |
| haw | [Hawaiian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/haw.html) | |
| hil | [Hiligaynon](https://cohenpr-xpf.github.io/XPF/conv_resources/info/hil.html) | |
| hmn | [Hmong](https://cohenpr-xpf.github.io/XPF/conv_resources/info/hmn.html) | lacks lenition |
| hsb | [Upper Sorbian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/hsb.html) | |
| ht | [Haitian Creole](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ht.html) | |
| hu | [Hungarian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/hu.html) | |
| hy | [Armenian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/hy.html) | |
| ign | [Ignaciano](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ign.html) | |
| ilo | [Ilocano](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ilo.html) | |
| inb | [Inga](https://cohenpr-xpf.github.io/XPF/conv_resources/info/inb.html) | |
| iu | [Inuktitut](https://cohenpr-xpf.github.io/XPF/conv_resources/info/iu.html) | |
| iws | [Sepik Iwam](https://cohenpr-xpf.github.io/XPF/conv_resources/info/iws.html) | |
| jam | [Jamaican Creole](https://cohenpr-xpf.github.io/XPF/conv_resources/info/jam.html) | |
| jv | [Javanese](https://cohenpr-xpf.github.io/XPF/conv_resources/info/jv.html) | |
| ka | [Georgian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ka.html) | |
| kbd | [Kabardian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kbd.html) | |
| kjb | [Q'anjob'al](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kjb.html) | |
| kki | [Kagulu](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kki.html) | lacks lenition |
| kl | [Kalaallisut](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kl.html) | |
| kn | [Kannada](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kn.html) | |
| ko | [Korean](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ko.html) | |
| kpf | [Komba](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kpf.html) | |
| kpx | [Mountain Koiali](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kpx.html) | |
| krc | [Karachay-Balkar](https://cohenpr-xpf.github.io/XPF/conv_resources/info/krc.html) | |
| ksr | [Borong](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ksr.html) | lacks lenition |
| kup | [Kunimaipa](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kup.html) | |
| kv | [Komi](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kv.html) | lacks lenition |
| ky | [Kirghiz](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ky.html) | |
| lem | [Nomaande](https://cohenpr-xpf.github.io/XPF/conv_resources/info/lem.html) | |
| mam | [Mam](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mam.html) | |
| mcq | [Ese](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mcq.html) | |
| mg | [Malagasy](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mg.html) | |
| mhl | [Mauwake](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mhl.html) | |
| mk | [Macedonian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mk.html) | |
| mqj | [Mamasa](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mqj.html) | |
| mto | [Totontepec Mixe](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mto.html) | |
| mva | [Manam](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mva.html) | |
| naf | [Nabak](https://cohenpr-xpf.github.io/XPF/conv_resources/info/naf.html) | |
| nan | [Min Nan Chinese](https://cohenpr-xpf.github.io/XPF/conv_resources/info/nan.html) | lacks lenition |
| nas | [Naasioi](https://cohenpr-xpf.github.io/XPF/conv_resources/info/nas.html) | |
| nhe | [Nahuatl](https://cohenpr-xpf.github.io/XPF/conv_resources/info/nhe.html) | |
| nhr | [Naro](https://cohenpr-xpf.github.io/XPF/conv_resources/info/nhr.html) | |
| nsn | [Nehan](https://cohenpr-xpf.github.io/XPF/conv_resources/info/nsn.html) | lacks lenition |
| nuy | [Nunggubuyu](https://cohenpr-xpf.github.io/XPF/conv_resources/info/nuy.html) | |
| omw | [South Tairora](https://cohenpr-xpf.github.io/XPF/conv_resources/info/omw.html) | |
| pad | [Paumarí](https://cohenpr-xpf.github.io/XPF/conv_resources/info/pad.html) | lacks lenition |
| pau | [Palauan](https://cohenpr-xpf.github.io/XPF/conv_resources/info/pau.html) | |
| pio | [Piapoco](https://cohenpr-xpf.github.io/XPF/conv_resources/info/pio.html) | |
| pwg | [Gapapaiwa](https://cohenpr-xpf.github.io/XPF/conv_resources/info/pwg.html) | |
| quz | [Cusco Quechua](https://cohenpr-xpf.github.io/XPF/conv_resources/info/quz.html) | |
| rkb | [Rikbaktsa](https://cohenpr-xpf.github.io/XPF/conv_resources/info/rkb.html) | |
| ro | [Romanian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ro.html) | |
| roo | [Rotokas](https://cohenpr-xpf.github.io/XPF/conv_resources/info/roo.html) | |
| shi | [Shilha](https://cohenpr-xpf.github.io/XPF/conv_resources/info/shi.html) | |
| shp | [Shipibo Konibo](https://cohenpr-xpf.github.io/XPF/conv_resources/info/shp.html) | |
| si | [Sinhala](https://cohenpr-xpf.github.io/XPF/conv_resources/info/si.html) | |
| snc | [Sinaugoro](https://cohenpr-xpf.github.io/XPF/conv_resources/info/snc.html) | lacks lenition |
| sq | [Albanian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/sq.html) | |
| ta | [Tamil](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ta.html) | |
| tac | [Western Tarahumara](https://cohenpr-xpf.github.io/XPF/conv_resources/info/tac.html) | |
| te | [Telugu](https://cohenpr-xpf.github.io/XPF/conv_resources/info/te.html) | |
| tee | [Huehuetla Tepehua](https://cohenpr-xpf.github.io/XPF/conv_resources/info/tee.html) | |
| tg | [Tajik](https://cohenpr-xpf.github.io/XPF/conv_resources/info/tg.html) | |
| to | [Tongan](https://cohenpr-xpf.github.io/XPF/conv_resources/info/to.html) | lacks lenition |
| tpi | [Tok Pisin](https://cohenpr-xpf.github.io/XPF/conv_resources/info/tpi.html) | |
| tr | [Turkish](https://cohenpr-xpf.github.io/XPF/conv_resources/info/tr.html) | |
| tt | [Tatar](https://cohenpr-xpf.github.io/XPF/conv_resources/info/tt.html) | |
| tyv | [Tuvan](https://cohenpr-xpf.github.io/XPF/conv_resources/info/tyv.html) | |
| tzo | [Tzotzil](https://cohenpr-xpf.github.io/XPF/conv_resources/info/tzo.html) | |
| ug | [Uyghur](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ug.html) | |
| uk | [Ukrainian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/uk.html) | |
| usa | [Usarufa](https://cohenpr-xpf.github.io/XPF/conv_resources/info/usa.html) | |
| uz | [Uzbek](https://cohenpr-xpf.github.io/XPF/conv_resources/info/uz.html) | |
| var | [Huarjío](https://cohenpr-xpf.github.io/XPF/conv_resources/info/var.html) | |
| vi | [Vietnamese](https://cohenpr-xpf.github.io/XPF/conv_resources/info/vi.html) | |
| viv | [Iduna](https://cohenpr-xpf.github.io/XPF/conv_resources/info/viv.html) | |
| way | [Wayana](https://cohenpr-xpf.github.io/XPF/conv_resources/info/way.html) | |
| wbp | [Warlpiri](https://cohenpr-xpf.github.io/XPF/conv_resources/info/wbp.html) | |
| wo | [Wolof](https://cohenpr-xpf.github.io/XPF/conv_resources/info/wo.html) | lacks lenition |
| ycn | [Yucuna](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ycn.html) | |
| yi | [Yiddish](https://cohenpr-xpf.github.io/XPF/conv_resources/info/yi.html) | |
| yua | [Yucatec Maya](https://cohenpr-xpf.github.io/XPF/conv_resources/info/yua.html) | |
| yuz | [Yuracare](https://cohenpr-xpf.github.io/XPF/conv_resources/info/yuz.html) | |
| yva | [Yawa](https://cohenpr-xpf.github.io/XPF/conv_resources/info/yva.html) | lacks lenition |
| zos | [Francisco León Zoque](https://cohenpr-xpf.github.io/XPF/conv_resources/info/zos.html) | |
## Compromised Languages
| Language Code | Language (click for info) | Reason (more thorough explanation in Rmd files) | Comments |
|-----------------|-----------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------|----------------|
| acr | [Rabinal Achi'](https://cohenpr-xpf.github.io/XPF/conv_resources/info/acr.html) | suspect marking of vowel length | lacks lenition |
| ake | [Akawaio](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ake.html) | conflation between voiceless and voiced consonants | |
| amp | [Alamblak](https://cohenpr-xpf.github.io/XPF/conv_resources/info/amp.html) | conflation between /ɘ/ and /o/ | |
| aoj | [Mufian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/aoj.html) | conflation among vowels; ambiguity regarding vowel length and labialized consonant clusters | lacks lenition |
| ar | [Arabic](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ar.html) | ambiguous transcription of alif; conflation between vowels and glides | |
| arn | [Mapudungun](https://cohenpr-xpf.github.io/XPF/conv_resources/info/arn.html) | ambiguous orthography; conflation between dental and alveolar consonants | |
| awx | [Awara](https://cohenpr-xpf.github.io/XPF/conv_resources/info/awx.html) | conflation between /nd/, /mb/, /nɡ/ and /d/, /b/, /ɡ/, respectively | |
| bcl | [Central Bikol](https://cohenpr-xpf.github.io/XPF/conv_resources/info/bcl.html) | inconsistent marking of glottal stops | lacks lenition |
| bmu | [Somba Siawari](https://cohenpr-xpf.github.io/XPF/conv_resources/info/bmu.html) | phonetic alphabet | |
| btx | [Batak Karo](https://cohenpr-xpf.github.io/XPF/conv_resources/info/btx.html) | conflation among /e/, /ɘ/, and /ɯ/ | |
| bzd | [Bribri](https://cohenpr-xpf.github.io/XPF/conv_resources/info/bzd.html) | phonetic alphabet; contradicting documentation | |
| bzh | [Mapos Buang](https://cohenpr-xpf.github.io/XPF/conv_resources/info/bzh.html) | conflation between /ɛ/ and other vowels | |
| ca | [Catalan](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ca.html) | conflation among vowels and glides; ambiguous phonological interpretations | |
| cav | [Cavineña](https://cohenpr-xpf.github.io/XPF/conv_resources/info/cav.html) | ambiguity whether a digraph represents one phoneme or two, depending on syllable structure | lacks lenition |
| chf | [Tabasco Chontal](https://cohenpr-xpf.github.io/XPF/conv_resources/info/chf.html) | conflation between ejectives and stop-glottal stop sequences | |
| chm | [Mari](https://cohenpr-xpf.github.io/XPF/conv_resources/info/chm.html) | conflation with some palatalized and non-palatalized consonants; some vowels not always represented orthographically | lacks lenition |
| cho | [Choctaw](https://cohenpr-xpf.github.io/XPF/conv_resources/info/cho.html) | phonetic alphabet | |
| cni | [Asháninka](https://cohenpr-xpf.github.io/XPF/conv_resources/info/cni.html) | conflation among nasals | |
| cof | [Colorado](https://cohenpr-xpf.github.io/XPF/conv_resources/info/cof.html) | orthographic ambiguity with glottal stops | |
| con | [Cofan](https://cohenpr-xpf.github.io/XPF/conv_resources/info/con_Cofan.html) | conflation between consonants | |
| crm | [Moose Cree](https://cohenpr-xpf.github.io/XPF/conv_resources/info/crm.html) | /h/ represented only when contrast is required | lacks lenition |
| dyo | [Jola-Fogny](https://cohenpr-xpf.github.io/XPF/conv_resources/info/dyo.html) | uncertainty around the marking of +ATR vowels | lacks lenition |
| es | [Spanish](https://cohenpr-xpf.github.io/XPF/conv_resources/info/es.html) | non-transparent transcription of diphthongs | |
| fuv | [Nigerian Fulfulde](https://cohenpr-xpf.github.io/XPF/conv_resources/info/fuv.html) | inconsistent marking of glottal stops; unclear transcription of palatalized glottal stop | |
| hi | [Hindi](https://cohenpr-xpf.github.io/XPF/conv_resources/info/hi.html) | conflation between /æ/ and /ɛ/; vowel nasalization ambiguity; unreliable marking of some consonants | |
| id | [Indonesian](https://cohenpr-xpf.github.io/XPF/conv_resources/info/id.html) | conflation between /e/ and /ə/ | |
| ixl | [Ixil](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ixl.html) | word-initial glottal stop not always marked; somewhat ambiguous orthography | |
| kea | [Cape Verdean Creole](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kea.html) | possible conflation between /a/ and /ɐ/, /e/ and /ɛ/, and /ɾ/ and /ʀ/ | lacks lenition |
| kek | [Qeqchi](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kek.html) | ambiguity between ejective stops and stop-glottal stop sequences | |
| kk | [Kazakh](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kk.html) | conflation between vowels and glides; widely contradicting phonological accounts of the language | |
| kmo | [Kwoma](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kmo.html) | non-transparent transcription of glottal stops | |
| kyz | [Kayabí](https://cohenpr-xpf.github.io/XPF/conv_resources/info/kyz.html) | conflation between /i/ and /j/ | lacks lenition |
| mcf | [Matsés](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mcf.html) | conflation between alveolar and retroflex consonants; conflation between vowels | |
| mek | [Mekeo](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mek.html) | non-transparent transcription of glottal stops | |
| mfe | [Morisyen](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mfe.html) | highly suspect orthography; conflation among consonants | |
| ml | [Malayalam](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ml.html) | conflation between dental and alveolar /n/ | |
| mlp | [Bargam](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mlp.html) | conflation between /n/ and /ŋ/ | lacks lenition |
| mnb | [Muna](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mnb.html) | suspect orthography | |
| mpx | [Misima-Panaeati](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mpx.html) | conflation between /e/ and /ɛ/ and between /o/ and /ɔ/ | lacks lenition |
| mt | [Maltese](https://cohenpr-xpf.github.io/XPF/conv_resources/info/mt.html) | conflation between /ts/ and /dz/ and between /ʃ/ and /ʒ/ | |
| myv | [Erzya](https://cohenpr-xpf.github.io/XPF/conv_resources/info/myv.html) | conflation between /n/ and /ŋ/ | lacks lenition |
| ne | [Nepali](https://cohenpr-xpf.github.io/XPF/conv_resources/info/ne.html) | certain diacritics used interchangeably and inconsistently marked | |
| not | [Nomatsiguenga](https://cohenpr-xpf.github.io/XPF/conv_resources/info/not.html) | conflation among nasals | |
| or | [Oriya](https://cohenpr-xpf.github.io/XPF/conv_resources/info/or.html) | certain diacritics used interchangeably and inconsistently marked | |
| os | [Ossetic](https://cohenpr-xpf.github.io/XPF/conv_resources/info/os.html) | conflation among /u/, /w/, and /ʷ/; inconsistent marking of consonant gemination | |
| pag | [Pangasinan](https://cohenpr-xpf.github.io/XPF/conv_resources/info/pag.html) | possible conflation between /ŋ/ and /nɡ/ | |
| pib | [Yine](https://cohenpr-xpf.github.io/XPF/conv_resources/info/pib.html) | conflation between /n/ and /h̃/ | lacks lenition |
| plu | [Palikúr](https://cohenpr-xpf.github.io/XPF/conv_resources/info/plu.html) | conflation between /ɡ/ and /ɣ/ | |
| qub | [Huallaga Huanuco Quechua](https://cohenpr-xpf.github.io/XPF/conv_resources/info/qub.html) | suspect orthography; conflation between vowels and glides | |
| rwo | [Rawa](https://cohenpr-xpf.github.io/XPF/conv_resources/info/rwo.html) | conflation between /l/ and /r/ | |
| sah | [Yakut](https://cohenpr-xpf.github.io/XPF/conv_resources/info/sah.html) | conflation between /j/ and /j̃/ | |
| sk | [Slovak](https://cohenpr-xpf.github.io/XPF/conv_resources/info/sk.html) | non-transparent transcription of palatal consonants; ambiguity whether digraphs represent one phoneme or two | |
| sm | [Samoan](https://cohenpr-xpf.github.io/XPF/conv_resources/info/sm.html) | marking of long vowels and glottal stops is suspect | |
| suz | [Sunwar](https://cohenpr-xpf.github.io/XPF/conv_resources/info/suz.html) | conflation between /ɾ/, /ɭ/, and possibly /l̪/; inconsistent marking of glottal stops | |
| sw | [Swahili](https://cohenpr-xpf.github.io/XPF/conv_resources/info/sw.html) | conflation between syllabic nasals and non-syllabic counterparts | |
| too | [Xicotepec de Juárez Totonac](https://cohenpr-xpf.github.io/XPF/conv_resources/info/too.html) | suspect transcription due to unclear documentation | |
| tpp | [Pisaflores Tepehua](https://cohenpr-xpf.github.io/XPF/conv_resources/info/tpp.html) | suspect marking of vowel length | |
| tzj | [Tz'utujil](https://cohenpr-xpf.github.io/XPF/conv_resources/info/tzj.html) | uncertainty around the marking of the glottal stop and the orthography | |
| tzm | [Central Atlas Tamazight](https://cohenpr-xpf.github.io/XPF/conv_resources/info/tzm.html) | conflation between /l̪/ and /l̪ˤ/, and between /ʒ/ and /ʒˀ/ | |
| wmw | [Mwani](https://cohenpr-xpf.github.io/XPF/conv_resources/info/wmw.html) | conflation between syllabic nasals and prenasalized stops | lacks lenition |
| zsm | [Standard Malay](https://cohenpr-xpf.github.io/XPF/conv_resources/info/zsm.html) | conflation between /e/ and /ə/; conflicting orthographies | |
| zza | [Zaza](https://cohenpr-xpf.github.io/XPF/conv_resources/info/zza.html) | conflicting orthographies; conflation among vowels | |
## Abandoned Languages
| Language Code | Language | Reason |
|---------------|--------------------------------|--------------------------------------------------------------------------|
| ace | Acehnese | non-transparent transcription of vowel nasalization |
| ach | Acholi | non-transparent transcription of tones |
| acu | Achuar-Shiwiar | non-transparent transcription of vowel nasalization |
| adh | Adhola | non-transparent transcription of tones |
| af | Afrikaans | non-transparent transcription of vowels, vowel length, and diphthongs |
| agd | Agarabi | non-transparent transcription of tones |
| agm | Angaataha | non-transparent transcription of tones |
| agr | Aguaruna | non-transparent transcription of vowel nasalization |
| ak | Akan | non-transparent transcription of tones |
| alq | Algonquin | non-transparent transcription of vowel length |
| am | Amharic | non-transparent transcription of consonant gemination |
| anv | Denya | non-transparent transcription of tones |
| as | Assamese | non-transparent transcription of vowels |
| aso | Dano | non-transparent transcription of tones |
| avt | Avar | non-transparent transcription of consonant gemination |
| ban | Bali | non-standardized orthography |
| bem | Bemba | non-transparent transcription of tones |
| bba | Bariba | non-transparent transcription of tones |
| bcw | Bana | non-transparent transcription of tones |
| bhl | Bimin | non-transparent transcription of tones |
| bm | Bambara | non-transparent transcription of tones |
| bmr | Muinane | non-transparent transcription of tones |
| bs | Bosnian | non-transparent transcription of vowel length and tones |
| bsn | Barasana-Eduria | non-transparent transcription of tones |
| bua | Buryat | non-transparent transcription of palatalization |
| byr | Baruya | non-transparent transcription of tones |
| cao | Chácobo | non-transparent transcription of tones |
| cax | Chiquitano | non-transparent transcription of vowel nasalization |
| cbc | Carapan | non-transparent transcription of tones |
| ce | Chechen | non-transparent transcription of vowel length |
| ceb | Cebuano | non-transparent transcription of vowel length |
| chr | Cherokee | non-transparent transcription of vowel length |
| cwk | Western Kaqchikel | non-transparent transcription of vowels |
| cnh | Haka Chin | non-transparent transcription of tones |
| coe | Koreguaja | non-transparent transcription of tones |
| ctd | Tedim Chin | non-transparent transcription of tones |
| cub | Cubeo | non-transparent transcription of tones |
| cuk | San Blas Kuna | non-transparent transcription |
| cy | Welsh | non-transparent transcription of vowel length |
| da | Danish | non-transparent transcription of vowels |
| daa | Dangaléat | non-transparent transcription of tones |
| des | Desano | non-transparent transcription of tones |
| dgo | Dogri | non-transparent transcription of tones |
| din | Dinka | non-transparent transcription of tones |
| dts | Toro So Dogon | non-transparent transcription of tones |
| dz | Dzongkha | non-transparent transcription |
| ee | Ewe | non-transparent transcription of tones |
| efi | Efik | non-transparent transcription of tones |
| emp | Northern Emberá | non-transparent transcription |
| enb | Markweeta | non-transparent transcription of tones |
| enq | Enga | non-transparent transcription of tones |
| et | Estonian | non-transparent transcription of contrastive syllable length |
| faa | Fasu | non-transparent transcription of tones |
| fi | Finnish | non-transparent transcription |
| fj | Fijian | non-transparent transcription of vowel length |
| fo | Faroese | non-transparent transcription of vowels |
| for | Fore | non-transparent transcription of tones |
| fur | Friulian | non-transparent transcription of vowels |
| fy | Frisian | non-transparent transcription of vowels |
| ga | Irish | non-transparent transcription |
| gah | Alekano | non-transparent transcription of tones |
| gd | Scottish Gaelic | non-transparent transcription of consonants and vowels |
| gl | Galician | non-transparent transcription |
| gmo | Gamo-Gofa-Dawro | three languages understood to be linguistically separate |
| grb | Grebo | non-transparent transcription of tones |
| grt | Garo | non-transparent transcription of vowels |
| gub | Guajajara | non-transparent transcription of vowel nasalization |
| gum | Guambiano | non-standardized orthography |
| gur | Farefare | non-transparent transcription of tones |
| gv | Manx Gaelic | non-transparent transcription of consonants and vowels |
| ha | Hausa | non-transparent transcription of vowel length |
| hbs | Serbo-Croatian | non-transparent transcription of tones |
| hch | Huichol | non-transparent transcription of tones |
| heh | Hehe | non-transparent transcription of tones |
| hr | Croatian | non-transparent transcription of vowel length |
| hub | Huambisa | non-transparent transcription of vowel nasalization |
| hui | Huli | non-transparent transcription of tones |
| huv | Huave | inconsistent phonological documentation |
| hz | Herero | non-transparent transcription of tones |
| ig | Igbo | non-transparent transcription of tones |
| ik | Inupiaq | insufficient tokens |
| is | Icelandic | non-transparent transcription of vowel length |
| jiv | Shuar | non-transparent transcription of vowel nasalization |
| kab | Kabyle | non-transparent transcription of consonants |
| kac | Jingpho | non-transparent transcription of tones |
| kaq | Capanahua | non-transparent transcription of tones |
| kbc | Kadiweu | non-transparent transcription of consonant gemination |
| kbr | Kafa | non-transparent transcription of tones |
| kha | Khasi | non-transparent transcription of vowel length |
| khk | Khalkha Mongolian | non-transparent transcription of vowels |
| ki | Gikuyu | non-transparent transcription of tones |
| kj | Kwanyama | non-transparent transcription of tones |
| kjs | East Kewa | non-transparent transcription of tones |
| kew | West Kewa | non-transparent transcription of tones |
| kmr | Northern Kurdish | non-transparent transcription of consonants |
| kmu | Kanite | non-transparent transcription of tones |
| ksd | Kuanua | non-transparent transcription of vowel length |
| kus | Kusaal | non-transparent transcription of tones and vowel length |
| kw | Cornish | non-transparent transcription of vowel length |
| lac | Lacandon | non-transparent transcription of vowel length |
| lb | Luxembourgish | non-transparent transcription of vowels |
| lef | Lelemi | non-transparent transcription of tones |
| lg | Luganda | non-transparent transcription of tones |
| ln | Lingala | non-transparent transcription of tones |
| loz | Lozi | non-transparent transcription of tones |
| lt | Lithuanian | non-transparent transcription of tones |
| luo | Dholuo | non-transparent transcription of tones |
| lus | Mizo | non-transparent transcription of tones |
| lv | Latvian | non-transparent transcription of tones |
| lvs | Standard Latvian | non-transparent transcription of tones |
| lwo | Luwo | non-transparent transcription of tones and breathy vowels |
| man | Mandingo | non-transparent transcription of tones |
| mas | Maasai | insufficient tokens |
| mcb | Machiguenga | non-transparent transcription of tones |
| mcd | Sharanahua | non-transparent transcription of tones |
| meu | Motu | non-transparent transcription of vowel length |
| mfi | Wandala | non-transparent transcription of tones |
| mfz | Mabaan | non-transparent transcription of tones |
| mhr | Eastern Mari | non-transparent transcription of palatalization |
| mi | Maori | non-transparent transcription of vowel length |
| miq | Miskito | non-transparent transcription of vowel nasalization and length |
| mni | Meitei | non-transparent transcription of tones |
| mos | Mossi | non-transparent transcription of tones |
| mps | Dadibi | non-transparent transcription of tones and vowel nasalization |
| mpt | Mian | non-transparent transcription of tones |
| ms | Malay | non-transparent transcription of vowels |
| my | Burmese | non-transparent transcription of tones |
| myu | Mundurukú | non-transparent transcription of tones and creaky vowels |
| myy | Macuna | non-transparent transcription of tones |
| nd | Northern Ndebele | insufficient tokens |
| nds | Low Saxon | non-transparent transcription |
| nfr | Nafaanra | non-transparent transcription of tones |
| nhg | Tetelcingo Nahuatl | non-transparent transcription of vowel length |
| no | Norwegian | non-transparent transcription of tones and vowel length |
| ntp | Northern Tepehuan | non-transparent transcription of tones |
| nv | Navajo | non-transparent transcription of vowel nasalization |
| ny | Chichewa | non-transparent transcription of tones |
| nyn | Nyankore | non-transparent transcription of tones |
| om | Oromo | non-transparent transcription of tones |
| opm | Oksapmin | non-transparent transcription of vowels |
| ood | Tohono O'odham | non-transparent transcription |
| ots | Estado de México Otomi | non-transparent transcription of tones |
| pab | Parecís | non-transparent transcription of vowel length and nasalization |
| pao | Northern Paiute | non-transparent transcription of vowel length |
| pap | Papiamentu | non-transparent transcription of vowels |
| pir | Wanano | non-transparent transcription of tones |
| pl | Polish | non-transparent transcription |
| pms | Piedmontese | non-transparent transcription |
| poh | Poqomchi' | insufficient documentation |
| rw | Kinyarwanda | non-transparent transcription of tones and vowel length |
| sd | Sindhi | non-transparent transcription of vowels |
| se | Northern Sami | non-transparent transcription |
| sg | Sango | non-transparent transcription of tones |
| sim | Mende | non-transparent transcription of tones |
| sll | Salt-Yui | non-transparent transcription of tones |
| sn | Shona | non-transparent transcription of tones |
| so | Somali | non-transparent transcription of tones |
| soq | Kanasi | non-transparent transcription of glottal stops |
| spp | Supyire Senoufo | non-transparent transcription of tones |
| ss | Swati | non-transparent transcription of tones |
| st | Sesotho | non-transparent transcription of tones |
| sv | Swedish | non-transparent transcription |
| swp | Suau | non-transparent transcription |
| sxb | Suba | non-transparent transcription of tones |
| tav | Tatuyo | non-transparent transcription of tones |
| tcc | Datooga | non-transparent transcription of tones |
| tcy | Tulu | non-transparent transcription of vowels |
| tcz | Thadou Chin | non-transparent transcription of tones |
| ti | Tigrinya | non-transparent transcription of gemination |
| tk | Turkmen | non-transparent transcription of vowel length |
| tl | Tagalog | non-transparent spalling of vowel length |
| tn | Tswana | non-transparent transcription of tones |
| toi | Tonga | non-transparent transcription of tones |
| trp | Kok Borok | non-transparent transcription of tones |
| ts | Tsonga | non-transparent transcription of tones |
| ttc | Tekiteko | non-transparent transcription of vowel length |
| tuf | Central Tunebo | non-transparent transcription of contrastive features (first syllable) |
| tw | Twi | non-transparent transcription of tones |
| ubu | Umbu-Ungu | non-transparent transcription of tones |
| udu | Uduk | non-transparent transcription of tones |
| ur | Urdu | non-transparent transcription of vowels |
| ura | Urarina | non-transparent transcription of tones |
| usp | Uspanteko | non-transparent transcription of tones |
| ve | Venda | non-transparent transcription of tones |
| vro | Võro | non-transparent transcription of vowels and palatalization |
| wa | Walloon | non-transparent transcription |
| wal | Wolaytta | non-transparent transcription of tones |
| war | Waray-Waray | insufficient documentation |
| wiu | Wiru | non-transparent transcription of tones |
| xal | Kalmyk-Oirat | non-transparent transcription of vowels |
| xav | Xavánte | non-transparent transcription of vowel length |
| xbi | Kombio | non-transparent transcription of vowels |
| xh | Xhosa | non-transparent transcription of tones |
| xla | Kamula | non-transparent transcription of vowels and tones |
| xsr | Sherpa | insufficient documentation |
| yaa | Yaminahua | non-transparent transcription of tones |
| yad | Yagua | non-transparent transcription of tones |
| yby | Yaweyuha | non-transparent transcription of tones |
| yo | Yoruba | non-transparent transcription of tones |
| zai | Zapotec | non-transparent transcription of tones |
| zca | Coatecas Altas Zapotec | non-transparent transcription of tones |
| zpi | Santa María Quiegolani Zapotec | non-transparent transcription of tones |
| zpq | Zoogocho Zapotec | non-transparent transcription of tones |
| zu | Zulu | non-transparent transcription of tones |