diff --git a/.gitattributes b/.gitattributes index 910e23c9cef19f45a3d70a4c998d84dc72178022..f957351a2e319537be5dce16ece7143a71113260 100644 --- a/.gitattributes +++ b/.gitattributes @@ -36,3 +36,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text visualizations/embedding_similarity.png filter=lfs diff=lfs merge=lfs -text visualizations/performance_dashboard.png filter=lfs diff=lfs merge=lfs -text visualizations/position_encoding_comparison.png filter=lfs diff=lfs merge=lfs -text +visualizations/embedding_tsne_multilingual.png filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md index cf4b0134a8a8a55a3e0b667f5d51bc605627ef4f..8ae5f87347a122cc299c005e42787dfb45037306 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,6 @@ --- language: cr -language_name: CR +language_name: Cree language_family: american_algonquian tags: - wikilangs @@ -10,11 +10,21 @@ tags: - n-gram - markov - wikipedia + - feature-extraction + - sentence-similarity + - tokenization + - n-grams + - markov-chain + - text-mining + - fasttext + - babelvec + - vocabulous + - vocabulary - monolingual - family-american_algonquian license: mit library_name: wikilangs -pipeline_tag: feature-extraction +pipeline_tag: text-generation datasets: - omarkamali/wikipedia-monthly dataset_info: @@ -23,20 +33,20 @@ dataset_info: metrics: - name: best_compression_ratio type: compression - value: 3.182 + value: 3.238 - name: best_isotropy type: isotropy - value: 0.0381 + value: 0.0354 - name: vocabulary_size type: vocab value: 0 generated: 2026-01-03 --- -# CR - Wikilangs Models +# Cree - Wikilangs Models ## Comprehensive Research Report & Full Ablation Study -This repository contains NLP models trained and evaluated by Wikilangs, specifically on **CR** Wikipedia data. +This repository contains NLP models trained and evaluated by Wikilangs, specifically on **Cree** Wikipedia data. We analyze tokenizers, n-gram models, Markov chains, vocabulary statistics, and word embeddings. ## 📋 Repository Contents @@ -60,7 +70,7 @@ We analyze tokenizers, n-gram models, Markov chains, vocabulary statistics, and - [3. Markov Chain Evaluation](#3-markov-chain-evaluation) - [4. Vocabulary Analysis](#4-vocabulary-analysis) - [5. Word Embeddings Evaluation](#5-word-embeddings-evaluation) -- [6. Morphological Analysis (Experimental)](#6-morphological-analysis) +- [6. Morphological Analysis (Experimental)](#6--morphological-analysis-experimental) - [7. Summary & Recommendations](#7-summary--recommendations) - [Metrics Glossary](#appendix-metrics-glossary--interpretation-guide) - [Visualizations Index](#visualizations-index) @@ -80,35 +90,35 @@ We analyze tokenizers, n-gram models, Markov chains, vocabulary statistics, and | Vocab Size | Compression | Avg Token Len | UNK Rate | Total Tokens | |------------|-------------|---------------|----------|--------------| -| **8k** | 3.182x 🏆 | 3.19 | 2.9567% | 6,629 | +| **8k** | 3.238x 🏆 | 3.24 | 2.7764% | 6,267 | ### Tokenization Examples Below are sample sentences tokenized with each vocabulary size: -**Sample 1:** `ᐊᓐ ᐊᒋᐦᑖᓱᓐ ᐯᔭᒄ ᑲ ᐃᔑᓂᐦᑳᑌᒡ, ᐋᐸᑎᓐ ᒉ ᒌ ᐃᑣᓅᐦᒡ ᐯᔭᒄ ᒉᒀᓐ ᒫᒃ ᐊᐌᓐ᙮ ᐊᓐ ᒫᒃ ᐊᒋᐦᑖᓱᓐ ᐯᔭᒄ, ᐁᐅᑯᓐ ᓃ...` +**Sample 1:** `ᓀᐦᐃᔭᐁᐧᐃᐧᐣ ᑕᐣᓯ ᑲ ᐃᓯᐲᑭᐢᑫᐧᕁ ᓵᓴᕀ ᐳᓂ ᐱᑭᐢᑫᐧᐃᐧᐣ ᐱᐦᒑᔨᕁ ᑳᓇᑕ. ᓵᓴᕀ ᐳᓂ ᐱᑭᐢᑫᐧᐃᐧᐣ ᓇᐊᐧᐨ ᐳᑯ ᒌᑳᐦᑕ...` | Vocab | Tokens | Count | |-------|--------|-------| -| 8k | `▁ᐊᓐ ▁ᐊᒋᐦᑖᓱᓐ ▁ᐯᔭᒄ ▁ᑲ ▁ᐃᔑᓂᐦᑳᑌᒡ , ▁ᐋᐸᑎᓐ ▁ᒉ ▁ᒌ ▁ᐃᑣᓅᐦᒡ ... (+19 more)` | 29 | +| 8k | `▁ᓀᐦᐃᔭᐁᐧᐃᐧᐣ ▁ᑕᐣᓯ ▁ᑲ ▁ᐃᓯᐲᑭᐢᑫᐧᕁ ▁ᓵᓴᕀ ▁ᐳᓂ ▁ᐱᑭᐢᑫᐧᐃᐧᐣ ▁ᐱᐦᒑᔨᕁ ▁ᑳᓇᑕ . ... (+11 more)` | 21 | -**Sample 2:** `ᓀᐦᐃᔭᐁᐧᐃᐧᐣ ᑕᐣᓯ ᑲ ᐃᓯᐲᑭᐢᑫᐧᕁ ᓵᓴᕀ ᐳᓂ ᐱᑭᐢᑫᐧᐃᐧᐣ ᐱᐦᒑᔨᕁ ᑳᓇᑕ. ᓵᓴᕀ ᐳᓂ ᐱᑭᐢᑫᐧᐃᐧᐣ ᓇᐊᐧᐨ ᐳᑯ ᒌᑳᐦᑕ...` +**Sample 2:** `ᐊᓐ ᐊᒋᐦᑖᓱᓐ ᐯᔭᒄ ᑲ ᐃᔑᓂᐦᑳᑌᒡ, ᐋᐸᑎᓐ ᒉ ᒌ ᐃᑣᓅᐦᒡ ᐯᔭᒄ ᒉᒀᓐ ᒫᒃ ᐊᐌᓐ᙮ ᐊᓐ ᒫᒃ ᐊᒋᐦᑖᓱᓐ ᐯᔭᒄ, ᐁᐅᑯᓐ ᓃ...` | Vocab | Tokens | Count | |-------|--------|-------| -| 8k | `▁ᓀᐦᐃᔭᐁᐧᐃᐧᐣ ▁ᑕᐣᓯ ▁ᑲ ▁ᐃᓯᐲᑭᐢᑫᐧᕁ ▁ᓵᓴᕀ ▁ᐳᓂ ▁ᐱᑭᐢᑫᐧᐃᐧᐣ ▁ᐱᐦᒑᔨᕁ ▁ᑳᓇᑕ . ... (+11 more)` | 21 | +| 8k | `▁ᐊᓐ ▁ᐊᒋᐦᑖᓱᓐ ▁ᐯᔭᒄ ▁ᑲ ▁ᐃᔑᓂᐦᑳᑌᒡ , ▁ᐋᐸᑎᓐ ▁ᒉ ▁ᒌ ▁ᐃᑣᓅᐦᒡ ... (+19 more)` | 29 | -**Sample 3:** `ᒨᔅ, Muus, Mush ( ; ) n.a. ᐊᐧᐁᓰᔅ ᐆ᙮ ᒨᔅ ᒥᐦᒑᐱᔅᒋᓲ᙮ ᓂᒥᑕᐦᐊᒻ ᑲᔦᐦ᙮ ᐸᐹᒦᒋᓲ᙮ ᒨᒥᓀᐤ᙮ ᒨᔅ ᒦᒎ ᓂᐦ...` +**Sample 3:** `ᒦᒃᓰᖂ (english : Mexico) ᐊᐢᑭᐩ ᑮᐍᑎᐣ ᐊᒣᕒᐃᑲ ᐆᐦᒋ᙮ ᐊᔨᓯᔨᓂᐘᐠ ᐑᑭᐘᐠ ᐆᒪ ᐊᐢᑭᔭᕽ᙮ ` | Vocab | Tokens | Count | |-------|--------|-------| -| 8k | `▁ᒨᔅ , ▁muus , ▁mush ▁( ▁; ▁) ▁n . ... (+17 more)` | 27 | +| 8k | `▁ᒦᒃᓰᖂ ▁( english ▁: ▁mexico ) ▁ᐊᐢᑭᐩ ▁ᑮᐍᑎᐣ ▁ᐊᒣᕒᐃᑲ ▁ᐆᐦᒋ᙮ ... (+7 more)` | 17 | ### Key Findings -- **Best Compression:** 8k achieves 3.182x compression -- **Lowest UNK Rate:** 8k with 2.9567% unknown tokens +- **Best Compression:** 8k achieves 3.238x compression +- **Lowest UNK Rate:** 8k with 2.7764% unknown tokens - **Trade-off:** Larger vocabularies improve compression but increase model size - **Recommendation:** 32k vocabulary provides optimal balance for production use @@ -126,11 +136,13 @@ Below are sample sentences tokenized with each vocabulary size: | N-gram | Variant | Perplexity | Entropy | Unique N-grams | Top-100 Coverage | Top-1000 Coverage | |--------|---------|------------|---------|----------------|------------------|-------------------| | **2-gram** | Word | 16 | 4.04 | 17 | 100.0% | 100.0% | -| **2-gram** | Subword | 492 | 8.94 | 848 | 48.2% | 100.0% | +| **2-gram** | Subword | 473 | 8.89 | 812 | 49.1% | 100.0% | | **3-gram** | Word | 15 🏆 | 3.88 | 16 | 100.0% | 100.0% | -| **3-gram** | Subword | 1,528 | 10.58 | 1,986 | 19.4% | 75.4% | -| **4-gram** | Word | 163 | 7.35 | 166 | 62.1% | 100.0% | -| **4-gram** | Subword | 3,131 | 11.61 | 3,878 | 11.9% | 50.9% | +| **3-gram** | Subword | 1,468 | 10.52 | 1,902 | 19.8% | 76.9% | +| **4-gram** | Word | 157 | 7.29 | 160 | 64.3% | 100.0% | +| **4-gram** | Subword | 2,988 | 11.54 | 3,702 | 12.2% | 52.2% | +| **5-gram** | Word | 137 | 7.10 | 138 | 73.1% | 100.0% | +| **5-gram** | Subword | 2,771 | 11.44 | 3,264 | 12.2% | 51.7% | ### Top 5 N-grams by Size @@ -162,23 +174,33 @@ Below are sample sentences tokenized with each vocabulary size: | 2 | `in standard roman orthography` | 5 | | 3 | `written in standard roman` | 5 | | 4 | `ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ` | 4 | -| 5 | `of articles some articles` | 3 | +| 5 | `center for global nonkilling` | 3 | + +**5-grams (Word):** + +| Rank | N-gram | Count | +|------|--------|-------| +| 1 | `written in standard roman orthography` | 5 | +| 2 | `list of articles some articles` | 3 | +| 3 | `of articles some articles in` | 3 | +| 4 | `dialect list of articles some` | 3 | +| 5 | `ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ` | 3 | **2-grams (Subword):** | Rank | N-gram | Count | |------|--------|-------| -| 1 | `i n` | 215 | -| 2 | `, _` | 213 | -| 3 | `_ ᐊ` | 179 | -| 4 | `i k` | 168 | -| 5 | `n _` | 165 | +| 1 | `i n` | 207 | +| 2 | `, _` | 202 | +| 3 | `i k` | 169 | +| 4 | `_ ᐊ` | 164 | +| 5 | `i s` | 159 | **3-grams (Subword):** | Rank | N-gram | Count | |------|--------|-------| -| 1 | `i n _` | 61 | +| 1 | `i n _` | 58 | | 2 | `a n i` | 49 | | 3 | `w i n` | 48 | | 4 | `_ k i` | 47 | @@ -190,16 +212,26 @@ Below are sample sentences tokenized with each vocabulary size: |------|--------|-------| | 1 | `w a k _` | 33 | | 2 | `w i n _` | 27 | -| 3 | `t i o n` | 24 | -| 4 | `k a n i` | 23 | -| 5 | `i k a n` | 22 | +| 3 | `k a n i` | 23 | +| 4 | `t i o n` | 23 | +| 5 | `_ o f _` | 22 | + +**5-grams (Subword):** + +| Rank | N-gram | Count | +|------|--------|-------| +| 1 | `_ a n d _` | 22 | +| 2 | `a t i o n` | 21 | +| 3 | `p î s i m` | 20 | +| 4 | `- p î s i` | 19 | +| 5 | `a r t i c` | 19 | ### Key Findings - **Best Perplexity:** 3-gram (word) with 15 - **Entropy Trend:** Decreases with larger n-grams (more predictable) -- **Coverage:** Top-1000 patterns cover ~51% of corpus +- **Coverage:** Top-1000 patterns cover ~52% of corpus - **Recommendation:** 4-gram or 5-gram for best predictive performance --- @@ -215,14 +247,14 @@ Below are sample sentences tokenized with each vocabulary size: | Context | Variant | Avg Entropy | Perplexity | Branching Factor | Unique Contexts | Predictability | |---------|---------|-------------|------------|------------------|-----------------|----------------| -| **1** | Word | 0.2827 | 1.216 | 1.47 | 1,787 | 71.7% | -| **1** | Subword | 1.9100 | 3.758 | 10.53 | 273 | 0.0% | -| **2** | Word | 0.0424 | 1.030 | 1.05 | 2,607 | 95.8% | -| **2** | Subword | 0.6919 | 1.615 | 2.63 | 2,872 | 30.8% | -| **3** | Word | 0.0178 | 1.012 | 1.02 | 2,724 | 98.2% | -| **3** | Subword | 0.3559 | 1.280 | 1.57 | 7,557 | 64.4% | -| **4** | Word | 0.0086 🏆 | 1.006 | 1.01 | 2,765 | 99.1% | -| **4** | Subword | 0.1591 | 1.117 | 1.21 | 11,842 | 84.1% | +| **1** | Word | 0.2841 | 1.218 | 1.47 | 1,711 | 71.6% | +| **1** | Subword | 1.8933 | 3.715 | 10.31 | 271 | 0.0% | +| **2** | Word | 0.0442 | 1.031 | 1.05 | 2,501 | 95.6% | +| **2** | Subword | 0.6883 | 1.611 | 2.62 | 2,789 | 31.2% | +| **3** | Word | 0.0186 | 1.013 | 1.02 | 2,617 | 98.1% | +| **3** | Subword | 0.3514 | 1.276 | 1.56 | 7,299 | 64.9% | +| **4** | Word | 0.0089 🏆 | 1.006 | 1.01 | 2,657 | 99.1% | +| **4** | Subword | 0.1579 | 1.116 | 1.21 | 11,392 | 84.2% | ### Generated Text Samples (Word-based) @@ -230,27 +262,27 @@ Below are text samples generated from each word-based Markov chain model: **Context Size 1:** -1. `ᐁ ᐊᐧᐃᐢᑮᐦᐃᑲᐣ ᐃᐧᐊ ᐁᔥᐃᐦᑕᒧᐃᐧᐣ ᑳ ᐋᐸᐦᐄᔥᑌᒡ english and montana some articles in iyuw iyimuun natuashish dia...` -2. `e kašcihot e wîcit e iskwewit mâk atimwa wes namawîy nataweyihtam cecî cisceyihtâkwaniyic ekw wenâpe...` -3. `of nonkilling channel on l nehirâmowin qc r s t u v w ᐌ ᐎ ᐒ` +1. `ᐁ ᐃ ᐅ ᐊ ᐄ ᐆ ᐋ p q r s ᓭ ᓯ ᓱ ᓴ ᓰ` +2. `e kiskatcik e tašitwâw awesîsac sašimuve nîštam atim nâpeštimw išinihkâtâkaniwiw simpohanin âtayôhkâ...` +3. `of articles in ininiwi išikišwēwin eastern dialect western montagnais iso 639 crk location québec an...` **Context Size 2:** -1. `some articles in ininiwi išikišwēwin eastern dialect la romaine mingan natashquan pakuashipi and she...` -2. `articles in lehlueun western dialect list of articles some articles in nīhithawīwin list of articles...` -3. `ēkwa mīna otaskānitik e ka naskahtamēw nikiskihcēta anihi tahki itēhk kā pēhtahkik tānpahtiwin ē mic...` +1. `some articles in nēhiyawēwin âpihtâkosisânak kâ isiwepahki maskisin ᐸᐦᑵᓯᑲᐣ pimîhkân tipahikan itasin...` +2. `articles in iyuw iyimuun natuashish dialect list of articles ᐃᔨᔨᐤ ᐊᔨᒧᐧᐃᓐ iyyû ayimuwin nēhiyawēwin p...` +3. `list of articles ᐃᔨᔨᐤ ᐊᔨᒧᐧᐃᓐ iyyû ayimuwin northern dialect chisasibi eastmain waskaganish wemindji ...` **Context Size 3:** -1. `some articles in nēhiyawēwin âpihtâkosisânak kâ isiwepahki maskisin ᐸᐦᑵᓯᑲᐣ pimîhkân tipahikan itasin...` -2. `list of articles wikipedias in other native american languages atikamekw avañe ẽ aymar choctaw ꮳꮃꭹ c...` -3. `dialect list of articles some articles in ililîmowin list of articles ᐃᓕᓖᒧᐎᓐ ililîmowin ililîmowin p...` +1. `some articles in lehlueun western dialect betsiamites mashteuiatsh matimekosh and uashat maliotenam ...` +2. `list of articles ᐃᓕᓖᒧᐎᓐ ililîmowin ililîmowin portal english name woods cree iso 639 crk location sa...` +3. `dialect list of articles ᐃᓕᓖᒧᐎᓐ ililîmowin ililîmowin portal english name moose cree iso 639 csw loc...` **Context Size 4:** -1. `dialect list of articles some articles in iyuw iyimuun kawawachikamach dialect list of articles nīhi...` +1. `dialect list of articles nīhithawīwin portal english name woods cree iso 639 cwd location manitoba a...` 2. `written in standard roman orthography` -3. `ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᐋᐱᐦᑖᒌᔑᑳᐤ ᐋᐱᐦᑖᑎᐱᔅᑳᐤ 1 05 ᐯᔭᒄ ᑎᐸᐦᐄᑲᓐ ᒦᓐ ᓂᔮᔪ ᒥᓂᑯᔥ ᓂᔮᔪ ᒥᓂᑯᔥ ᒥᔮ...` +3. `ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᑎᐸᐦᐄᑲᓐ ᐋᐱᐦᑖᒌᔑᑳᐤ ᐋᐱᐦᑖᑎᐱᔅᑳᐤ 1 05 ᐯᔭᒄ ᑎᐸᐦᐄᑲᓐ ᒦᓐ ᓂᔮᔪ ᒥᓂᑯᔥ ᓂᔮᔪ ᒥᓂᑯᔥ ᒥᔮᐧᐃᐸᔩᐤ ᐯᔭᒄ 1 30` ### Generated Text Samples (Subword-based) @@ -259,34 +291,34 @@ Below are text samples generated from each subword-based Markov chain model: **Context Size 1:** -1. `_ᒉᒀᓐᓂᓕᐅᕝᕙᓪᓗ_ᐃᓐᓂᓂ` -2. `ik;_ᑲᐤ_(_ᑕᐦᐁᐧᔭᐍᑎ` -3. `am_ᐁr_ē-nata_ost` +1. `_ck.._ntahkwiwre` +2. `iw._ey_îskānakat` +3. `asuét):_ᓅᐦᑭᑫᓂᐤ..` **Context Size 2:** -1. `inēhiyiy-âyot_ayi` -2. `,_ᐱᔪᓐᓇᖅ_ᖂᑉ_ᒪᓕᒋᐊᓕᖕ` -3. `_ᐊᑕᐦᑐᒥᒃ_ᑐᒃᓯᓪᓗᓂ_ᐊᓯ` +1. `initahtâw._ᑭᒋᒧᐏᐣ_` +2. `,_miyis_nawamēwik` +3. `ikawahtawāt_kin_o` **Context Size 3:** -1. `in_nešt_mâk_ekwa_a` -2. `anininisiniw._pask` -3. `win_okiskān_tipēna` +1. `in_itakwa_é-nipaho` +2. `anitināw_ōnahkân_a` +3. `winaka_kikamîw-sîp` **Context Size 4:** -1. `wak_tāpihikan_ᐆᒪ_ᐊᐢ` -2. `win_(statistics_(10` -3. `tion_métis_federati` +1. `wak_*` +2. `win_ᐊᑎᒽ_ᐯᔭᒄ_ᓀᐦᐃᔭᐍᐏᐣ` +3. `tion:_saskapi_qc_y_` ### Key Findings - **Best Predictability:** Context-4 (word) with 99.1% predictability - **Branching Factor:** Decreases with context size (more deterministic) -- **Memory Trade-off:** Larger contexts require more storage (11,842 contexts) +- **Memory Trade-off:** Larger contexts require more storage (11,392 contexts) - **Recommendation:** Context-3 or Context-4 for text generation --- @@ -302,9 +334,9 @@ Below are text samples generated from each subword-based Markov chain model: | Metric | Value | |--------|-------| -| Vocabulary Size | 489 | -| Total Tokens | 1,731 | -| Mean Frequency | 3.54 | +| Vocabulary Size | 468 | +| Total Tokens | 1,673 | +| Mean Frequency | 3.57 | | Median Frequency | 2 | | Frequency Std Dev | 3.40 | @@ -312,11 +344,11 @@ Below are text samples generated from each subword-based Markov chain model: | Rank | Word | Frequency | |------|------|-----------| -| 1 | ᐁ | 34 | +| 1 | ᐁ | 31 | | 2 | e | 30 | | 3 | and | 22 | -| 4 | in | 22 | -| 5 | of | 22 | +| 4 | of | 22 | +| 5 | in | 21 | | 6 | pîsim | 19 | | 7 | articles | 18 | | 8 | cree | 16 | @@ -327,39 +359,39 @@ Below are text samples generated from each subword-based Markov chain model: | Rank | Word | Frequency | |------|------|-----------| -| 1 | ᐸᑦᑕᖕᓂᑦ | 2 | -| 2 | ordinateur | 2 | -| 3 | demandez | 2 | -| 4 | le | 2 | -| 5 | programme | 2 | -| 6 | eurêka | 2 | -| 7 | culture | 2 | -| 8 | 18 | 2 | -| 9 | août | 2 | +| 1 | ᑯᓐᓄᑦ | 2 | +| 2 | ᐊᒻᒪᐃᓛᒃ | 2 | +| 3 | ᐊᑎᕐᒥᒃ | 2 | +| 4 | ᖃᕆᑕᐅᔭᕐᒧᑦ | 2 | +| 5 | ᐅᖃᐅᓯᕐᒥᒃ | 2 | +| 6 | ᐊᔾᔨᐅᖏᑦᑐᒥᒃ | 2 | +| 7 | ᑖᓐᓇ | 2 | +| 8 | ᑕᐃᓐᓇ | 2 | +| 9 | ᖃᕆᑕᐅᔭᒃᑯᑦ | 2 | | 10 | ᖃᐅᔨᓴᖅᑎᐅᔪᓄᑦ | 2 | ### Zipf's Law Analysis | Metric | Value | |--------|-------| -| Zipf Coefficient | 0.5522 | -| R² (Goodness of Fit) | 0.947702 | +| Zipf Coefficient | 0.5578 | +| R² (Goodness of Fit) | 0.947960 | | Adherence Quality | **excellent** | ### Coverage Analysis | Top N Words | Coverage | |-------------|----------| -| Top 100 | 47.6% | +| Top 100 | 48.8% | | Top 1,000 | 0.0% | | Top 5,000 | 0.0% | | Top 10,000 | 0.0% | ### Key Findings -- **Zipf Compliance:** R²=0.9477 indicates excellent adherence to Zipf's law -- **High Frequency Dominance:** Top 100 words cover 47.6% of corpus -- **Long Tail:** -9,511 words needed for remaining 100.0% coverage +- **Zipf Compliance:** R²=0.9480 indicates excellent adherence to Zipf's law +- **High Frequency Dominance:** Top 100 words cover 48.8% of corpus +- **Long Tail:** -9,532 words needed for remaining 100.0% coverage --- ## 5. Word Embeddings Evaluation @@ -375,37 +407,38 @@ Below are text samples generated from each subword-based Markov chain model: ### 5.1 Cross-Lingual Alignment -> *Note: Multilingual alignment visualization not available for this language.* +![Multilingual t-SNE](visualizations/embedding_tsne_multilingual.png) ### 5.2 Model Comparison | Model | Dimension | Isotropy | Semantic Density | Alignment R@1 | Alignment R@10 | |-------|-----------|----------|------------------|---------------|----------------| -| **mono_32d** | 32 | 0.0381 🏆 | 0.0000 | N/A | N/A | -| **mono_64d** | 64 | 0.0033 | 0.0000 | N/A | N/A | +| **mono_32d** | 32 | 0.0354 | 0.0000 | N/A | N/A | +| **mono_64d** | 64 | 0.0038 | 0.0000 | N/A | N/A | | **mono_128d** | 128 | 0.0000 | 0.0000 | N/A | N/A | +| **aligned_32d** | 32 | 0.0354 🏆 | 0.0000 | 0.0000 | 0.0000 | +| **aligned_64d** | 64 | 0.0038 | 0.0000 | 0.0000 | 0.0000 | +| **aligned_128d** | 128 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | ### Key Findings -- **Best Isotropy:** mono_32d with 0.0381 (more uniform distribution) +- **Best Isotropy:** aligned_32d with 0.0354 (more uniform distribution) - **Semantic Density:** Average pairwise similarity of 0.0000. Lower values indicate better semantic separation. -- **Alignment Quality:** No aligned models evaluated in this run. +- **Alignment Quality:** Aligned models evaluated but achieved 0% recall. - **Recommendation:** 128d aligned for best cross-lingual performance --- ## 6. Morphological Analysis (Experimental) -> ⚠️ **Warning:** This language shows low morphological productivity. The statistical signals used for this analysis may be noisy or less reliable than for morphologically rich languages. - This section presents an automated morphological analysis derived from the statistical divergence between word-level and subword-level models. By analyzing where subword predictability spikes and where word-level coverage fails, we can infer linguistic structures without supervised data. ### 6.1 Productivity & Complexity | Metric | Value | Interpretation | Recommendation | |--------|-------|----------------|----------------| -| Productivity Index | **0.000** | Low morphological productivity | ⚠️ Likely unreliable | -| Idiomaticity Gap | **-1.000** | Low formulaic content | - | +| Productivity Index | **5.000** | High morphological productivity | Reliable analysis | +| Idiomaticity Gap | **0.933** | High formulaic/idiomatic content | - | ### 6.2 Affix Inventory (Productive Units) @@ -438,7 +471,9 @@ Using **Recursive Hierarchical Substitutability**, we decompose complex words in ### 6.6 Linguistic Interpretation > **Automated Insight:** -The language CR appears to be more isolating or has a highly fixed vocabulary. Word-level models perform nearly as well as subword models, indicating fewer productive morphological processes. +The language Cree shows high morphological productivity. The subword models are significantly more efficient than word models, suggesting a rich system of affixation or compounding. + +> **Note on Idiomaticity:** The high Idiomaticity Gap suggests a large number of frequent multi-word expressions or formulaic sequences that are statistically distinct from their component parts. --- ## 7. Summary & Recommendations @@ -449,7 +484,7 @@ The language CR appears to be more isolating or has a highly fixed vocabulary. W | Component | Recommended | Rationale | |-----------|-------------|-----------| -| Tokenizer | **8k BPE** | Best compression (3.18x) | +| Tokenizer | **8k BPE** | Best compression (3.24x) | | N-gram | **3-gram** | Lowest perplexity (15) | | Markov | **Context-4** | Highest predictability (99.1%) | | Embeddings | **100d** | Balanced semantic capture and isotropy | @@ -665,4 +700,4 @@ MIT License - Free for academic and commercial use. --- *Generated by Wikilangs Models Pipeline* -*Report Date: 2026-01-03 10:19:03* +*Report Date: 2026-01-03 20:39:39* diff --git a/models/embeddings/aligned/cr_128d.bin b/models/embeddings/aligned/cr_128d.bin new file mode 100644 index 0000000000000000000000000000000000000000..3c8058c5d0aca2010fae907f843f89aab5854afc --- /dev/null +++ b/models/embeddings/aligned/cr_128d.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5dacb587c7d15197d345442364b1889a8b7a457b453f7df9253e97673b4fb352 +size 1024067754 diff --git a/models/embeddings/aligned/cr_128d.meta.json b/models/embeddings/aligned/cr_128d.meta.json new file mode 100644 index 0000000000000000000000000000000000000000..3ab38b3b5bc2359f90667912834e3139ecdefccd --- /dev/null +++ b/models/embeddings/aligned/cr_128d.meta.json @@ -0,0 +1 @@ +{"lang": "cr", "dim": 128, "max_seq_len": 512, "is_aligned": true} \ No newline at end of file diff --git a/models/embeddings/aligned/cr_128d.projection.npy b/models/embeddings/aligned/cr_128d.projection.npy new file mode 100644 index 0000000000000000000000000000000000000000..2acb4582e9a9e7248b93a99d1426bff18251e1f7 --- /dev/null +++ b/models/embeddings/aligned/cr_128d.projection.npy @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2b6ebec184c4af672ce0f571958b634809f46f6a1ea2dc7e6f8f2e7f53555387 +size 65664 diff --git a/models/embeddings/aligned/cr_128d_metadata.json b/models/embeddings/aligned/cr_128d_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..0ac3287395f757e6c89c3812c4512459aaca8eb0 --- /dev/null +++ b/models/embeddings/aligned/cr_128d_metadata.json @@ -0,0 +1,8 @@ +{ + "language": "cr", + "dimension": 128, + "version": "aligned", + "hub_language": "en", + "seed_vocab_size": 39, + "vocab_size": 65 +} \ No newline at end of file diff --git a/models/embeddings/aligned/cr_32d.bin b/models/embeddings/aligned/cr_32d.bin new file mode 100644 index 0000000000000000000000000000000000000000..d3068724288d2906823e64b69d63547a3f961a03 --- /dev/null +++ b/models/embeddings/aligned/cr_32d.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6e6a077873b64cad1912b66853d07d4f890bc6606e6b08083d4bfd3153121129 +size 256017834 diff --git a/models/embeddings/aligned/cr_32d.meta.json b/models/embeddings/aligned/cr_32d.meta.json new file mode 100644 index 0000000000000000000000000000000000000000..0bbfec7f027ac284a62129baae08c4e62e1d518f --- /dev/null +++ b/models/embeddings/aligned/cr_32d.meta.json @@ -0,0 +1 @@ +{"lang": "cr", "dim": 32, "max_seq_len": 512, "is_aligned": true} \ No newline at end of file diff --git a/models/embeddings/aligned/cr_32d.projection.npy b/models/embeddings/aligned/cr_32d.projection.npy new file mode 100644 index 0000000000000000000000000000000000000000..a9bffe616ff1542a9afc486de72b31b47673c9c4 --- /dev/null +++ b/models/embeddings/aligned/cr_32d.projection.npy @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b14cf327498f9c92f06cc00df8284bfbdfd03069966ad20b3ada72dc5ce02488 +size 4224 diff --git a/models/embeddings/aligned/cr_32d_metadata.json b/models/embeddings/aligned/cr_32d_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..ca440bcd14645e4663acc8c36c51e72f02bd0852 --- /dev/null +++ b/models/embeddings/aligned/cr_32d_metadata.json @@ -0,0 +1,8 @@ +{ + "language": "cr", + "dimension": 32, + "version": "aligned", + "hub_language": "en", + "seed_vocab_size": 39, + "vocab_size": 65 +} \ No newline at end of file diff --git a/models/embeddings/aligned/cr_64d.bin b/models/embeddings/aligned/cr_64d.bin new file mode 100644 index 0000000000000000000000000000000000000000..0ea7c1d7a48818811ea2e85ea1d838fb3f045351 --- /dev/null +++ b/models/embeddings/aligned/cr_64d.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fd5960af306a49bc16d12139a057de3daae86c9d43e31642ee7dbd085eb553b7 +size 512034474 diff --git a/models/embeddings/aligned/cr_64d.meta.json b/models/embeddings/aligned/cr_64d.meta.json new file mode 100644 index 0000000000000000000000000000000000000000..6e2ff3e036916367c0fc5d6738b7a7489c000254 --- /dev/null +++ b/models/embeddings/aligned/cr_64d.meta.json @@ -0,0 +1 @@ +{"lang": "cr", "dim": 64, "max_seq_len": 512, "is_aligned": true} \ No newline at end of file diff --git a/models/embeddings/aligned/cr_64d.projection.npy b/models/embeddings/aligned/cr_64d.projection.npy new file mode 100644 index 0000000000000000000000000000000000000000..e1eb64fe866f15ae9c63501844f6691c9eb9e35a --- /dev/null +++ b/models/embeddings/aligned/cr_64d.projection.npy @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d846b3fae902e708babece2e6c2e3c62275450ddb7096ca337af53dcf63a96b6 +size 16512 diff --git a/models/embeddings/aligned/cr_64d_metadata.json b/models/embeddings/aligned/cr_64d_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..74c7a6b5374dd4a5dccc8a1a13993f23adeb2f86 --- /dev/null +++ b/models/embeddings/aligned/cr_64d_metadata.json @@ -0,0 +1,8 @@ +{ + "language": "cr", + "dimension": 64, + "version": "aligned", + "hub_language": "en", + "seed_vocab_size": 39, + "vocab_size": 65 +} \ No newline at end of file diff --git a/models/embeddings/monolingual/cr_128d.bin b/models/embeddings/monolingual/cr_128d.bin index 455deccb7f2c6542abc5eeef346b9e6a11ad5f14..3c8058c5d0aca2010fae907f843f89aab5854afc 100644 --- a/models/embeddings/monolingual/cr_128d.bin +++ b/models/embeddings/monolingual/cr_128d.bin @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:322799eab6ee84e7d28a0f76de24efb5b70f9e1fcd0eef88c9b61829ce272397 +oid sha256:5dacb587c7d15197d345442364b1889a8b7a457b453f7df9253e97673b4fb352 size 1024067754 diff --git a/models/embeddings/monolingual/cr_32d.bin b/models/embeddings/monolingual/cr_32d.bin index 6fc586ab738c751274965f08fb144b17f2c8faf3..d3068724288d2906823e64b69d63547a3f961a03 100644 --- a/models/embeddings/monolingual/cr_32d.bin +++ b/models/embeddings/monolingual/cr_32d.bin @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:095584a1fc693fcab2c3fd7f0ac831d533933eb58ce565c3b082866e303db9f6 +oid sha256:6e6a077873b64cad1912b66853d07d4f890bc6606e6b08083d4bfd3153121129 size 256017834 diff --git a/models/embeddings/monolingual/cr_64d.bin b/models/embeddings/monolingual/cr_64d.bin index 62487a4041e07e833f2224edeafdd9c9f6bd0b40..0ea7c1d7a48818811ea2e85ea1d838fb3f045351 100644 --- a/models/embeddings/monolingual/cr_64d.bin +++ b/models/embeddings/monolingual/cr_64d.bin @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:fdd5c40a0696cfa8faa26c56bda982ab2a5efbbc0bc7d30fcea9c944b571b542 +oid sha256:fd5960af306a49bc16d12139a057de3daae86c9d43e31642ee7dbd085eb553b7 size 512034474 diff --git a/models/subword_markov/cr_markov_ctx1_subword.parquet b/models/subword_markov/cr_markov_ctx1_subword.parquet index fab5d1105425cb521bc8e2d4833f35ed80734490..e3d49433c90e4e2e326082bbdfc4e23fb0314250 100644 --- a/models/subword_markov/cr_markov_ctx1_subword.parquet +++ b/models/subword_markov/cr_markov_ctx1_subword.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:d336a63f8ad5e082086c17ec41db170270495929a8dd340a6abcd2e5998bc0e6 -size 19362 +oid sha256:d58185c9c750453fec2b1be4bea145ca19c8154e9c1851c770fcfe60e0945619 +size 19062 diff --git a/models/subword_markov/cr_markov_ctx1_subword_metadata.json b/models/subword_markov/cr_markov_ctx1_subword_metadata.json index 4991a2a5284988848c502ff6264907ec3ea1748f..7194a76f154cec92918d288e28a3907fa313c190 100644 --- a/models/subword_markov/cr_markov_ctx1_subword_metadata.json +++ b/models/subword_markov/cr_markov_ctx1_subword_metadata.json @@ -2,6 +2,6 @@ "context_size": 1, "variant": "subword", "language": "cr", - "unique_contexts": 273, - "total_transitions": 21066 + "unique_contexts": 271, + "total_transitions": 20269 } \ No newline at end of file diff --git a/models/subword_markov/cr_markov_ctx2_subword.parquet b/models/subword_markov/cr_markov_ctx2_subword.parquet index d662c9ebb1f08134ddff69b229c324a3685fe7fd..c8087bb3246f02f15aae03d3e97792ce29f57d90 100644 --- a/models/subword_markov/cr_markov_ctx2_subword.parquet +++ b/models/subword_markov/cr_markov_ctx2_subword.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:88cd25c549a06a864e6af9c0a79d29d91815a704038e65f2ce7d2c821349c095 -size 54769 +oid sha256:3f22b06562e4d08f3c27092ecc548a0cae5a9446cc2ae3e54f2073e8b8900e95 +size 52808 diff --git a/models/subword_markov/cr_markov_ctx2_subword_metadata.json b/models/subword_markov/cr_markov_ctx2_subword_metadata.json index b93a810a58f66ea24e1b17a0a4998c324635c272..f2c8b54a13ad47c36cf28a2b6c478e9411a2bf2b 100644 --- a/models/subword_markov/cr_markov_ctx2_subword_metadata.json +++ b/models/subword_markov/cr_markov_ctx2_subword_metadata.json @@ -2,6 +2,6 @@ "context_size": 2, "variant": "subword", "language": "cr", - "unique_contexts": 2872, - "total_transitions": 21041 + "unique_contexts": 2789, + "total_transitions": 20244 } \ No newline at end of file diff --git a/models/subword_markov/cr_markov_ctx3_subword.parquet b/models/subword_markov/cr_markov_ctx3_subword.parquet index 1d33c19df6e0d17534ee93deddbb468bb7c5738b..763fb95832399cb107f479c624eab45d008ed151 100644 --- a/models/subword_markov/cr_markov_ctx3_subword.parquet +++ b/models/subword_markov/cr_markov_ctx3_subword.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:493e48b8388399c0693ca1644064c6aff29ebd8768e02dc362bd13a9beb9923c -size 109170 +oid sha256:f2ad975814a144c1719a36efe4086171171e4415b69c07c0cfc7589d7b44408b +size 105153 diff --git a/models/subword_markov/cr_markov_ctx3_subword_metadata.json b/models/subword_markov/cr_markov_ctx3_subword_metadata.json index f49cc20b038a50ad39da1114c0daa6d36515fdc4..61a95af4d3a69d0854d038428f939ba9c1ca0f9f 100644 --- a/models/subword_markov/cr_markov_ctx3_subword_metadata.json +++ b/models/subword_markov/cr_markov_ctx3_subword_metadata.json @@ -2,6 +2,6 @@ "context_size": 3, "variant": "subword", "language": "cr", - "unique_contexts": 7557, - "total_transitions": 21016 + "unique_contexts": 7299, + "total_transitions": 20219 } \ No newline at end of file diff --git a/models/subword_markov/cr_markov_ctx4_subword.parquet b/models/subword_markov/cr_markov_ctx4_subword.parquet index c709a267bbece03397409834f0cd471f00d8ff4d..462a2446ac5c7e0987bb2a271a0bf25a3b35af99 100644 --- a/models/subword_markov/cr_markov_ctx4_subword.parquet +++ b/models/subword_markov/cr_markov_ctx4_subword.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:77ef47f052b70e08ed38cb704ef21e780c36aa884a719e5d4963f672dd6af637 -size 165104 +oid sha256:fbe0924ddd06b05b7652bd329b4be5d86de19b757a42631954cd9abc0a1910f8 +size 158381 diff --git a/models/subword_markov/cr_markov_ctx4_subword_metadata.json b/models/subword_markov/cr_markov_ctx4_subword_metadata.json index fef471d63f2758632ff4e1b878ede61d0dbde114..b3c1393a88375bbe7c32e58c045459968deada6a 100644 --- a/models/subword_markov/cr_markov_ctx4_subword_metadata.json +++ b/models/subword_markov/cr_markov_ctx4_subword_metadata.json @@ -2,6 +2,6 @@ "context_size": 4, "variant": "subword", "language": "cr", - "unique_contexts": 11842, - "total_transitions": 20991 + "unique_contexts": 11392, + "total_transitions": 20194 } \ No newline at end of file diff --git a/models/subword_ngram/cr_2gram_subword.parquet b/models/subword_ngram/cr_2gram_subword.parquet index dcc9512e4a319c726e4c8128abb8ee4b6350f8f1..67c529a3c84f13446b5851ad8bf4c0b1f6fce10e 100644 --- a/models/subword_ngram/cr_2gram_subword.parquet +++ b/models/subword_ngram/cr_2gram_subword.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:4ec4f4bbb6a07a46393df0aae6d9e7264c3913c983048c199b187ccf1637509f -size 10150 +oid sha256:437302b4fde849c0752032e34257d718792b347ef32757c168db42d3025abc26 +size 9822 diff --git a/models/subword_ngram/cr_2gram_subword_metadata.json b/models/subword_ngram/cr_2gram_subword_metadata.json index 2d880626a4c032248b6610ede7bb916439e07537..0746b00d519daa593013a6b92752b0bac1d00c40 100644 --- a/models/subword_ngram/cr_2gram_subword_metadata.json +++ b/models/subword_ngram/cr_2gram_subword_metadata.json @@ -2,6 +2,6 @@ "n": 2, "variant": "subword", "language": "cr", - "unique_ngrams": 848, - "total_ngrams": 21066 + "unique_ngrams": 812, + "total_ngrams": 20269 } \ No newline at end of file diff --git a/models/subword_ngram/cr_3gram_subword.parquet b/models/subword_ngram/cr_3gram_subword.parquet index 255349a1ff8f1db8a610231774ece0509a776ae0..4c0c35549eb5112406164d88b4427469161f180a 100644 --- a/models/subword_ngram/cr_3gram_subword.parquet +++ b/models/subword_ngram/cr_3gram_subword.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:3a341e935103c45c5dd7be03811b9df507bbbcb941614f0a4d3949665defe332 -size 22181 +oid sha256:7980b5aac9cba90ebe083cb0bfd2b3416465c2a52b83cbcf3a73434e722de7a0 +size 21356 diff --git a/models/subword_ngram/cr_3gram_subword_metadata.json b/models/subword_ngram/cr_3gram_subword_metadata.json index 60c6acd2ab90d1c73ee474e9df1367d8e014352b..201e345d8bff3f8524d067cd3bba099c3b815e44 100644 --- a/models/subword_ngram/cr_3gram_subword_metadata.json +++ b/models/subword_ngram/cr_3gram_subword_metadata.json @@ -2,6 +2,6 @@ "n": 3, "variant": "subword", "language": "cr", - "unique_ngrams": 1986, - "total_ngrams": 21041 + "unique_ngrams": 1902, + "total_ngrams": 20244 } \ No newline at end of file diff --git a/models/subword_ngram/cr_4gram_subword.parquet b/models/subword_ngram/cr_4gram_subword.parquet index 7a3ddd5bfd8c4477233442723c0c1cdcf99bdf31..261b7c0844a509d830f7c6c94e1c2615efceaee5 100644 --- a/models/subword_ngram/cr_4gram_subword.parquet +++ b/models/subword_ngram/cr_4gram_subword.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:36e06026aa7388f2d7e58a4f337a1ff541dac38696cfa56eaaaec940804c17a1 -size 46333 +oid sha256:b2bcb275f3772e5f6b8bc913b0b382afa84dac97f36e76b7d03ee8a6db106e56 +size 44325 diff --git a/models/subword_ngram/cr_4gram_subword_metadata.json b/models/subword_ngram/cr_4gram_subword_metadata.json index 5d1e7b4b6cf8acf894813d77ceaf7ab0edcfa8fa..1fd8a87a8e3b7ceb41c597e379ab6e32f0262fdc 100644 --- a/models/subword_ngram/cr_4gram_subword_metadata.json +++ b/models/subword_ngram/cr_4gram_subword_metadata.json @@ -2,6 +2,6 @@ "n": 4, "variant": "subword", "language": "cr", - "unique_ngrams": 3878, - "total_ngrams": 21016 + "unique_ngrams": 3702, + "total_ngrams": 20219 } \ No newline at end of file diff --git a/models/subword_ngram/cr_5gram_subword.parquet b/models/subword_ngram/cr_5gram_subword.parquet new file mode 100644 index 0000000000000000000000000000000000000000..fb59f2ac4a8b2afcea1ac1b5d37c5e4c153b749f --- /dev/null +++ b/models/subword_ngram/cr_5gram_subword.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:18b2af82413c2d5e0b7124bc18273388641db7427c102a2465c939e7d69ffc67 +size 42412 diff --git a/models/subword_ngram/cr_5gram_subword_metadata.json b/models/subword_ngram/cr_5gram_subword_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..812b65501d87993ce902339c43e863c6e77f6d6a --- /dev/null +++ b/models/subword_ngram/cr_5gram_subword_metadata.json @@ -0,0 +1,7 @@ +{ + "n": 5, + "variant": "subword", + "language": "cr", + "unique_ngrams": 3264, + "total_ngrams": 20194 +} \ No newline at end of file diff --git a/models/tokenizer/cr_tokenizer_8k.model b/models/tokenizer/cr_tokenizer_8k.model index 1a5573877a954065160fdbc3445e2717be72358b..74c97dc83be20fcd09c8526fd4c41012e637e000 100644 --- a/models/tokenizer/cr_tokenizer_8k.model +++ b/models/tokenizer/cr_tokenizer_8k.model @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:a0973f38231d8be6cc17f5bb11b4adcec87ca7e994a2d91d59f4a7f15ea4655f -size 379309 +oid sha256:aafdacc6f2d991f561954af6f998ab1116a7904c0d6408e0e6f25d2ce7b6f625 +size 379259 diff --git a/models/tokenizer/cr_tokenizer_8k.vocab b/models/tokenizer/cr_tokenizer_8k.vocab index 8c6821b6c7c241b48ebe7352b882787c337ac8e3..89a9cd253a1023f404cdebef12e1362cd41c6ba6 100644 --- a/models/tokenizer/cr_tokenizer_8k.vocab +++ b/models/tokenizer/cr_tokenizer_8k.vocab @@ -9,7754 +9,7754 @@ an -3 wa -4 ▁k -5 ▁n -6 -▁a -7 -it -8 -hk -9 -▁m -10 -▁ᐊ -11 -▁( -12 +it -7 +▁a -8 +▁m -9 +hk -10 +▁( -11 +▁ᐊ -12 ta -13 im -14 ▁c -15 iw -16 -▁ᐁ -17 +▁o -17 sk -18 -▁o -19 +▁ᐁ -19 ▁p -20 ih -21 ▁s -22 ▁e -23 es -24 -ti -25 -iy -26 -ᐃᐧ -27 -wâ -28 +iy -25 +ti -26 +wâ -27 +ᐃᐧ -28 ▁ᐅ -29 on -30 er -31 -os -32 -ᐦᐃ -33 -am -34 -▁ᐃ -35 -win -36 -wak -37 -▁t -38 -▁w -39 -pa -40 -îs -41 -as -42 -or -43 -▁ᑭ -44 -▁ᑲ -45 -▁an -46 -na -47 -al -48 -âw -49 -ᑯᓯ -50 -ec -51 -▁l -52 -▁in -53 -aw -54 -en -55 -om -56 -ᐊᐧ -57 -ᐱᐦ -58 -▁ᒥ -59 -isk -60 -âk -61 -▁d -62 -cik -63 +am -32 +os -33 +win -34 +wak -35 +ᐦᐃ -36 +▁w -37 +pa -38 +îs -39 +▁ᐃ -40 +or -41 +▁ᑭ -42 +▁an -43 +as -44 +na -45 +▁t -46 +al -47 +ec -48 +en -49 +âw -50 +▁l -51 +ᑯᓯ -52 +aw -53 +▁ᒥ -54 +▁in -55 +ᐱᐦ -56 +om -57 +ᐊᐧ -58 +▁d -59 +▁ᑲ -60 +isk -61 +cik -62 +âk -63 ihk -64 -ak -65 -ᐸᐦ -66 -ᐃᐧᐣ -67 -ot -68 +ᐸᐦ -65 +ak -66 +ot -67 +▁f -68 ask -69 -ân -70 +ci -70 ᑲᐣ -71 -▁f -72 -ci -73 -▁ē -74 -▁ᓀ -75 +▁ē -72 +▁ᓀ -73 +ân -74 +▁r -75 ▁ᓂ -76 -▁r -77 +ᐃᐧᐣ -77 ic -78 ati -79 kwa -80 -tic -81 -▁of -82 -▁and -83 -at -84 -ht -85 -▁ᐱ -86 -▁ᒪ -87 -▁cr -88 -ation -89 -ee -90 -ew -91 -il -92 -un -93 -▁b -94 -îsim -95 -gl -96 -ow -97 -ᐦᒋ -98 -ᑲᐧ -99 -▁- -100 -▁ᓇ -101 -ect -102 -les -103 -ter -104 -▁ar -105 -▁th -106 -ikan -107 -pîsim -108 -âp -109 -tim -110 -îhk -111 -owin -112 -ticles -113 -▁articles -114 -ad -115 -ar -116 -ce -117 -ch -118 -ᐦᑖ -119 -ᐧᐃ -120 -▁ᐯ -121 -▁ᑎ -122 -iht -123 -ᑕᑯᓯ -124 -▁ᐊᔨ -125 -▁ᓀᐦᐃ -126 -.. -127 -th -128 -ât -129 -ᐢᑭ -130 -ᑲᓐ -131 -▁ᑳ -132 -kîs -133 -isiw -134 -kîsik -135 -▁cree -136 -pi -137 -âs -138 +▁of -81 +▁and -82 +at -83 +ee -84 +tic -85 +▁cr -86 +ation -87 +ew -88 +gl -89 +il -90 +un -91 +▁b -92 +▁ᐱ -93 +îsim -94 +ow -95 +▁- -96 +▁ᓇ -97 +ect -98 +les -99 +ter -100 +▁ar -101 +ikan -102 +pîsim -103 +ad -104 +ht -105 +ᐦᒋ -106 +ᑲᐧ -107 +▁ᒪ -108 +tim -109 +owin -110 +ticles -111 +▁articles -112 +ar -113 +âp -114 +▁ᑎ -115 +îhk -116 +ᑕᑯᓯ -117 +▁ᓀᐦᐃ -118 +.. -119 +ac -120 +th -121 +ât -122 +ᐢᑭ -123 +ᑲᓐ -124 +▁ᐯ -125 +▁ᑳ -126 +iht -127 +ish -128 +kîs -129 +▁ᐊᔨ -130 +isiw -131 +kîsik -132 +▁cree -133 +eh -134 +pi -135 +âs -136 +ᐦᑖ -137 +ᐧᐃ -138 ▁ᐋ -139 -▁ᒫ -140 -ial -141 -ish -142 -ist -143 -rit -144 -ᐸᐦᐄ -145 -▁dial -146 -▁ᓀᐦᐃᔭ -147 -kîsikâw -148 -▁dialect -149 -eh -150 -ig -151 -um -152 -us -153 -îp -154 -▁y -155 -▁ᐦ -156 -ing -157 -tam -158 -▁ka -159 -êwak -160 -▁pim -161 -ᐱᐦᑕᑯᓯ -162 -ᐸᐦᐄᑲᓐ -163 -▁ᑎᐸᐦᐄᑲᓐ -164 -ed -165 -oc -166 -pô -167 -āt -168 -ita -169 -wâw -170 -▁iy -171 -tern -172 -▁for -173 -ap -174 -ni -175 -ou -176 -px -177 -êw -178 -ēh -179 -ᐦᐠ -180 -▁q -181 -▁· -182 -▁â -183 -▁ᒌ -184 -hci -185 -tis -186 -ᐊᐧᐠ -187 -▁ᐯᔭ -188 +ial -140 +ist -141 +rit -142 +ᐸᐦᐄ -143 +▁dial -144 +▁ᓀᐦᐃᔭ -145 +kîsikâw -146 +▁dialect -147 +▁ᐦ -148 +▁ᒫ -149 +tam -150 +▁ka -151 +▁th -152 +êwak -153 +▁pim -154 +ᐱᐦᑕᑯᓯ -155 +ᐸᐦᐄᑲᓐ -156 +▁ᑎᐸᐦᐄᑲᓐ -157 +ch -158 +ed -159 +oc -160 +ou -161 +pô -162 +us -163 +āt -164 +ᐟᐨ -165 +▁y -166 +ing -167 +ita -168 +wâw -169 +▁iy -170 +tern -171 +▁for -172 +ey -173 +ni -174 +px -175 +êw -176 +îp -177 +ēh -178 +ᐦᐠ -179 +▁q -180 +▁· -181 +▁â -182 +▁ᒌ -183 +hci -184 +tis -185 +ᐊᐧᐠ -186 +▁ᐯᔭ -187 +▁the -188 ▁ᐁᑲᐧ -189 ᐱᐦᑕᑯᓯᓴ -190 ▁ᐊᐱᐦᑕᑯᓯᓴ -191 -ey -192 -ob -193 -ol -194 -îw -195 -ān -196 -ᐏᐣ -197 -ᓇᐠ -198 -ᔨᓂ -199 -wâk -200 -ᐦᑖᓱ -201 -▁ay -202 -▁ta -203 -▁ᐊᐧ -204 -iwak -205 -▁nam -206 -▁the -207 -▁atim -208 -▁ēkwa -209 -): -210 -ac -211 -kw -212 -oh -213 -ok -214 -ᐁᐧ -215 -ᐟᐨ -216 -ᐢᑫ -217 -▁h -218 -▁ᐸ -219 -▁ᒨ -220 -hki -221 -hpô -222 -ies -223 -ome -224 -pah -225 -pay -226 -▁gl -227 -▁kâ -228 -▁mé -229 -▁on -230 -▁ᐸᐦ -231 -aniw -232 -inos -233 -isin -234 -▁can -235 -▁kik -236 -▁ᐯᔭᒄ -237 -▁ahpô -238 -▁some -239 -▁métis -240 -ab -241 -el -242 -iš -243 -āk -244 -āw -245 -▁ᑌ -246 -▁ᒉ -247 -▁ᒣ -248 -▁ᒦ -249 -▁ᓴ -250 -ill -251 -imu -252 -iwa -253 -onk -254 -way -255 -ᓯᑲᐣ -256 -▁en -257 -▁is -258 -▁kā -259 -▁ᐊᒋ -260 -ikot -261 -ᒧᐃᐧᐣ -262 -▁ᐅᐦᒋ -263 -▁nonk -264 -▁writ -265 -illing -266 -▁canad -267 -▁pimîhk -268 -▁nonkilling -269 -), -270 -ay -271 -eš -272 -gr -273 -le -274 -ma -275 -sh -276 -ᐘᐠ -277 -ᐦᑕ -278 -ᒋᐠ -279 -ᓇᐤ -280 -▁* -281 -▁ᑕ -282 -▁ᒧ -283 -... -284 -cih -285 -umb -286 -īna -287 -▁kī -288 -▁wâ -289 -▁ᑭᒋ -290 -ikam -291 -iniw -292 -▁ask -293 -▁cen -294 -▁loc -295 -âpiht -296 -▁list -297 -▁mīna -298 -▁ohci -299 -aniwiw -300 -ihikan -301 -▁kinos -302 -▁ᐊᒋᐦᑖᓱ -303 -▁location -304 -ah -305 -em -306 -et -307 -kī -308 -êh -309 -îm -310 -ôt -311 -ēw -312 -ᐋᐧ -313 -ᐘᐣ -314 -ᐢᑌ -315 -ᐢᑎ -316 -ᑳᐧ -317 -ᓂᐠ -318 -ᓯᒼ -319 -ᕒᐃ -320 -▁ᐆ -321 -▁ᐘ -322 -▁ᓃ -323 -▁ᓵ -324 -ast -325 -ata -326 -iso -327 -tak -328 -ten -329 -wah -330 -wes -331 -âna -332 -ᐃᐧᓇ -333 -ᐢᑫᐧ -334 -ᑯᓯᐤ -335 -▁it -336 -▁or -337 -▁pa -338 -▁st -339 -▁ᐃᔨ -340 -▁ᑲᔦ -341 -kisk -342 -obal -343 -ēwin -344 -ᐦᐃᑲᐣ -345 -ᑵᓯᑲᐣ -346 -glish -347 -payik -348 -▁mask -349 -▁name -350 -astern -351 -▁canada -352 -▁global -353 -▁english -354 -▁written -355 -ek -356 -ll -357 -op -358 -st -359 -up -360 -ēk -361 -ēn -362 -ēs -363 -ēy -364 -īh -365 -ᑌᐤ -366 -ᑭᓂ -367 -ᒥᐦ -368 -ᓇᑕ -369 -ᓯᐤ -370 -▁j -371 -▁v -372 -▁ê -373 -▁ᑫ -374 -▁ᓯ -375 -aph -376 -ard -377 -ces -378 -ith -379 -iya -380 -ogr -381 -owi -382 -tal -383 -▁ch -384 -▁cî -385 -▁ki -386 -▁qc -387 -▁ᐃᐧ -388 -▁ᐃᓯ -389 -▁ᐅᓂ -390 -▁ᑭᑭ -391 -anit -392 -osiw -393 -tahk -394 -ther -395 -▁awa -396 -▁com -397 -▁isk -398 -▁neh -399 -▁nik -400 -▁nēh -401 -▁por -402 -▁rom -403 -▁kita -404 -awēwin -405 -ograph -406 -▁thumb -407 -▁nation -408 -▁portal -409 -▁ᐊᒋᐦᑖᓱᓐ -410 -▁ᐸᐦᑵᓯᑲᐣ -411 -âkaniwiw -412 -▁kinosêw -413 -▁maskisin -414 -▁pimîhkân -415 -▁ᐊᐱᐦᑕᑯᓯᓴᐣ -416 -▁ᐊᐱᐦᑕᑯᓯᓴᓇᐠ -417 -ag -418 -tu -419 -ul -420 -ᐎᓐ -421 -ᐢᑯ -422 -ᐢᑲ -423 -ᐦᐅ -424 -ᐦᑳ -425 -ᐦᒡ -426 -ᐧᒥ -427 -ᑌᒡ -428 -ᑕᐦ -429 -ᑭᓯ -430 -ᑲᔭ -431 -ᒁᐘ -432 -ᒥᐣ -433 -▁ô -434 -▁ᐄ -435 -▁ᐳ -436 -▁ᑮ -437 -▁ᒋ -438 -ana -439 -and -440 -awî -441 -awī -442 -ihi -443 -ina -444 -ite -445 -mun -446 -oos -447 -ota -448 -oun -449 -sus -450 -tay -451 -wân -452 -yll -453 -âhk -454 -âsk -455 -âwi -456 -ᐁᐧᐃ -457 -ᑲᔭᓯ -458 -ᓯᓇᐤ -459 -▁ak -460 -▁kî -461 -▁pr -462 -▁wî -463 -▁ᐃᔑ -464 -▁ᐊᐦ -465 -▁ᐊᓂ -466 -▁ᒥᑕ -467 -▁ᒨᔅ -468 -▁ᒫᒃ -469 -▁ᓂᑕ -470 -ight -471 -ihci -472 -inak -473 -paht -474 -wâpi -475 -ᐁᐧᐃᐣ -476 -ᑭᐢᑫᐧ -477 -ᑭᓂᑲᐣ -478 -▁mon -479 -▁mâk -480 -▁pak -481 -▁pik -482 -▁way -483 -▁ᐊᔨᒧ -484 -▁ᐋᐱᐦ -485 -▁ᑲᔦᐦ -486 -ihtam -487 -munit -488 -thern -489 -yllab -490 -▁aniy -491 -▁east -492 -▁iskw -493 -▁kika -494 -▁kisk -495 -▁orth -496 -andard -497 -imuwin -498 -îmowin -499 -ᒁᐘᑭᓂᑲᐣ -500 -▁anihi -501 -▁roman -502 -▁ᓀᐦᐃᔭᐍ -503 -eastern -504 -ihkêwak -505 -ography -506 -western -507 -▁census -508 -▁syllab -509 -▁ispayik -510 -▁communit -511 -▁standard -512 -▁orthography -513 -cî -514 -ef -515 -hi -516 -ia -517 -kâ -518 -lu -519 -nu -520 -pē -521 -si -522 -ué -523 -yû -524 -ák -525 -îk -526 -ᐢᑕ -527 -ᐢᑮ -528 -ᐤ᙮ -529 -ᐦᑯ -530 -ᐧᐁ -531 -ᐧᐊ -532 -ᐨᑕ -533 -ᑎᑯ -534 -ᑖᓂ -535 -ᑭᐌ -536 -ᑲᑌ -537 -ᑲᓂ -538 -ᑳᐤ -539 -ᒪᓇ -540 -ᓂᐤ -541 -ᓂᑯ -542 -ᓱᐣ -543 -ᔨᐤ -544 -ᔪᐘ -545 -ᕒᐅ -546 -▁i -547 -▁é -548 -▁ō -549 -▁ᐲ -550 -▁ᑯ -551 -▁ᓰ -552 -awê -553 -bec -554 -enn -555 -eun -556 -gna -557 -ian -558 -kic -559 -ohk -560 -osk -561 -ost -562 -otā -563 -ple -564 -rad -565 -sîp -566 -taw -567 -wan -568 -ôhk -569 -ᐢᑌᐠ -570 -ᐢᑭᐤ -571 -ᐢᑭᑌ -572 -ᐦᒋᑫ -573 -ᐧᐃᓐ -574 -ᑳᐧᐤ -575 -ᒧᐏᐣ -576 -ᓅᐦᒡ -577 -▁ab -578 -▁at -579 -▁mā -580 -▁wa -581 -▁ᐁᑿ -582 -▁ᐃᓕ -583 -▁ᐅᒪ -584 -▁ᐊᓐ -585 -▁ᐱᓯ -586 -▁ᓴᑲ -587 -apān -588 -aska -589 -atch -590 -ewan -591 -igin -592 -inik -593 -itaw -594 -iyin -595 -osis -596 -owiy -597 -âkan -598 -ânis -599 -îwah -600 -ēsta -601 -ᐢᑎᑯᓯ -602 -ᔨᓂᐘᐣ -603 -ᔪᐘᐦᐠ -604 -▁ekw -605 -▁mas -606 -▁mis -607 -▁neš -608 -▁nâp -609 -▁nīh -610 -▁pro -611 -▁pâs -612 -▁qué -613 -▁ref -614 -▁tān -615 -▁ᐁᐧᒥ -616 -▁ᐊᐢᑭ -617 -▁ᐊᐧᐁ -618 -▁ᑭᓱᐣ -619 -▁ᑲᓇᑕ -620 -▁ᓀᐢᑕ -621 -gnais -622 -isiwe -623 -lueun -624 -pahki -625 -▁ekwa -626 -▁innu -627 -▁iyyû -628 -▁nata -629 -▁nita -630 -▁piko -631 -▁âtay -632 -awîtim -633 -awīwin -634 -iyawak -635 -âkosis -636 -▁monta -637 -▁nēhiy -638 -▁nēsta -639 -▁tahki -640 -▁ᐃᓯᐦᒋᑫ -641 -▁ᐃᔨᓂᐘᐣ -642 -▁ᐱᓯᐢᑭᐤ -643 -▁ᒥᔪᐘᐦᐠ -644 -askatch -645 -▁atimwa -646 -▁nīhith -647 -▁québec -648 -▁ᓀᐦᐃᔭᐃᐧ -649 -ayimuwin -650 -iyiniwak -651 -îwahikan -652 -▁aniyiwa -653 -▁ᓀᐦᐃᔭᐍᐏᐣ -654 -▁ᓯᒁᐘᑭᓂᑲᐣ -655 -awîtimihk -656 -▁nitawâpi -657 -askatchewan -658 -âpihtâkosis -659 -▁montagnais -660 -▁communities -661 -▁nēhiyawēwin -662 -▁nīhithawīwin -663 -▁wayawîtimihk -664 -ds -665 -fa -666 -gu -667 -hc -668 -io -669 -ip -670 -iv -671 -lb -672 -mp -673 -ps -674 -qu -675 -uk -676 -uw -677 -va -678 -ve -679 -ya -680 -ây -681 -êy -682 -āc -683 -ēm -684 -ēt -685 -ōt -686 -ᐅᓐ -687 -ᐟᒋ -688 -ᐟᒐ -689 -ᐤᐸ -690 -ᐦᐄ -691 -ᐧᐋ -692 -ᐸᐸ -693 -ᐸᒥ -694 -ᑕᓂ -695 -ᑖᐤ -696 -ᑭᒋ -697 -ᑯᐣ -698 -ᑲᔮ -699 -ᒣᐤ -700 -ᒣᔅ -701 -ᒧᐏ -702 -ᔭᐠ -703 -ᔭᐤ -704 -ᔮᔪ -705 -▁u -706 -▁z -707 -▁ᐏ -708 -▁ᐑ -709 -▁ᐹ -710 -▁ᒐ -711 -▁ᓅ -712 -▁ᓈ -713 -ach -714 -ain -715 -ann -716 -ash -717 -cas -718 -ers -719 -ewa -720 -ige -721 -ihc -722 -ikā -723 -ion -724 -kin -725 -kīs -726 -mpy -727 -nic -728 -oba -729 -osâ -730 -pat -731 -pin -732 -rio -733 -ses -734 -ski -735 -vin -736 -wâs -737 -áká -738 -êst -739 -êwi -740 -îwi -741 -īhk -742 -ᐋᐧᐠ -743 -ᐎᓂᐠ -744 -ᐟᒋᐠ -745 -ᐟᒐᓂ -746 -ᐢᑳᐧ -747 -ᑧᑭᐌ -748 -ᑭᐦᐅ -749 -ᑯᐟᐨ -750 -ᑲᔮᓰ -751 -ᒧᐅᓐ -752 -ᓂᐦᑳ -753 -ᔥᑎᑯ -754 -▁ac -755 -▁ed -756 -▁fr -757 -▁il -758 -▁iš -759 -▁mw -760 -▁os -761 -▁sh -762 -▁sk -763 -▁ti -764 -▁ᐅᑌ -765 -▁ᐅᑎ -766 -▁ᐅᒋ -767 -▁ᐊᑭ -768 -▁ᐋᐸ -769 -▁ᐯᐃ -770 -▁ᐱᒥ -771 -▁ᒣᑲ -772 -▁ᒥᓇ -773 -▁ᒫᑲ -774 -▁ᓇᑕ -775 -amik -776 -angu -777 -ence -778 -eren -779 -ewak -780 -ical -781 -ikât -782 -inoh -783 -isik -784 -isit -785 -ites -786 -iwāt -787 -kask -788 -kīsk -789 -nahk -790 -niwa -791 -olit -792 -omin -793 -oose -794 -osik -795 -pisk -796 -skaw -797 -tahi -798 -tihk -799 -timw -800 -wach -801 -wina -802 -âcik -803 -ânak -804 -âwak -805 -âwew -806 -ēcik -807 -ᐃᐧᓯᐤ -808 -ᐢᑯᑌᐤ -809 -ᐦᑯᒪᓇ -810 -ᐸᐸᐤᐸ -811 -ᑭᐊᐧᐠ -812 -ᔨᓂᐘᐠ -813 -▁ekâ -814 -▁eth -815 -▁itē -816 -▁kîw -817 -▁lab -818 -▁māt -819 -▁res -820 -▁sam -821 -▁sel -822 -▁sim -823 -▁ter -824 -▁wik -825 -▁wor -826 -▁ôma -827 -▁ᐃᐧᐊ -828 -▁ᐅᐦᐃ -829 -▁ᑭᐢᑫ -830 -▁ᒥᐨᑕ -831 -▁ᒥᓂᑯ -832 -▁ᓇᒣᔅ -833 -atcik -834 -cikan -835 -ected -836 -imuun -837 -inawê -838 -inihk -839 -itcik -840 -itwâw -841 -opîwi -842 -ories -843 -rador -844 -right -845 -tario -846 -wampy -847 -wâkam -848 -wânâs -849 -îpihk -850 -ᓂᐦᑳᑌᒡ -851 -▁ayis -852 -▁ilil -853 -▁iyuw -854 -▁kask -855 -▁mihk -856 -▁māce -857 -▁nešt -858 -▁wâsk -859 -▁êkwa -860 -▁ᐃᔨᔨᐤ -861 -▁ᐊᐧᐁᓰ -862 -▁ᐊᑲᔭᓯ -863 -▁ᐋᐱᐦᑖ -864 -▁ᐸᐦᑭᓯ -865 -anguag -866 -eštimw -867 -iginal -868 -ikamik -869 -ikâtew -870 -inoham -871 -niwahk -872 -owiyâs -873 -tahkik -874 -takwâk -875 -wawach -876 -ânisâw -877 -âwisiw -878 -▁glenn -879 -▁kihci -880 -▁manit -881 -▁paige -882 -▁âskaw -883 -▁ᐃᔨᒧᐅᓐ -884 -▁ᐅᓂᐟᒐᓂ -885 -▁ᑌᐸᐸᐤᐸ -886 -▁ᑲᓇᑕᐦᐠ -887 -▁ᒪᐢᑯᑌᐤ -888 -erences -889 -ikamach -890 -orthern -891 -ᑭᐢᑫᐧᐃᐧᐣ -892 -ᑭᐦᐅᐁᐧᐃᐣ -893 -▁center -894 -▁ethnic -895 -▁iskwew -896 -▁provin -897 -▁sample -898 -▁territ -899 -▁ᐃᐧᑭᐊᐧᐠ -900 -▁ᐊᔨᒧᐧᐃᓐ -901 -▁ᓀᐦᐃᔭᐁᐧ -902 -▁ᓇᒣᔅᑧᑭᐌ -903 -original -904 -▁mistahi -905 -▁ontario -906 -▁pânisâw -907 -▁âtayôhk -908 -▁ᐁᐧᒥᐢᑎᑯᓯ -909 -kicihikan -910 -ákániwahk -911 -▁labrador -912 -▁manitoba -913 -▁selected -914 -▁syllabar -915 -âtâkaniwiw -916 -âwewikamik -917 -îwahikanak -918 -▁nehiyawak -919 -▁pâsikâtew -920 -âkanihkêwak -921 -▁ililîmowin -922 -▁references -923 -▁territories -924 -wawachikamach -925 -▁ayisiyiniwak -926 -▁saskatchewan -927 -âpihtâkosisânak -928 -'← -929 -). -930 -bi -931 -bo -932 -ex -933 -ff -934 -ge -935 -hu -936 -id -937 -ir -938 -iz -939 -ka -940 -kn -941 -ly -942 -mô -943 -oo -944 -pl -945 -rd -946 -sw -947 -to -948 -tâ -949 -tā -950 -ut -951 -wê -952 -ác -953 -âm -954 -êk -955 -ôp -956 -ûn -957 -ās -958 -āy -959 -īk -960 -ōh -961 -ᐁᐁ -962 -ᐁᔪ -963 -ᐅᑯ -964 -ᐅᒡ -965 -ᐊᓯ -966 -ᐋᐣ -967 -ᐌᐎ -968 -ᐌᓐ -969 -ᐎᓯ -970 -ᐑᑎ -971 -ᐚᐱ -972 -ᐟᒉ -973 -ᐢᑐ -974 -ᐢᒋ -975 -ᐣᓯ -976 -ᐦᐨ -977 -ᐦᑲ -978 -ᐦᔪ -979 -ᐯᔨ -980 -ᐱᐣ -981 -ᐱᐩ -982 -ᐱᔅ -983 -ᐱᔖ -984 -ᐲᔨ -985 -ᐴᓵ -986 -ᑌᐦ -987 -ᑎᐣ -988 -ᑎᒻ -989 -ᑎᓐ -990 -ᑎᓰ -991 -ᑎᓱ -992 -ᑐᒥ -993 -ᑕᒻ -994 -ᑕᓇ -995 -ᑖᕁ -996 -ᑭᐱ -997 -ᑭᓄ -998 -ᑭᔨ -999 -ᑮᓯ -1000 -ᑯᒋ -1001 -ᑰᐤ -1002 -ᑲᐠ -1003 -ᑲᓄ -1004 -ᑲᓇ -1005 -ᑳᐣ -1006 -ᑵᐠ -1007 -ᑿᐤ -1008 -ᒀᓐ -1009 -ᒋᓲ -1010 -ᒌᔑ -1011 -ᒡᐦ -1012 -ᒥᓱ -1013 -ᒥᕽ -1014 -ᒨᓐ -1015 -ᒪᑯ -1016 -ᒫᐤ -1017 -ᓀᐤ -1018 -ᓂᒥ -1019 -ᓇᐗ -1020 -ᓖᒧ -1021 -ᓯᓂ -1022 -ᓯᓇ -1023 -ᓯᓭ -1024 -ᓴᐦ -1025 -ᓴᕀ -1026 -ᔑᒡ -1027 -ᔨᔫ -1028 -ᔪᐤ -1029 -ᔭᐨ -1030 -ᖬᐑ -1031 -▁' -1032 -▁g -1033 -▁x -1034 -▁î -1035 -▁ᐎ -1036 -▁ᐤ -1037 -▁ᐧ -1038 -▁ᐴ -1039 -▁ᑏ -1040 -▁ᑐ -1041 -▁ᑑ -1042 -▁ᑖ -1043 -▁ᑰ -1044 -▁ᒍ -1045 -▁ᒎ -1046 -▁ᒑ -1047 -▁ᓄ -1048 -▁ᓚ -1049 -▁ᓭ -1050 -▁ᓱ -1051 -▁ᓲ -1052 -▁ᔕ -1053 -▁ᔥ -1054 -▁ᔦ -1055 -▁ᔨ -1056 -▁ᔩ -1057 -▁ᔪ -1058 -▁ᔫ -1059 -▁ᔭ -1060 -▁ᔮ -1061 -aim -1062 -ako -1063 -asi -1064 -ate -1065 -cak -1066 -can -1067 -cho -1068 -cil -1069 -ecl -1070 -ecî -1071 -ekw -1072 -els -1073 -erg -1074 -ics -1075 -ito -1076 -ity -1077 -kac -1078 -kim -1079 -lac -1080 -mar -1081 -mât -1082 -môs -1083 -nîp -1084 -ock -1085 -ohc -1086 -ohn -1087 -osh -1088 -out -1089 -pak -1090 -per -1091 -pim -1092 -piw -1093 -pon -1094 -pôs -1095 -ral -1096 -shi -1097 -sim -1098 -sou -1099 -sta -1100 -upi -1101 -use -1102 -wap -1103 -wec -1104 -wiy -1105 -êwo -1106 -îht -1107 -îna -1108 -îso -1109 -āhk -1110 -šiš -1111 -ᐁᐁ᙮ -1112 -ᐁᔪᑯ -1113 -ᐃᐧᑭ -1114 -ᐃᐧᒧ -1115 -ᐅᑯᓐ -1116 -ᐊᐧᐤ -1117 -ᐊᐧᐱ -1118 -ᐊᓇᐠ -1119 -ᐋᐧᐤ -1120 -ᐌᐎᐣ -1121 -ᐌᓐ᙮ -1122 -ᐢᑌᓄ -1123 -ᐢᑯᓯ -1124 -ᐢᑲᐤ -1125 -ᐢᒋᑫ -1126 -ᐦᐃᑭ -1127 -ᐦᐠ᙮ -1128 -ᐦᐱᐦ -1129 -ᐦᑲᓐ -1130 -ᐧᐁᒥ -1131 -ᐧᐁᔨ -1132 -ᐧᐃᐤ -1133 -ᐧᐃᐸ -1134 -ᐧᐊᒡ -1135 -ᐧᑖᐤ -1136 -ᐸᒣᐤ -1137 -ᑐᒥᑕ -1138 -ᑕᐦᐃ -1139 -ᑕᓂᓴ -1140 -ᑮᓯᓵ -1141 -ᑯᓯᓴ -1142 -ᑲᐧᓂ -1143 -ᑲᑌᐠ -1144 -ᒋᓲ᙮ -1145 -ᒋᕒᐃ -1146 -ᒥᐦᐊ -1147 -ᒧᐏᓂ -1148 -ᒨᑖᕁ -1149 -ᒫᑎᓰ -1150 -ᓂᓂᐤ -1151 -ᓄᐦᐨ -1152 -ᓇᐊᐧ -1153 -ᓰᐱᐦ -1154 -ᔥᑌᒡ -1155 -ᔥᑕᒻ -1156 -ᔨᐦᑕ -1157 -ᔭᐑᑎ -1158 -▁ap -1159 -▁bo -1160 -▁bu -1161 -▁gr -1162 -▁ht -1163 -▁mb -1164 -▁ni -1165 -▁nô -1166 -▁oc -1167 -▁oh -1168 -▁ot -1169 -▁to -1170 -▁tā -1171 -▁up -1172 -▁âp -1173 -▁ēk -1174 -▁ᐁᑮ -1175 -▁ᐁᑯ -1176 -▁ᐁᑾ -1177 -▁ᐁᔥ -1178 -▁ᐃᑣ -1179 -▁ᐅᑳ -1180 -▁ᐅᓴ -1181 -▁ᐅᓵ -1182 -▁ᐆᒪ -1183 -▁ᐊᒻ -1184 -▁ᐘᐱ -1185 -▁ᐦᐁ -1186 -▁ᐦᐃ -1187 -▁ᐦᐄ -1188 -▁ᐦᐅ -1189 -▁ᐦᐆ -1190 -▁ᐦᐊ -1191 -▁ᐦᐋ -1192 -▁ᐱᐦ -1193 -▁ᐳᓂ -1194 -▁ᐸᓵ -1195 -▁ᑭᓄ -1196 -▁ᑲᐅ -1197 -▁ᑲᓂ -1198 -▁ᒣᑿ -1199 -▁ᒥᐦ -1200 -▁ᒥᔮ -1201 -▁ᒦᓇ -1202 -▁ᒪᒋ -1203 -▁ᒪᓂ -1204 -▁ᒫᑕ -1205 -▁ᓂᑐ -1206 -▁ᓇᐯ -1207 -▁ᓵᑳ -1208 -acik -1209 -agan -1210 -ains -1211 -amih -1212 -amil -1213 -anin -1214 -asin -1215 -asiw -1216 -atim -1217 -awāt -1218 -ayâs -1219 -book -1220 -case -1221 -dies -1222 -ench -1223 -erta -1224 -etin -1225 -etsi -1226 -face -1227 -hkek -1228 -hkân -1229 -imin -1230 -imit -1231 -imot -1232 -ināw -1233 -iped -1234 -isto -1235 -itik -1236 -iwin -1237 -iwāw -1238 -iyik -1239 -iyîk -1240 -kned -1241 -kwan -1242 -kway -1243 -main -1244 -natā -1245 -oods -1246 -otam -1247 -oups -1248 -pahk -1249 -pask -1250 -pâhk -1251 -skēk -1252 -stēn -1253 -tics -1254 -uman -1255 -upus -1256 -used -1257 -âpem -1258 -îpeh -1259 -îpis -1260 -ôpâk -1261 -ôtin -1262 -ēkok -1263 -ᐁᐧᐊᐧ -1264 -ᐃᐧᐦᔪ -1265 -ᐃᐧᔨᓂ -1266 -ᐊᐧᐠ᙮ -1267 -ᐊᑲᔭᓯ -1268 -ᐊᓯᓯᒼ -1269 -ᐋᑲᔮᓰ -1270 -ᐏᓯᓇᐤ -1271 -ᐢᑎᑳᐣ -1272 -ᐢᑯᒥᐣ -1273 -ᐢᑯᓯᐦ -1274 -ᐦᐃᖬᐑ -1275 -ᐦᐱᐦᑲ -1276 -ᐦᑕᑯᓯ -1277 -ᐦᑖᑭᔨ -1278 -ᐦᑖᓱᐣ -1279 -ᐯᔨᒥᓱ -1280 -ᐱᐦᑰᐤ -1281 -ᐱᐦᑵᐠ -1282 -ᐱᔖᔑᒡ -1283 -ᐲᔨᓯᒼ -1284 -ᑲᐢᑭᑌ -1285 -ᒌᔑᑳᐤ -1286 -ᓂᒥᑎᓱ -1287 -ᓅᐦᒡ᙮ -1288 -ᓇᐌᐎᐣ -1289 -ᓇᐗᐸᒥ -1290 -ᓖᒧᐎᓐ -1291 -ᓯᓂᐘᐠ -1292 -ᓯᓭᐘᐠ -1293 -ᔥᑎᑯᔒ -1294 -ᔭᒥᐦᐊ -1295 -▁alb -1296 -▁ava -1297 -▁col -1298 -▁iht -1299 -▁isi -1300 -▁ita -1301 -▁kin -1302 -▁leh -1303 -▁mah -1304 -▁mok -1305 -▁new -1306 -▁nêh -1307 -▁ota -1308 -▁pêy -1309 -▁sci -1310 -▁sîw -1311 -▁tip -1312 -▁vow -1313 -▁wes -1314 -▁wīk -1315 -▁âpa -1316 -▁ᐁᑌᐦ -1317 -▁ᐃᔪᐤ -1318 -▁ᐄᔨᔫ -1319 -▁ᐆᐦᒋ -1320 -▁ᐊᐦᐳ -1321 -▁ᐊᐦᐴ -1322 -▁ᐊᐧᐱ -1323 -▁ᐊᑕᐦ -1324 -▁ᐊᓂᒌ -1325 -▁ᐊᔨᒥ -1326 -▁ᐊᔨᓯ -1327 -▁ᐱᒥᐩ -1328 -▁ᐱᓯᒼ -1329 -▁ᑭᐦᒋ -1330 -▁ᑳᓇᑕ -1331 -▁ᒉᒀᓐ -1332 -▁ᒥᑕᑐ -1333 -▁ᒥᕒᐅ -1334 -▁ᒫᐅᒡ -1335 -▁ᓂᐱᐩ -1336 -▁ᓂᔮᔪ -1337 -▁ᓇᐦᑲ -1338 -▁ᓰᐱᐦ -1339 -▁ᓵᓴᕀ -1340 -aimûn -1341 -asibi -1342 -cakin -1343 -capān -1344 -cewâk -1345 -chool -1346 -eclar -1347 -ights -1348 -ihcik -1349 -ihtak -1350 -ihtaw -1351 -ihtâk -1352 -imate -1353 -isiwê -1354 -ition -1355 -itâsk -1356 -ivers -1357 -kimâw -1358 -kinos -1359 -kisik -1360 -nakac -1361 -nipah -1362 -nipat -1363 -nîpin -1364 -ostos -1365 -otcik -1366 -simôt -1367 -sîpiy -1368 -tamik -1369 -ticik -1370 -tural -1371 -umbia -1372 -wayân -1373 -wâwân -1374 -âtahk -1375 -êhohk -1376 -îhkân -1377 -ᐢᑌᓄᐦᐠ -1378 -ᐢᑭᐦᐠ᙮ -1379 -ᐢᑳᐧᔭᐨ -1380 -ᐦᑕᑯᓯᐤ -1381 -ᑐᒥᑕᓇᐤ -1382 -ᑮᓯᓵᒋᐠ -1383 -ᓈᓅᐦᒡ᙮ -1384 -ᔭᐑᑎᒥᕽ -1385 -▁bann -1386 -▁blac -1387 -▁brit -1388 -▁cecî -1389 -▁char -1390 -▁chis -1391 -▁coun -1392 -▁data -1393 -▁htos -1394 -▁inin -1395 -▁itēw -1396 -▁iyin -1397 -▁john -1398 -▁kikî -1399 -▁kikī -1400 -▁kotā -1401 -▁mist -1402 -▁mwác -1403 -▁mîna -1404 -▁nisk -1405 -▁noun -1406 -▁nêwo -1407 -▁nîso -1408 -▁oski -1409 -▁wask -1410 -▁with -1411 -▁word -1412 -▁ōhci -1413 -▁ᐁᐅᑯᓐ -1414 -▁ᐃᐢᑫᐧ -1415 -▁ᐅᑕᓂᓴ -1416 -▁ᐅᑯᓯᓴ -1417 -▁ᐊᐌᓐ᙮ -1418 -▁ᐊᓄᐦᐨ -1419 -▁ᐊᔨᒨᓐ -1420 -▁ᐋᐱᐦᑕ -1421 -▁ᐋᐸᑎᓐ -1422 -▁ᐯᐃᑿᐤ -1423 -▁ᐱᒫᑎᓰ -1424 -▁ᑫᑯᐟᐨ -1425 -▁ᑲᐢᑭᑌ -1426 -▁ᒥᑕᑕᐦ -1427 -▁ᒥᓂᑯᔥ -1428 -▁ᓂᑕᐚᐱ -1429 -▁ᓃᔥᑕᒻ -1430 -amites -1431 -aniwit -1432 -ashish -1433 -asinâs -1434 -asiwât -1435 -askapi -1436 -ihkwâw -1437 -ikisiw -1438 -ikotik -1439 -imitis -1440 -iwâcik -1441 -mation -1442 -osâwâw -1443 -otawâk -1444 -owinik -1445 -pēnahk -1446 -skēkot -1447 -âkayâs -1448 -âpihtâ -1449 -ᐃᐧᒧᐢᒋᑫ -1450 -ᐃᐧᔨᓂᐊᐧ -1451 -ᐊᐱᐦᑕᑯᓯ -1452 -ᐦᐃᐁᐧᐃᐣ -1453 -ᐦᐃᖬᐑᐏᐣ -1454 -ᐦᑖᑭᔨᒡᐦ -1455 -ᑭᒋᐱᐦᑵᐠ -1456 -ᑲᐧᓂᐃᐧᓇ -1457 -ᓰᐱᐦᑳᐧᐤ -1458 -▁anite -1459 -▁askiy -1460 -▁askōt -1461 -▁atihk -1462 -▁atimw -1463 -▁awask -1464 -▁betsi -1465 -▁capān -1466 -▁famil -1467 -▁human -1468 -▁kitaw -1469 -▁lupus -1470 -▁masīh -1471 -▁moose -1472 -▁māceh -1473 -▁nikot -1474 -▁nisto -1475 -▁nēhin -1476 -▁okisk -1477 -▁pakit -1478 -▁polit -1479 -▁upper -1480 -▁ᐃᑣᓅᐦᒡ -1481 -▁ᐅᑌᓇᐊᐧ -1482 -▁ᐊᐧᐁᓰᔅ -1483 -▁ᐊᐱᔖᔑᒡ -1484 -▁ᐊᔨᒧᐎᓐ -1485 -▁ᐸᐊᓯᓯᒼ -1486 -▁ᑭᐢᑎᑳᐣ -1487 -▁ᑭᐢᑯᓯᐦ -1488 -▁ᑮᓯᓭᐘᐠ -1489 -▁ᑲᓇᐗᐸᒥ -1490 -▁ᒥᔮᐧᐃᐸ -1491 -▁ᒧᐦᑯᒪᓇ -1492 -▁ᒫᑕᐢᑌᐠ -1493 -▁ᓂᐱᐦᑰᐤ -1494 -▁ᓂᑐᐧᐁᔨ -1495 -▁ᓵᑳᐢᑌᐠ -1496 -aganish -1497 -amihêwi -1498 -cikêwak -1499 -eyihtam -1500 -ikamekw -1501 -ispayik -1502 -kaskite -1503 -nipatht -1504 -pahikan -1505 -upright -1506 -wâcakin -1507 -wâcikan -1508 -wâkamîw -1509 -îpehtak -1510 -ᐃᐧᐦᔪᐃᐧᐣ -1511 -ᐋᑲᔮᓰᒧᐏᐣ -1512 -ᐦᐱᐦᑲᓯᑲᐣ -1513 -ᐧᐁᒥᔥᑎᑯᔒ -1514 -ᐯᔨᒥᓱᐊᐧᐠ -1515 -ᓂᒥᑎᓱᐎᓂᐠ -1516 -ᔭᒥᐦᐊᐃᐧᐣ -1517 -▁ayisin -1518 -▁declar -1519 -▁french -1520 -▁groups -1521 -▁ininiw -1522 -▁kānatā -1523 -▁mahîhk -1524 -▁misiwe -1525 -▁namêst -1526 -▁nikisk -1527 -▁nâpewa -1528 -▁okimâw -1529 -▁pahkek -1530 -▁rights -1531 -▁school -1532 -▁swampy -1533 -▁sîwîht -1534 -▁vowels -1535 -▁âpacih -1536 -▁ᐃᐢᑳᐧᔭᐨ -1537 -▁ᐃᓕᓖᒧᐎᓐ -1538 -▁ᐅᓂᐟᒐᓂᒐ -1539 -▁ᐅᓂᐲᔨᓯᒼ -1540 -▁ᐊᑭᐦᑖᓱᐣ -1541 -▁ᐊᑲᔭᓯᐊᐧ -1542 -▁ᐘᔭᐑᑎᒥᕽ -1543 -▁ᑭᐢᑫᔨᐦᑕ -1544 -▁ᓀᐦᐃᔭᐊᐧ -1545 -▁ᓇᐯᐊᐧᐠ᙮ -1546 -▁ᓇᑕᐁᐧᐊᐧ -1547 -▁ᓰᐱᐦᑯᓯᐤ -1548 -▁ᓴᑲᐦᐃᑲᐣ -1549 -anguages -1550 -etinôpâk -1551 -itâskost -1552 -mâtinawê -1553 -nakacīhk -1554 -northern -1555 -simôtâhk -1556 -southern -1557 -takwâkin -1558 -âpihtâwi -1559 -ēnimitis -1560 -ᐊᑲᔭᓯᒧᐃᐧᐣ -1561 -▁akwâwân -1562 -▁alberta -1563 -▁awahkân -1564 -▁bannock -1565 -▁british -1566 -▁council -1567 -▁eastern -1568 -▁iyimuun -1569 -▁išinihk -1570 -▁naskapi -1571 -▁natawah -1572 -▁science -1573 -▁wikiped -1574 -▁writing -1575 -▁wîcewâk -1576 -▁wīkiwak -1577 -▁âpihtaw -1578 -▁ᐃᐢᑫᐧᐊᐧᐠ -1579 -▁ᐃᔑᓂᐦᑳᑌᒡ -1580 -▁ᐸᐦᑭᓯᒨᑖᕁ -1581 -▁ᓀᐦᐃᓇᐌᐎᐣ -1582 -▁ᓃᐦᐃᖬᐑᐏᐣ -1583 -▁ᓴᑲᐢᑌᓄᐦᐠ -1584 -iskinoham -1585 -iyîkopîwi -1586 -kiskatcik -1587 -osikwânâs -1588 -êhohkêwak -1589 -îpihkosiw -1590 -ᐃᐧᒧᐢᒋᑫᐃᐧᓇ -1591 -ᐊᐱᐦᑕᑯᓯᐊᓇᐠ -1592 -▁blackned -1593 -▁canadian -1594 -▁columbia -1595 -▁eastmain -1596 -▁kīskēkot -1597 -▁lehlueun -1598 -▁nehlueun -1599 -▁nikotwâs -1600 -▁ᐃᓯᐦᒋᑫᐃᐧᐣ -1601 -▁ᐃᓯᐦᒋᑫᐃᐧᓇ -1602 -▁ᐅᑭᐦᐅᐁᐧᐃᐣ -1603 -▁ᐋᐱᐦᑖᒌᔑᑳᐤ -1604 -▁ᐱᑭᐢᑫᐧᐃᐧᐣ -1605 -▁ᑭᐢᑯᓯᐦᐋᐧᐠ -1606 -anitotawâk -1607 -asinâsowin -1608 -isiwepahki -1609 -isiwêpahki -1610 -▁chisasibi -1611 -▁languages -1612 -▁maskisina -1613 -▁political -1614 -▁provinces -1615 -▁syllabary -1616 -▁syllabics -1617 -▁âtayôhkân -1618 -▁ᐁᐧᒥᐢᑎᑯᓯᐊᐧ -1619 -▁ᐅᑎᐯᔨᒥᓱᐊᐧᐠ -1620 -▁ᒥᑕᑕᐦᑐᒥᑕᓇᐤ -1621 -▁ᓀᐦᐃᔭᐁᐧᐃᐧᐣ -1622 -▁ᓇᐦᑲᐃᐧᔨᓂᐊᐧ -1623 -▁aboriginal -1624 -▁ᐊᒋᐦᑖᓱᓈᓅᐦᒡ᙮ -1625 -▁ᒧᐦᑯᒪᓇᐢᑭᐦᐠ᙮ -1626 -âkayâsîmowin -1627 -âtâwewikamik -1628 -▁betsiamites -1629 -▁declaration -1630 -▁kîwetinôpâk -1631 -▁mihkwâkamîw -1632 -▁nēhinawēwin -1633 -▁pimitâskost -1634 -▁pânisâwêwak -1635 -▁waskaganish -1636 -▁yîwahikanak -1637 -▁ᐅᒪᐃᐧᒧᐢᒋᑫᐃᐧᓇ -1638 -▁ᓂᑐᐧᐁᔨᐦᑖᑭᔨᒡᐦ -1639 -ēnimitisowinik -1640 -▁akwâwânihkêwak -1641 -▁âpacihâkaniwiw -1642 -nipathtákániwahk -1643 -▁kawawachikamach -1644 -▁namêstêhohkêwak -1645 -▁âpihtâkosisânak -1646 -*, -1647 -aš -1648 -ca -1649 -cr -1650 -dj -1651 -eô -1652 -gc -1653 -go -1654 -ja -1655 -má -1656 -of -1657 -ož -1658 -se -1659 -sî -1660 -sā -1661 -te -1662 -tk -1663 -ur -1664 -wd -1665 -we -1666 -wî -1667 -ám -1668 -êm -1669 -ēr -1670 -īm -1671 -še -1672 -ᎳᎩ -1673 -ᐃᐱ -1674 -ᐃᓄ -1675 -ᐃᔑ -1676 -ᐄᓂ -1677 -ᐅᐠ -1678 -ᐅᒋ -1679 -ᐊᐨ -1680 -ᐌᓂ -1681 -ᐎᑭ -1682 -ᐓᑭ -1683 -ᐚᐤ -1684 -ᐠᓯ -1685 -ᐦᐋ -1686 -ᐦᑭ -1687 -ᐧᑭ -1688 -ᐨᑫ -1689 -ᐯᑯ -1690 -ᐯᒃ -1691 -ᐯᔭ -1692 -ᐱᑭ -1693 -ᐱᒥ -1694 -ᐳᐣ -1695 -ᐳᑕ -1696 -ᐸᐣ -1697 -ᑌᑭ -1698 -ᑌ᙮ -1699 -îš -1700 -ôm -1701 -ᐏᐱ -1702 -ᐧᓭ -1703 -ᑎᐠ -1704 -ᑎᓇ -1705 -ᑎᓯ -1706 -ᑎᔭ -1707 -ᑐᐸ -1708 -ᑐᑦ -1709 -ᑐᓀ -1710 -ᑐᖅ -1711 -ᑑᐸ -1712 -ᑕᒧ -1713 -ᑕᒫ -1714 -ᑖᐧ -1715 -ᑫᐏ -1716 -ᑫᐤ -1717 -ᑫᑕ -1718 -ᑭᑫ -1719 -ᑭᓇ -1720 -ᑭ᙮ -1721 -ᑲᐨ -1722 -ᑲᒋ -1723 -ᑲᒥ -1724 -ᑳᒋ -1725 -ᑴᓯ -1726 -ᑾᐣ -1727 -ᒋᐘ -1728 -ᒋᔦ -1729 -ᒌᐤ -1730 -ᒑᔨ -1731 -ᒡᑋ -1732 -ᒧᒥ -1733 -ᒧᔭ -1734 -ᒨᒫ -1735 -ᒫᒋ -1736 -ᓀᐅ -1737 -ᓂᔥ -1738 -ᓄᐁ -1739 -ᓇᐨ -1740 -ᓇᑖ -1741 -ᓐᑌ -1742 -ᓗᒃ -1743 -ᓯᐌ -1744 -ᓯᐢ -1745 -ᓯᐣ -1746 -ᓯᒡ -1747 -ᓯᔅ -1748 -ᓴᐠ -1749 -ᔕᐠ -1750 -ᔦᔨ -1751 -ᔨᐣ -1752 -ᔨᓄ -1753 -ᔩᐤ -1754 -ᔭᐣ -1755 -▁: -1756 -▁; -1757 -▁Ꮳ -1758 -▁ᐒ -1759 -▁ᐔ -1760 -▁ᐕ -1761 -▁ᐙ -1762 -▁ᐟ -1763 -▁ᐠ -1764 -▁ᐢ -1765 -▁ᐣ -1766 -▁ᐨ -1767 -▁ᐩ -1768 -▁ᑉ -1769 -▁ᑊ -1770 -▁ᒃ -1771 -▁ᓐ -1772 -▁ᓕ -1773 -▁ᓘ -1774 -▁ᓛ -1775 -▁ᓬ -1776 -▁ᔅ -1777 -▁ᔐ -1778 -▁ᔓ -1779 -▁ᔾ -1780 -▁ᕃ -1781 -▁ᕆ -1782 -▁ᕇ -1783 -▁ᕈ -1784 -▁ᕉ -1785 -▁ᕌ -1786 -▁ᕐ -1787 -▁ᕒ -1788 -▁ᕓ -1789 -▁ᕘ -1790 -▁ᕚ -1791 -▁ᕞ -1792 -▁ᕢ -1793 -▁ᕤ -1794 -▁ᕥ -1795 -▁ᕦ -1796 -▁ᕧ -1797 -▁ᕽ -1798 -▁ᖧ -1799 -., -1800 -be -1801 -cí -1802 -jo -1803 -jé -1804 -ld -1805 -tō -1806 -ui -1807 -âh -1808 -ēp -1809 -ģī -1810 -ᑌᐠ -1811 -ᑎᐤ -1812 -ᑐᓇ -1813 -ᓇᑌ -1814 -ᔭᓐ -1815 -▁< -1816 -▁ᑦ -1817 -▁ᔑ -1818 -▁ᕖ -1819 -▁ᖩ -1820 -▁ᖬ -1821 -aki -1822 -ano -1823 -ans -1824 -atu -1825 -cit -1826 -ciw -1827 -cîy -1828 -dji -1829 -eci -1830 -ely -1831 -est -1832 -eui -1833 -hat -1834 -hin -1835 -hmá -1836 -hua -1837 -iar -1838 -ick -1839 -ika -1840 -ini -1841 -ins -1842 -iné -1843 -inā -1844 -isc -1845 -isi -1846 -iss -1847 -isê -1848 -iwe -1849 -iwn -1850 -iwî -1851 -iwā -1852 -kan -1853 -lfa -1854 -lis -1855 -mag -1856 -mic -1857 -māc -1858 -ohp -1859 -oke -1860 -oll -1861 -ona -1862 -oya -1863 -oám -1864 -pan -1865 -pli -1866 -poh -1867 -ruk -1868 -sām -1869 -tit -1870 -tum -1871 -tôm -1872 -ujé -1873 -ult -1874 -vsk -1875 -wac -1876 -wēt -1877 -xof -1878 -áhk -1879 -îki -1880 -îsi -1881 -āya -1882 -ēna -1883 -ēni -1884 -ēwi -1885 -ōsk -1886 -ᐃᑑᐸ -1887 -ᐃᓄᒃ -1888 -ᐃᔭᓐ -1889 -ᐊᐧᐣ -1890 -ᐊᑎᒻ -1891 -ᐋᐸᐣ -1892 -ᐋᔭᐤ -1893 -ᐍᑎᐣ -1894 -ᐎᑭᐟ -1895 -ᐎᓯᐎ -1896 -ᐎᓯᐟ -1897 -ᐏᐟᐨ -1898 -ᐗᑭᐱ -1899 -ᐁᐤ -1900 -ᑎᒽ -1901 -ᓇᓱ -1902 -ᔅᒌ -1903 -ᔅᒡ -1904 -ᔮᐤ -1905 -▁ᕕ -1906 -▁ᖨ -1907 -anē -1908 -ēta -1909 -ᐘᒌᐤ -1910 -ᐟᐨ᙮ -1911 -ᐠᓯᑯ -1912 -ᐢᑐᐢ -1913 -ᐢᑐᑰ -1914 -ᐣᒋᐠ -1915 -ᐦᐁᐤ -1916 -ᐧᐴᓵ -1917 -ᐧᑭᐣ -1918 -ᐧᑳᐤ -1919 -ᐨᐱᐣ -1920 -ᐯᔭᒄ -1921 -ᐱᒥᐦ -1922 -ᐳᓂᐠ -1923 -ᑎᐱᔅ -1924 -ᑎᑐᑦ -1925 -ᑎᓇᒪ -1926 -ᑐᓀᓂ -1927 -ᑕᐦᐊ -1928 -ᑕᒧᐎ -1929 -ᑕᒫᑐ -1930 -ᑖᐧᓯ -1931 -ᑭᓇᓭ -1932 -ᑯᒋᑲ -1933 -ᑲᓄ᙮ -1934 -ᑲᓇᐣ -1935 -ᑲᓇᓐ -1936 -ᑲᔨᐣ -1937 -ᑴᓯᑐ -1938 -ᒋᐘ᙮ -1939 -ᒑᐱᔅ -1940 -ᒑᔨᕁ -1941 -ᒡᑋᐃ -1942 -ᒥᔭᐤ -1943 -ᒦᐧᐃ -1944 -ᒧᒥᒋ -1945 -ᒨᐧᓭ -1946 -ᒪᑲᓂ -1947 -ᓀᐤ᙮ -1948 -ᓂᑲᑌ -1949 -ᓂᓴᐠ -1950 -ᓂᔥᑐ -1951 -ᓂᔮᔪ -1952 -ᓄᑕᓇ -1953 -ᓇᓀᐤ -1954 -ᓈᐃᐧ -1955 -ᓈᓀᐅ -1956 -ᓐᑌᕇ -1957 -ᓭᐁᐧ -1958 -ᔨᒨᒫ -1959 -ᔭᐚᐤ -1960 -ᔭᐦᐋ -1961 -▁af -1962 -▁am -1963 -▁be -1964 -▁ca -1965 -▁cl -1966 -▁cī -1967 -▁hk -1968 -▁iw -1969 -▁ke -1970 -▁me -1971 -▁mu -1972 -▁mí -1973 -▁ne -1974 -▁nl -1975 -▁nt -1976 -▁nâ -1977 -▁ob -1978 -▁oy -1979 -▁pl -1980 -▁pp -1981 -▁se -1982 -▁ts -1983 -▁wh -1984 -▁āc -1985 -▁ās -1986 -▁še -1987 -▁ᐁᐘ -1988 -▁ᐁᐱ -1989 -▁ᐁᔑ -1990 -▁ᐅᐱ -1991 -▁ᐊᑲ -1992 -▁ᐊᒣ -1993 -▁ᐊᒥ -1994 -▁ᐊᓅ -1995 -▁ᐊᔭ -1996 -▁ᐑᑭ -1997 -▁ᐧᐋ -1998 -▁ᐧᒣ -1999 -oe -2000 -ᐃᐢ -2001 -ᑰᓇ -2002 -ᒋᒫ -2003 -ᓰᖂ -2004 -▁, -2005 -▁ᐍ -2006 -▁ᒼ -2007 -▁ᓓ -2008 -▁ᓗ -2009 -▁ᔒ -2010 -▁ᖪ -2011 -ole -2012 -ors -2013 -ᐊᐧᒋ -2014 -ᐦᑰᓇ -2015 -ᐸᐧᐋ -2016 -ᑲᐧᐤ -2017 -ᑲᓄᐟ -2018 -ᒃᓰᖂ -2019 -ᓂᐧᐊ -2020 -▁ᐁᑭ -2021 -▁ᐎᒋ -2022 -▁ᐱᑳ -2023 -▁ᐸᑭ -2024 -▁ᐹᔅ -2025 -▁ᑲᐃ -2026 -▁ᒉᐱ -2027 -▁ᒦᒎ -2028 -▁ᒫᓇ -2029 -▁ᓂᓂ -2030 -▁ᓇᐨ -2031 -▁ᓚᐯ -2032 -▁ᓴᓯ -2033 -▁ᓴᖑ -2034 -akie -2035 -ames -2036 -amin -2037 -amêk -2038 -amēr -2039 -apiy -2040 -atun -2041 -awîy -2042 -awāy -2043 -cess -2044 -chem -2045 -cîhc -2046 -dian -2047 -emik -2048 -emin -2049 -enož -2050 -ergy -2051 -ersh -2052 -erģī -2053 -etic -2054 -ewin -2055 -ewit -2056 -ffir -2057 -goum -2058 -hipi -2059 -hkik -2060 -huat -2061 -igid -2062 -ihik -2063 -ikit -2064 -iliz -2065 -ingl -2066 -inwe -2067 -ipak -2068 -itah -2069 -iton -2070 -itêm -2071 -itōh -2072 -iwît -2073 -iyân -2074 -ištâ -2075 -kwēs -2076 -lfav -2077 -lish -2078 -mist -2079 -nask -2080 -nava -2081 -náhk -2082 -ohke -2083 -olid -2084 -otaw -2085 -otey -2086 -otōt -2087 -outu -2088 -poni -2089 -psis -2090 -pôsâ -2091 -quan -2092 -quec -2093 -sest -2094 -shiu -2095 -skwê -2096 -tana -2097 -tatâ -2098 -tcan -2099 -aq -2100 -âc -2101 -ᐠ᙮ -2102 -ᐹᐣ -2103 -ᑌᒋ -2104 -ᒁᐃ -2105 -ᒧᑫ -2106 -▁) -2107 -▁ᓖ -2108 -▁ᓪ -2109 -ckw -2110 -itē -2111 -oht -2112 -ēhk -2113 -ᐨᑕᑎ -2114 -ᑌᒋᓂ -2115 -ᒋᐦᐃ -2116 -ckwē -2117 -enam -2118 -esen -2119 -kīsi -2120 -orth -2121 -ougl -2122 -sion -2123 -tiff -2124 -tkad -2125 -towa -2126 -ulti -2127 -ults -2128 -unsw -2129 -vans -2130 -wapê -2131 -wâni -2132 -âcim -2133 -âpet -2134 -âpic -2135 -âyey -2136 -êhiy -2137 -îpon -2138 -ñupi -2139 -ôtse -2140 -ākēm -2141 -āsih -2142 -ēyih -2143 -ᐃᐧᐣ᙮ -2144 -ᐃᐧᔭᐠ -2145 -ᐃᐱᓗᒃ -2146 -ᐃᔑᐳᑕ -2147 -ᐄᓂᐯᑯ -2148 -ᐅᒋᐦᒋ -2149 -ᐊᐟᒋᐠ -2150 -ᐊᐧᐃᐢ -2151 -ᐊᐧᓯᐣ -2152 -ᐋᐧᐴᓵ -2153 -ᐎᓯᐎᓇ -2154 -ᐓᑭᕒᐃ -2155 -ᐟᒉᑎᐤ -2156 -ᐟᒋᐤ᙮ -2157 -ᐢᑭᓯᐤ -2158 -ᐢᑮᐦᐃ -2159 -ᐦᐄᑲᓂ -2160 -ᐦᐅᐱᑭ -2161 -ᐦᑳᔦᔨ -2162 -ᐧᐃᒋᒫ -2163 -ᐱᐦᐋᐣ -2164 -ᐱᐦᑭᒋ -2165 -ᐱᒥᑌᐤ -2166 -ᐸᐧᑳᐤ -2167 -ᐸᑲᑌᐠ -2168 -ᑌᑭᒪᑯ -2169 -ᑎᔥᑌᒡ -2170 -ᑐᓀᓂᒋ -2171 -ᑐᓇᐸᒥ -2172 -ᑑᒨᐧᓭ -2173 -ᑕᐦᐊᒻ -2174 -ᑖᐧᓯᐠ -2175 -ᑖᓂᒧᑫ -2176 -ᑫᐏᔭᐣ -2177 -ᑫᑕᐌᓂ -2178 -ᑭᐃᐧᐣ -2179 -ᑭᒧᐏᓂ -2180 -ᑭᓄᐘᐣ -2181 -ᑯᐎᓯᐟ -2182 -ᑯᓯᑖᐤ -2183 -ᑿᐟᒋᐠ -2184 -ᒁᐃᑲᐣ -2185 -ᒋᐟᐨ᙮ -2186 -ᒋᕒᐃᐤ -2187 -ᒥᓀᐤ᙮ -2188 -ᒦᒋᓲ᙮ -2189 -ᒧᐏᓇᐠ -2190 -ᒧᒥᒋᐣ -2191 -ᒪᑯᔭᐠ -2192 -ᓂᐧᐃᐤ -2193 -ᓂᑯᑎᐣ -2194 -ᓈᐦᑰᓇ -2195 -ᓐᑌᕇᐤ -2196 -ᓯᐢᑳᒋ -2197 -ᓯᒧᐏᓂ -2198 -ᓵᐋᐧᐤ -2199 -nû -2200 -wy -2201 -ôh -2202 -ōc -2203 -ᓃᔓ -2204 -ᔅᐧ -2205 -ᔭᕽ -2206 -▁ᐐ -2207 -▁ᐗ -2208 -▁ᒡ -2209 -▁ᕝ -2210 -ahc -2211 -iza -2212 -kiy -2213 -kāc -2214 -ory -2215 -osi -2216 -res -2217 -tiv -2218 -wês -2219 -ᐊᐢᑭ -2220 -ᑐᐏᐣ -2221 -▁tō -2222 -▁ᑲᐱ -2223 -▁ᒪᐢ -2224 -bles -2225 -ilab -2226 -iles -2227 -irâm -2228 -iter -2229 -kiyo -2230 -mark -2231 -môhk -2232 -nati -2233 -skic -2234 -tory -2235 -wyom -2236 -ôhce -2237 -ᔅᐧᑫᐤ -2238 -ᔅᒌᐧᐊ -2239 -ᔑᑕᐦᐃ -2240 -ᕒᐃᐠ᙮ -2241 -▁age -2242 -▁all -2243 -▁bay -2244 -▁cis -2245 -▁civ -2246 -▁csw -2247 -▁cwd -2248 -▁cēm -2249 -▁dem -2250 -▁fac -2251 -▁her -2252 -▁išm -2253 -▁kan -2254 -▁kic -2255 -▁kul -2256 -▁kîm -2257 -▁kîn -2258 -▁mac -2259 -▁moe -2260 -▁mor -2261 -▁nan -2262 -▁niw -2263 -▁nâm -2264 -▁nîš -2265 -▁nôš -2266 -▁osī -2267 -▁pis -2268 -▁pop -2269 -▁saš -2270 -▁sev -2271 -▁sex -2272 -▁spi -2273 -▁sur -2274 -▁tex -2275 -▁teô -2276 -▁tēp -2277 -▁wāt -2278 -▁âta -2279 -▁šep -2280 -▁ᏣᎳᎩ -2281 -▁ᐁᐦᐄ -2282 -▁ᐁᑭᐱ -2283 -▁ᐁᔥᐃ -2284 -▁ᐁᔥᑭ -2285 -▁ᐃᐦᐃ -2286 -▁ᐃᓇᑌ -2287 -▁ᐃᔨᐤ -2288 -▁ᐅᑳᐦ -2289 -▁ᐅᓇᐨ -2290 -▁ᐅᓯᓇ -2291 -▁ᐅᓵᒼ -2292 -▁ᐊᑎᒻ -2293 -▁ᐊᑎᒽ -2294 -▁ᐊᕒᐅ -2295 -▁ᐏᕒᐃ -2296 -▁ᐘᐨᑫ -2297 -▁ᐱᓇᓱ -2298 -▁ᐲᐳᐣ -2299 -ub -2300 -▁ᐚ -2301 -▁ᔔ -2302 -▁ᕋ -2303 -ace -2304 -att -2305 -ciy -2306 -don -2307 -erb -2308 -išw -2309 -ond -2310 -onf -2311 -pos -2312 -son -2313 -ākā -2314 -ᐃᐧᓃ -2315 -ᐦᐃᑲ -2316 -ᐦᐃᓇ -2317 -▁hi -2318 -▁ᒦᓐ -2319 -awēw -2320 -batt -2321 -ciyê -2322 -donn -2323 -eder -2324 -ican -2325 -kerb -2326 -ower -2327 -ᑭᓂᔭᐤ -2328 -ᒋᕒᐃᐘ -2329 -ᓇᐃᑑᐸ -2330 -▁cul -2331 -▁kwâ -2332 -▁kây -2333 -▁pub -2334 -▁ᐊᐧᐃ -2335 -▁ᐸᓵᐣ -2336 -▁ᐸᔅᒡ -2337 -▁ᑌᑲᒋ -2338 -▁ᑕᐣᓯ -2339 -▁ᑭᐟᒋ -2340 -▁ᑭᑭᓄ -2341 -▁ᑯᐯᒃ -2342 -▁ᑳᑭᑫ -2343 -▁ᑳᓇᑖ -2344 -▁ᒥᓯᐌ -2345 -▁ᒨᓯᔅ -2346 -▁ᒪᐟᒋ -2347 -▁ᒪᐦᐃ -2348 -▁ᒪᐦᐄ -2349 -▁ᒪᑲᐠ -2350 -▁ᒪᑲᑌ -2351 -▁ᓂᐦᑭ -2352 -▁ᓃᐱᐣ -2353 -▁ᓅᐦᒋ -2354 -▁ᓇᑕᐤ -2355 -▁ᓇᒧᔭ -2356 -▁ᓴᓯᑭ -2357 -▁ᔥᑭᓄ -2358 -allis -2359 -apmag -2360 -ashat -2361 -astēs -2362 -ative -2363 -aykan -2364 -canis -2365 -cihot -2366 -ckerb -2367 -cíhki -2368 -dians -2369 -econd -2370 -ectiv -2371 -ehona -2372 -ekosh -2373 -erson -2374 -eshat -2375 -esteh -2376 -euiat -2377 -eyako -2378 -eyenn -2379 -hmáht -2380 -huatl -2381 -iaris -2382 -ikišw -2383 -ikotā -2384 -ikāhk -2385 -imuve -2386 -ingle -2387 -iscus -2388 -isina -2389 -iskoc -2390 -isîsi -2391 -itors -2392 -itowi -2393 -iwihk -2394 -iwniw -2395 -iyahk -2396 -izaad -2397 -iñupi -2398 -išihc -2399 -îy -2400 -ᐁᐟ -2401 -▁ð -2402 -▁ᕠ -2403 -▁ᖫ -2404 -ala -2405 -ali -2406 -eig -2407 -for -2408 -ias -2409 -mîy -2410 -una -2411 -ᐁᐟᐦ -2412 -ᐊᑲᐧ -2413 -ᐏᑯᓯ -2414 -ᐦᑖᐏ -2415 -ᑖᐤ᙮ -2416 -ᓄᑌᐤ -2417 -ᓴᐦᐊ -2418 -▁la -2419 -▁le -2420 -▁ᑌᐟ -2421 -▁ᑲᑕ -2422 -asîs -2423 -ecik -2424 -eign -2425 -hkom -2426 -ihti -2427 -itam -2428 -iyic -2429 -mali -2430 -osal -2431 -âwih -2432 -āwak -2433 -ᐏᐱᓄᐁ -2434 -ᐏᑯᓯᓵ -2435 -▁piy -2436 -▁wiy -2437 -▁ᐊᐦᒐ -2438 -▁ᐯᔭᐠ -2439 -▁ᓂᐦᑖ -2440 -kihci -2441 -kinēh -2442 -kinēy -2443 -kitik -2444 -lexof -2445 -micin -2446 -moose -2447 -môsos -2448 -occas -2449 -ohcin -2450 -ohtey -2451 -omahc -2452 -ominâ -2453 -opask -2454 -oshon -2455 -ospin -2456 -osâwa -2457 -otahc -2458 -otamë -2459 -ounds -2460 -panik -2461 -piney -2462 -pinâw -2463 -piwēh -2464 -plied -2465 -pâhko -2466 -senož -2467 -skîhk -2468 -stēna -2469 -tawin -2470 -tepak -2471 -ticle -2472 -titut -2473 -tiwak -2474 -tōtam -2475 -udies -2476 -wapês -2477 -watôm -2478 -wesîs -2479 -wiyak -2480 -woods -2481 -wâson -2482 -wâwak -2483 -wâšiš -2484 -wîpis -2485 -wētin -2486 -ânisi -2487 -âniwa -2488 -âyiwa -2489 -îmiwa -2490 -ôhcet -2491 -ēhcik -2492 -ētisk -2493 -ētwin -2494 -ōcāhk -2495 -ōhtam -2496 -šišit -2497 -ᐁᐧᐋᔭᐤ -2498 -ᐅᓵᐋᐧᐤ -2499 -ñe -2500 -ᒋᐯ -2501 -▁ᕪ -2502 -akē -2503 -omw -2504 -ēko -2505 -ᐊᐧᐨ -2506 -ᑕᐦᐠ -2507 -▁br -2508 -▁ᐱᑭ -2509 -▁ᒌᑳ -2510 -▁ᒥᒋ -2511 -cree -2512 -ikos -2513 -itak -2514 -ᐧᒥᑕᓇ -2515 -ᑲᓂᓂᐤ -2516 -ᒋᐯᐦᑕ -2517 -ᒧᑕᐦᐠ -2518 -▁ihk -2519 -▁ish -2520 -▁kom -2521 -▁ᐱᑭᐢ -2522 -▁ᑲᓂᑲ -2523 -▁ᒪᐟᒉ -2524 -acter -2525 -creek -2526 -ihkâk -2527 -ikani -2528 -nêhiy -2529 -pîpon -2530 -ᐊᐢᑭᐦᑕ -2531 -ᐋᐧᐸᒣᐤ -2532 -ᐎᓯᐎᓇ᙮ -2533 -ᐓᑭᕒᐃᐘ -2534 -ᐗᑭᐱᑎᔭ -2535 -ᐢᑎᑯᓯᐤ -2536 -ᐢᑐᑰᒫᐤ -2537 -ᐢᑯᒥᐣ᙮ -2538 -ᐦᐃᑐᐏᐣ -2539 -ᐦᐃᑲᓂᐠ -2540 -ᐦᐄᑲᓂᐣ -2541 -ᐦᑕᐃᐧᑭ -2542 -ᐦᑖᐏᓂᐠ -2543 -ᐦᑳᔦᔨᒫ -2544 -ᐧᐋᐸᒣᐤ -2545 -ᐱᒥᐦᐋᐣ -2546 -ᑌᕒᐃᐠ᙮ -2547 -ᑎᐱᔅᑳᐤ -2548 -ᑎᓇᒪᒋᐠ -2549 -ᑎᓯᐃᐧᐣ -2550 -ᑕᐦᐃᐁᐧ -2551 -ᑕᑯᓯᐅᐠ -2552 -ᑕᓂᐦᐃᑭ -2553 -ᑭᐢᑫᐧᕁ -2554 -ᑯᑖᐧᓯᐠ -2555 -ᑯᒋᑲᑌᐤ -2556 -ᑲᒥᐃᐧᐣ -2557 -ᒥᐢᑎᑯᓯ -2558 -ᒥᑕᐦᐊᒻ -2559 -ᒦᐧᐃᔨᓂ -2560 -ᓀᐏᓯᓇᐤ -2561 -ᓂᓯᓂᐘᐠ -2562 -ᓯᓇᐣᒋᐠ -2563 -ᔅᒌᐧᐊᒋ -2564 -ᔐᐧᐃᒋᒫ -2565 -ᔥᑎᑯᔮᐤ -2566 -ᔨᓄᐊᐧᐠ -2567 -ᕒᐅᐟᒋᐠ -2568 -▁acad -2569 -▁acik -2570 -▁acqu -2571 -▁akwa -2572 -▁amer -2573 -▁ases -2574 -▁asām -2575 -▁awas -2576 -▁choc -2577 -▁clos -2578 -▁conf -2579 -▁cîhk -2580 -▁diné -2581 -▁foll -2582 -▁from -2583 -▁here -2584 -▁ihtô -2585 -▁ishi -2586 -▁ispi -2587 -▁iynû -2588 -▁joám -2589 -▁kakē -2590 -▁kala -2591 -▁kiki -2592 -▁kisê -2593 -▁kitē -2594 -▁kosk -2595 -▁lead -2596 -▁ming -2597 -▁more -2598 -▁mosk -2599 -▁ᐌ -2600 -▁ᕗ -2601 -aks -2602 -ᐏᔨᓂ -2603 -ᑲᐣ᙮ -2604 -▁bc -2605 -▁by -2606 -▁un -2607 -▁ᐱᑯ -2608 -▁ᒪᑲ -2609 -▁ᓀᒪ -2610 -▁ᓂᐨ -2611 -▁ᓂᔥ -2612 -anak -2613 -askā -2614 -awīy -2615 -cial -2616 -htik -2617 -iciw -2618 -ikēy -2619 -onee -2620 -umāw -2621 -ôski -2622 -ēnāw -2623 -ᐏᔨᓂᐤ -2624 -ᓂᐧᐊᐠ -2625 -▁crj -2626 -▁crk -2627 -▁crl -2628 -▁dot -2629 -▁kân -2630 -▁not -2631 -▁nân -2632 -▁see -2633 -▁sou -2634 -▁taš -2635 -▁ᐊᐧᒪ -2636 -ahtam -2637 -htisk -2638 -isask -2639 -kinak -2640 -ᒥᐦᐋᐧᐠ -2641 -ᕒᐃᑲᐣ᙮ -2642 -▁muse -2643 -▁muus -2644 -▁mvsk -2645 -▁mâka -2646 -▁mína -2647 -▁mōsk -2648 -▁newo -2649 -▁nâma -2650 -▁oaks -2651 -▁ocēh -2652 -▁only -2653 -▁onto -2654 -▁osik -2655 -▁oujé -2656 -▁page -2657 -▁pask -2658 -▁peci -2659 -▁proj -2660 -▁prop -2661 -▁pâsk -2662 -▁pēhk -2663 -▁resp -2664 -▁role -2665 -▁runa -2666 -▁sima -2667 -▁simi -2668 -▁sour -2669 -▁swec -2670 -▁sākā -2671 -▁text -2672 -▁trad -2673 -▁tsêh -2674 -▁used -2675 -▁ᐁᐘᐦᑕ -2676 -▁ᐁᑮᒫᒋ -2677 -▁ᐁᔭᐦᐋ -2678 -▁ᐃᐦᐃᒪ -2679 -▁ᐃᑲᔨᐣ -2680 -▁ᐃᓂᓂᐤ -2681 -▁ᐃᓄᑌᐤ -2682 -▁ᐃᓯᐦᑖ -2683 -▁ᐄᔨᔫᒡ -2684 -▁ᐅᑌᐦᐃ -2685 -▁ᐅᒡᑋᐃ -2686 -▁ᐅᒥᔭᐤ -2687 -▁ᐊᐢᑭᐩ -2688 -▁ᐊᐢᑭᕀ -2689 -▁ᐊᐧᐁᔑ -2690 -▁ᐊᐧᔭᐠ -2691 -▁ᐊᒋᐟᐨ -2692 -▁ᐊᒣᕒᐃ -2693 -▁ᐊᓂᐦᐃ -2694 -▁ᐊᓂᐦᑖ -2695 -▁ᐊᓂᑌ᙮ -2696 -▁ᐊᓅᒥᐣ -2697 -▁ᐋᐢᑲᐤ -2698 -▁ᐋᐧᐴᓵ -2699 -ᓭᐢ -2700 -▁ᐓ -2701 -cey -2702 -ipi -2703 -pôn -2704 -ush -2705 -âsi -2706 -ᐢᑮᕽ -2707 -ᐦᑕᐤ -2708 -ᑐᒋᐠ -2709 -ᑲᐧᐣ -2710 -ᓂᐘᐣ -2711 -▁ᑌᑦ -2712 -▁ᓇᐱ -2713 -ance -2714 -cher -2715 -ehci -2716 -ikow -2717 -ᑐᒋᐠ᙮ -2718 -▁are -2719 -▁ham -2720 -▁kaš -2721 -▁wen -2722 -▁ᐁᑯᓯ -2723 -▁ᒧᓭᐢ -2724 -▁ᒨᔅ᙮ -2725 -ihkow -2726 -ᑖᓂᓯᓇᐤ -2727 -ᑭᓯᓂᐘᐣ -2728 -▁face -2729 -▁inuk -2730 -▁kist -2731 -▁mush -2732 -▁māna -2733 -▁paku -2734 -▁pimâ -2735 -▁show -2736 -▁ᐋᐸᐦᐄ -2737 -▁ᐋᑲᔮᓰ -2738 -▁ᐏᕒᐃᓄ -2739 -▁ᐑᑭᐘᐠ -2740 -▁ᐑᔭᐚᐤ -2741 -▁ᐘᐱᑯᐣ -2742 -▁ᐘᐱᓴᐦ -2743 -▁ᐘᒪᑲᓂ -2744 -▁ᐧᐋᒋᔦ -2745 -▁ᐧᒣᐦᒡ -2746 -▁ᐯᐃᑯᐣ -2747 -▁ᐱᐳᓂᐠ -2748 -▁ᐱᒥᐢᑲ -2749 -▁ᐱᒥᐩ᙮ -2750 -▁ᐸᐢᑳᐧ -2751 -▁ᑕᐃᔭᓐ -2752 -▁ᑫᐨᐱᐣ -2753 -▁ᑭᐃᐧᑭ -2754 -▁ᑭᒋᑲᐨ -2755 -▁ᑭᓄᑖᓂ -2756 -▁ᑮᐍᑎᐣ -2757 -▁ᑯᐨᑕᑎ -2758 -▁ᑲᐃᐧᐣ -2759 -▁ᑲᐊᑲᐧ -2760 -▁ᑲᓂᐹᐣ -2761 -▁ᑲᔦᐦ᙮ -2762 -▁ᒐᑯᐟᐨ -2763 -▁ᒣᐠᓯᑯ -2764 -▁ᒣᑿᐟᐨ -2765 -▁ᒥᐸᐧᐋ -2766 -▁ᒦᓂᓴᐠ -2767 -▁ᒧᐢᑐᐢ -2768 -▁ᒧᓄᑕᓇ -2769 -▁ᒪᐊᐧᒋ -2770 -▁ᒪᐢᑲᐣ -2771 -▁ᒪᒋᐁᐧ -2772 -▁ᒪᓂᑐᐸ -2773 -▁ᓇᐊᐧᐨ -2774 -▁ᓇᐘᒌᐤ -2775 -▁ᓇᐱᐟᐨ -2776 -▁ᓈᑕᒫᑐ -2777 -▁ᓴᑭᓇᓭ -2778 -ahêwak -2779 -akāwak -2780 -amacik -2781 -amikow -2782 -amêkiw -2783 -anawap -2784 -ascîhc -2785 -ashipi -2786 -askāni -2787 -atatum -2788 -battle -2789 -cessed -2790 -cherok -2791 -cihiwe -2792 -donnel -2793 -emaska -2794 -enitam -2795 -ership -2796 -erģīja -2797 -ewiniw -2798 -eyenne -2799 +ap -192 +ig -193 +ob -194 +ol -195 +îw -196 +ᓇᐠ -197 +ᐦᑖᓱ -198 +▁ay -199 +▁on -200 +▁ta -201 +iwak -202 +▁nam -203 +▁atim -204 +▁ēkwa -205 +): -206 +ce -207 +oh -208 +ok -209 +ān -210 +ᐏᐣ -211 +ᐢᑫ -212 +▁ᐸ -213 +▁ᑌ -214 +▁ᒨ -215 +hki -216 +hpô -217 +ome -218 +pah -219 +pay -220 +wâk -221 +▁gl -222 +▁kâ -223 +▁mé -224 +▁ᐃᔨ -225 +▁ᐸᐦ -226 +aniw -227 +inos -228 +▁can -229 +▁kik -230 +▁ᐯᔭᒄ -231 +▁ahpô -232 +▁some -233 +▁métis -234 +ab -235 +el -236 +iš -237 +kw -238 +āk -239 +āw -240 +ᐁᐧ -241 +▁h -242 +▁ᒉ -243 +▁ᒦ -244 +▁ᓴ -245 +ies -246 +ill -247 +imu -248 +iwa -249 +onk -250 +way -251 +ᓯᑲᐣ -252 +▁is -253 +▁kā -254 +▁ᐊᒋ -255 +ikot -256 +iniw -257 +ᒧᐃᐧᐣ -258 +▁ᐅᐦᒋ -259 +▁nonk -260 +▁writ -261 +illing -262 +▁canad -263 +▁pimîhk -264 +▁nonkilling -265 +ag -266 +ay -267 +eš -268 +gr -269 +le -270 +ᐧᐁ -271 +ᓇᐤ -272 +▁* -273 +▁ᒧ -274 +... -275 +cih -276 +īna -277 +▁en -278 +▁it -279 +▁kī -280 +▁wâ -281 +▁ᑭᒋ -282 +ikam -283 +isin -284 +▁cen -285 +▁loc -286 +glish -287 +▁list -288 +▁mīna -289 +▁ohci -290 +aniwiw -291 +ihikan -292 +▁kinos -293 +▁ᐊᒋᐦᑖᓱ -294 +▁location -295 +), -296 +ah -297 +et -298 +hi -299 +kī -300 +ma -301 +tu -302 +êh -303 +îm -304 +ēw -305 +ᐋᐧ -306 +ᐢᑌ -307 +ᐢᑎ -308 +ᐦᑕ -309 +ᑳᐧ -310 +ᒋᐠ -311 +ᓯᒼ -312 +ᕒᐃ -313 +▁ᐘ -314 +▁ᑕ -315 +▁ᑫ -316 +▁ᒣ -317 +▁ᓃ -318 +▁ᓵ -319 +ast -320 +ata -321 +ces -322 +iso -323 +ten -324 +wah -325 +wes -326 +âna -327 +ᐃᐧᓇ -328 +ᐢᑫᐧ -329 +▁or -330 +▁ᑲᔦ -331 +obal -332 +ēwin -333 +ᐦᐃᑲᐣ -334 +ᑵᓯᑲᐣ -335 +▁ask -336 +payik -337 +âpiht -338 +▁mask -339 +▁name -340 +astern -341 +▁canada -342 +▁global -343 +▁english -344 +▁written -345 +ek -346 +em -347 +ll -348 +op -349 +st -350 +um -351 +ôt -352 +ēk -353 +ēn -354 +ēs -355 +ēy -356 +īh -357 +ᐘᐣ -358 +ᑌᐤ -359 +ᓯᐤ -360 +ᔨᓂ -361 +▁j -362 +▁v -363 +▁ê -364 +▁ᐆ -365 +▁ᓯ -366 +aph -367 +ard -368 +ith -369 +iya -370 +ogr -371 +omm -372 +owi -373 +tak -374 +tal -375 +tan -376 +ᑯᓯᐤ -377 +▁ch -378 +▁cî -379 +▁ki -380 +▁pa -381 +▁qc -382 +▁ᐃᐧ -383 +▁ᐃᓯ -384 +▁ᐅᓂ -385 +▁ᑭᑭ -386 +anit -387 +kisk -388 +osiw -389 +tahk -390 +ther -391 +▁awa -392 +▁isk -393 +▁neh -394 +▁nik -395 +▁nēh -396 +▁por -397 +▁rom -398 +▁comm -399 +▁kita -400 +awēwin -401 +ograph -402 +▁nation -403 +▁portal -404 +▁ᐊᒋᐦᑖᓱᓐ -405 +▁ᐸᐦᑵᓯᑲᐣ -406 +âkaniwiw -407 +▁kinosêw -408 +▁maskisin -409 +▁pimîhkân -410 +▁ᐊᐱᐦᑕᑯᓯᓴᐣ -411 +▁ᐊᐱᐦᑕᑯᓯᓴᓇᐠ -412 +ue -413 +ul -414 +āc -415 +ᐎᓐ -416 +ᐘᐠ -417 +ᐘᑭ -418 +ᐢᑲ -419 +ᐦᐊ -420 +ᐦᑳ -421 +ᐦᒡ -422 +ᐧᒥ -423 +ᑌᒡ -424 +ᑭᓯ -425 +ᑲᔭ -426 +ᓂᐠ -427 +ᓇᑕ -428 +▁u -429 +▁ô -430 +▁ᐄ -431 +▁ᐳ -432 +▁ᑮ -433 +▁ᒋ -434 +awî -435 +awī -436 +ihi -437 +ina -438 +ite -439 +oos -440 +ota -441 +oun -442 +sus -443 +tay -444 +wân -445 +yll -446 +âhk -447 +âsk -448 +âwi -449 +ᐘᑭᓂ -450 +ᑲᔭᓯ -451 +ᓯᓇᐤ -452 +▁kî -453 +▁pr -454 +▁wî -455 +▁ᐃᔑ -456 +▁ᐊᐦ -457 +▁ᒥᑕ -458 +▁ᒨᔅ -459 +▁ᒫᒃ -460 +dard -461 +ihci -462 +inak -463 +paht -464 +unit -465 +wâpi -466 +ᑭᐢᑫᐧ -467 +ᒁᐘᑭᓂ -468 +▁mon -469 +▁mâk -470 +▁pak -471 +▁pik -472 +▁way -473 +▁ᐊᔨᒧ -474 +▁ᐋᐱᐦ -475 +▁ᑲᔦᐦ -476 +ihtam -477 +thern -478 +yllab -479 +▁aniy -480 +▁east -481 +▁iskw -482 +▁kika -483 +▁kisk -484 +▁orth -485 +▁stan -486 +imuwin -487 +îmowin -488 +ᒁᐘᑭᓂᑲᐣ -489 +▁anihi -490 +▁roman -491 +▁ᓀᐦᐃᔭᐍ -492 +eastern -493 +ihkêwak -494 +ography -495 +western -496 +▁census -497 +▁syllab -498 +▁ispayik -499 +▁communit -500 +▁standard -501 +▁orthography -502 +bi -503 +cî -504 +ef -505 +gu -506 +kâ -507 +nu -508 +pē -509 +ué -510 +yû -511 +ák -512 +ᐢᑕ -513 +ᐢᑯ -514 +ᐤ᙮ -515 +ᐦᑯ -516 +ᐧᐊ -517 +ᐨᑕ -518 +ᑎᑯ -519 +ᑖᓂ -520 +ᑭᐌ -521 +ᑲᑌ -522 +ᑲᓂ -523 +ᑳᐤ -524 +ᒥᐣ -525 +ᒪᓇ -526 +ᓂᐤ -527 +ᓂᑯ -528 +ᓱᐣ -529 +ᔨᐤ -530 +ᔪᐘ -531 +ᕒᐅ -532 +▁i -533 +▁é -534 +▁ō -535 +▁ᐲ -536 +▁ᑯ -537 +▁ᓰ -538 +ana -539 +awê -540 +bec -541 +enn -542 +ers -543 +gna -544 +ian -545 +iki -546 +kic -547 +kīs -548 +lue -549 +ohk -550 +osk -551 +ost -552 +ple -553 +rad -554 +taw -555 +wan -556 +ôhk -557 +āhk -558 +ᐢᑌᐠ -559 +ᐢᑭᐤ -560 +ᐢᑭᑌ -561 +ᐦᒋᑫ -562 +ᐧᐃᓐ -563 +ᑯᐟᐨ -564 +ᑳᐧᐤ -565 +ᒧᐏᐣ -566 +ᓂᐘᐣ -567 +ᓅᐦᒡ -568 +▁ab -569 +▁ak -570 +▁at -571 +▁ᐁᑿ -572 +▁ᐅᒪ -573 +▁ᐊᐧ -574 +▁ᐊᓂ -575 +▁ᐊᓐ -576 +▁ᐱᓯ -577 +▁ᓂᑕ -578 +▁ᓴᑲ -579 +angu -580 +aska -581 +atch -582 +ewan -583 +igin -584 +inik -585 +itaw -586 +iyin -587 +owiy -588 +âkan -589 +ânis -590 +îwah -591 +ēsta -592 +ᐢᑎᑯᓯ -593 +ᔪᐘᐦᐠ -594 +▁ekw -595 +▁mas -596 +▁mis -597 +▁neš -598 +▁nâp -599 +▁nīh -600 +▁pro -601 +▁pâs -602 +▁qué -603 +▁ref -604 +▁tān -605 +▁ᐁᐧᒥ -606 +▁ᐊᐢᑭ -607 +▁ᐊᐧᐁ -608 +▁ᑭᓱᐣ -609 +▁ᓀᐢᑕ -610 +gnais -611 +isiwe -612 +lueun -613 +pahki -614 +îpihk -615 +▁ayis -616 +▁ekwa -617 +▁innu -618 +▁iyyû -619 +▁nata -620 +▁nita -621 +▁piko -622 +▁âtay -623 +anguag -624 +awîtim -625 +awīwin -626 +iyawak -627 +▁monta -628 +▁nēhiy -629 +▁nēsta -630 +▁tahki -631 +▁ᐃᓯᐦᒋᑫ -632 +▁ᐱᓯᐢᑭᐤ -633 +▁ᒥᔪᐘᐦᐠ -634 +askatch -635 +▁atimwa -636 +▁nīhith -637 +▁québec -638 +▁ᓀᐦᐃᔭᐃᐧ -639 +ayimuwin -640 +iyiniwak -641 +îwahikan -642 +▁aniyiwa -643 +▁ᓀᐦᐃᔭᐍᐏᐣ -644 +▁ᓯᒁᐘᑭᓂᑲᐣ -645 +awîtimihk -646 +▁nitawâpi -647 +askatchewan -648 +▁montagnais -649 +▁communities -650 +▁nēhiyawēwin -651 +▁nīhithawīwin -652 +▁wayawîtimihk -653 +ds -654 +ex -655 +hc -656 +io -657 +lb -658 +mp -659 +pp -660 +ps -661 +qu -662 +sh -663 +uk -664 +uw -665 +va -666 +ve -667 +ya -668 +ây -669 +êy -670 +îk -671 +ēm -672 +ēt -673 +ōt -674 +ᐅᓐ -675 +ᐟᒋ -676 +ᐟᒐ -677 +ᐤᐸ -678 +ᐦᐄ -679 +ᐦᐅ -680 +ᐧᐋ -681 +ᐸᐸ -682 +ᐸᒥ -683 +ᑕᐦ -684 +ᑕᓂ -685 +ᑖᐤ -686 +ᑭᒋ -687 +ᑲᔮ -688 +ᒣᐤ -689 +ᒣᔅ -690 +ᒥᐦ -691 +ᒥᑕ -692 +ᒧᐏ -693 +ᔭᐠ -694 +ᔮᔪ -695 +▁z -696 +▁ᐏ -697 +▁ᐑ -698 +▁ᐹ -699 +▁ᒐ -700 +▁ᓅ -701 +▁ᓈ -702 +ace -703 +ach -704 +ain -705 +ann -706 +ash -707 +cas -708 +ewa -709 +ige -710 +ihc -711 +ion -712 +mpy -713 +nic -714 +oba -715 +osâ -716 +pat -717 +pin -718 +rio -719 +ses -720 +ski -721 +vin -722 +wâs -723 +áká -724 +êst -725 +êwi -726 +îwi -727 +īhk -728 +ᐁᐧᐃ -729 +ᐋᐧᐠ -730 +ᐎᓂᐠ -731 +ᐟᒋᐠ -732 +ᐟᒐᓂ -733 +ᐢᑳᐧ -734 +ᑧᑭᐌ -735 +ᑲᔮᓰ -736 +ᒧᐅᓐ -737 +ᓂᐦᑳ -738 +ᔥᑎᑯ -739 +▁ac -740 +▁ed -741 +▁fr -742 +▁il -743 +▁iš -744 +▁mw -745 +▁os -746 +▁sh -747 +▁sk -748 +▁ti -749 +▁to -750 +▁wa -751 +▁ᐃᓕ -752 +▁ᐅᑌ -753 +▁ᐅᒋ -754 +▁ᐅᓴ -755 +▁ᐊᑭ -756 +▁ᐋᐸ -757 +▁ᐱᒥ -758 +▁ᒥᓇ -759 +▁ᓇᑕ -760 +amik -761 +apān -762 +ence -763 +eren -764 +ewak -765 +ikât -766 +inoh -767 +isik -768 +isit -769 +ites -770 +iwāt -771 +kask -772 +kīsk -773 +nahk -774 +niwa -775 +olit -776 +omin -777 +oose -778 +osik -779 +osis -780 +pisk -781 +skaw -782 +tahi -783 +tihk -784 +timw -785 +wach -786 +wina -787 +âcik -788 +ânak -789 +âwak -790 +âwew -791 +ēcik -792 +ᐁᐧᐃᐣ -793 +ᐃᐧᓯᐤ -794 +ᐢᑯᑌᐤ -795 +ᐦᑯᒪᓇ -796 +ᐸᐸᐤᐸ -797 +ᑭᐊᐧᐠ -798 +ᓇᑕᐦᐠ -799 +▁dec -800 +▁ekâ -801 +▁eth -802 +▁itē -803 +▁kîw -804 +▁lab -805 +▁māt -806 +▁res -807 +▁rig -808 +▁sam -809 +▁sel -810 +▁sim -811 +▁ter -812 +▁wor -813 +▁ôma -814 +▁ᐃᐧᐊ -815 +▁ᑭᐢᑫ -816 +▁ᒥᐨᑕ -817 +▁ᒥᓂᑯ -818 +▁ᓇᒣᔅ -819 +atcik -820 +cikan -821 +ected -822 +imuun -823 +inawê -824 +inihk -825 +itcik -826 +itwâw -827 +opîwi -828 +ories -829 +rador -830 +tario -831 +wampy -832 +wâkam -833 +wânâs -834 +ᓂᐦᑳᑌᒡ -835 +▁ilil -836 +▁iyuw -837 +▁kask -838 +▁mihk -839 +▁nešt -840 +▁wiki -841 +▁wâsk -842 +▁êkwa -843 +▁ᐃᔨᔨᐤ -844 +▁ᐊᐧᐁᓰ -845 +▁ᐊᑲᔭᓯ -846 +▁ᐋᐱᐦᑖ -847 +▁ᐸᐦᑭᓯ -848 +▁ᑫᑯᐟᐨ -849 +eštimw -850 +iginal -851 +ikamik -852 +ikâtew -853 +inoham -854 +niwahk -855 +owiyâs -856 +tahkik -857 +wawach -858 +âkosis -859 +ânisâw -860 +âwisiw -861 +▁glenn -862 +▁kihci -863 +▁manit -864 +▁paige -865 +▁âskaw -866 +▁ᐃᔨᒧᐅᓐ -867 +▁ᐃᔨᓂᐘᐣ -868 +▁ᐅᓂᐟᒐᓂ -869 +▁ᑌᐸᐸᐤᐸ -870 +▁ᑲᓇᑕᐦᐠ -871 +▁ᒪᐢᑯᑌᐤ -872 +erences -873 +ikamach -874 +orthern -875 +ᑭᐢᑫᐧᐃᐧᐣ -876 +▁center -877 +▁ethnic -878 +▁iskwew -879 +▁provin -880 +▁sample -881 +▁territ -882 +▁ᐃᐧᑭᐊᐧᐠ -883 +▁ᐊᔨᒧᐧᐃᓐ -884 +▁ᓀᐦᐃᔭᐁᐧ -885 +▁ᓇᒣᔅᑧᑭᐌ -886 +original -887 +▁languag -888 +▁mistahi -889 +▁ontario -890 +▁pânisâw -891 +▁âtayôhk -892 +▁ᐁᐧᒥᐢᑎᑯᓯ -893 +kicihikan -894 +ákániwahk -895 +▁labrador -896 +▁manitoba -897 +▁selected -898 +▁syllabar -899 +âkosisânak -900 +âtâkaniwiw -901 +âwewikamik -902 +îwahikanak -903 +▁nehiyawak -904 +▁pâsikâtew -905 +âkanihkêwak -906 +▁ililîmowin -907 +▁references -908 +▁territories -909 +wawachikamach -910 +▁ayisiyiniwak -911 +▁saskatchewan -912 +âpihtâkosisânak -913 +'← -914 +bo -915 +hu -916 +ia -917 +id -918 +ir -919 +iv -920 +iz -921 +ka -922 +kn -923 +ly -924 +mô -925 +oo -926 +rd -927 +si -928 +sw -929 +to -930 +tâ -931 +tā -932 +up -933 +ut -934 +wê -935 +ác -936 +âm -937 +êk -938 +ôp -939 +ûn -940 +ās -941 +āy -942 +īk -943 +ōh -944 +ᐁᐁ -945 +ᐁᔪ -946 +ᐃᑿ -947 +ᐅᑯ -948 +ᐅᒡ -949 +ᐊᓯ -950 +ᐋᐣ -951 +ᐌᐎ -952 +ᐌᓐ -953 +ᐎᓯ -954 +ᐏᓯ -955 +ᐑᑎ -956 +ᐚᐱ -957 +ᐟᒉ -958 +ᐢᑐ -959 +ᐢᑮ -960 +ᐢᒋ -961 +ᐣᓯ -962 +ᐦᐨ -963 +ᐦᑲ -964 +ᐦᔪ -965 +ᐯᔨ -966 +ᐱᐣ -967 +ᐱᐩ -968 +ᐱᔅ -969 +ᐱᔖ -970 +ᐲᔨ -971 +ᐴᓵ -972 +ᑌᐦ -973 +ᑎᐣ -974 +ᑎᒻ -975 +ᑎᓐ -976 +ᑎᓰ -977 +ᑎᓱ -978 +ᑕᒻ -979 +ᑕᓇ -980 +ᑖᕁ -981 +ᑭᐱ -982 +ᑭᓄ -983 +ᑭᔨ -984 +ᑮᓯ -985 +ᑯᒋ -986 +ᑰᐤ -987 +ᑲᐠ -988 +ᑲᒋ -989 +ᑲᓄ -990 +ᑲᓇ -991 +ᑳᐣ -992 +ᑵᐠ -993 +ᒀᓐ -994 +ᒋᓲ -995 +ᒌᔑ -996 +ᒡᐦ -997 +ᒥᓱ -998 +ᒥᕽ -999 +ᒨᓐ -1000 +ᒪᑯ -1001 +ᒫᐤ -1002 +ᓀᐤ -1003 +ᓂᒥ -1004 +ᓇᐗ -1005 +ᓖᒧ -1006 +ᓯᓭ -1007 +ᓴᕀ -1008 +ᔑᒡ -1009 +ᔨᔫ -1010 +ᔪᐤ -1011 +ᔭᐨ -1012 +ᔭᒥ -1013 +ᖬᐑ -1014 +▁' -1015 +▁: -1016 +▁< -1017 +▁g -1018 +▁x -1019 +▁î -1020 +▁ᐎ -1021 +▁ᐤ -1022 +▁ᐧ -1023 +▁ᐴ -1024 +▁ᑏ -1025 +▁ᑐ -1026 +▁ᑑ -1027 +▁ᑖ -1028 +▁ᑰ -1029 +▁ᒍ -1030 +▁ᒎ -1031 +▁ᒑ -1032 +▁ᓄ -1033 +▁ᓚ -1034 +▁ᓭ -1035 +▁ᓱ -1036 +▁ᓲ -1037 +▁ᔕ -1038 +▁ᔥ -1039 +▁ᔦ -1040 +▁ᔨ -1041 +▁ᔩ -1042 +▁ᔪ -1043 +▁ᔫ -1044 +▁ᔭ -1045 +▁ᔮ -1046 +aim -1047 +ako -1048 +asi -1049 +ate -1050 +bia -1051 +cak -1052 +can -1053 +cho -1054 +cil -1055 +ecî -1056 +ekw -1057 +els -1058 +hts -1059 +ics -1060 +ikā -1061 +ito -1062 +ity -1063 +kac -1064 +kin -1065 +lac -1066 +lar -1067 +mar -1068 +mât -1069 +môs -1070 +ock -1071 +ohc -1072 +ohn -1073 +otā -1074 +out -1075 +pak -1076 +ped -1077 +pim -1078 +piw -1079 +pon -1080 +pôs -1081 +ral -1082 +shi -1083 +sim -1084 +sou -1085 +sta -1086 +sîp -1087 +upi -1088 +use -1089 +wap -1090 +wec -1091 +wiy -1092 +êwo -1093 +îht -1094 +îna -1095 +îso -1096 +šiš -1097 +ᐁᐁ᙮ -1098 +ᐁᔪᑯ -1099 +ᐃᐧᑭ -1100 +ᐃᐧᒧ -1101 +ᐃᑿᐤ -1102 +ᐅᑯᓐ -1103 +ᐊᐧᐤ -1104 +ᐊᐧᐱ -1105 +ᐊᓇᐠ -1106 +ᐋᐧᐤ -1107 +ᐌᐎᐣ -1108 +ᐌᓐ᙮ -1109 +ᐟᐨ᙮ -1110 +ᐢᑌᓄ -1111 +ᐢᑯᓯ -1112 +ᐢᑲᐤ -1113 +ᐢᒋᑫ -1114 +ᐦᐃᑭ -1115 +ᐦᐠ᙮ -1116 +ᐦᐱᐦ -1117 +ᐦᑲᓐ -1118 +ᐧᐁᒥ -1119 +ᐧᐁᔨ -1120 +ᐧᐃᐤ -1121 +ᐧᐃᐸ -1122 +ᐧᐊᒡ -1123 +ᐧᑖᐤ -1124 +ᐸᒣᐤ -1125 +ᑎᐯᔨ -1126 +ᑐᒥᑕ -1127 +ᑕᐦᐃ -1128 +ᑕᓂᓴ -1129 +ᑮᓯᓵ -1130 +ᑯᓯᓴ -1131 +ᑲᐧᓂ -1132 +ᑲᑌᐠ -1133 +ᒋᓲ᙮ -1134 +ᒋᕒᐃ -1135 +ᒧᐏᓂ -1136 +ᒨᑖᕁ -1137 +ᒫᑎᓰ -1138 +ᓂᓂᐤ -1139 +ᓄᐦᐨ -1140 +ᓇᐊᐧ -1141 +ᓰᐱᐦ -1142 +ᓴᐦᐊ -1143 +ᔥᑌᒡ -1144 +ᔥᑕᒻ -1145 +ᔨᐦᑕ -1146 +ᔭᐑᑎ -1147 +ᕒᐃᑲ -1148 +▁bu -1149 +▁by -1150 +▁gr -1151 +▁ht -1152 +▁mb -1153 +▁me -1154 +▁mā -1155 +▁ni -1156 +▁nô -1157 +▁oc -1158 +▁oh -1159 +▁ot -1160 +▁tā -1161 +▁âp -1162 +▁ēk -1163 +▁ᐁᑮ -1164 +▁ᐁᑾ -1165 +▁ᐁᔥ -1166 +▁ᐃᑣ -1167 +▁ᐅᓵ -1168 +▁ᐆᒪ -1169 +▁ᐊᒻ -1170 +▁ᐘᐱ -1171 +▁ᐦᐁ -1172 +▁ᐦᐃ -1173 +▁ᐦᐄ -1174 +▁ᐦᐅ -1175 +▁ᐦᐆ -1176 +▁ᐦᐊ -1177 +▁ᐦᐋ -1178 +▁ᐳᓂ -1179 +▁ᐸᓵ -1180 +▁ᑭᓄ -1181 +▁ᒣᑿ -1182 +▁ᒥᐦ -1183 +▁ᒥᔮ -1184 +▁ᒦᓇ -1185 +▁ᒫᑕ -1186 +▁ᒫᑲ -1187 +▁ᓂᑐ -1188 +▁ᓇᐯ -1189 +▁ᓵᑳ -1190 +acik -1191 +agan -1192 +ains -1193 +amih -1194 +amil -1195 +anin -1196 +asin -1197 +asiw -1198 +atim -1199 +awāt -1200 +ayâs -1201 +book -1202 +case -1203 +ench -1204 +erta -1205 +etin -1206 +etsi -1207 +face -1208 +hkek -1209 +hkân -1210 +ical -1211 +icik -1212 +imin -1213 +imit -1214 +imot -1215 +ināw -1216 +isto -1217 +itik -1218 +iwin -1219 +iwāw -1220 +iyik -1221 +iyîk -1222 +kned -1223 +kway -1224 +main -1225 +natā -1226 +olum -1227 +oods -1228 +otam -1229 +oups -1230 +pahk -1231 +pask -1232 +pper -1233 +pâhk -1234 +skēk -1235 +stēn -1236 +tics -1237 +uman -1238 +upus -1239 +used -1240 +âpem -1241 +îpeh -1242 +îpis -1243 +ôpâk -1244 +āceh -1245 +ēkok -1246 +ᐁᐧᐊᐧ -1247 +ᐃᐧᐦᔪ -1248 +ᐃᐧᔨᓂ -1249 +ᐊᐧᐠ᙮ -1250 +ᐊᑲᔭᓯ -1251 +ᐊᓯᓯᒼ -1252 +ᐋᑲᔮᓰ -1253 +ᐏᓯᓇᐤ -1254 +ᐢᑎᑳᐣ -1255 +ᐢᑯᓯᐦ -1256 +ᐦᐃᖬᐑ -1257 +ᐦᐱᐦᑲ -1258 +ᐦᑕᑯᓯ -1259 +ᐦᑖᑭᔨ -1260 +ᐦᑖᓱᐣ -1261 +ᐱᐦᑰᐤ -1262 +ᐱᐦᑵᐠ -1263 +ᐱᔖᔑᒡ -1264 +ᐲᔨᓯᒼ -1265 +ᑲᐢᑭᑌ -1266 +ᒌᔑᑳᐤ -1267 +ᓂᒥᑎᓱ -1268 +ᓅᐦᒡ᙮ -1269 +ᓇᐌᐎᐣ -1270 +ᓇᐗᐸᒥ -1271 +ᓖᒧᐎᓐ -1272 +ᓯᓭᐘᐠ -1273 +ᔥᑎᑯᔒ -1274 +ᔨᓂᐘᐠ -1275 +ᔭᒥᐦᐊ -1276 +▁alb -1277 +▁ava -1278 +▁iht -1279 +▁isi -1280 +▁ita -1281 +▁kin -1282 +▁leh -1283 +▁mah -1284 +▁mok -1285 +▁new -1286 +▁nêh -1287 +▁ota -1288 +▁pêy -1289 +▁sci -1290 +▁sîw -1291 +▁tip -1292 +▁vow -1293 +▁wes -1294 +▁wīk -1295 +▁âpa -1296 +▁ᐁᑌᐦ -1297 +▁ᐃᔪᐤ -1298 +▁ᐄᔨᔫ -1299 +▁ᐅᐦᐃ -1300 +▁ᐊᐦᐳ -1301 +▁ᐊᐦᐴ -1302 +▁ᐊᐧᐱ -1303 +▁ᐊᓂᒌ -1304 +▁ᐊᔨᒥ -1305 +▁ᐊᔨᓯ -1306 +▁ᐱᒥᐩ -1307 +▁ᐱᓯᒼ -1308 +▁ᑌᑲᒋ -1309 +▁ᑭᐦᒋ -1310 +▁ᑳᓇᑕ -1311 +▁ᒉᒀᓐ -1312 +▁ᒥᑕᑐ -1313 +▁ᒥᕒᐅ -1314 +▁ᒫᐅᒡ -1315 +▁ᓂᐱᐩ -1316 +▁ᓂᔮᔪ -1317 +▁ᓇᐦᑲ -1318 +▁ᓰᐱᐦ -1319 +▁ᓵᓴᕀ -1320 +aimûn -1321 +asibi -1322 +cakin -1323 +cewâk -1324 +chool -1325 +eyiht -1326 +ihcik -1327 +ihtak -1328 +ihtaw -1329 +imate -1330 +inito -1331 +isiwê -1332 +itâsk -1333 +kinos -1334 +kisik -1335 +lains -1336 +nakac -1337 +nipah -1338 +nipat -1339 +ostos -1340 +otcik -1341 +simôt -1342 +tamik -1343 +ticik -1344 +tural -1345 +wayân -1346 +wâwân -1347 +âtahk -1348 +êhohk -1349 +ᐢᑌᓄᐦᐠ -1350 +ᐢᑭᐦᐠ᙮ -1351 +ᐢᑳᐧᔭᐨ -1352 +ᐦᑕᑯᓯᐤ -1353 +ᑎᐯᔨᒥᓱ -1354 +ᑐᒥᑕᓇᐤ -1355 +ᑮᓯᓵᒋᐠ -1356 +ᓈᓅᐦᒡ᙮ -1357 +ᔭᐑᑎᒥᕽ -1358 +▁bann -1359 +▁blac -1360 +▁brit -1361 +▁cecî -1362 +▁char -1363 +▁chis -1364 +▁coun -1365 +▁data -1366 +▁htos -1367 +▁inin -1368 +▁itēw -1369 +▁john -1370 +▁kikî -1371 +▁kikī -1372 +▁kotā -1373 +▁mist -1374 +▁mwác -1375 +▁mîna -1376 +▁nisk -1377 +▁noun -1378 +▁nêwo -1379 +▁nîso -1380 +▁oski -1381 +▁wask -1382 +▁with -1383 +▁word -1384 +▁ōhci -1385 +▁ᐁᐅᑯᓐ -1386 +▁ᐃᐢᑫᐧ -1387 +▁ᐅᑕᓂᓴ -1388 +▁ᐅᑯᓯᓴ -1389 +▁ᐊᐌᓐ᙮ -1390 +▁ᐊᓄᐦᐨ -1391 +▁ᐊᔨᒨᓐ -1392 +▁ᐋᐱᐦᑕ -1393 +▁ᐋᐸᑎᓐ -1394 +▁ᐯᐃᑿᐤ -1395 +▁ᐱᒫᑎᓰ -1396 +▁ᑲᐢᑭᑌ -1397 +▁ᒥᑕᑕᐦ -1398 +▁ᒥᓂᑯᔥ -1399 +▁ᓂᑕᐚᐱ -1400 +▁ᓃᔥᑕᒻ -1401 +amites -1402 +aniwit -1403 +ashish -1404 +asinâs -1405 +asiwât -1406 +askapi -1407 +ikisiw -1408 +ikotik -1409 +imitis -1410 +iwâcik -1411 +mation -1412 +osâwâw -1413 +otawâk -1414 +owinik -1415 +pimîhk -1416 +pēnahk -1417 +skēkot -1418 +takwâk -1419 +âkayâs -1420 +âpihtâ -1421 +ᐃᐧᒧᐢᒋᑫ -1422 +ᐃᐧᔨᓂᐊᐧ -1423 +ᐊᐱᐦᑕᑯᓯ -1424 +ᐦᐃᐁᐧᐃᐣ -1425 +ᐦᐃᖬᐑᐏᐣ -1426 +ᐦᑖᑭᔨᒡᐦ -1427 +ᑭᒋᐱᐦᑵᐠ -1428 +ᑲᐧᓂᐃᐧᓇ -1429 +ᓰᐱᐦᑳᐧᐤ -1430 +▁anite -1431 +▁askiy -1432 +▁askōt -1433 +▁atihk -1434 +▁atimw -1435 +▁awask -1436 +▁betsi -1437 +▁capān -1438 +▁colum -1439 +▁famil -1440 +▁human -1441 +▁kitaw -1442 +▁lupus -1443 +▁masīh -1444 +▁moose -1445 +▁māceh -1446 +▁nikot -1447 +▁nisto -1448 +▁nēhin -1449 +▁pakit -1450 +▁polit -1451 +▁upper -1452 +▁ᐃᑣᓅᐦᒡ -1453 +▁ᐅᑌᓇᐊᐧ -1454 +▁ᐊᐧᐁᓰᔅ -1455 +▁ᐊᐱᔖᔑᒡ -1456 +▁ᐊᔨᒧᐎᓐ -1457 +▁ᐸᐊᓯᓯᒼ -1458 +▁ᑭᐢᑎᑳᐣ -1459 +▁ᑭᐢᑯᓯᐦ -1460 +▁ᑮᓯᓭᐘᐠ -1461 +▁ᑲᓇᐗᐸᒥ -1462 +▁ᒥᔮᐧᐃᐸ -1463 +▁ᒧᐦᑯᒪᓇ -1464 +▁ᒫᑕᐢᑌᐠ -1465 +▁ᓂᐱᐦᑰᐤ -1466 +▁ᓂᑐᐧᐁᔨ -1467 +▁ᓵᑳᐢᑌᐠ -1468 +aganish -1469 +amihêwi -1470 +cikêwak -1471 +eyihtam -1472 +eyihtâk -1473 +ikamekw -1474 +ispayik -1475 +kaskite -1476 +nipatht -1477 +pahikan -1478 +wâcakin -1479 +wâcikan -1480 +wâkamîw -1481 +îpehtak -1482 +ᐃᐧᐦᔪᐃᐧᐣ -1483 +ᐋᑲᔮᓰᒧᐏᐣ -1484 +ᐦᐱᐦᑲᓯᑲᐣ -1485 +ᐧᐁᒥᔥᑎᑯᔒ -1486 +ᓂᒥᑎᓱᐎᓂᐠ -1487 +ᔭᒥᐦᐊᐃᐧᐣ -1488 +▁declar -1489 +▁french -1490 +▁groups -1491 +▁ininiw -1492 +▁kānatā -1493 +▁mahîhk -1494 +▁misiwe -1495 +▁namêst -1496 +▁nikisk -1497 +▁nâpewa -1498 +▁pahkek -1499 +▁rights -1500 +▁school -1501 +▁swampy -1502 +▁sîwîht -1503 +▁vowels -1504 +▁âpacih -1505 +▁ᐃᐢᑳᐧᔭᐨ -1506 +▁ᐃᓕᓖᒧᐎᓐ -1507 +▁ᐅᑎᐯᔨᒥᓱ -1508 +▁ᐅᓂᐟᒐᓂᒐ -1509 +▁ᐅᓂᐲᔨᓯᒼ -1510 +▁ᐊᑭᐦᑖᓱᐣ -1511 +▁ᐊᑲᔭᓯᐊᐧ -1512 +▁ᐘᔭᐑᑎᒥᕽ -1513 +▁ᑭᐢᑫᔨᐦᑕ -1514 +▁ᓀᐦᐃᔭᐊᐧ -1515 +▁ᓇᐯᐊᐧᐠ᙮ -1516 +▁ᓇᑕᐁᐧᐊᐧ -1517 +▁ᓰᐱᐦᑯᓯᐤ -1518 +▁ᓴᑲᐦᐃᑲᐣ -1519 +etinôpâk -1520 +itâskost -1521 +mâtinawê -1522 +nakacīhk -1523 +northern -1524 +simôtâhk -1525 +southern -1526 +âpihtâwi -1527 +îpihkwâw -1528 +ēnimitis -1529 +ᐊᑲᔭᓯᒧᐃᐧᐣ -1530 +▁akwâwân -1531 +▁alberta -1532 +▁awahkân -1533 +▁bannock -1534 +▁british -1535 +▁council -1536 +▁eastern -1537 +▁iyimuun -1538 +▁išinihk -1539 +▁naskapi -1540 +▁natawah -1541 +▁science -1542 +▁wikiped -1543 +▁writing -1544 +▁wîcewâk -1545 +▁wīkiwak -1546 +▁âpihtaw -1547 +▁ᐃᐢᑫᐧᐊᐧᐠ -1548 +▁ᐃᔑᓂᐦᑳᑌᒡ -1549 +▁ᐸᐦᑭᓯᒨᑖᕁ -1550 +▁ᓀᐦᐃᓇᐌᐎᐣ -1551 +▁ᓃᐦᐃᖬᐑᐏᐣ -1552 +▁ᓴᑲᐢᑌᓄᐦᐠ -1553 +iskinoham -1554 +iyîkopîwi -1555 +kiskatcik -1556 +osikwânâs -1557 +êhohkêwak -1558 +îpihkosiw -1559 +ᐃᐧᒧᐢᒋᑫᐃᐧᓇ -1560 +ᐊᐱᐦᑕᑯᓯᐊᓇᐠ -1561 +▁blackned -1562 +▁canadian -1563 +▁columbia -1564 +▁eastmain -1565 +▁kīskēkot -1566 +▁lehlueun -1567 +▁nehlueun -1568 +▁nikotwâs -1569 +▁ᐃᓯᐦᒋᑫᐃᐧᐣ -1570 +▁ᐃᓯᐦᒋᑫᐃᐧᓇ -1571 +▁ᐋᐱᐦᑖᒌᔑᑳᐤ -1572 +▁ᐱᑭᐢᑫᐧᐃᐧᐣ -1573 +▁ᑭᐢᑯᓯᐦᐋᐧᐠ -1574 +anitotawâk -1575 +asinâsowin -1576 +isiwepahki -1577 +isiwêpahki -1578 +▁chisasibi -1579 +▁languages -1580 +▁maskisina -1581 +▁political -1582 +▁provinces -1583 +▁syllabary -1584 +▁syllabics -1585 +▁âtayôhkân -1586 +▁ᐁᐧᒥᐢᑎᑯᓯᐊᐧ -1587 +▁ᐅᑎᐯᔨᒥᓱᐊᐧᐠ -1588 +▁ᒥᑕᑕᐦᑐᒥᑕᓇᐤ -1589 +▁ᓀᐦᐃᔭᐁᐧᐃᐧᐣ -1590 +▁ᓇᐦᑲᐃᐧᔨᓂᐊᐧ -1591 +▁aboriginal -1592 +▁ᐊᒋᐦᑖᓱᓈᓅᐦᒡ᙮ -1593 +▁ᒧᐦᑯᒪᓇᐢᑭᐦᐠ᙮ -1594 +âkayâsîmowin -1595 +âtâwewikamik -1596 +▁betsiamites -1597 +▁declaration -1598 +▁kîwetinôpâk -1599 +▁mihkwâkamîw -1600 +▁nēhinawēwin -1601 +▁pimitâskost -1602 +▁pânisâwêwak -1603 +▁waskaganish -1604 +▁yîwahikanak -1605 +▁ᐅᒪᐃᐧᒧᐢᒋᑫᐃᐧᓇ -1606 +▁ᓂᑐᐧᐁᔨᐦᑖᑭᔨᒡᐦ -1607 +ēnimitisowinik -1608 +▁akwâwânihkêwak -1609 +▁âpacihâkaniwiw -1610 +nipathtákániwahk -1611 +▁kawawachikamach -1612 +▁namêstêhohkêwak -1613 +▁âpihtâkosisânak -1614 +*, -1615 +av -1616 +aš -1617 +be -1618 +bs -1619 +ca -1620 +cr -1621 +cí -1622 +dj -1623 +eô -1624 +ff -1625 +gc -1626 +ja -1627 +li -1628 +má -1629 +of -1630 +ož -1631 +se -1632 +sā -1633 +te -1634 +tk -1635 +tō -1636 +ui -1637 +ur -1638 +wd -1639 +we -1640 +wî -1641 +âh -1642 +êm -1643 +ín -1644 +îš -1645 +ôm -1646 +ēp -1647 +ēr -1648 +īm -1649 +ōk -1650 +še -1651 +ᎳᎩ -1652 +ᐃᐱ -1653 +ᐃᑑ -1654 +ᐃᓄ -1655 +ᐃᔑ -1656 +ᐄᓂ -1657 +ᐅᐠ -1658 +ᐅᑭ -1659 +ᐅᒋ -1660 +ᐅᓵ -1661 +ᐊᐨ -1662 +ᐎᑭ -1663 +ᐏᐱ -1664 +ᐓᑭ -1665 +ᐚᐤ -1666 +ᐟᐦ -1667 +ᐠᓯ -1668 +ᐦᐋ -1669 +ᐦᑭ -1670 +ᐧᓭ -1671 +ᐨᑫ -1672 +ᐯᑯ -1673 +ᐯᒃ -1674 +ᐱᑭ -1675 +ᐱᒥ -1676 +ᐳᐣ -1677 +ᐳᑕ -1678 +ᑋᐃ -1679 +ᑌᑭ -1680 +ᑌ᙮ -1681 +ᑎᐠ -1682 +ᑎᓇ -1683 +ᑎᔭ -1684 +ᑐᐸ -1685 +ᑐᑦ -1686 +ᑐᓀ -1687 +ᑐᖅ -1688 +ᑕᐌ -1689 +ᑕᒧ -1690 +ᑕᒫ -1691 +ᑖᐧ -1692 +ᑫᐤ -1693 +ᑭᑫ -1694 +ᑭᓇ -1695 +ᑭ᙮ -1696 +ᑲᐨ -1697 +ᑲᒥ -1698 +ᑳᒋ -1699 +jo -1700 +jé -1701 +lf -1702 +oá -1703 +ģī -1704 +ᐘ᙮ -1705 +ᑌᐠ -1706 +ᑎᐤ -1707 +ᑴᓯ -1708 +ᑾᐣ -1709 +ᒁᐃ -1710 +ᒌᐤ -1711 +ᒑᔨ -1712 +ᒥᓀ -1713 +ᒧᒥ -1714 +ᒧᔭ -1715 +ᒨᒫ -1716 +ᒫᒋ -1717 +ᓀᐅ -1718 +ᓂᐹ -1719 +ᓂᒋ -1720 +ᓂᓴ -1721 +ᓂᔥ -1722 +ᓄᐁ -1723 +ᓇᐨ -1724 +ᓇᑌ -1725 +ᓇᑖ -1726 +ᓐᑌ -1727 +ᓗᒃ -1728 +ᓯᐌ -1729 +ᓯᐢ -1730 +ᓯᑭ -1731 +ᓯᒡ -1732 +ᓯᓇ -1733 +ᓯᔅ -1734 +ᔕᐠ -1735 +ᔦᔨ -1736 +ᔨᓄ -1737 +ᔩᐤ -1738 +ᔭᐤ -1739 +ᕽ᙮ -1740 +▁; -1741 +▁Ꮳ -1742 +▁ᐒ -1743 +▁ᐔ -1744 +▁ᐕ -1745 +▁ᐙ -1746 +▁ᐟ -1747 +▁ᐠ -1748 +▁ᐢ -1749 +▁ᐣ -1750 +▁ᐨ -1751 +▁ᐩ -1752 +▁ᑉ -1753 +▁ᑊ -1754 +▁ᑦ -1755 +▁ᒃ -1756 +▁ᓐ -1757 +▁ᓕ -1758 +▁ᓘ -1759 +▁ᓛ -1760 +▁ᓬ -1761 +▁ᔅ -1762 +▁ᔐ -1763 +▁ᔑ -1764 +▁ᔓ -1765 +▁ᕃ -1766 +▁ᕆ -1767 +▁ᕇ -1768 +▁ᕈ -1769 +▁ᕉ -1770 +▁ᕌ -1771 +▁ᕐ -1772 +▁ᕒ -1773 +▁ᕓ -1774 +▁ᕖ -1775 +▁ᕘ -1776 +▁ᕚ -1777 +▁ᕞ -1778 +▁ᕢ -1779 +▁ᕤ -1780 +▁ᕥ -1781 +▁ᕧ -1782 +▁ᕽ -1783 +▁ᖧ -1784 +▁ᖩ -1785 +▁ᖬ -1786 +ano -1787 +ans -1788 +anē -1789 +apm -1790 +atu -1791 +cit -1792 +ciw -1793 +cîy -1794 +dji -1795 +eci -1796 +ely -1797 +erg -1798 +est -1799 +ît -1800 +ᐁᐤ -1801 +ᑎᒽ -1802 +ᓇᓱ -1803 +ᓇ᙮ -1804 +ᓭᐠ -1805 +ᔅᒌ -1806 +ᔅᒡ -1807 +ᔮᐤ -1808 +▁, -1809 +▁ᒼ -1810 +▁ᓓ -1811 +▁ᓗ -1812 +▁ᕕ -1813 +▁ᖨ -1814 +▁ᖪ -1815 +eui -1816 +hat -1817 +hin -1818 +hmá -1819 +hua -1820 +iar -1821 +ick -1822 +ico -1823 +ika -1824 +ini -1825 +ins -1826 +iné -1827 +inā -1828 +isc -1829 +isi -1830 +iss -1831 +isê -1832 +itt -1833 +iwe -1834 +iwn -1835 +iwā -1836 +kim -1837 +lis -1838 +mic -1839 +ohp -1840 +oke -1841 +ole -1842 +oll -1843 +osi -1844 +oám -1845 +pan -1846 +poh -1847 +sêh -1848 +sām -1849 +tit -1850 +tôm -1851 +ujé -1852 +ult -1853 +wac -1854 +xof -1855 +you -1856 +áhk -1857 +âmo -1858 +îsi -1859 +ēna -1860 +ēni -1861 +ēta -1862 +ōsk -1863 +ᐁᐟᐦ -1864 +ᐃᑑᐸ -1865 +ᐃᓄᒃ -1866 +ᐊᐧᐣ -1867 +ᐊᑎᒻ -1868 +ᐍᑎᐣ -1869 +ᐎᑭᐟ -1870 +ᐎᓇ᙮ -1871 +ᐎᓯᐟ -1872 +ᐏᐟᐨ -1873 +ᐗᑭᐱ -1874 +ᐘᒌᐤ -1875 +ᐠᓯᑯ -1876 +ᐢᑐᐢ -1877 +ᐢᑐᑰ -1878 +ᐢᑮᕽ -1879 +ᐦᐁᐤ -1880 +ᐦᐊᒻ -1881 +ᐧᐋᒋ -1882 +ᐧᐴᓵ -1883 +ᐨᐱᐣ -1884 +ᐨᑫᒋ -1885 +ᐱᒥᐦ -1886 +ᐳᓂᐠ -1887 +ᑎᐱᔅ -1888 +ᑎᑐᑦ -1889 +ᑎᓇᒪ -1890 +ᑕᐌᓂ -1891 +ᑕᒧᐎ -1892 +ᑕᒫᑐ -1893 +ᑖᐧᓯ -1894 +ᑯᐏᐱ -1895 +ᑯᒋᑲ -1896 +ᑲᓄᐟ -1897 +ᑲᓄ᙮ -1898 +ᑲᓇᐣ -1899 +., -1900 +aq -1901 +ge -1902 +oe -1903 +âc -1904 +ᐠ᙮ -1905 +ᑌᒋ -1906 +ᑰᓇ -1907 +ᒋᐣ -1908 +ᒋᒫ -1909 +ᒧᑫ -1910 +ᓰᖂ -1911 +▁ᓖ -1912 +▁ᔒ -1913 +ckw -1914 +oht -1915 +ors -1916 +ēhk -1917 +ᐊᐧᒋ -1918 +ᐦᑰᓇ -1919 +ᐨᑕᑎ -1920 +ᑌᒋᓂ -1921 +ᑲᓇᓐ -1922 +ᑴᓯᑐ -1923 +ᒃᓰᖂ -1924 +ᒑᐱᔅ -1925 +ᒑᔨᕁ -1926 +ᒡᑋᐃ -1927 +ᒥᔭᐤ -1928 +ᒦᐧᐃ -1929 +ᒨᐧᓭ -1930 +ᒪᑲᓂ -1931 +ᓂᐧᐊ -1932 +ᓂᐹᐣ -1933 +ᓂᑐᐸ -1934 +ᓂᑲᑌ -1935 +ᓂᓴᐠ -1936 +ᓂᔥᑐ -1937 +ᓂᔮᔪ -1938 +ᓄᑕᓇ -1939 +ᓇᓀᐤ -1940 +ᓈᐃᐧ -1941 +ᓈᓀᐅ -1942 +ᓐᑌᕇ -1943 +ᓭᐁᐧ -1944 +ᔨᒨᒫ -1945 +ᔭᐚᐤ -1946 +ᔭᐦᐋ -1947 +ᔭᕽ᙮ -1948 +▁ -7938 ? -7939 ë -7940 ð -7941 diff --git a/models/vocabulary/cr_vocabulary.parquet b/models/vocabulary/cr_vocabulary.parquet index 998e97b8e90b2bbf47b0bf1219af6cf51c02faae..2ce09e422a5a6f13e92857f339ec42e9a3afa2d7 100644 --- a/models/vocabulary/cr_vocabulary.parquet +++ b/models/vocabulary/cr_vocabulary.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:c9a339a558f9a91e507cf4c6a9afd7657b8a650a75b8594311ae35047326f22c -size 10707 +oid sha256:4da6445f4ebee272af36858a529222db9605c42d9f0ad70060cc98b59cfaf5ee +size 10298 diff --git a/models/vocabulary/cr_vocabulary_metadata.json b/models/vocabulary/cr_vocabulary_metadata.json index 7c557d514a454dec2d75be8b101d553f65a78b12..5583b5d63d26198c824d7c1f9639eb512067f15a 100644 --- a/models/vocabulary/cr_vocabulary_metadata.json +++ b/models/vocabulary/cr_vocabulary_metadata.json @@ -1,15 +1,15 @@ { "language": "cr", - "vocabulary_size": 489, + "vocabulary_size": 468, "variant": "full", "statistics": { - "type_token_ratio": 0.5915817165406116, + "type_token_ratio": 0.5884562841530054, "coverage": { - "top_100": 0.2709634988490628, - "top_1000": 0.7372574810917462 + "top_100": 0.2786885245901639, + "top_1000": 0.7530737704918032 }, - "hapax_count": 1310, - "hapax_ratio": 0.7281823235130628, + "hapax_count": 1255, + "hapax_ratio": 0.7283807312826466, "total_documents": 25 } } \ No newline at end of file diff --git a/models/word_markov/cr_markov_ctx1_word.parquet b/models/word_markov/cr_markov_ctx1_word.parquet index 29507891d01081885a3cbff556a2c45ee8bfeff9..d718fe30b28e05fbef4b4bd3d35c8271f6f57676 100644 --- a/models/word_markov/cr_markov_ctx1_word.parquet +++ b/models/word_markov/cr_markov_ctx1_word.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:1b9cad226d4f9247d3628f1a8d45e964b5ac7aee3588940f10b795c7db079363 -size 51656 +oid sha256:67ec91aad046d21a2680c190c002d6bbdf3e246b18b9dffb01c01c60fdd45417 +size 49544 diff --git a/models/word_markov/cr_markov_ctx1_word_metadata.json b/models/word_markov/cr_markov_ctx1_word_metadata.json index 11a5cfdd37f9d280aab5c196332d73dbeb14a9cf..07ccfe13a5f5e3e4195c74d2e9b3bc5444bf83ba 100644 --- a/models/word_markov/cr_markov_ctx1_word_metadata.json +++ b/models/word_markov/cr_markov_ctx1_word_metadata.json @@ -2,6 +2,6 @@ "context_size": 1, "variant": "word", "language": "cr", - "unique_contexts": 1787, - "total_transitions": 3016 + "unique_contexts": 1711, + "total_transitions": 2903 } \ No newline at end of file diff --git a/models/word_markov/cr_markov_ctx2_word.parquet b/models/word_markov/cr_markov_ctx2_word.parquet index a1ca92c4791bb739ca04cfdbba08a6d893e47859..53d1c806d04be6b6ea3af27704d03165231b815d 100644 --- a/models/word_markov/cr_markov_ctx2_word.parquet +++ b/models/word_markov/cr_markov_ctx2_word.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:b1f017105092b235e6c58f3c540d7b81c2ec84d6afd12ad9d42b7f06f32711cb -size 68727 +oid sha256:da940dffbcc51f50754424866b5a37358c2ebc38b4236083ea24a6f1806004df +size 65639 diff --git a/models/word_markov/cr_markov_ctx2_word_metadata.json b/models/word_markov/cr_markov_ctx2_word_metadata.json index d886a0860f530d1fd6d637fb15f1c3f21b9e4bbd..5213300799dd0d66c921a2e74b5eb5d99452a3dc 100644 --- a/models/word_markov/cr_markov_ctx2_word_metadata.json +++ b/models/word_markov/cr_markov_ctx2_word_metadata.json @@ -2,6 +2,6 @@ "context_size": 2, "variant": "word", "language": "cr", - "unique_contexts": 2607, - "total_transitions": 2991 + "unique_contexts": 2501, + "total_transitions": 2878 } \ No newline at end of file diff --git a/models/word_markov/cr_markov_ctx3_word.parquet b/models/word_markov/cr_markov_ctx3_word.parquet index 7bcd17cb34ccc3c8051337781465a48beee98a11..59aba068d92bd83e3049ea73ba6450d332b6b5d6 100644 --- a/models/word_markov/cr_markov_ctx3_word.parquet +++ b/models/word_markov/cr_markov_ctx3_word.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:ba438b38b527c374838426729a544c3af06986da131dcd7b6efc96c6174fdf64 -size 78351 +oid sha256:d5fd696554eca1524675c807015f1565335c8e8d03c64184b7283aecaf4877ac +size 75064 diff --git a/models/word_markov/cr_markov_ctx3_word_metadata.json b/models/word_markov/cr_markov_ctx3_word_metadata.json index eae3c24aad8a9c9aecfcecef8fa2cfeb43d05265..fe45a19bf433bdac8e7a0fe592086a1278ce40d5 100644 --- a/models/word_markov/cr_markov_ctx3_word_metadata.json +++ b/models/word_markov/cr_markov_ctx3_word_metadata.json @@ -2,6 +2,6 @@ "context_size": 3, "variant": "word", "language": "cr", - "unique_contexts": 2724, - "total_transitions": 2966 + "unique_contexts": 2617, + "total_transitions": 2853 } \ No newline at end of file diff --git a/models/word_markov/cr_markov_ctx4_word.parquet b/models/word_markov/cr_markov_ctx4_word.parquet index 8e784366eb7376241942007a967749461b9bbb5b..7e95539fad6240ebc94ea2350842c55f74810032 100644 --- a/models/word_markov/cr_markov_ctx4_word.parquet +++ b/models/word_markov/cr_markov_ctx4_word.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:de165eca955a9457fb7d104a709440b37d8fe11272266a6eaf29d8aca13bcb96 -size 86120 +oid sha256:241b5162742d910baedb70c153556b238227f4b436220e840d48f82f5b69800a +size 82423 diff --git a/models/word_markov/cr_markov_ctx4_word_metadata.json b/models/word_markov/cr_markov_ctx4_word_metadata.json index 5b23b3064bab88fea5601e3764e9b68113df96c0..d10f7f2d4dfe6c6c00ceaa631f217072e7e33593 100644 --- a/models/word_markov/cr_markov_ctx4_word_metadata.json +++ b/models/word_markov/cr_markov_ctx4_word_metadata.json @@ -2,6 +2,6 @@ "context_size": 4, "variant": "word", "language": "cr", - "unique_contexts": 2765, - "total_transitions": 2941 + "unique_contexts": 2657, + "total_transitions": 2828 } \ No newline at end of file diff --git a/models/word_ngram/cr_2gram_word_metadata.json b/models/word_ngram/cr_2gram_word_metadata.json index a13f97a74e5febb978a4a15a0763bd98d0ab7c56..3ff21ed3d3515efaca965fd34750555e9982fcab 100644 --- a/models/word_ngram/cr_2gram_word_metadata.json +++ b/models/word_ngram/cr_2gram_word_metadata.json @@ -3,5 +3,5 @@ "variant": "word", "language": "cr", "unique_ngrams": 17, - "total_ngrams": 3016 + "total_ngrams": 2903 } \ No newline at end of file diff --git a/models/word_ngram/cr_3gram_word_metadata.json b/models/word_ngram/cr_3gram_word_metadata.json index 8a9c0c7b817c3fcea2bc4b778bbe80f4ff0ab717..8369e95f21ee6beb6f45bd7a91a053e6c8fe2041 100644 --- a/models/word_ngram/cr_3gram_word_metadata.json +++ b/models/word_ngram/cr_3gram_word_metadata.json @@ -3,5 +3,5 @@ "variant": "word", "language": "cr", "unique_ngrams": 16, - "total_ngrams": 2991 + "total_ngrams": 2878 } \ No newline at end of file diff --git a/models/word_ngram/cr_4gram_word.parquet b/models/word_ngram/cr_4gram_word.parquet index 567d491e169df1f3ecde7bccb28c37d7fd246fdb..5c2ec18f53ceb86bdecd83fd66f44f6ef3b140ef 100644 --- a/models/word_ngram/cr_4gram_word.parquet +++ b/models/word_ngram/cr_4gram_word.parquet @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:8c1ac43c0e3b24b9b058c232c3156ef3f4650c2f5098663b05aff7249ed3efd3 -size 5830 +oid sha256:bf88b51861c7b3e616572bab59d3354c7c36ad29a0cdfcbfb5afb5e47dca5e8c +size 5691 diff --git a/models/word_ngram/cr_4gram_word_metadata.json b/models/word_ngram/cr_4gram_word_metadata.json index 3a7f5d9c102b3ec32077d2333bc8e01ced7afd6a..0a040cafc4dd219be47f56b99f38fcc7741cd295 100644 --- a/models/word_ngram/cr_4gram_word_metadata.json +++ b/models/word_ngram/cr_4gram_word_metadata.json @@ -2,6 +2,6 @@ "n": 4, "variant": "word", "language": "cr", - "unique_ngrams": 166, - "total_ngrams": 2966 + "unique_ngrams": 160, + "total_ngrams": 2853 } \ No newline at end of file diff --git a/models/word_ngram/cr_5gram_word.parquet b/models/word_ngram/cr_5gram_word.parquet new file mode 100644 index 0000000000000000000000000000000000000000..effeb7c301c6add3004bc92b3bafd2ba12a5c1ef --- /dev/null +++ b/models/word_ngram/cr_5gram_word.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:70760bb1c7ef26efafbcc96f57e199a10946cfebe8a2d04b08e9a1105d4bfa70 +size 5647 diff --git a/models/word_ngram/cr_5gram_word_metadata.json b/models/word_ngram/cr_5gram_word_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..7f78785a95d635526f1d9c01ea96d9e162fbe0e8 --- /dev/null +++ b/models/word_ngram/cr_5gram_word_metadata.json @@ -0,0 +1,7 @@ +{ + "n": 5, + "variant": "word", + "language": "cr", + "unique_ngrams": 138, + "total_ngrams": 2828 +} \ No newline at end of file diff --git a/visualizations/embedding_isotropy.png b/visualizations/embedding_isotropy.png index c212b68909629729c7f3d221fa1d35876a653486..a34f7d4859d0accd9e95788d9b0b6ace8b82389e 100644 Binary files a/visualizations/embedding_isotropy.png and b/visualizations/embedding_isotropy.png differ diff --git a/visualizations/embedding_norms.png b/visualizations/embedding_norms.png index 38c1aa252f1bef8ec7c017b49d02233f9a97d259..195b9e5c480f5de66293e38bcf2fe0d84caaf761 100644 Binary files a/visualizations/embedding_norms.png and b/visualizations/embedding_norms.png differ diff --git a/visualizations/embedding_similarity.png b/visualizations/embedding_similarity.png index d0ae6521f2ccf0fb933165ff8c2e960b9d854bb1..a0628c1dacf070951681a8003835589ceaf980c6 100644 --- a/visualizations/embedding_similarity.png +++ b/visualizations/embedding_similarity.png @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:732705e6bcdc346987e99c1a4a72a4b8e9b4651fb77957ca2da5c1266364de8f -size 151699 +oid sha256:3dd3e54f3eb72410e4700e80d77bfcc6534a63a9c7e5ce8f00a9830dadf6345b +size 153374 diff --git a/visualizations/embedding_tsne_multilingual.png b/visualizations/embedding_tsne_multilingual.png new file mode 100644 index 0000000000000000000000000000000000000000..ad78d7ab5a966a8ab60f4f473cb0da51c022650d --- /dev/null +++ b/visualizations/embedding_tsne_multilingual.png @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0d14969dcb0f433b2743fef0dee2e4b51b0dbf4d9d2a046e28d86bec0d513aec +size 199939 diff --git a/visualizations/markov_branching.png b/visualizations/markov_branching.png index 2d00f5bf1edff8faef46a5d9d30e8844deb7474c..edf0ec2aa71455c7693785a3afbd3423af39d011 100644 Binary files a/visualizations/markov_branching.png and b/visualizations/markov_branching.png differ diff --git a/visualizations/markov_contexts.png b/visualizations/markov_contexts.png index d49aa744e1a7dc391f5b3939ef9f9a2c36590868..9fce04b42b30e3b5ccb893ef9e01bacfa1362533 100644 Binary files a/visualizations/markov_contexts.png and b/visualizations/markov_contexts.png differ diff --git a/visualizations/markov_entropy.png b/visualizations/markov_entropy.png index 58435e1d85cd40cd6d0a1171a436f7e76cf8f894..920c3dc5ef0390a5d8abbd536fcc90b501f67388 100644 Binary files a/visualizations/markov_entropy.png and b/visualizations/markov_entropy.png differ diff --git a/visualizations/model_sizes.png b/visualizations/model_sizes.png index aa9db7de8a059ec83f3cac2bf31bfe781c422fa1..7a6024aa65fa30b0c76ecae4492f6fcf8b0de286 100644 Binary files a/visualizations/model_sizes.png and b/visualizations/model_sizes.png differ diff --git a/visualizations/nearest_neighbors.png b/visualizations/nearest_neighbors.png index a7362b7817c2b9bdc735c8a61f6802c80da674d8..fb43c7f69b001156e184f348d16d85717a6b4087 100644 Binary files a/visualizations/nearest_neighbors.png and b/visualizations/nearest_neighbors.png differ diff --git a/visualizations/ngram_coverage.png b/visualizations/ngram_coverage.png index c407f31a3fa307526a8ff57ff0dcbfa3b094d767..06da7adddffbb6e5f24261f0380e4f675f91ac05 100644 Binary files a/visualizations/ngram_coverage.png and b/visualizations/ngram_coverage.png differ diff --git a/visualizations/ngram_entropy.png b/visualizations/ngram_entropy.png index 6326b11d321ecbe94a2ddf68f87935b105556360..192ee8e2567ebbedc106d8082cff2bbbb9c5e8b2 100644 Binary files a/visualizations/ngram_entropy.png and b/visualizations/ngram_entropy.png differ diff --git a/visualizations/ngram_perplexity.png b/visualizations/ngram_perplexity.png index b7bde9951185d9795dbafe497363b4b4524780de..6a47b83b614e522686500abbdb124bf2559bab1c 100644 Binary files a/visualizations/ngram_perplexity.png and b/visualizations/ngram_perplexity.png differ diff --git a/visualizations/ngram_unique.png b/visualizations/ngram_unique.png index 1cbf0e8bcf237bd2c5e4cff33ce257c68c207b9d..86b3f3215e4aa9b1f7417ff9b9b988576127f47a 100644 Binary files a/visualizations/ngram_unique.png and b/visualizations/ngram_unique.png differ diff --git a/visualizations/performance_dashboard.png b/visualizations/performance_dashboard.png index 26b02d4ea12e18412d837af34f131e8b88182126..174716c7c74294e7e25babdd57e1ab413168bc83 100644 --- a/visualizations/performance_dashboard.png +++ b/visualizations/performance_dashboard.png @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:37a27aec3a8ba503551fc7333c7b3ac282b8ec1c0372f69be56f99a4a852d0a2 -size 259027 +oid sha256:e51327876802232c5f01b6c649c8ac636520f25b5773d0ba556d3ae70cada920 +size 349716 diff --git a/visualizations/position_encoding_comparison.png b/visualizations/position_encoding_comparison.png index 7674d0ef3a67d86ef8cc31a257f8daf1380179a9..002bc3a6b80922c6d908c5d0d6100965ebc8426c 100644 --- a/visualizations/position_encoding_comparison.png +++ b/visualizations/position_encoding_comparison.png @@ -1,3 +1,3 @@ version https://git-lfs.github.com/spec/v1 -oid sha256:85ef36936b103151179f21255d94207d5eb77a7091df35c4ca1190440a4358fe -size 111649 +oid sha256:4f887019f325ced3f69abf5bb3ffb59c4b8768224d251e9947ba8ba581bd4799 +size 112017 diff --git a/visualizations/tokenizer_compression.png b/visualizations/tokenizer_compression.png index 69f6bd3b0a817018285992284a6b6f5ecbfca5df..13615c69f07fd5410f8e3ba45aa580a6296d2874 100644 Binary files a/visualizations/tokenizer_compression.png and b/visualizations/tokenizer_compression.png differ diff --git a/visualizations/tokenizer_fertility.png b/visualizations/tokenizer_fertility.png index 69378eada67d5a29c5ca34a19996c859e8d23273..ca2b46030bbf3a433f66a791ea750b7983f18099 100644 Binary files a/visualizations/tokenizer_fertility.png and b/visualizations/tokenizer_fertility.png differ diff --git a/visualizations/tokenizer_oov.png b/visualizations/tokenizer_oov.png index 00463da8fffa75227d1afeb4e19e302a265ce86a..947eb81c987068c0c03a8aa6c7fff6ce6cf82023 100644 Binary files a/visualizations/tokenizer_oov.png and b/visualizations/tokenizer_oov.png differ diff --git a/visualizations/tokenizer_total_tokens.png b/visualizations/tokenizer_total_tokens.png index 0ef55b785857c894b20fe844606279d616ea3ef5..fd14beac1900a895bec5536d16224a8f802688d6 100644 Binary files a/visualizations/tokenizer_total_tokens.png and b/visualizations/tokenizer_total_tokens.png differ diff --git a/visualizations/top20_words.png b/visualizations/top20_words.png index a399e763adb5e14ff37463e94657c1d4b6d2040c..666eb7c0eacf4e35c48a31804b43051805a9ed63 100644 Binary files a/visualizations/top20_words.png and b/visualizations/top20_words.png differ diff --git a/visualizations/vocab_coverage.png b/visualizations/vocab_coverage.png index d5a15cddbb4852bf6c6428fe2be12de5c3b5bfda..730bbb7c571a8ec0b449c191d75e4dd44db48b6e 100644 Binary files a/visualizations/vocab_coverage.png and b/visualizations/vocab_coverage.png differ diff --git a/visualizations/vocab_freq_dist.png b/visualizations/vocab_freq_dist.png index 511949f5f34187e10ed56cb05288f3bbe41202eb..187165faa89ae3f3b4b5daff1b4ae9c048c3becc 100644 Binary files a/visualizations/vocab_freq_dist.png and b/visualizations/vocab_freq_dist.png differ diff --git a/visualizations/zipf_law.png b/visualizations/zipf_law.png index 1273c177118e298c881d28edd31d3b7fb6a93a69..7d2779618f2b09854d36826ac701fe455de603e3 100644 Binary files a/visualizations/zipf_law.png and b/visualizations/zipf_law.png differ