diff --git a/.gitattributes b/.gitattributes index a6344aac8c09253b3b630fb776ae94478aa0275b..86cd132a87e95bd3676ae4270acf536c2eb446b7 100644 --- a/.gitattributes +++ b/.gitattributes @@ -33,3 +33,7 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text *.zip filter=lfs diff=lfs merge=lfs -text *.zst filter=lfs diff=lfs merge=lfs -text *tfevents* filter=lfs diff=lfs merge=lfs -text +visualizations/embedding_similarity.png filter=lfs diff=lfs merge=lfs -text +visualizations/embedding_tsne_multilingual.png filter=lfs diff=lfs merge=lfs -text +visualizations/performance_dashboard.png filter=lfs diff=lfs merge=lfs -text +visualizations/position_encoding_comparison.png filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000000000000000000000000000000000000..cce0b63bdaaea28b3ea5ce574f35357f2a341977 --- /dev/null +++ b/README.md @@ -0,0 +1,698 @@ +--- +language: ng +language_name: Ndonga +language_family: bantu_central +tags: + - wikilangs + - nlp + - tokenizer + - embeddings + - n-gram + - markov + - wikipedia + - feature-extraction + - sentence-similarity + - tokenization + - n-grams + - markov-chain + - text-mining + - fasttext + - babelvec + - vocabulous + - vocabulary + - monolingual + - family-bantu_central +license: mit +library_name: wikilangs +pipeline_tag: text-generation +datasets: + - omarkamali/wikipedia-monthly +dataset_info: + name: wikipedia-monthly + description: Monthly snapshots of Wikipedia articles across 300+ languages +metrics: + - name: best_compression_ratio + type: compression + value: 2.981 + - name: best_isotropy + type: isotropy + value: 0.0034 + - name: vocabulary_size + type: vocab + value: 0 +generated: 2026-01-10 +--- + +# Ndonga - Wikilangs Models +## Comprehensive Research Report & Full Ablation Study + +This repository contains NLP models trained and evaluated by Wikilangs, specifically on **Ndonga** Wikipedia data. +We analyze tokenizers, n-gram models, Markov chains, vocabulary statistics, and word embeddings. + +## 📋 Repository Contents + +### Models & Assets + +- Tokenizers (8k, 16k, 32k, 64k) +- N-gram models (2, 3, 4, 5-gram) +- Markov chains (context of 1, 2, 3, 4 and 5) +- Subword N-gram and Markov chains +- Embeddings in various sizes and dimensions (aligned and unaligned) +- Language Vocabulary +- Language Statistics + +![Performance Dashboard](visualizations/performance_dashboard.png) + +### Analysis and Evaluation + +- [1. Tokenizer Evaluation](#1-tokenizer-evaluation) +- [2. N-gram Model Evaluation](#2-n-gram-model-evaluation) +- [3. Markov Chain Evaluation](#3-markov-chain-evaluation) +- [4. Vocabulary Analysis](#4-vocabulary-analysis) +- [5. Word Embeddings Evaluation](#5-word-embeddings-evaluation) +- [6. Morphological Analysis (Experimental)](#6--morphological-analysis-experimental) +- [7. Summary & Recommendations](#7-summary--recommendations) +- [Metrics Glossary](#appendix-metrics-glossary--interpretation-guide) +- [Visualizations Index](#visualizations-index) + +--- +## 1. Tokenizer Evaluation + +![Tokenizer Compression](visualizations/tokenizer_compression.png) + +![Tokenizer Fertility](visualizations/tokenizer_fertility.png) + +![Tokenizer OOV](visualizations/tokenizer_oov.png) + +![Total Tokens](visualizations/tokenizer_total_tokens.png) + +### Results + +| Vocab Size | Compression | Avg Token Len | UNK Rate | Total Tokens | +|------------|-------------|---------------|----------|--------------| +| **8k** | 2.981x 🏆 | 2.98 | 1.0627% | 13,080 | + +### Tokenization Examples + +Below are sample sentences tokenized with each vocabulary size: + + +### Key Findings + +- **Best Compression:** 8k achieves 2.981x compression +- **Lowest UNK Rate:** 8k with 1.0627% unknown tokens +- **Trade-off:** Larger vocabularies improve compression but increase model size +- **Recommendation:** 32k vocabulary provides optimal balance for production use + +--- +## 2. N-gram Model Evaluation + +![N-gram Perplexity](visualizations/ngram_perplexity.png) + +![N-gram Unique](visualizations/ngram_unique.png) + +![N-gram Coverage](visualizations/ngram_coverage.png) + +### Results + +| N-gram | Variant | Perplexity | Entropy | Unique N-grams | Top-100 Coverage | Top-1000 Coverage | +|--------|---------|------------|---------|----------------|------------------|-------------------| +| **2-gram** | Word | 17 | 4.12 | 22 | 100.0% | 100.0% | +| **2-gram** | Subword | 286 | 8.16 | 589 | 60.1% | 100.0% | +| **3-gram** | Word | 13 | 3.74 | 23 | 100.0% | 100.0% | +| **3-gram** | Subword | 1,258 | 10.30 | 2,328 | 29.4% | 80.5% | +| **4-gram** | Word | 16 | 4.02 | 29 | 100.0% | 100.0% | +| **4-gram** | Subword | 2,459 | 11.26 | 4,677 | 22.5% | 61.8% | +| **5-gram** | Word | 9 🏆 | 3.17 | 15 | 100.0% | 100.0% | +| **5-gram** | Subword | 2,457 | 11.26 | 4,586 | 24.2% | 59.4% | + +### Top 5 N-grams by Size + +**2-grams (Word):** + +| Rank | N-gram | Count | +|------|--------|-------| +| 1 | `nowy dwór` | 35 | +| 2 | `dwór królewski` | 35 | +| 3 | `na uuthemba` | 31 | +| 4 | `omuntu kehe` | 29 | +| 5 | `oku na` | 29 | + +**3-grams (Word):** + +| Rank | N-gram | Count | +|------|--------|-------| +| 1 | `nowy dwór królewski` | 35 | +| 2 | `omuntu kehe oku` | 27 | +| 3 | `kehe oku na` | 27 | +| 4 | `oku na uuthemba` | 26 | +| 5 | `zh min nan` | 12 | + +**4-grams (Word):** + +| Rank | N-gram | Count | +|------|--------|-------| +| 1 | `omuntu kehe oku na` | 27 | +| 2 | `kehe oku na uuthemba` | 24 | +| 3 | `nekofungama ar sefala angubo` | 3 | +| 4 | `harranga nekofungama ar sefala` | 3 | +| 5 | `ast harranga nekofungama ar` | 3 | + +**5-grams (Word):** + +| Rank | N-gram | Count | +|------|--------|-------| +| 1 | `omuntu kehe oku na uuthemba` | 24 | +| 2 | `harranga nekofungama ar sefala angubo` | 3 | +| 3 | `ast harranga nekofungama ar sefala` | 3 | +| 4 | `nekofungama ar sefala angubo andusat` | 3 | +| 5 | `kape na nando omuntu e` | 3 | + +**2-grams (Subword):** + +| Rank | N-gram | Count | +|------|--------|-------| +| 1 | `a _` | 1,406 | +| 2 | `a n` | 583 | +| 3 | `e _` | 427 | +| 4 | `n g` | 419 | +| 5 | `e n` | 411 | + +**3-grams (Subword):** + +| Rank | N-gram | Count | +|------|--------|-------| +| 1 | `i a _` | 277 | +| 2 | `n a _` | 275 | +| 3 | `e r s` | 197 | +| 4 | `e n _` | 193 | +| 5 | `t e r` | 177 | + +**4-grams (Subword):** + +| Rank | N-gram | Count | +|------|--------|-------| +| 1 | `e r s e` | 175 | +| 2 | `t e r s` | 169 | +| 3 | `r s e n` | 169 | +| 4 | `e t e r` | 169 | +| 5 | `u e t e` | 168 | + +**5-grams (Subword):** + +| Rank | N-gram | Count | +|------|--------|-------| +| 1 | `e r s e n` | 169 | +| 2 | `t e r s e` | 169 | +| 3 | `u e t e r` | 168 | +| 4 | `e t e r s` | 168 | +| 5 | `r s e n _` | 167 | + + +### Key Findings + +- **Best Perplexity:** 5-gram (word) with 9 +- **Entropy Trend:** Decreases with larger n-grams (more predictable) +- **Coverage:** Top-1000 patterns cover ~59% of corpus +- **Recommendation:** 4-gram or 5-gram for best predictive performance + +--- +## 3. Markov Chain Evaluation + +![Markov Entropy](visualizations/markov_entropy.png) + +![Markov Contexts](visualizations/markov_contexts.png) + +![Markov Branching](visualizations/markov_branching.png) + +### Results + +| Context | Variant | Avg Entropy | Perplexity | Branching Factor | Unique Contexts | Predictability | +|---------|---------|-------------|------------|------------------|-----------------|----------------| +| **1** | Word | 0.4936 | 1.408 | 2.30 | 2,515 | 50.6% | +| **1** | Subword | 0.5935 | 1.509 | 3.07 | 1,104 | 40.6% | +| **2** | Word | 0.0333 | 1.023 | 1.06 | 5,756 | 96.7% | +| **2** | Subword | 0.4561 | 1.372 | 2.43 | 3,389 | 54.4% | +| **3** | Word | 0.0092 | 1.006 | 1.02 | 6,060 | 99.1% | +| **3** | Subword | 0.4100 | 1.329 | 1.84 | 8,218 | 59.0% | +| **4** | Word | 0.0036 🏆 | 1.002 | 1.01 | 6,160 | 99.6% | +| **4** | Subword | 0.2372 | 1.179 | 1.40 | 15,074 | 76.3% | + +### Generated Text Samples (Word-based) + +Below are text samples generated from each word-based Markov chain model: + +**Context Size 1:** + +1. `uetersen nds asien li azië nn geografi sw jamhuri ya uvuneka kutya otashi gandja uuthemba wokugamenw...` +2. `wikipedia id turki sq uetersen tl turkiya crh asiya hak asasi manusia io kulturo es uetersen` +3. `na nando omuntu kehe ngoka ha baibûl hak ngùi kî pak khô haw ākia he אנגלית` + +**Context Size 2:** + +1. `nowy dwór królewski tr nowy dwór królewski en nowy dwór królewski nn nowy dwór królewski pt nowy` +2. `dwór królewski en nowy dwór królewski nn nowy dwór królewski en nowy dwór królewski et nowy dwór` +3. `na uuthemba womuthika omwaanawa gwonkalamwenyo memanguluko iya andjagana uuna iilyo yiilongo ya uvun...` + +**Context Size 3:** + +1. `nowy dwór królewski ff nowy dwór królewski tum nowy dwór królewski pl nowy dwór królewski de nowy dw...` +2. `kehe oku na uuthemba womuthika omwaanawa gwonkalamwenyo gwa yeleka uukolele nonkalo ombwanawa ye mwe...` +3. `omuntu kehe oku na uuthemba welandulathano iyopankalathano nolyomuuyuni moka uuthemba nemanguluko nd...` + +**Context Size 4:** + +1. `omuntu kehe oku na uuthemba wokutota nokuninga oshilyo shehangano iyaaniilonga opo a gamene uuwanawa...` +2. `kehe oku na uuthemba womafutilo ngele okwa kulupa nenge a mona oshiponga moshilongo she nenge paigwa...` +3. `ar sefala angubo andusat ace bahsa inggréh af engels ak english als englische sprache am እንግሊዝኛ an i...` + + +### Generated Text Samples (Subword-based) + +Below are text samples generated from each subword-based Markov chain model: + +**Context Size 1:** + +1. `_inoghe_अधिकारों_sk:` +2. `a:tesen_ndulidur` +3. `entueneburs'at_b` + +**Context Size 2:** + +1. `a_a_vica_op_an_uu` +2. `an_she:ויקיפדיה_l` +3. `e_papublisencia_k` + +**Context Size 3:** + +1. `ia_bm:hadan_mwl:bi` +2. `na_nga_nomakwa_uvu` +3. `ersen_wu_li:una_oy` + +**Context Size 4:** + +1. `ersen_wuukwa,_a_kut` +2. `etersen_su:wikipiki` +3. `tersele_nokoompumbi` + + +### Key Findings + +- **Best Predictability:** Context-4 (word) with 99.6% predictability +- **Branching Factor:** Decreases with context size (more deterministic) +- **Memory Trade-off:** Larger contexts require more storage (15,074 contexts) +- **Recommendation:** Context-3 or Context-4 for text generation + +--- +## 4. Vocabulary Analysis + +![Zipf's Law](visualizations/zipf_law.png) + +![Top Words](visualizations/top20_words.png) + +![Coverage Curve](visualizations/vocab_coverage.png) + +### Statistics + +| Metric | Value | +|--------|-------| +| Vocabulary Size | 648 | +| Total Tokens | 4,436 | +| Mean Frequency | 6.85 | +| Median Frequency | 4 | +| Frequency Std Dev | 10.46 | + +### Most Common Words + +| Rank | Word | Frequency | +|------|------|-----------| +| 1 | uetersen | 168 | +| 2 | wikipedia | 87 | +| 3 | na | 78 | +| 4 | ghana | 71 | +| 5 | uuthemba | 50 | +| 6 | asia | 49 | +| 7 | pigazzano | 47 | +| 8 | zh | 44 | +| 9 | de | 42 | +| 10 | kehe | 37 | + +### Least Common Words (from vocabulary) + +| Rank | Word | Frequency | +|------|------|-----------| +| 1 | turecko | 2 | +| 2 | τουρκία | 2 | +| 3 | tuirc | 2 | +| 4 | तुर्किये | 2 | +| 5 | germany | 2 | +| 6 | ঘানা | 2 | +| 7 | thumb | 2 | +| 8 | italy | 2 | +| 9 | piasensa | 2 | +| 10 | двор | 2 | + +### Zipf's Law Analysis + +| Metric | Value | +|--------|-------| +| Zipf Coefficient | 0.8074 | +| R² (Goodness of Fit) | 0.939699 | +| Adherence Quality | **excellent** | + +### Coverage Analysis + +| Top N Words | Coverage | +|-------------|----------| +| Top 100 | 49.3% | +| Top 1,000 | 0.0% | +| Top 5,000 | 0.0% | +| Top 10,000 | 0.0% | + +### Key Findings + +- **Zipf Compliance:** R²=0.9397 indicates excellent adherence to Zipf's law +- **High Frequency Dominance:** Top 100 words cover 49.3% of corpus +- **Long Tail:** -9,352 words needed for remaining 100.0% coverage + +--- +## 5. Word Embeddings Evaluation + +![Embedding Isotropy](visualizations/embedding_isotropy.png) + +![Similarity Matrix](visualizations/embedding_similarity.png) + +![t-SNE Words](visualizations/tsne_words.png) + +![t-SNE Sentences](visualizations/tsne_sentences.png) + + +### 5.1 Cross-Lingual Alignment + +![Multilingual t-SNE](visualizations/embedding_tsne_multilingual.png) + + +### 5.2 Model Comparison + +| Model | Dimension | Isotropy | Semantic Density | Alignment R@1 | Alignment R@10 | +|-------|-----------|----------|------------------|---------------|----------------| +| **mono_32d** | 32 | 0.0034 🏆 | 0.0000 | N/A | N/A | +| **mono_64d** | 64 | 0.0001 | 0.0000 | N/A | N/A | +| **mono_128d** | 128 | 0.0000 | 0.0000 | N/A | N/A | +| **aligned_32d** | 32 | 0.0034 | 0.0000 | 0.0000 | 0.0000 | +| **aligned_64d** | 64 | 0.0001 | 0.0000 | 0.0000 | 0.0000 | +| **aligned_128d** | 128 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | + +### Key Findings + +- **Best Isotropy:** mono_32d with 0.0034 (more uniform distribution) +- **Semantic Density:** Average pairwise similarity of 0.0000. Lower values indicate better semantic separation. +- **Alignment Quality:** Aligned models evaluated but achieved 0% recall. +- **Recommendation:** 128d aligned for best cross-lingual performance + +--- +## 6. Morphological Analysis (Experimental) + +This section presents an automated morphological analysis derived from the statistical divergence between word-level and subword-level models. By analyzing where subword predictability spikes and where word-level coverage fails, we can infer linguistic structures without supervised data. + +### 6.1 Productivity & Complexity + +| Metric | Value | Interpretation | Recommendation | +|--------|-------|----------------|----------------| +| Productivity Index | **1.124** | High morphological productivity | Reliable analysis | +| Idiomaticity Gap | **0.683** | High formulaic/idiomatic content | - | + +### 6.2 Affix Inventory (Productive Units) + +These are the most productive prefixes and suffixes identified by sampling the vocabulary for global substitutability patterns. A unit is considered an affix if stripping it leaves a valid stem that appears in other contexts. + +#### Productive Prefixes +| Prefix | Examples | +|--------|----------| + +#### Productive Suffixes +| Suffix | Examples | +|--------|----------| +| `-a` | mpoka, ehyia, kaa | + +### 6.3 Bound Stems (Lexical Roots) + +Bound stems are high-frequency subword units that are semantically cohesive but rarely appear as standalone words. These often correspond to the 'core' of a word that requires inflection or derivation to be valid. + +*No significant bound stems detected.* + + +### 6.4 Affix Compatibility (Co-occurrence) + +This table shows which prefixes and suffixes most frequently co-occur on the same stems, revealing the 'stacking' rules of the language's morphology. + +*No significant affix co-occurrences detected.* + + +### 6.5 Recursive Morpheme Segmentation + +Using **Recursive Hierarchical Substitutability**, we decompose complex words into their constituent morphemes. This approach handles nested affixes (e.g., `prefix-prefix-root-suffix`). + +| Word | Suggested Split | Confidence | Stem | +|------|-----------------|------------|------| +| universala | **`universal-a`** | 4.5 | `universal` | +| geografia | **`geografi-a`** | 4.5 | `geografi` | +| republika | **`republik-a`** | 4.5 | `republik` | +| kwatelela | **`kwatelel-a`** | 1.5 | `kwatelel` | +| manguluka | **`manguluk-a`** | 1.5 | `manguluk` | +| wikipedya | **`wikipedy-a`** | 1.5 | `wikipedy` | +| geographia | **`geographi-a`** | 1.5 | `geographi` | + +### 6.6 Linguistic Interpretation + +> **Automated Insight:** +The language Ndonga shows moderate morphological complexity. There is a balanced trade-off between whole-word memorization and subword composition. + +> **Note on Idiomaticity:** The high Idiomaticity Gap suggests a large number of frequent multi-word expressions or formulaic sequences that are statistically distinct from their component parts. + +--- +## 7. Summary & Recommendations + +![Performance Dashboard](visualizations/performance_dashboard.png) + +### Production Recommendations + +| Component | Recommended | Rationale | +|-----------|-------------|-----------| +| Tokenizer | **8k BPE** | Best compression (2.98x) | +| N-gram | **5-gram** | Lowest perplexity (9) | +| Markov | **Context-4** | Highest predictability (99.6%) | +| Embeddings | **100d** | Balanced semantic capture and isotropy | + + +--- +## Appendix: Metrics Glossary & Interpretation Guide + +This section provides definitions, intuitions, and guidance for interpreting the metrics used throughout this report. + +### Tokenizer Metrics + +**Compression Ratio** +> *Definition:* The ratio of characters to tokens (chars/token). Measures how efficiently the tokenizer represents text. +> +> *Intuition:* Higher compression means fewer tokens needed to represent the same text, reducing sequence lengths for downstream models. A 3x compression means ~3 characters per token on average. +> +> *What to seek:* Higher is generally better for efficiency, but extremely high compression may indicate overly aggressive merging that loses morphological information. + +**Average Token Length (Fertility)** +> *Definition:* Mean number of characters per token produced by the tokenizer. +> +> *Intuition:* Reflects the granularity of tokenization. Longer tokens capture more context but may struggle with rare words; shorter tokens are more flexible but increase sequence length. +> +> *What to seek:* Balance between 2-5 characters for most languages. Arabic/morphologically-rich languages may benefit from slightly longer tokens. + +**Unknown Token Rate (OOV Rate)** +> *Definition:* Percentage of tokens that map to the unknown/UNK token, indicating words the tokenizer cannot represent. +> +> *Intuition:* Lower OOV means better vocabulary coverage. High OOV indicates the tokenizer encounters many unseen character sequences. +> +> *What to seek:* Below 1% is excellent; below 5% is acceptable. BPE tokenizers typically achieve very low OOV due to subword fallback. + +### N-gram Model Metrics + +**Perplexity** +> *Definition:* Measures how "surprised" the model is by test data. Mathematically: 2^(cross-entropy). Lower values indicate better prediction. +> +> *Intuition:* If perplexity is 100, the model is as uncertain as if choosing uniformly among 100 options at each step. A perplexity of 10 means effectively choosing among 10 equally likely options. +> +> *What to seek:* Lower is better. Perplexity decreases with larger n-grams (more context). Values vary widely by language and corpus size. + +**Entropy** +> *Definition:* Average information content (in bits) needed to encode the next token given the context. Related to perplexity: perplexity = 2^entropy. +> +> *Intuition:* High entropy means high uncertainty/randomness; low entropy means predictable patterns. Natural language typically has entropy between 1-4 bits per character. +> +> *What to seek:* Lower entropy indicates more predictable text patterns. Entropy should decrease as n-gram size increases. + +**Coverage (Top-K)** +> *Definition:* Percentage of corpus occurrences explained by the top K most frequent n-grams. +> +> *Intuition:* High coverage with few patterns indicates repetitive/formulaic text; low coverage suggests diverse vocabulary usage. +> +> *What to seek:* Depends on use case. For language modeling, moderate coverage (40-60% with top-1000) is typical for natural text. + +### Markov Chain Metrics + +**Average Entropy** +> *Definition:* Mean entropy across all contexts, measuring average uncertainty in next-word prediction. +> +> *Intuition:* Lower entropy means the model is more confident about what comes next. Context-1 has high entropy (many possible next words); Context-4 has low entropy (few likely continuations). +> +> *What to seek:* Decreasing entropy with larger context sizes. Very low entropy (<0.1) indicates highly deterministic transitions. + +**Branching Factor** +> *Definition:* Average number of unique next tokens observed for each context. +> +> *Intuition:* High branching = many possible continuations (flexible but uncertain); low branching = few options (predictable but potentially repetitive). +> +> *What to seek:* Branching factor should decrease with context size. Values near 1.0 indicate nearly deterministic chains. + +**Predictability** +> *Definition:* Derived metric: (1 - normalized_entropy) × 100%. Indicates how deterministic the model's predictions are. +> +> *Intuition:* 100% predictability means the next word is always certain; 0% means completely random. Real text falls between these extremes. +> +> *What to seek:* Higher predictability for text generation quality, but too high (>98%) may produce repetitive output. + +### Vocabulary & Zipf's Law Metrics + +**Zipf's Coefficient** +> *Definition:* The slope of the log-log plot of word frequency vs. rank. Zipf's law predicts this should be approximately -1. +> +> *Intuition:* A coefficient near -1 indicates the corpus follows natural language patterns where a few words are very common and most words are rare. +> +> *What to seek:* Values between -0.8 and -1.2 indicate healthy natural language distribution. Deviations may suggest domain-specific or artificial text. + +**R² (Coefficient of Determination)** +> *Definition:* Measures how well the linear fit explains the frequency-rank relationship. Ranges from 0 to 1. +> +> *Intuition:* R² near 1.0 means the data closely follows Zipf's law; lower values indicate deviation from expected word frequency patterns. +> +> *What to seek:* R² > 0.95 is excellent; > 0.99 indicates near-perfect Zipf adherence typical of large natural corpora. + +**Vocabulary Coverage** +> *Definition:* Cumulative percentage of corpus tokens accounted for by the top N words. +> +> *Intuition:* Shows how concentrated word usage is. If top-100 words cover 50% of text, the corpus relies heavily on common words. +> +> *What to seek:* Top-100 covering 30-50% is typical. Higher coverage indicates more repetitive text; lower suggests richer vocabulary. + +### Word Embedding Metrics + +**Isotropy** +> *Definition:* Measures how uniformly distributed vectors are in the embedding space. Computed as the ratio of minimum to maximum singular values. +> +> *Intuition:* High isotropy (near 1.0) means vectors spread evenly in all directions; low isotropy means vectors cluster in certain directions, reducing expressiveness. +> +> *What to seek:* Higher isotropy generally indicates better-quality embeddings. Values > 0.1 are reasonable; > 0.3 is good. Lower-dimensional embeddings tend to have higher isotropy. + +**Average Norm** +> *Definition:* Mean magnitude (L2 norm) of word vectors in the embedding space. +> +> *Intuition:* Indicates the typical "length" of vectors. Consistent norms suggest stable training; high variance may indicate some words are undertrained. +> +> *What to seek:* Relatively consistent norms across models. The absolute value matters less than consistency (low std deviation). + +**Cosine Similarity** +> *Definition:* Measures angular similarity between vectors, ranging from -1 (opposite) to 1 (identical direction). +> +> *Intuition:* Words with similar meanings should have high cosine similarity. This is the standard metric for semantic relatedness in embeddings. +> +> *What to seek:* Semantically related words should score > 0.5; unrelated words should be near 0. Synonyms often score > 0.7. + +**t-SNE Visualization** +> *Definition:* t-Distributed Stochastic Neighbor Embedding - a dimensionality reduction technique that preserves local structure for visualization. +> +> *Intuition:* Clusters in t-SNE plots indicate groups of semantically related words. Spread indicates vocabulary diversity; tight clusters suggest semantic coherence. +> +> *What to seek:* Meaningful clusters (e.g., numbers together, verbs together). Avoid over-interpreting distances - t-SNE preserves local, not global, structure. + +### General Interpretation Guidelines + +1. **Compare within model families:** Metrics are most meaningful when comparing models of the same type (e.g., 8k vs 64k tokenizer). +2. **Consider trade-offs:** Better performance on one metric often comes at the cost of another (e.g., compression vs. OOV rate). +3. **Context matters:** Optimal values depend on downstream tasks. Text generation may prioritize different metrics than classification. +4. **Corpus influence:** All metrics are influenced by corpus characteristics. Wikipedia text differs from social media or literature. +5. **Language-specific patterns:** Morphologically rich languages (like Arabic) may show different optimal ranges than analytic languages. + + +### Visualizations Index + +| Visualization | Description | +|---------------|-------------| +| Tokenizer Compression | Compression ratios by vocabulary size | +| Tokenizer Fertility | Average token length by vocabulary | +| Tokenizer OOV | Unknown token rates | +| Tokenizer Total Tokens | Total tokens by vocabulary | +| N-gram Perplexity | Perplexity by n-gram size | +| N-gram Entropy | Entropy by n-gram size | +| N-gram Coverage | Top pattern coverage | +| N-gram Unique | Unique n-gram counts | +| Markov Entropy | Entropy by context size | +| Markov Branching | Branching factor by context | +| Markov Contexts | Unique context counts | +| Zipf's Law | Frequency-rank distribution with fit | +| Vocab Frequency | Word frequency distribution | +| Top 20 Words | Most frequent words | +| Vocab Coverage | Cumulative coverage curve | +| Embedding Isotropy | Vector space uniformity | +| Embedding Norms | Vector magnitude distribution | +| Embedding Similarity | Word similarity heatmap | +| Nearest Neighbors | Similar words for key terms | +| t-SNE Words | 2D word embedding visualization | +| t-SNE Sentences | 2D sentence embedding visualization | +| Position Encoding | Encoding method comparison | +| Model Sizes | Storage requirements | +| Performance Dashboard | Comprehensive performance overview | + +--- +## About This Project + +### Data Source + +Models trained on [wikipedia-monthly](https://huggingface.co/datasets/omarkamali/wikipedia-monthly) - a monthly snapshot of Wikipedia articles across 300+ languages. + +### Project + +A project by **[Wikilangs](https://wikilangs.org)** - Open-source NLP models for every Wikipedia language. + +### Maintainer + +[Omar Kamali](https://omarkamali.com) - [Omneity Labs](https://omneitylabs.com) + +### Citation + +If you use these models in your research, please cite: + +```bibtex +@misc{wikilangs2025, + author = {Kamali, Omar}, + title = {Wikilangs: Open NLP Models for Wikipedia Languages}, + year = {2025}, + doi = {10.5281/zenodo.18073153}, + publisher = {Zenodo}, + url = {https://huggingface.co/wikilangs} + institution = {Omneity Labs} +} +``` + +### License + +MIT License - Free for academic and commercial use. + +### Links + +- 🌐 Website: [wikilangs.org](https://wikilangs.org) +- 🤗 Models: [huggingface.co/wikilangs](https://huggingface.co/wikilangs) +- 📊 Data: [wikipedia-monthly](https://huggingface.co/datasets/omarkamali/wikipedia-monthly) +- 👤 Author: [Omar Kamali](https://huggingface.co/omarkamali) +- 🤝 Sponsor: [Featherless AI](https://featherless.ai) +--- +*Generated by Wikilangs Models Pipeline* + +*Report Date: 2026-01-10 14:50:35* diff --git a/models/embeddings/aligned/ng_128d.bin b/models/embeddings/aligned/ng_128d.bin new file mode 100644 index 0000000000000000000000000000000000000000..e0adacb77baed02f80859a33f831d17edab063a0 --- /dev/null +++ b/models/embeddings/aligned/ng_128d.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:34bf2a399fd128262456f17fe7df20a1e629cc9c5143947550b6d3d9cd66c209 +size 1024056189 diff --git a/models/embeddings/aligned/ng_128d.meta.json b/models/embeddings/aligned/ng_128d.meta.json new file mode 100644 index 0000000000000000000000000000000000000000..14495945363756689db67fc2e506e5c80185a655 --- /dev/null +++ b/models/embeddings/aligned/ng_128d.meta.json @@ -0,0 +1 @@ +{"lang": "ng", "dim": 128, "max_seq_len": 512, "is_aligned": true} \ No newline at end of file diff --git a/models/embeddings/aligned/ng_128d.projection.npy b/models/embeddings/aligned/ng_128d.projection.npy new file mode 100644 index 0000000000000000000000000000000000000000..e2934e84f65cfd7aaa8bac8d438fce476375c375 --- /dev/null +++ b/models/embeddings/aligned/ng_128d.projection.npy @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2e051c021f38d27c84ab4d6df44c89e9fbf3706c78ffd6fdf9b8d3763c34e7c5 +size 65664 diff --git a/models/embeddings/aligned/ng_128d_metadata.json b/models/embeddings/aligned/ng_128d_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..7452b9b530097b079f00384fea2d91b6f4cce874 --- /dev/null +++ b/models/embeddings/aligned/ng_128d_metadata.json @@ -0,0 +1,8 @@ +{ + "language": "ng", + "dimension": 128, + "version": "aligned", + "hub_language": "en", + "seed_vocab_size": 28, + "vocab_size": 54 +} \ No newline at end of file diff --git a/models/embeddings/aligned/ng_32d.bin b/models/embeddings/aligned/ng_32d.bin new file mode 100644 index 0000000000000000000000000000000000000000..31bd2358006ad970d5e39bd3d09459494e9b9321 --- /dev/null +++ b/models/embeddings/aligned/ng_32d.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fa58d081dda754238780d73dc92ccd9120de5c25f3e7700818762f81f1912421 +size 256014717 diff --git a/models/embeddings/aligned/ng_32d.meta.json b/models/embeddings/aligned/ng_32d.meta.json new file mode 100644 index 0000000000000000000000000000000000000000..46206ea58870b300ccd111154faeda554e02fce0 --- /dev/null +++ b/models/embeddings/aligned/ng_32d.meta.json @@ -0,0 +1 @@ +{"lang": "ng", "dim": 32, "max_seq_len": 512, "is_aligned": true} \ No newline at end of file diff --git a/models/embeddings/aligned/ng_32d.projection.npy b/models/embeddings/aligned/ng_32d.projection.npy new file mode 100644 index 0000000000000000000000000000000000000000..7c2e715ce35be70af5a3aff9d9e6f338c69c8930 --- /dev/null +++ b/models/embeddings/aligned/ng_32d.projection.npy @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9cf7f47d8f65cbe8641d49e2139bec4461ffe42210102fe78b31e529c0f04a39 +size 4224 diff --git a/models/embeddings/aligned/ng_32d_metadata.json b/models/embeddings/aligned/ng_32d_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..d3a3a5f36a8d7352d26415ca6f94532ab69d2925 --- /dev/null +++ b/models/embeddings/aligned/ng_32d_metadata.json @@ -0,0 +1,8 @@ +{ + "language": "ng", + "dimension": 32, + "version": "aligned", + "hub_language": "en", + "seed_vocab_size": 28, + "vocab_size": 54 +} \ No newline at end of file diff --git a/models/embeddings/aligned/ng_64d.bin b/models/embeddings/aligned/ng_64d.bin new file mode 100644 index 0000000000000000000000000000000000000000..a216e409bffa6a3d87cfbf72f09832174c759312 --- /dev/null +++ b/models/embeddings/aligned/ng_64d.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9408eae8053e83895873acb3dc744c69420075e2bdbd1f96b4425f7b1ad9e5f1 +size 512028541 diff --git a/models/embeddings/aligned/ng_64d.meta.json b/models/embeddings/aligned/ng_64d.meta.json new file mode 100644 index 0000000000000000000000000000000000000000..1d121c04553a79e27f70f52dbb3f9d9699945a69 --- /dev/null +++ b/models/embeddings/aligned/ng_64d.meta.json @@ -0,0 +1 @@ +{"lang": "ng", "dim": 64, "max_seq_len": 512, "is_aligned": true} \ No newline at end of file diff --git a/models/embeddings/aligned/ng_64d.projection.npy b/models/embeddings/aligned/ng_64d.projection.npy new file mode 100644 index 0000000000000000000000000000000000000000..0d39ff1fa312208fb83e72b249acddf69f852c8e --- /dev/null +++ b/models/embeddings/aligned/ng_64d.projection.npy @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a53262ac6a66e2ba0fb6f696fa834f7c1926c9d514fbceeb93869590b4c9efa4 +size 16512 diff --git a/models/embeddings/aligned/ng_64d_metadata.json b/models/embeddings/aligned/ng_64d_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..462fb8984d97d0db8e9f56c2b4688676a2ec1eda --- /dev/null +++ b/models/embeddings/aligned/ng_64d_metadata.json @@ -0,0 +1,8 @@ +{ + "language": "ng", + "dimension": 64, + "version": "aligned", + "hub_language": "en", + "seed_vocab_size": 28, + "vocab_size": 54 +} \ No newline at end of file diff --git a/models/embeddings/monolingual/ng_128d.bin b/models/embeddings/monolingual/ng_128d.bin new file mode 100644 index 0000000000000000000000000000000000000000..e0adacb77baed02f80859a33f831d17edab063a0 --- /dev/null +++ b/models/embeddings/monolingual/ng_128d.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:34bf2a399fd128262456f17fe7df20a1e629cc9c5143947550b6d3d9cd66c209 +size 1024056189 diff --git a/models/embeddings/monolingual/ng_128d.meta.json b/models/embeddings/monolingual/ng_128d.meta.json new file mode 100644 index 0000000000000000000000000000000000000000..916049a45d9aa886f1abeab04cd8054df1212e1e --- /dev/null +++ b/models/embeddings/monolingual/ng_128d.meta.json @@ -0,0 +1 @@ +{"lang": "ng", "dim": 128, "max_seq_len": 512, "is_aligned": false} \ No newline at end of file diff --git a/models/embeddings/monolingual/ng_128d_metadata.json b/models/embeddings/monolingual/ng_128d_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..23b4b271f9a5d66a7b1de91c8b71680bd6c2a716 --- /dev/null +++ b/models/embeddings/monolingual/ng_128d_metadata.json @@ -0,0 +1,16 @@ +{ + "language": "ng", + "dimension": 128, + "version": "monolingual", + "training_params": { + "algorithm": "skipgram", + "min_count": 5, + "window": 5, + "negative": 5, + "epochs": 5, + "encoding_method": "rope", + "dim": 128, + "threads": 5 + }, + "vocab_size": 54 +} \ No newline at end of file diff --git a/models/embeddings/monolingual/ng_32d.bin b/models/embeddings/monolingual/ng_32d.bin new file mode 100644 index 0000000000000000000000000000000000000000..31bd2358006ad970d5e39bd3d09459494e9b9321 --- /dev/null +++ b/models/embeddings/monolingual/ng_32d.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fa58d081dda754238780d73dc92ccd9120de5c25f3e7700818762f81f1912421 +size 256014717 diff --git a/models/embeddings/monolingual/ng_32d.meta.json b/models/embeddings/monolingual/ng_32d.meta.json new file mode 100644 index 0000000000000000000000000000000000000000..3e7d6843ba72e18d0ca5b86b101f707e39487054 --- /dev/null +++ b/models/embeddings/monolingual/ng_32d.meta.json @@ -0,0 +1 @@ +{"lang": "ng", "dim": 32, "max_seq_len": 512, "is_aligned": false} \ No newline at end of file diff --git a/models/embeddings/monolingual/ng_32d_metadata.json b/models/embeddings/monolingual/ng_32d_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..1c00a04de3740763a189bd544ad0cf0c92f38fb3 --- /dev/null +++ b/models/embeddings/monolingual/ng_32d_metadata.json @@ -0,0 +1,16 @@ +{ + "language": "ng", + "dimension": 32, + "version": "monolingual", + "training_params": { + "algorithm": "skipgram", + "min_count": 5, + "window": 5, + "negative": 5, + "epochs": 5, + "encoding_method": "rope", + "dim": 32, + "threads": 5 + }, + "vocab_size": 54 +} \ No newline at end of file diff --git a/models/embeddings/monolingual/ng_64d.bin b/models/embeddings/monolingual/ng_64d.bin new file mode 100644 index 0000000000000000000000000000000000000000..a216e409bffa6a3d87cfbf72f09832174c759312 --- /dev/null +++ b/models/embeddings/monolingual/ng_64d.bin @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9408eae8053e83895873acb3dc744c69420075e2bdbd1f96b4425f7b1ad9e5f1 +size 512028541 diff --git a/models/embeddings/monolingual/ng_64d.meta.json b/models/embeddings/monolingual/ng_64d.meta.json new file mode 100644 index 0000000000000000000000000000000000000000..e571c7292b0a2add6cf18e8c41c6c3c7ee5a538e --- /dev/null +++ b/models/embeddings/monolingual/ng_64d.meta.json @@ -0,0 +1 @@ +{"lang": "ng", "dim": 64, "max_seq_len": 512, "is_aligned": false} \ No newline at end of file diff --git a/models/embeddings/monolingual/ng_64d_metadata.json b/models/embeddings/monolingual/ng_64d_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..f241d6cf2d5ae5f9e5c5da7db9c9ced763c27d69 --- /dev/null +++ b/models/embeddings/monolingual/ng_64d_metadata.json @@ -0,0 +1,16 @@ +{ + "language": "ng", + "dimension": 64, + "version": "monolingual", + "training_params": { + "algorithm": "skipgram", + "min_count": 5, + "window": 5, + "negative": 5, + "epochs": 5, + "encoding_method": "rope", + "dim": 64, + "threads": 5 + }, + "vocab_size": 54 +} \ No newline at end of file diff --git a/models/subword_markov/ng_markov_ctx1_subword.parquet b/models/subword_markov/ng_markov_ctx1_subword.parquet new file mode 100644 index 0000000000000000000000000000000000000000..f3a470a31e42f012dc1b1d57a1a7ea68d677e92c --- /dev/null +++ b/models/subword_markov/ng_markov_ctx1_subword.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:297baf033d4f418727d23b17b2108aba4409f4ea17bae25bd29f1580df216092 +size 33198 diff --git a/models/subword_markov/ng_markov_ctx1_subword_metadata.json b/models/subword_markov/ng_markov_ctx1_subword_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..7475f3f763825122bc640d24a280bcfe8096a173 --- /dev/null +++ b/models/subword_markov/ng_markov_ctx1_subword_metadata.json @@ -0,0 +1,7 @@ +{ + "context_size": 1, + "variant": "subword", + "language": "ng", + "unique_contexts": 1104, + "total_transitions": 38341 +} \ No newline at end of file diff --git a/models/subword_markov/ng_markov_ctx2_subword.parquet b/models/subword_markov/ng_markov_ctx2_subword.parquet new file mode 100644 index 0000000000000000000000000000000000000000..b164e805ace2ed6ca009e4e26b5a3d67ee8b8f59 --- /dev/null +++ b/models/subword_markov/ng_markov_ctx2_subword.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f30b0fa204f3f0a19f033013fa4c7884427471a22cbab856e5e408f33f630da6 +size 75678 diff --git a/models/subword_markov/ng_markov_ctx2_subword_metadata.json b/models/subword_markov/ng_markov_ctx2_subword_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..72e01f2d344c90170f30864fb26368ae0be427c6 --- /dev/null +++ b/models/subword_markov/ng_markov_ctx2_subword_metadata.json @@ -0,0 +1,7 @@ +{ + "context_size": 2, + "variant": "subword", + "language": "ng", + "unique_contexts": 3389, + "total_transitions": 38324 +} \ No newline at end of file diff --git a/models/subword_markov/ng_markov_ctx3_subword.parquet b/models/subword_markov/ng_markov_ctx3_subword.parquet new file mode 100644 index 0000000000000000000000000000000000000000..ec8fa843964d06fe77ff60823af9569107d2bba6 --- /dev/null +++ b/models/subword_markov/ng_markov_ctx3_subword.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:47216f438bc6170fc0f32910e081ffaf456b5de4c17ffca2533aa3520fd1facb +size 144981 diff --git a/models/subword_markov/ng_markov_ctx3_subword_metadata.json b/models/subword_markov/ng_markov_ctx3_subword_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..aa754e68e752e0f3af8f7663c2af3192fb21a37d --- /dev/null +++ b/models/subword_markov/ng_markov_ctx3_subword_metadata.json @@ -0,0 +1,7 @@ +{ + "context_size": 3, + "variant": "subword", + "language": "ng", + "unique_contexts": 8218, + "total_transitions": 38307 +} \ No newline at end of file diff --git a/models/subword_markov/ng_markov_ctx4_subword.parquet b/models/subword_markov/ng_markov_ctx4_subword.parquet new file mode 100644 index 0000000000000000000000000000000000000000..b21aab52ec3835867a705f808d7852537929d083 --- /dev/null +++ b/models/subword_markov/ng_markov_ctx4_subword.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bb3e5cb24aabaaacc0f8e8680e997948f9312aba47d7375aee3edf8ca4e07335 +size 229202 diff --git a/models/subword_markov/ng_markov_ctx4_subword_metadata.json b/models/subword_markov/ng_markov_ctx4_subword_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..5ee8af2170095fa1fd471cbdd0d78fe2e1476b37 --- /dev/null +++ b/models/subword_markov/ng_markov_ctx4_subword_metadata.json @@ -0,0 +1,7 @@ +{ + "context_size": 4, + "variant": "subword", + "language": "ng", + "unique_contexts": 15074, + "total_transitions": 38290 +} \ No newline at end of file diff --git a/models/subword_ngram/ng_2gram_subword.parquet b/models/subword_ngram/ng_2gram_subword.parquet new file mode 100644 index 0000000000000000000000000000000000000000..d9ecde9c3db2e1d5c07530995da98619b8b8ed67 --- /dev/null +++ b/models/subword_ngram/ng_2gram_subword.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3799c029a3bf65b745ec32c27679ee1a5d3843f53b46739f065ec13d802b1729 +size 7920 diff --git a/models/subword_ngram/ng_2gram_subword_metadata.json b/models/subword_ngram/ng_2gram_subword_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..94a7143a432c4986e87ce267500b347105d84324 --- /dev/null +++ b/models/subword_ngram/ng_2gram_subword_metadata.json @@ -0,0 +1,7 @@ +{ + "n": 2, + "variant": "subword", + "language": "ng", + "unique_ngrams": 589, + "total_ngrams": 38341 +} \ No newline at end of file diff --git a/models/subword_ngram/ng_3gram_subword.parquet b/models/subword_ngram/ng_3gram_subword.parquet new file mode 100644 index 0000000000000000000000000000000000000000..2d22e0de58f4c3dfddf3a718bfebe7ddc4791e93 --- /dev/null +++ b/models/subword_ngram/ng_3gram_subword.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f090be70859a8466b7bd471c964709db1599df83218c6edddd011139c6ef36f2 +size 24223 diff --git a/models/subword_ngram/ng_3gram_subword_metadata.json b/models/subword_ngram/ng_3gram_subword_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..2219f6752fb41710e0b6e38b9dc711d27400de41 --- /dev/null +++ b/models/subword_ngram/ng_3gram_subword_metadata.json @@ -0,0 +1,7 @@ +{ + "n": 3, + "variant": "subword", + "language": "ng", + "unique_ngrams": 2328, + "total_ngrams": 38324 +} \ No newline at end of file diff --git a/models/subword_ngram/ng_4gram_subword.parquet b/models/subword_ngram/ng_4gram_subword.parquet new file mode 100644 index 0000000000000000000000000000000000000000..e4d5e0dbabe9f83ce34b259a9c988e4e1d23f5c1 --- /dev/null +++ b/models/subword_ngram/ng_4gram_subword.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1c1a682779981406d6788f56d88cae10731da840fbfb06ca0848e91d204759e1 +size 52059 diff --git a/models/subword_ngram/ng_4gram_subword_metadata.json b/models/subword_ngram/ng_4gram_subword_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..bd1b369907adcf99f16715c0f0516ffbb7164421 --- /dev/null +++ b/models/subword_ngram/ng_4gram_subword_metadata.json @@ -0,0 +1,7 @@ +{ + "n": 4, + "variant": "subword", + "language": "ng", + "unique_ngrams": 4677, + "total_ngrams": 38307 +} \ No newline at end of file diff --git a/models/subword_ngram/ng_5gram_subword.parquet b/models/subword_ngram/ng_5gram_subword.parquet new file mode 100644 index 0000000000000000000000000000000000000000..47fa7088de9c994d5197553309a66fbd70df09ad --- /dev/null +++ b/models/subword_ngram/ng_5gram_subword.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:470c9a91a994623fba445802bfc4106d6dcefe3031163d2744104d5977372b72 +size 54535 diff --git a/models/subword_ngram/ng_5gram_subword_metadata.json b/models/subword_ngram/ng_5gram_subword_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..e0d533b8100f8dc1b59387a788d53bc637f216ba --- /dev/null +++ b/models/subword_ngram/ng_5gram_subword_metadata.json @@ -0,0 +1,7 @@ +{ + "n": 5, + "variant": "subword", + "language": "ng", + "unique_ngrams": 4586, + "total_ngrams": 38290 +} \ No newline at end of file diff --git a/models/tokenizer/ng_tokenizer_8k.model b/models/tokenizer/ng_tokenizer_8k.model new file mode 100644 index 0000000000000000000000000000000000000000..d9e093373ba814f7bb753530bacc75517a066257 --- /dev/null +++ b/models/tokenizer/ng_tokenizer_8k.model @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:004c7de7524ff61123c7c67c10ee7f6aa5781b98f2e0b1a5096bd56e82ddbb50 +size 367292 diff --git a/models/tokenizer/ng_tokenizer_8k.vocab b/models/tokenizer/ng_tokenizer_8k.vocab new file mode 100644 index 0000000000000000000000000000000000000000..73edeebb49b347a662ded6fdaf267b79b590bbfb --- /dev/null +++ b/models/tokenizer/ng_tokenizer_8k.vocab @@ -0,0 +1,8000 @@ + 0 + 0 + 0 + 0 +ia -0 +an -1 +en -2 +ki -3 +te -4 +▁s -5 +ur -6 +ue -7 +rs -8 +rsen -9 +tersen -10 +uetersen -11 +li -12 +▁n -13 +▁t -14 +as -15 +▁k -16 +iki -17 +ikip -18 +tur -19 +ed -20 +in -21 +wikip -22 +▁c -23 +▁d -24 +ra -25 +ana -26 +▁h -27 +le -28 +wikiped -29 +▁a -30 +▁p -31 +▁l -32 +▁e -33 +bi -34 +ga -35 +▁i -36 +wikipedia -37 +og -38 +▁b -39 +▁f -40 +gh -41 +ge -42 +ogra -43 +▁m -44 +ul -45 +is -46 +ghana -47 +ya -48 +bib -49 +pi -50 +ograf -51 +ta -52 +ja -53 +re -54 +▁z -55 +▁g -56 +asia -57 +im -58 +ano -59 +▁v -60 +ang -61 +gaz -62 +gazz -63 +gazzano -64 +ie -65 +ultur -66 +pigazzano -67 +du -68 +▁w -69 +geograf -70 +ro -71 +bli -72 +▁zh -73 +ng -74 +▁o -75 +ing -76 +bur -77 +ski -78 +za -79 +ina -80 +ow -81 +pu -82 +burg -83 +ch -84 +pia -85 +duis -86 +ró -87 +wó -88 +lew -89 +owy -90 +wór -91 +▁kró -92 +▁dwór -93 +lewski -94 +▁królewski -95 +cen -96 +▁ch -97 +nowy -98 +publi -99 +el -100 +duisburg -101 +ar -102 +ka -103 +cenza -104 +▁y -105 +▁li -106 +gana -107 +lia -108 +▁pa -109 +piacenza -110 +qu -111 +▁no -112 +ultura -113 +az -114 +ke -115 +▁j -116 +▁na -117 +republi -118 +lo -119 +ls -120 +ija -121 +▁nd -122 +▁tim -123 +▁la -124 +▁sc -125 +▁timo -126 +ba -127 +la -128 +wan -129 +▁de -130 +lis -131 +▁en -132 +▁fi -133 +▁ka -134 +▁ro -135 +▁fr -136 +sb -137 +us -138 +ko -139 +rm -140 +sa -141 +iya -142 +tai -143 +▁ing -144 +turki -145 +ca -146 +he -147 +ua -148 +▁( -149 +nga -150 +▁nds -151 +ib -152 +ip -153 +▁china -154 +do -155 +ma -156 +dia -157 +▁ha -158 +▁ta -159 +biblia -160 +na -161 +eng -162 +▁ang -163 +to -164 +ye -165 +▁as -166 +▁es -167 +▁hi -168 +▁it -169 +▁ko -170 +▁te -171 +▁sim -172 +taiwan -173 +cultura -174 +▁timote -175 +eb -176 +je -177 +mo -178 +ti -179 +▁cs -180 +▁eo -181 +▁ja -182 +▁ki -183 +▁nl -184 +▁pl -185 +▁pt -186 +▁sv -187 +turk -188 +da -189 +ür -190 +asi -191 +ple -192 +▁an -193 +▁ca -194 +▁da -195 +▁id -196 +▁st -197 +▁ve -198 +▁ast -199 +▁simple -200 +bo -201 +cl -202 +hi -203 +um -204 +▁к -205 +les -206 +tür -207 +▁et -208 +▁he -209 +▁hu -210 +▁is -211 +▁lt -212 +▁ne -213 +▁nn -214 +▁th -215 +gels -216 +bibli -217 +kultura -218 +geografia -219 +at -220 +ce -221 +ms -222 +tu -223 +िय -224 +▁ar -225 +▁cy -226 +▁di -227 +▁hr -228 +▁ia -229 +▁ku -230 +▁oc -231 +▁qu -232 +▁sk -233 +▁sw -234 +▁tr -235 +▁vi -236 +▁wa -237 +ngua -238 +bibel -239 +vikip -240 +ph -241 +si -242 +th -243 +ía -244 +▁u -245 +min -246 +nan -247 +▁af -248 +▁br -249 +▁el -250 +▁eu -251 +▁gl -252 +▁jv -253 +▁lv -254 +▁ms -255 +▁rm -256 +▁sh -257 +▁sq -258 +▁tl -259 +▁vo -260 +▁ye -261 +▁yo -262 +asya -263 +▁scn -264 +▁war -265 +geografi -266 +wa -267 +xt -268 +र् -269 +kul -270 +ris -271 +vro -272 +yue -273 +▁az -274 +▁bn -275 +▁bs -276 +▁cr -277 +▁io -278 +▁lb -279 +▁mr -280 +▁ny -281 +▁se -282 +▁sl -283 +édia -284 +▁ceb -285 +▁fiu -286 +▁lmo -287 +▁nah -288 +▁nov -289 +▁roa -290 +▁vec -291 +turqu -292 +kultur -293 +ograph -294 +geografie -295 +bí -296 +es -297 +id -298 +ik -299 +ku -300 +mg -301 +nl -302 +pr -303 +uu -304 +zl -305 +asa -306 +key -307 +smg -308 +िया -309 +▁fy -310 +▁ga -311 +▁ht -312 +▁hy -313 +▁ks -314 +▁ln -315 +▁ml -316 +▁sa -317 +▁tk -318 +▁ts -319 +▁uz -320 +▁wo -321 +lish -322 +▁als -323 +▁bat -324 +▁hak -325 +▁lad -326 +▁lij -327 +▁pap -328 +▁stq -329 +▁szl -330 +engels -331 +lingua -332 +republik -333 +al -334 +be -335 +ha -336 +ui -337 +ία -338 +bíb -339 +che -340 +lan -341 +▁dv -342 +▁ep -343 +▁gd -344 +▁gu -345 +▁gv -346 +▁kl -347 +▁kn -348 +▁kw -349 +▁lo -350 +▁om -351 +▁yi -352 +edia -353 +▁ace -354 +▁arc -355 +▁bar -356 +▁diq -357 +▁ext -358 +▁hsb -359 +▁ilo -360 +▁jbo -361 +▁pms -362 +▁sco -363 +▁tpi -364 +asien -365 +bible -366 +turkia -367 +geograph -368 +wikipédia -369 +et -370 +io -371 +ni -372 +py -373 +rn -374 +ru -375 +vi -376 +wl -377 +ान -378 +▁т -379 +▁ا -380 +ndo -381 +ska -382 +▁aa -383 +▁ak -384 +▁am -385 +▁bi -386 +▁bm -387 +▁fo -388 +▁ie -389 +▁ma -390 +▁mg -391 +▁mt -392 +▁pi -393 +▁sm -394 +▁so -395 +▁su -396 +▁tw -397 +angl -398 +▁bpy -399 +▁crh -400 +▁dsb -401 +▁frp -402 +▁fur -403 +▁mwl -404 +▁nap -405 +▁nrm -406 +▁pam -407 +▁tet -408 +▁wuu -409 +azija -410 +turch -411 +bíblia -412 +cultur -413 +ndonga -414 +turkey -415 +turkiya -416 +vikiped -417 +republika -418 +bu -419 +dc -420 +ea -421 +il -422 +nt -423 +pa -424 +rk -425 +un -426 +vo -427 +ик -428 +мо -429 +ու -430 +יה -431 +ेन -432 +ிய -433 +cal -434 +enk -435 +mba -436 +rki -437 +rup -438 +sia -439 +uro -440 +▁ah -441 +▁ay -442 +▁ee -443 +▁ig -444 +▁iu -445 +▁sg -446 +▁si -447 +asie -448 +assi -449 +baib -450 +kita -451 +kult -452 +onga -453 +quip -454 +turc -455 +▁bcl -456 +▁csb -457 +▁gan -458 +▁haw -459 +▁hif -460 +▁kab -461 +▁pdc -462 +classi -463 +enkuro -464 +▁ingle -465 +ografia -466 +▁ndonga -467 +▁timoteo -468 +classical -469 +▁ahenkuro -470 +co -471 +er -472 +ez -473 +fa -474 +fi -475 +ho -476 +jo -477 +jė -478 +lí -479 +ml -480 +ty -481 +yo -482 +án -483 +та -484 +ول -485 +ܝܐ -486 +एत -487 +कि -488 +गो -489 +पी -490 +भू -491 +वि -492 +য় -493 +▁x -494 +土耳 -495 +azi -496 +kwa -497 +res -498 +van -499 +ėjė -500 +एते -501 +गोल -502 +पीड -503 +र्क -504 +र्स -505 +ेन् -506 +য়া -507 +▁bo -508 +▁co -509 +▁ff -510 +▁gn -511 +▁le -512 +▁nv -513 +▁sp -514 +▁ss -515 +▁tt -516 +▁zu -517 +▁ال -518 +土耳其 -519 +azia -520 +bibb -521 +enen -522 +gris -523 +ingl -524 +tara -525 +tuur -526 +विकि -527 +িয়া -528 +▁cdo -529 +▁ksh -530 +▁new -531 +▁pih -532 +▁rmy -533 +▁srn -534 +blica -535 +kitab -536 +türki -537 +भूगोल -538 +ograpi -539 +एतेर्स -540 +▁timot -541 +biblija -542 +culture -543 +english -544 +turchia -545 +turquia -546 +विकिपीड -547 +▁ingles -548 +republica -549 +एतेर्सेन् -550 +geografija -551 +wikipedija -552 +ci -553 +ek -554 +fu -555 +gé -556 +ju -557 +ks -558 +lu -559 +ri -560 +rz -561 +va -562 +yk -563 +än -564 +ès -565 +és -566 +ρα -567 +ва -568 +ел -569 +יק -570 +ܬܐ -571 +तु -572 +स् -573 +ია -574 +ურ -575 +▁- -576 +▁– -577 +中華 -578 +地理 -579 +基百 -580 +arr -581 +ase -582 +bah -583 +bel -584 +bms -585 +cip -586 +dam -587 +ens -588 +ita -589 +kas -590 +kei -591 +ker -592 +liz -593 +lop -594 +taj -595 +uis -596 +ulu -597 +İng -598 +▁go -599 +▁kg -600 +▁km -601 +▁lg -602 +▁mi -603 +▁my -604 +▁ty -605 +▁va -606 +▁xh -607 +▁ки -608 +▁мо -609 +中華民 -610 +基百科 -611 +anga -612 +bili -613 +buda -614 +inge -615 +prim -616 +terz -617 +ture -618 +tyrk -619 +ulus -620 +ásia -621 +èdia -622 +शिया -623 +▁eml -624 +▁eng -625 +▁kaa -626 +▁map -627 +▁pag -628 +▁pnt -629 +▁tum -630 +▁vls -631 +▁zea -632 +asiya -633 +तुर्क -634 +▁epis -635 +▁harr -636 +▁kina -637 +▁okwa -638 +bibbia -639 +budaya -640 +englis -641 +ingles -642 +kerase -643 +ografi -644 +tajvan -645 +alkitab -646 +bibliya -647 +turkije -648 +turquía -649 +ograpiya -650 +▁inggris -651 +▁inglesa -652 +▁timothe -653 +geography -654 +▁nykerase -655 +▁timoteus -656 +ऊएतेर्सेन् -657 +विकिपीडिया -658 +▁wikipedia -659 +di -660 +ee -661 +ep -662 +gi -663 +gá -664 +hu -665 +it -666 +jô -667 +kî -668 +ly -669 +ne -670 +nu -671 +oa -672 +ob -673 +pú -674 +rw -675 +se -676 +ug -677 +xe -678 +yn -679 +ze -680 +áz -681 +èd -682 +ís -683 +ýa -684 +ασ -685 +ια -686 +ικ -687 +ου -688 +ан -689 +бл -690 +дв -691 +ес -692 +на -693 +пу -694 +ск -695 +իա -696 +בל -697 +גר -698 +ור -699 +יע -700 +יפ -701 +دس -702 +مق -703 +ین -704 +އި -705 +ंग -706 +कृ -707 +तै -708 +बा -709 +भा -710 +षा -711 +सं -712 +রস -713 +্ক -714 +கி -715 +க் -716 +வி -717 +തി -718 +്ക -719 +යා -720 +ร์ -721 +ẹ̀ -722 +▁р -723 +▁ب -724 +▁ت -725 +亞洲 -726 +文化 -727 +聖經 -728 +英語 -729 +ani -730 +cul -731 +ele -732 +hay -733 +hey -734 +iel -735 +ken -736 +pla -737 +raf -738 +rin -739 +ron -740 +sat -741 +sen -742 +tay -743 +ubo -744 +yjo -745 +zyk -746 +ūra -747 +γρα -748 +ουρ -749 +зик -750 +ика -751 +тай -752 +ויק -753 +कृत -754 +घान -755 +तैव -756 +▁fj -757 +▁ik -758 +▁of -759 +▁po -760 +▁rw -761 +▁ti -762 +▁tn -763 +▁to -764 +▁ya -765 +▁za -766 +▁дв -767 +▁ти -768 +adam -769 +basa -770 +bele -771 +bieb -772 +chip -773 +esus -774 +ezik -775 +fala -776 +ibel -777 +ibhe -778 +idio -779 +iwan -780 +kofu -781 +land -782 +repú -783 +taiw -784 +tola -785 +uage -786 +γραφ -787 +еспу -788 +مقدس -789 +घाना -790 +भाषा -791 +संस् -792 +▁chr -793 +▁got -794 +▁ita -795 +▁nga -796 +▁tur -797 +▁wan -798 +中華民國 -799 +aasia -800 +angle -801 +bibia -802 +dusat -803 +enena -804 +engle -805 +idiya -806 +limba -807 +ngama -808 +prime -809 +блика -810 +ויקיפ -811 +एशिया -812 +तैवान -813 +▁chin -814 +▁cina -815 +▁duis -816 +▁dulu -817 +▁epís -818 +▁lang -819 +▁ling -820 +▁spra -821 +▁мова -822 +anglès -823 +bahasa -824 +biebel -825 +idioma -826 +inglis -827 +terzen -828 +turcia -829 +turska -830 +tyrkia -831 +türkei -832 +uiquip -833 +▁jesus -834 +▁jezik -835 +▁респу -836 +bibelen -837 +engelsk -838 +kultuur -839 +ografie -840 +ografía -841 +türkiye -842 +संस्कृत -843 +▁angubo -844 +▁nekofu -845 +▁paulus -846 +▁sefala -847 +▁taiwan -848 +▁timoth -849 +engleski -850 +republic -851 +respubli -852 +wikipedi -853 +▁andusat -854 +▁inglese -855 +▁timoteu -856 +duisburgo -857 +englische -858 +republiek -859 +república -860 +wikipedya -861 +▁duisburg -862 +▁epístola -863 +▁harranga -864 +▁language -865 +▁wanenena -866 +vikipediya -867 +republikken -868 +▁республика -869 +▁nekofungama -870 +:‘ -871 +ad -872 +ai -873 +ak -874 +bk -875 +by -876 +cd -877 +de -878 +eh -879 +fr -880 +fö -881 +gr -882 +gà -883 +iń -884 +ji -885 +ję -886 +kh -887 +ké -888 +kù -889 +lt -890 +lè -891 +mb -892 +mi -893 +mä -894 +nà -895 +ol -896 +or -897 +oz -898 +pl -899 +rc -900 +ry -901 +st -902 +sí -903 +tl -904 +tw -905 +tö -906 +tú -907 +yl -908 +yō -909 +yə -910 +zu -911 +ìa -912 +în -913 +ði -914 +üs -915 +čt -916 +šć -917 +ίδ -918 +αγ -919 +γε -920 +γκ -921 +γλ -922 +ιπ -923 +ισ -924 +ντ -925 +ле -926 +ли -927 +ор -928 +ос -929 +ра -930 +ул -931 +ыл -932 +іл -933 +ան -934 +աշ -935 +աս -936 +են -937 +եր -938 +או -939 +אי -940 +אנ -941 +בי -942 +גל -943 +הר -944 +ות -945 +טע -946 +ית -947 +او -948 +لى -949 +لی -950 +مو -951 +ܓܪ -952 +ܩܕ -953 +ނާ -954 +ބަ -955 +ތު -956 +ގާ -957 +ޖު -958 +इब -959 +ংর -960 +ইক -961 +ইব -962 +উট -963 +এশ -964 +গো -965 +ঘা -966 +জি -967 +তু -968 +না -969 +পি -970 +বা -971 +ভূ -972 +ার -973 +েন -974 +েল -975 +্র -976 +યા -977 +ર્ -978 +ાન -979 +ம் -980 +்ப -981 +కీ -982 +యా -983 +ర్ -984 +స్ -985 +ಕಿ -986 +ರ್ -987 +ത് -988 +രം -989 +ഷ് -990 +වි -991 +ทศ -992 +ปร -993 +ะเ -994 +ีย -995 +ກິ -996 +ພີ -997 +ສາ -998 +་ས -999 +ེ་ -1000 +င် -1001 +ဝီ -1002 +ან -1003 +ენ -1004 +ის -1005 +វិ -1006 +ẹ́ -1007 +ọ́ -1008 +▁á -1009 +▁п -1010 +▁х -1011 +▁ј -1012 +▁إ -1013 +▁چ -1014 +亚洲 -1015 +特森 -1016 +英语 -1017 +부르 -1018 +and -1019 +ane -1020 +any -1021 +ari -1022 +ati -1023 +chi -1024 +chû -1025 +cia -1026 +cko -1027 +coğ -1028 +die -1029 +dom -1030 +eed -1031 +ehy -1032 +eli -1033 +ere -1034 +eth -1035 +eto -1036 +etr -1037 +eye -1038 +gje -1039 +hun -1040 +hur -1041 +ief -1042 +ipa -1043 +isa -1044 +ise -1045 +juq -1046 +kan -1047 +lba -1048 +len -1049 +lés -1050 +lẹ̀ -1051 +mat -1052 +nar -1053 +neg -1054 +ngí -1055 +olo -1056 +ong -1057 +pan -1058 +ped -1059 +reo -1060 +rev -1061 +rst -1062 +ske -1063 +ssa -1064 +tan -1065 +tch -1066 +ten -1067 +tia -1068 +tle -1069 +tre -1070 +tut -1071 +uki -1072 +uli -1073 +ulo -1074 +umb -1075 +ura -1076 +usi -1077 +vel -1078 +wen -1079 +wik -1080 +zam -1081 +çin -1082 +èdè -1083 +úra -1084 +ĭng -1085 +ίαν -1086 +ίδε -1087 +βικ -1088 +γεω -1089 +ιπα -1090 +ισμ -1091 +κία -1092 +ани -1093 +вск -1094 +еле -1095 +осл -1096 +ուն -1097 +דיה -1098 +טור -1099 +טער -1100 +اوس -1101 +گلی -1102 +ंग् -1103 +िये -1104 +ংরে -1105 +ইকি -1106 +গোল -1107 +পিড -1108 +િયા -1109 +டிய -1110 +ియా -1111 +ക്ക -1112 +തിയ -1113 +යාව -1114 +▁av -1115 +▁bh -1116 +▁ge -1117 +▁ke -1118 +▁ng -1119 +▁or -1120 +▁ra -1121 +▁rn -1122 +▁sn -1123 +▁кр -1124 +▁ܩܕ -1125 +地理学 -1126 +angu -1127 +asen -1128 +ashi -1129 +azië -1130 +bari -1131 +bibl -1132 +clop -1133 +dafr -1134 +edie -1135 +epis -1136 +hunu -1137 +huri -1138 +icip -1139 +idia -1140 +jina -1141 +kara -1142 +kuli -1143 +lelo -1144 +ling -1145 +ličt -1146 +loni -1147 +nuna -1148 +obla -1149 +repu -1150 +rste -1151 +siyo -1152 +tama -1153 +teus -1154 +tius -1155 +tura -1156 +uirc -1157 +yōtl -1158 +ázia -1159 +İngi -1160 +τουρ -1161 +език -1162 +ביבל -1163 +बाइब -1164 +ইংরে -1165 +ইবেল -1166 +উইকি -1167 +ঘানা -1168 +তুরস -1169 +க்கி -1170 +ക്കി -1171 +ประเ -1172 +▁bug -1173 +▁cbk -1174 +▁die -1175 +▁fre -1176 +▁kin -1177 +▁lin -1178 +▁oku -1179 +▁pcd -1180 +▁pia -1181 +▁spr -1182 +▁the -1183 +▁van -1184 +▁кел -1185 +▁چین -1186 +維基百科 -1187 +维基百科 -1188 +angla -1189 +biblí -1190 +bybel -1191 +ehyia -1192 +etrus -1193 +first -1194 +gáana -1195 +hiina -1196 +ibhay -1197 +język -1198 +korin -1199 +narko -1200 +risto -1201 +rkiya -1202 +rmany -1203 +sensa -1204 +tessa -1205 +turke -1206 +uikip -1207 +wenyo -1208 +šćina -1209 +ίδεια -1210 +ভূগোল -1211 +▁chiń -1212 +▁dili -1213 +▁lama -1214 +▁list -1215 +▁mani -1216 +▁spro -1217 +▁tava -1218 +▁tili -1219 +▁tusi -1220 +▁двор -1221 +▁кина -1222 +▁посл -1223 +▁теле -1224 +▁тимо -1225 +▁भाषा -1226 +anglia -1227 +bariik -1228 +bibili -1229 +biblio -1230 +biquip -1231 +coğraf -1232 +duisbo -1233 +ibheli -1234 +ingels -1235 +ingere -1236 +lonika -1237 +língua -1238 +plasen -1239 +taiwán -1240 +taywan -1241 +vichip -1242 +βικιπα -1243 +ইংরেজি -1244 +এশিয়া -1245 +তুরস্ক -1246 +বাইবেল -1247 +ประเทศ -1248 +▁angli -1249 +▁brief -1250 +▁ghana -1251 +▁italy -1252 +▁kalba -1253 +▁thumb -1254 +▁tsina -1255 +▁tuirc -1256 +▁китай -1257 +angličt -1258 +karamba -1259 +korinto -1260 +kulturo -1261 +kultúra -1262 +kultūra -1263 +landafr -1264 +primera -1265 +turecko -1266 +γεωγραφ -1267 +τουρκία -1268 +উইকিপিড -1269 +▁episto -1270 +▁lingvo -1271 +▁petrus -1272 +▁rahunu -1273 +▁turkey -1274 +▁المقدس -1275 +bibiliya -1276 +géografi -1277 +géograph -1278 +piasensa -1279 +तुर्किये -1280 +संस्कृति -1281 +▁aatessa -1282 +▁epistle -1283 +▁germany -1284 +▁nonarko -1285 +▁omwenyo -1286 +▁sprache -1287 +▁timothy -1288 +▁послани -1289 +geografio -1290 +wikipedie -1291 +wikipèdia -1292 +xeografía -1293 +▁piacenza -1294 +▁respubli -1295 +▁timotheo -1296 +▁timotius -1297 +▁uetersen -1298 +▁vabariik -1299 +angličtina -1300 +geographia -1301 +geographie -1302 +géographie -1303 +heograpiya -1304 +uiquipedia -1305 +wikipediya -1306 +wikipidiya -1307 +▁aakorinto -1308 +▁lokaramba -1309 +heyograpiya -1310 +βικιπαίδεια -1311 +উইকিপিডিয়া -1312 +▁nonarkobele -1313 +▁respublikas -1314 +▁aatessalonika -1315 +bl -1316 +bì -1317 +dd -1318 +dy -1319 +dź -1320 +eq -1321 +gs -1322 +gw -1323 +gì -1324 +gí -1325 +gü -1326 +iä -1327 +iš -1328 +kt -1329 +kô -1330 +kü -1331 +ld -1332 +lă -1333 +ní -1334 +os -1335 +ré -1336 +rî -1337 +rö -1338 +sà -1339 +sé -1340 +së -1341 +sṳ -1342 +ww -1343 +wọ -1344 +wụ -1345 +yd -1346 +zh -1347 +zi -1348 +ân -1349 +ês -1350 +ío -1351 +òl -1352 +óa -1353 +ýð -1354 +ıl -1355 +δη -1356 +ζα -1357 +ιτ -1358 +но -1359 +ур -1360 +ագ -1361 +ալ -1362 +ար -1363 +եդ -1364 +ետ -1365 +շա -1366 +որ -1367 +վա -1368 +քի -1369 +גע -1370 +טר -1371 +יס -1372 +סי -1373 +רב -1374 +دف -1375 +نج -1376 +ܒܫ -1377 +ܠܘ -1378 +ހޫ -1379 +ރާ -1380 +ރީ -1381 +ރޭ -1382 +ކީ -1383 +ވި -1384 +މް -1385 +ފީ -1386 +ގި -1387 +एस -1388 +ची -1389 +জা -1390 +ভা -1391 +ਭਾ -1392 +અં -1393 +ઇબ -1394 +ગો -1395 +ગ્ -1396 +બા -1397 +ભૂ -1398 +સે -1399 +ސަ -1400 +नी -1401 +ংস -1402 +তি -1403 +ন্ -1404 +ਅੰ -1405 +િક -1406 +ତେ -1407 +ஆச -1408 +ாட -1409 +బై -1410 +ಂಗ -1411 +ಘಾ -1412 +ಯು -1413 +ಷ್ -1414 +ಿಯ -1415 +ೀಡ -1416 +ೂಗ -1417 +ೆನ -1418 +್ಲ -1419 +ഗ് -1420 +നാ -1421 +പീ -1422 +സ് -1423 +්ක -1424 +กฤ -1425 +ซิ -1426 +ณร -1427 +ภู -1428 +รก -1429 +าน -1430 +ດຍ -1431 +ວິ -1432 +ອາ -1433 +ະຄ -1434 +་ཀ -1435 +་ཏ -1436 +་ར -1437 +ཇི -1438 +དབ -1439 +ན། -1440 +བ། -1441 +ཡ། -1442 +ི། -1443 +ུང -1444 +ུར -1445 +ེན -1446 +ྐད -1447 +ྱི -1448 +ကီ -1449 +ဒိ -1450 +ရှ -1451 +აი -1452 +ბი -1453 +ვი -1454 +ინ -1455 +კი -1456 +კუ -1457 +ლტ -1458 +ንግ -1459 +ዲያ -1460 +ᎵᏏ -1461 +ᏏᎠ -1462 +ᏗᏯ -1463 +ᏫᎩ -1464 +ᐃᐊ -1465 +ᐃᑭ -1466 +ᑎᐊ -1467 +ᑎᑐ -1468 +ᑲᔭ -1469 +ᓄᓇ -1470 +ᔪᖅ -1471 +ប្ -1472 +មិ -1473 +ស៊ -1474 +ីឌ -1475 +ᨅᨔ -1476 +ṣà -1477 +▁ئ -1478 +▁خ -1479 +への -1480 +ィア -1481 +ィキ -1482 +ウエ -1483 +ジア -1484 +チェ -1485 +テモ -1486 +デュ -1487 +トル -1488 +ルク -1489 +伊斯 -1490 +民国 -1491 +聖書 -1492 +ꙑ́ -1493 +뒤스 -1494 +백과 -1495 +영어 -1496 +중화 -1497 +𐌲𐌹 -1498 +𐌹𐌰 -1499 +bû -1500 +bė -1501 +dä -1502 +də -1503 +ll -1504 +mì -1505 +qa -1506 +sh -1507 +tū -1508 +uo -1509 +zá -1510 +ùk -1511 +ùn -1512 +ōl -1513 +ού -1514 +ца -1515 +यब -1516 +্স -1517 +਼ੀ -1518 +ଏସ -1519 +்ச -1520 +สา -1521 +ན་ -1522 +ეტ -1523 +ᑖᓯ -1524 +▁ز -1525 +維基 -1526 +bìo -1527 +bûn -1528 +cht -1529 +ciŭ -1530 +cou -1531 +diw -1532 +dið -1533 +dri -1534 +ear -1535 +eaŋ -1536 +etã -1537 +eze -1538 +eză -1539 +gel -1540 +geò -1541 +ght -1542 +ghy -1543 +gju -1544 +goz -1545 +grá -1546 +gré -1547 +gùk -1548 +hoà -1549 +hoâ -1550 +ibí -1551 +ilî -1552 +ilɛ -1553 +iri -1554 +ità -1555 +jew -1556 +jil -1557 +jop -1558 +jui -1559 +jẹ́ -1560 +khì -1561 +koj -1562 +kso -1563 +kwọ -1564 +kän -1565 +kùl -1566 +kùn -1567 +kül -1568 +lah -1569 +lmi -1570 +lon -1571 +lpa -1572 +lqa -1573 +lýð -1574 +mọ́ -1575 +nci -1576 +nde -1577 +nek -1578 +niy -1579 +niä -1580 +oly -1581 +qad -1582 +qil -1583 +rci -1584 +rea -1585 +rkì -1586 +ryd -1587 +sem -1588 +sho -1589 +sàn -1590 +sṳn -1591 +teo -1592 +tok -1593 +tru -1594 +tōl -1595 +uam -1596 +viq -1597 +vit -1598 +vùn -1599 +nh -1600 +tı -1601 +þē -1602 +ƿu -1603 +يز -1604 +ܐܝ -1605 +ܐܣ -1606 +ܒܐ -1607 +ܓܐ -1608 +ܘܝ -1609 +ܘܪ -1610 +ܝܦ -1611 +ܢܐ -1612 +ސި -1613 +ಬೈ -1614 +თი -1615 +ሐፍ -1616 +ርክ -1617 +ኢው -1618 +እስ -1619 +ክፔ -1620 +ᖃᓪ -1621 +្យ -1622 +ペデ -1623 +위터 -1624 +gha -1625 +mon -1626 +ngu -1627 +tis -1628 +vní -1629 +wsa -1630 +yîn -1631 +yət -1632 +zim -1633 +zág -1634 +àṣà -1635 +ány -1636 +ánà -1637 +ásí -1638 +äna -1639 +éid -1640 +ëjô -1641 +ínà -1642 +îga -1643 +îkî -1644 +îng -1645 +înt -1646 +òlò -1647 +øði -1648 +ùip -1649 +üse -1650 +ĉen -1651 +īƿu -1652 +ədə -1653 +έντ -1654 +γκά -1655 +δημ -1656 +ιατ -1657 +ική -1658 +κρα -1659 +λιτ -1660 +νασ -1661 +τία -1662 +анг -1663 +ара -1664 +йра -1665 +йыл -1666 +леў -1667 +лис -1668 +оле -1669 +тад -1670 +улс -1671 +улє -1672 +ыла -1673 +ілі -1674 +ագր -1675 +անգ -1676 +աստ -1677 +արհ -1678 +երս -1679 +թու -1680 +իքի -1681 +ծաշ -1682 +կու -1683 +մշա -1684 +ութ -1685 +պոր -1686 +տալ -1687 +փեդ -1688 +քիա -1689 +אגר -1690 +אנה -1691 +גאו -1692 +גרפ -1693 +גרת -1694 +דיס -1695 +דיע -1696 +האי -1697 +הרא -1698 +הרפ -1699 +af -1700 +cy -1701 +ei -1702 +sọ -1703 +όσ -1704 +еј -1705 +ин -1706 +קו -1707 +فس -1708 +ܛܙ -1709 +ރު -1710 +ލް -1711 +કસ -1712 +ங் -1713 +யூ -1714 +ான -1715 +వి -1716 +ూగ -1717 +ഭാ -1718 +ัง -1719 +ພາ -1720 +დუ -1721 +ᎯᏍ -1722 +ᒧᐃ -1723 +手紙 -1724 +提摩 -1725 +𐌰𐍂 -1726 +erd -1727 +miè -1728 +ram -1729 +ula -1730 +yah -1731 +еју -1732 +инҷ -1733 +ובל -1734 +ותי -1735 +טרס -1736 +יקה -1737 +נגל -1738 +נית -1739 +סיה -1740 +קול -1741 +קיה -1742 +תרב -1743 +أول -1744 +دفو -1745 +رول -1746 +سول -1747 +فسك -1748 +ليز -1749 +موت -1750 +موث -1751 +ىلى -1752 +ܓܐܘ -1753 +ܓܪܦ -1754 +ܕܠܘ -1755 +ܘܝܩ -1756 +ܛܘܪ -1757 +ܛܙܢ -1758 +ܝܒܫ -1759 +ܡܬܐ -1760 +ܩܝܐ -1761 +ܫܢܐ -1762 +ܬܒܐ -1763 +ނގި -1764 +ވިލ -1765 +ގާފ -1766 +ޖުޣ -1767 +ުރާ -1768 +ंगल -1769 +झेन -1770 +तान -1771 +यबल -1772 +উটে -1773 +জাত -1774 +ঠার -1775 +ত্র -1776 +প্র -1777 +ভাষ -1778 +সংস -1779 +ৃতি -1780 +ਜ਼ੀ -1781 +ਭਾਸ -1782 +ઇબલ -1783 +કસ્ -1784 +ગોળ -1785 +ગ્ર -1786 +ઘાન -1787 +તાન -1788 +સેન -1789 +િકિ -1790 +ુર્ -1791 +ેર્ -1792 +ଏତେ -1793 +ଏସି -1794 +ஆங் -1795 +கான -1796 +ண்ப -1797 +னக் -1798 +யூட -1799 +eŋ -1800 +oh -1801 +zî -1802 +ɛl -1803 +сы -1804 +ية -1805 +ର୍ -1806 +ఆస -1807 +ఎత -1808 +కృ -1809 +త్ -1810 +రమ -1811 +กิ -1812 +ภี -1813 +ิล -1814 +မ္ -1815 +პე -1816 +スブ -1817 +asc -1818 +ywa -1819 +ɛlɛ -1820 +աշխ -1821 +ର୍ସ -1822 +ர்ச -1823 +லம் -1824 +லிய -1825 +விய -1826 +ாடு -1827 +ியல -1828 +ியா -1829 +்பீ -1830 +ంస్ -1831 +ఊఎత -1832 +కీప -1833 +కృత -1834 +టర్ -1835 +బైబ -1836 +రము -1837 +ర్స -1838 +ాస్ -1839 +ಆಂಗ -1840 +ಏಷ್ -1841 +ಕಿಪ -1842 +ಘಾನ -1843 +ಬೈಬ -1844 +ಭೂಗ -1845 +ಯುಟ -1846 +ರ್ಸ -1847 +ഗ്ല -1848 +നാം -1849 +പീഡ -1850 +ഷ്യ -1851 +സ്ക -1852 +ാരം -1853 +ാസ് -1854 +ീഷ് -1855 +ോത് -1856 +වික -1857 +විද -1858 +ියා -1859 +กิพ -1860 +ชีย -1861 +ณรั -1862 +ดีย -1863 +ตร์ -1864 +ทร์ -1865 +บิล -1866 +ภูม -1867 +มภี -1868 +สาธ -1869 +านา -1870 +ุรก -1871 +เซิ -1872 +ກິດ -1873 +ມສາ -1874 +ສາອ -1875 +ອາຊ -1876 +ະຄໍ -1877 +ເດຍ -1878 +ཏུར -1879 +ར་ས -1880 +ཤེ་ -1881 +སུང -1882 +ཨེ་ -1883 +ི་ས -1884 +ུ་ཏ -1885 +င်င -1886 +ထဝီ -1887 +ဒိယ -1888 +မ္မ -1889 +ာရှ -1890 +აივ -1891 +ბურ -1892 +დია -1893 +ეთი -1894 +ენა -1895 +ენი -1896 +ეტე -1897 +თურ -1898 +ლია -1899 +dù -1900 +ig -1901 +jz -1902 +sè -1903 +sû -1904 +зе -1905 +رى -1906 +ज़ -1907 +ಲ್ -1908 +ූග -1909 +ธร -1910 +ีน -1911 +ັງ -1912 +გი -1913 +ሊዝ -1914 +គី -1915 +ṳ̄ -1916 +ọc -1917 +▁ಶ -1918 +첸차 -1919 +ent -1920 +зен -1921 +լեր -1922 +رىن -1923 +ज़ी -1924 +භූග -1925 +จีน -1926 +ธรร -1927 +อัง -1928 +ურა -1929 +ური -1930 +ფია -1931 +ሊዝኛ -1932 +ቱርክ -1933 +ኢውተ -1934 +እስያ -1935 +እንግ -1936 +ውክፔ -1937 +ጽሐፍ -1938 +ᎠᏏᎠ -1939 +ᎩᎵᏏ -1940 +ᎯᏍᏗ -1941 +ᏇᏗᏯ -1942 +ᐃᑖᓯ -1943 +ᐃᑭᐱ -1944 +ᐅᔪᖅ -1945 +ᐊᑲᔭ -1946 +ᑎᑐᑦ -1947 +ᒧᐃᐧ -1948 +ᖃᓪᓗ -1949 +គីភ -1950 +ទ្យ -1951 +ប្ប -1952 +ស៊ី -1953 +ីឌា -1954 +▁ac -1955 +▁do -1956 +▁ii -1957 +▁je -1958 +▁lý -1959 +▁mí -1960 +▁ol -1961 +▁on -1962 +▁pā -1963 +▁ug -1964 +▁γλ -1965 +▁ва -1966 +▁кы -1967 +▁на -1968 +▁хя -1969 +▁ја -1970 +▁ان -1971 +▁بى -1972 +▁تی -1973 +▁خە -1974 +▁زب -1975 +▁ಶಾ -1976 +ウエテ -1977 +チェン -1978 +テへの -1979 +デュー -1980 +トルコ -1981 +于特森 -1982 +伊斯堡 -1983 +华民国 -1984 +地理學 -1985 +手紙一 -1986 +細亞洲 -1987 +維基大 -1988 +ꙁꙑ́ -1989 +드부르 -1990 +부르크 -1991 +아첸차 -1992 +중화민 -1993 +𐌰𐍂𐌰 -1994 +𐌲𐌲𐌹 -1995 +afia -1996 +akwụ -1997 +allu -1998 +alta -1999 +ça -2000 +ée -2001 +σσ -2002 +τη -2003 +लि -2004 +ਰੇ -2005 +ಯಾ -2006 +ವಿ -2007 +മി -2008 +ทว -2009 +ოგ -2010 +რზ -2011 +ᏬᏂ -2012 +អា -2013 +bha -2014 +dka -2015 +ida -2016 +kui -2017 +lte -2018 +uac -2019 +ídi -2020 +σσα -2021 +τησ -2022 +ولس -2023 +ܝܬܐ -2024 +लिश -2025 +੍ਰੇ -2026 +തിമ -2027 +മിശ -2028 +ทวี -2029 +ეოგ -2030 +ᎦᏬᏂ -2031 +▁кя -2032 +▁кі -2033 +aney -2034 +anän -2035 +asis -2036 +asië -2037 +asja -2038 +azio -2039 +ball -2040 +beib -2041 +belé -2042 +beri -2043 +bhay -2044 +biww -2045 +bíbé -2046 +chiu -2047 +cipè -2048 +couh -2049 +drij -2050 +duni -2051 +dźel -2052 +elos -2053 +enee -2054 +ensa -2055 +epin -2056 +erdn -2057 +erin -2058 +fija -2059 +futa -2060 +föld -2061 +förs -2062 +gala -2063 +gals -2064 +geug -2065 +geòg -2066 +ghee -2067 +ghyi -2068 +giel -2069 +gliz -2070 +gráf -2071 +gréh -2072 +gàna -2073 +gĭng -2074 +hlon -2075 +holo -2076 +huac -2077 +huan -2078 +idüs -2079 +igbe -2080 +inar -2081 +inwa -2082 +ipei -2083 +ipho -2084 +ismo -2085 +itia -2086 +jbel -2087 +jztu -2088 +kadd -2089 +kali -2090 +kalt -2091 +kiin -2092 +klop -2093 +koet -2094 +kram -2095 +kron -2096 +ksel -2097 +lann -2098 +laío -2099 +të -2100 +wu -2101 +èn -2102 +ên -2103 +να -2104 +πο -2105 +וס -2106 +िल -2107 +એશ -2108 +પી -2109 +கு -2110 +ాష -2111 +നം -2112 +ഴു -2113 +าร -2114 +เอ -2115 +ཀ་ -2116 +ᐊᓰ -2117 +ធម -2118 +▁é -2119 +▁ޗ -2120 +에게 -2121 +cên -2122 +eda -2123 +edy -2124 +жіл -2125 +בור -2126 +પીડ -2127 +భాష -2128 +ഖനം -2129 +െഴു -2130 +ปเอ -2131 +ᐊᓰᐊ -2132 +ធម៌ -2133 +▁ép -2134 +▁хэ -2135 +▁ޗަ -2136 +▁கு -2137 +於特森 -2138 +오에게 -2139 +ascã -2140 +azía -2141 +bėbl -2142 +edav -2143 +fijô -2144 +hile -2145 +hita -2146 +hièn -2147 +iana -2148 +leea -2149 +leze -2150 +lopä -2151 +läna -2152 +lèng -2153 +meye -2154 +moxt -2155 +mäin -2156 +mìni -2157 +nang -2158 +ngué -2159 +ngṳ̄ -2160 +ning -2161 +niät -2162 +nàka -2163 +nīƿu -2164 +oayl -2165 +ohan -2166 +pags -2167 +pere -2168 +piaç -2169 +pigà -2170 +prog -2171 +prva -2172 +pėjė -2173 +qadd -2174 +qita -2175 +razu -2176 +rese -2177 +ripa -2178 +rkie -2179 +roma -2180 +rydh -2181 +röko -2182 +slan -2183 +snek -2184 +sèng -2185 +séng -2186 +tanb -2187 +tava -2188 +tcha -2189 +teli -2190 +teny -2191 +tera -2192 +thai -2193 +tian -2194 +tiki -2195 +tivo -2196 +tlah -2197 +turč -2198 +tuyk -2199 +tã -2200 +tí -2201 +ģe -2202 +īs -2203 +łé -2204 +κί -2205 +фе -2206 +ާތ -2207 +ਈਬ -2208 +อื -2209 +าษ -2210 +ีเ -2211 +ພູ -2212 +ዘን -2213 +ᨕᨗ -2214 +▁і -2215 +▁ભ -2216 +ガー -2217 +ルセ -2218 +아시 -2219 +터키 -2220 +ast -2221 +beb -2222 +dīs -2223 +nes -2224 +roz -2225 +tal -2226 +čen -2227 +ılı -2228 +фею -2229 +ާތް -2230 +ਅੰਗ -2231 +ਈਬਲ -2232 +ภาษ -2233 +อือ -2234 +ᨕᨗᨋ -2235 +▁κί -2236 +▁ін -2237 +▁ભા -2238 +ガーナ -2239 +ルセン -2240 +아시아 -2241 +iezh -2242 +kiel -2243 +polo -2244 +sëin -2245 +tetã -2246 +tork -2247 +tula -2248 +tëre -2249 +ujuq -2250 +ukas -2251 +unde -2252 +ungh -2253 +uraa -2254 +uspa -2255 +vana -2256 +vele -2257 +veto -2258 +wîkî -2259 +yati -2260 +yeti -2261 +zaad -2262 +zeml -2263 +zgoz -2264 +àsia -2265 +ásíà -2266 +çand -2267 +èdie -2268 +édie -2269 +ílẹ̀ -2270 +întâ -2271 +îroz -2272 +ître -2273 +ürki -2274 +ýwan -2275 +ătre -2276 +čens -2277 +čina -2278 +ģeog -2279 +İnci -2280 +ılız -2281 +ǣdia -2282 +ɛlɛ́ -2283 +αγία -2284 +αγγλ -2285 +ασία -2286 +δημο -2287 +ιατσ -2288 +ισμπ -2289 +ντού -2290 +ώσσα -2291 +инҷи -2292 +йрам -2293 +йылм -2294 +рзен -2295 +теју -2296 +թուր -2297 +կույ -2298 +յուն -2299 +āk -2300 +אל -2301 +טי -2302 +נה -2303 +टर -2304 +णर -2305 +এছ -2306 +ਡਿ -2307 +ષા -2308 +സി -2309 +සි -2310 +აზ -2311 +문화 -2312 +kîn -2313 +sur -2314 +անա -2315 +ונה -2316 +טימ -2317 +פיע -2318 +गणर -2319 +ुटर -2320 +ਡਿਆ -2321 +భూగ -2322 +സിന -2323 +ආසි -2324 +າພີ -2325 +ကီပ -2326 +▁al -2327 +▁אל -2328 +aziə -2329 +ākia -2330 +лисӣ -2331 +գանա -2332 +ուետ -2333 +ունչ -2334 +վիքի -2335 +אגרא -2336 +אנגל -2337 +בורג -2338 +גאנה -2339 +הראש -2340 +טערז -2341 +טערק -2342 +טרסן -2343 +ידיה -2344 +נגלי -2345 +עדיע -2346 +أولى -2347 +رسول -2348 +رولي -2349 +فسكي -2350 +يموث -2351 +گلیز -2352 +گلیس -2353 +ܐܣܝܐ -2354 +ܓܪܬܐ -2355 +ܕܠܘܬ -2356 +ܝܡܬܐ -2357 +ܟܬܒܐ -2358 +ܡܝܬܐ -2359 +ހޫރީ -2360 +ބައި -2361 +ބަލް -2362 +އިނާ -2363 +ތުރު -2364 +ގާނާ -2365 +ގާފަ -2366 +ޖުމް -2367 +ंग्र -2368 +आंगल -2369 +इंग् -2370 +गणरा -2371 +चीनी -2372 +युटर -2373 +र्की -2374 +ेज़ी -2375 +উটার -2376 +ত্রী -2377 +্সেন -2378 +ਭਾਸ਼ -2379 +ਾਈਬਲ -2380 +ੀਡਿਆ -2381 +ઘાના -2382 +તુર્ -2383 +તેર્ -2384 +વિકિ -2385 +ଊଏତେ -2386 +ଏସିଯ -2387 +ର୍ସେ -2388 +கானா -2389 +டர்ச -2390 +டியர -2391 +டியா -2392 +பண்ப -2393 +ப்பீ -2394 +யூட் -2395 +விவி -2396 +ியல் -2397 +ீனக் -2398 +ுவிய -2399 +fà -2400 +iň -2401 +ză -2402 +ùi -2403 +ਿਕ -2404 +ઊએ -2405 +ന് -2406 +വി -2407 +ᨔᨗ -2408 +▁“ -2409 +ツァ -2410 +가나 -2411 +위키 -2412 +dua -2413 +elv -2414 +geu -2415 +teō -2416 +urg -2417 +xia -2418 +הסי -2419 +ਵਿਕ -2420 +ೆನ್ -2421 +ഒന് -2422 +ანა -2423 +ᨗᨔᨗ -2424 +▁fø -2425 +アジア -2426 +amat -2427 +axia -2428 +baar -2429 +keng -2430 +kinh -2431 +leză -2432 +mara -2433 +ngùi -2434 +tayv -2435 +teki -2436 +teōā -2437 +ulat -2438 +utur -2439 +ਵਿਕਿ -2440 +కీపీ -2441 +డియా -2442 +బైబి -2443 +భూగో -2444 +ర్సె -2445 +శాస్ -2446 +సంస్ -2447 +ಘಾನಾ -2448 +ಭೂಗೋ -2449 +ಯುಟೆ -2450 +ರ್ಕಿ -2451 +ംസ്ക -2452 +ഏഷ്യ -2453 +തിയോ -2454 +ത്രം -2455 +പീഡി -2456 +ൂമിശ -2457 +േഖനം -2458 +භූගෝ -2459 +විකි -2460 +විද් -2461 +กานา -2462 +ฐจีน -2463 +ตุรก -2464 +ธรรม -2465 +ปเอเ -2466 +ภาษา -2467 +ภูมิ -2468 +สตร์ -2469 +อือแ -2470 +ัมภี -2471 +เบิล -2472 +ພະຄໍ -2473 +ວິກິ -2474 +ອາຊີ -2475 +་ཀི། -2476 +་རབ། -2477 +ཀ་ན། -2478 +གསུང -2479 +དབྱི -2480 +འི་ས -2481 +ཨུ་ཏ -2482 +ེར་ས -2483 +ကီပိ -2484 +င်ငံ -2485 +ထဝီဝ -2486 +သမ္မ -2487 +အာရှ -2488 +აზია -2489 +აფია -2490 +განა -2491 +გეოგ -2492 +დუის -2493 +თურქ -2494 +კულტ -2495 +ტაივ -2496 +უეტე -2497 +መጽሐፍ -2498 +ᐃᑖᓯᓐ -2499 +bī -2500 +rg -2501 +sụ -2502 +āz -2503 +ēd -2504 +īn -2505 +ļu -2506 +ǣc -2507 +не -2508 +ون -2509 +തു -2510 +ര് -2511 +ിൾ -2512 +ัฒ -2513 +ጋና -2514 +ភូ -2515 +ṣá -2516 +▁ቅ -2517 +ピア -2518 +성경 -2519 +mam -2520 +thì -2521 +thổ -2522 +tio -2523 +uuf -2524 +ĉin -2525 +ķīn -2526 +әне -2527 +ബിൾ -2528 +ര്‍ -2529 +ස්ක -2530 +วัฒ -2531 +ინგ -2532 +ụsụ -2533 +▁ad -2534 +▁به -2535 +▁ቅዱ -2536 +▁ṣá -2537 +提摩太 -2538 +heer -2539 +kolt -2540 +және -2541 +ѩꙁꙑ́ -2542 +ఊఎతె -2543 +ൈബിൾ -2544 +ංස්ක -2545 +วัฒน -2546 +ინგლ -2547 +ᐅᐃᑭᐱ -2548 +ᐊᑲᔭᓯ -2549 +ᒧᐃᐧᐣ -2550 +ᓈᑎᑐᑦ -2551 +ទ្យា -2552 +ភូមិ -2553 +វប្ប -2554 +ẹ̀ẹ́ -2555 +‍යාව -2556 +▁ais -2557 +▁anh -2558 +▁avo -2559 +▁bîn -2560 +▁des -2561 +▁dân -2562 +▁ham -2563 +▁hóa -2564 +▁job -2565 +▁kay -2566 +▁kir -2567 +▁lef -2568 +▁let -2569 +▁lui -2570 +▁oló -2571 +▁ovo -2572 +▁qil -2573 +▁tar -2574 +▁til -2575 +▁îng -2576 +▁ĉin -2577 +▁τησ -2578 +▁анг -2579 +▁мот -2580 +▁улс -2581 +▁хэл -2582 +▁הסי -2583 +▁טימ -2584 +▁إلى -2585 +▁إنج -2586 +▁ئین -2587 +▁الص -2588 +▁اول -2589 +▁خەت -2590 +▁ܩܕܝ -2591 +▁ভাষ -2592 +▁భాష -2593 +▁ಶಾಸ -2594 +▁ენა -2595 +▁ቅዱስ -2596 +▁ᨕᨗᨋ -2597 +▁드부르 -2598 +ィキペデ -2599 +bè -2600 +hĩ -2601 +jé -2602 +nā -2603 +rā -2604 +ál -2605 +âs -2606 +åk -2607 +ëe -2608 +óo -2609 +ġe -2610 +̍k -2611 +ନ୍ -2612 +து -2613 +ತ್ -2614 +ති -2615 +පී -2616 +တန -2617 +တရ -2618 +တ် -2619 +ို -2620 +cem -2621 +chd -2622 +ene -2623 +gye -2624 +kut -2625 +men -2626 +óok -2627 +ānā -2628 +துர -2629 +ತ್ರ -2630 +ഭാഷ -2631 +තිය -2632 +පීඩ -2633 +กฤษ -2634 +တရု -2635 +ბიბ -2636 +▁bé -2637 +위터젠 -2638 +esis -2639 +ha̍k -2640 +imba -2641 +toor -2642 +âsie -2643 +ğana -2644 +துரு -2645 +್ತ್ರ -2646 +ෘතිය -2647 +เซิน -2648 +တနို -2649 +▁học -2650 +▁mál -2651 +▁nhĩ -2652 +▁ঠার -2653 +スブルク -2654 +中华民国 -2655 +中華民国 -2656 +亞細亞洲 -2657 +杜伊斯堡 -2658 +維基大典 -2659 +위키백과 -2660 +중화민국 -2661 +𐌰𐌲𐌲𐌹 -2662 +anglų -2663 +angol -2664 +angļu -2665 +aniel -2666 +aniya -2667 +apere -2668 +asụsụ -2669 +aziýa -2670 +azyjo -2671 +azëjô -2672 +azėjė -2673 +bahsa -2674 +baibû -2675 +beibl -2676 +biblo -2677 +bíbél -2678 +cchip -2679 +cenze -2680 +chris -2681 +chûng -2682 +cland -2683 +cultú -2684 +daear -2685 +diwyl -2686 +dùgùk -2687 +eedia -2688 +eediä -2689 +ekuli -2690 +engel -2691 +ensim -2692 +enska -2693 +enskt -2694 +esneg -2695 +etoko -2696 +eŋlis -2697 +ganao -2698 +ganän -2699 +hā -2700 +pü -2701 +tē -2702 +uv -2703 +vă -2704 +yê -2705 +ël -2706 +ýe -2707 +đị -2708 +İz -2709 +яз -2710 +ען -2711 +ன் -2712 +ంగ -2713 +ఘన -2714 +ల్ -2715 +్ల -2716 +วิ -2717 +ศา -2718 +ไบ -2719 +前書 -2720 +加納 -2721 +加纳 -2722 +圣经 -2723 +英文 -2724 +迦納 -2725 +cra -2726 +dje -2727 +pir -2728 +pük -2729 +tad -2730 +vhi -2731 +vre -2732 +văn -2733 +địa -2734 +скі -2735 +язы -2736 +ܝܦܕ -2737 +ென் -2738 +ఆంగ -2739 +ఘనా -2740 +ანი -2741 +▁но -2742 +azje -2743 +djey -2744 +gána -2745 +ibël -2746 +ikia -2747 +ipen -2748 +kehā -2749 +ksku -2750 +mbib -2751 +phuv -2752 +raka -2753 +stad -2754 +yeyê -2755 +óteo -2756 +İzge -2757 +вскі -2758 +язык -2759 +אסיה -2760 +رىنچ -2761 +સેન્ -2762 +ร์ไบ -2763 +ན་ཇི -2764 +▁hoa -2765 +▁min -2766 +▁нов -2767 +▁ഒന് -2768 +aazje -2769 +allpa -2770 +bivhi -2771 +geris -2772 +ghyij -2773 +ghánà -2774 +gjuha -2775 +grisa -2776 +huacā -2777 +iaeth -2778 +iaita -2779 +ibili -2780 +idias -2781 +iejui -2782 +ikani -2783 +ilîzî -2784 +ingla -2785 +inglé -2786 +inida -2787 +injil -2788 +ipala -2789 +iriis -2790 +itali -2791 +iňlis -2792 +iškas -2793 +jagha -2794 +jačen -2795 +jopis -2796 +jẹ́ọ́ -2797 +kania -2798 +kiped -2799 +.” -2800 +mu -2801 +tê -2802 +âu -2803 +ìn -2804 +ах -2805 +ль -2806 +тэ -2807 +ان -2808 +زى -2809 +لغ -2810 +ين -2811 +يې -2812 +ܓܠ -2813 +ܫܐ -2814 +ज् -2815 +চী -2816 +சு -2817 +▁à -2818 +▁ю -2819 +▁ك -2820 +▁ܐ -2821 +皮亚 -2822 +bhi -2823 +dur -2824 +ngo -2825 +sko -2826 +vli -2827 +æði -2828 +ìne -2829 +дах -2830 +ܢܓܠ -2831 +ज्य -2832 +চীন -2833 +▁dz -2834 +▁gh -2835 +▁يې -2836 +皮亚琴 -2837 +beur -2838 +bila -2839 +châu -2840 +kitê -2841 +saja -2842 +вски -2843 +زىلغ -2844 +ೀಡಿಯ -2845 +▁kik -2846 +▁mìn -2847 +▁ютэ -2848 +▁চীন -2849 +皮亚琴察 -2850 +durka -2851 +ising -2852 +kawsa -2853 +kitaa -2854 +ksodu -2855 +kulnu -2856 +kéyah -2857 +laisa -2858 +lenga -2859 +lengh -2860 +lerin -2861 +liant -2862 +likwa -2863 +linga -2864 +lizce -2865 +lteer -2866 +lunga -2867 +lèisa -2868 +maati -2869 +mambí -2870 +melos -2871 +mhuri -2872 +mière -2873 +monum -2874 +naati -2875 +nalis -2876 +natio -2877 +ngaha -2878 +ngísi -2879 +ntivo -2880 +osiyo -2881 +owska -2882 +ozneg -2883 +panhu -2884 +pipir -2885 +plais -2886 +pobla -2887 +první -2888 +qallu -2889 +rafia -2890 +repub -2891 +revet -2892 +rkiet -2893 +rszág -2894 +salmi -2895 +shiya -2896 +siyop -2897 +sjina -2898 +skeje -2899 +mô -2900 +rè -2901 +ેજ -2902 +ốc -2903 +▁臺 -2904 +▁보 -2905 +▁크 -2906 +레브 -2907 +서간 -2908 +스키 -2909 +지리 -2910 +첫째 -2911 +ckô -2912 +ede -2913 +gli -2914 +jen -2915 +kee -2916 +kil -2917 +qia -2918 +que -2919 +quī -2920 +sin -2921 +uer -2922 +ેજી -2923 +▁kỳ -2924 +▁臺灣 -2925 +▁보낸 -2926 +▁서간 -2927 +▁첫째 -2928 +▁크룰 -2929 +aasi -2930 +asav -2931 +doua -2932 +enga -2933 +esia -2934 +mädä -2935 +rkei -2936 +wiki -2937 +دفور -2938 +রসেন -2939 +ვიკი -2940 +▁oul -2941 +▁per -2942 +레브스키 -2943 +atsin -2944 +gliba -2945 +ieina -2946 +kilti -2947 +quīxt -2948 +slani -2949 +spolo -2950 +sprog -2951 +surat -2952 +sveto -2953 +tania -2954 +tchaj -2955 +thuis -2956 +tiede -2957 +tiong -2958 +tirki -2959 +tisch -2960 +trung -2961 +turkä -2962 +tuuri -2963 +twere -2964 +tíreo -2965 +túrkì -2966 +türgi -2967 +türgü -2968 +tōlli -2969 +uamua -2970 +ukadd -2971 +ulika -2972 +unang -2973 +unghv -2974 +utama -2975 +vanas -2976 +vicip -2977 +vithi -2978 +vlika -2979 +wanza -2980 +wicip -2981 +yamba -2982 +yddia -2983 +yesem -2984 +ziman -2985 +ázsia -2986 +çheer -2987 +íobla -2988 +îgarî -2989 +āzija -2990 +ĉenco -2991 +ēdija -2992 +īterz -2993 +ķīnas -2994 +ōlelo -2995 +ščina -2996 +ƿicip -2997 +έντζα -2998 +ασίαν -2999 +sì -3000 +къ -3001 +יי -3002 +ഘാ -3003 +노비 -3004 +dim -3005 +gaf -3006 +ghá -3007 +isi -3008 +khô -3009 +pii -3010 +taï -3011 +yin -3012 +àis -3013 +ánh -3014 +ഘാന -3015 +ྐད། -3016 +▁mo -3017 +▁vr -3018 +anaa -3019 +angi -3020 +azja -3021 +doti -3022 +dotu -3023 +qana -3024 +ulul -3025 +ասիա -3026 +ܠܫܢܐ -3027 +ംഗ്ല -3028 +ມສາດ -3029 +▁aan -3030 +▁gen -3031 +▁han -3032 +▁кыв -3033 +aedia -3034 +burch -3035 +gaana -3036 +ghána -3037 +gráfì -3038 +onglu -3039 +prima -3040 +twrci -3041 +zdotu -3042 +àisia -3043 +γκάνα -3044 +γραφή -3045 +ισμόσ -3046 +ουργκ -3047 +πιατσ -3048 +πολιτ -3049 +инҷил -3050 +йылме -3051 +աստվա -3052 +լերեն -3053 +կույթ -3054 +փեդիա -3055 +גרפיה -3056 +נגליש -3057 +תרבות -3058 +رىنچى -3059 +ليزية -3060 +گلیسی -3061 +ܐܓܪܬܐ -3062 +ܐܝܛܙܢ -3063 +ܓܪܦܝܐ -3064 +ܛܝܡܬܐ -3065 +ܝܒܫܬܐ -3066 +ܝܦܕܝܐ -3067 +ܢܓܠܝܐ -3068 +ނގިރޭ -3069 +ކީވިލ -3070 +ުރާފީ -3071 +अंग्र -3072 +आशिया -3073 +एसिया -3074 +टर्की -3075 +बाइबल -3076 +बायबल -3077 +स्तान -3078 +জাতন্ -3079 +সংস্ক -3080 +ਪੀਡਿਆ -3081 +ਬਾਈਬਲ -3082 +ਭਾਸ਼ਾ -3083 +અંગ્ર -3084 +એશિયા -3085 +બાઇબલ -3086 +ભૂગોળ -3087 +ଏସିଯା -3088 +ஆங்கி -3089 +ஆசியா -3090 +சீனக் -3091 +புவிய -3092 +ఆంగ్ల -3093 +ఆసియా -3094 +టర్కీ -3095 +త్రము -3096 +భూగోళ -3097 +ర్సెన -3098 +ಆಂಗ್ಲ -3099 +▁ല -3100 +aar -3101 +ake -3102 +ira -3103 +kob -3104 +kok -3105 +kum -3106 +mad -3107 +ngî -3108 +pre -3109 +sow -3110 +tie -3111 +tiế -3112 +ína -3113 +זיע -3114 +▁pu -3115 +anos -3116 +arla -3117 +azie -3118 +blik -3119 +chia -3120 +ebol -3121 +eija -3122 +eshe -3123 +leko -3124 +leng -3125 +lisc -3126 +lisy -3127 +long -3128 +mbaz -3129 +mədə -3130 +pirm -3131 +resa -3132 +unhu -3133 +ushe -3134 +þēce -3135 +אזיע -3136 +కృతి -3137 +▁chy -3138 +▁nsọ -3139 +▁pak -3140 +aziya -3141 +bibla -3142 +burga -3143 +inlis -3144 +lesch -3145 +longa -3146 +pigaz -3147 +tchie -3148 +tiếng -3149 +turka -3150 +ותיוס -3151 +லியம் -3152 +ಏಷ್ಯಾ -3153 +ಟರ್ಕಿ -3154 +ಬೈಬಲ್ -3155 +ಭೂಗೋಳ -3156 +ವಿಕಿಪ -3157 +ഇംഗ്ല -3158 +തുര്‍ -3159 +പീഡിയ -3160 +ബൈബിൾ -3161 +ഭൂമിശ -3162 +സംസ്ക -3163 +භූගෝල -3164 +සංස්ක -3165 +กิพีเ -3166 +คัมภี -3167 +ตุรกี -3168 +สาธาร -3169 +ພາສາອ -3170 +ພີເດຍ -3171 +ັງກິດ -3172 +ཤེ་ཡ། -3173 +တရုတ် -3174 +ပထဝီဝ -3175 +ბურგი -3176 +გეოგრ -3177 +ისური -3178 +პედია -3179 +რზენი -3180 +ኢውተዘን -3181 +ውክፔዲያ -3182 +ᏫᎩᏇᏗᏯ -3183 +ᓄᓇᐅᔪᖅ -3184 +វិគីភ -3185 +អាស៊ី -3186 +▁aahe -3187 +▁adam -3188 +▁asia -3189 +▁brev -3190 +▁chib -3191 +▁enci -3192 +▁ency -3193 +▁ensi -3194 +▁eshi -3195 +▁free -3196 +▁hano -3197 +▁hast -3198 +▁hese -3199 +:' -3200 +ܘܣ -3201 +phi -3202 +thú -3203 +tob -3204 +ियः -3205 +ance -3206 +asao -3207 +asía -3208 +edje -3209 +kumo -3210 +lese -3211 +loda -3212 +maan -3213 +pada -3214 +xris -3215 +yeth -3216 +▁aru -3217 +▁cum -3218 +▁kwa -3219 +▁sha -3220 +▁shi -3221 +asiia -3222 +tturk -3223 +turkí -3224 +երսեն -3225 +گلیزی -3226 +▁ilẹ̀ -3227 +▁keel -3228 +▁kiil -3229 +▁kron -3230 +▁kuka -3231 +▁kína -3232 +▁leid -3233 +▁moku -3234 +▁ndel -3235 +▁nowy -3236 +▁okum -3237 +▁omat -3238 +▁omuu -3239 +▁osho -3240 +▁pele -3241 +▁quốc -3242 +▁simi -3243 +▁sìne -3244 +▁tabu -3245 +▁tymo -3246 +▁vedy -3247 +▁vola -3248 +▁vrye -3249 +▁waal -3250 +▁waye -3251 +▁xina -3252 +▁yang -3253 +▁yazu -3254 +▁áise -3255 +▁двур -3256 +▁език -3257 +▁және -3258 +▁кара -3259 +▁келн -3260 +▁кель -3261 +▁кяль -3262 +▁мотт -3263 +▁новы -3264 +▁тили -3265 +▁тыла -3266 +▁тілі -3267 +▁язык -3268 +▁ѩꙁꙑ́ -3269 +▁הראש -3270 +▁بولس -3271 +▁تىلى -3272 +▁دفور -3273 +▁زبون -3274 +▁مقدس -3275 +▁ܕܠܘܬ -3276 +▁गणरा -3277 +▁ভাষা -3278 +▁ભાષા -3279 +▁శాస్ -3280 +▁විද් -3281 +ウィキペデ -3282 +チェンツァ -3283 +テモテへの -3284 +提摩太前書 -3285 +뒤스부르크 -3286 +aasije -3287 +aasiya -3288 +amattu -3289 +angels -3290 +angilɛ -3291 +angles -3292 +anglez -3293 +anglum -3294 +anglés -3295 +anglëe -3296 +angule -3297 +aniyat -3298 +baarle -3299 +baebol -3300 +baibel -3301 +baibûl -3302 +bebele -3303 +beibel -3304 +beurla -3305 +bibbja -3306 +bibeln -3307 +biblie -3308 +bibulu -3309 +biibël -3310 +bijbel -3311 +biwwel -3312 +burgum -3313 +bèibel -3314 +bíbélì -3315 +bībele -3316 +cemānā -3317 +centia -3318 +chhièn -3319 +cultúr -3320 +cunghv -3321 +cência -3322 +domány -3323 +dorydh -3324 +dyuspa -3325 +eediya -3326 +eerste -3327 +fiteny -3328 +földra -3329 +första -3330 +gafano -3331 +galati -3332 +gelske -3333 +geogra -3334 +geògra -3335 +giella -3336 +glibau -3337 +glizit -3338 +gwerin -3339 +hitapu -3340 +hololo -3341 +ibheri -3342 +ibhile -3343 +ibíídi -3344 +ielski -3345 +inarum -3346 +ingesi -3347 +ingliz -3348 +inglès -3349 +inglés -3350 +inglês -3351 +inhlon -3352 +inlish -3353 +italia -3354 +italie -3355 +itàlia -3356 +kawsay -3357 +kerazu -3358 +kiinan -3359 +kitêba -3360 +kojska -3361 +komara -3362 +krambu -3363 +kselle -3364 +ksodus -3365 +kuidua -3366 +kumona -3367 +kültür -3368 +lannin -3369 +lenghe -3370 +lengua -3371 +libhay -3372 +luenga -3373 +lèngoa -3374 +lýðvel -3375 +mbangu -3376 +moxtli -3377 +muqadd -3378 +mäinen -3379 +mìnira -3380 +nangan -3381 +nguéra -3382 +niyyət -3383 +orílẹ̀ -3384 +paulus -3385 +pediya -3386 +piibel -3387 +piibli -3388 +pjačen -3389 +rāfija -3390 +sëinsa -3391 +tanbul -3392 +tayvän -3393 +taïwan -3394 +taýwan -3395 +tendom -3396 +teusza -3397 +turcja -3398 +tureke -3399 +turkie -3400 +turkii -3401 +turkio -3402 +turkki -3403 +turkye -3404 +turkäi -3405 +turqia -3406 +tutske -3407 +tuykia -3408 +törkie -3409 +töröko -3410 +türkän -3411 +ululwa -3412 +uraafi -3413 +viquip -3414 +viqùip -3415 +waraka -3416 +wikang -3417 +yacouh -3418 +ywachi -3419 +zimanê -3420 +întâia -3421 +òlòkùn -3422 +ürkiye -3423 +čenské -3424 +İncile -3425 +İngliz -3426 +ılızki -3427 +łéngua -3428 +ɛlɛ́sa -3429 +κρατία -3430 +леўскі -3431 +ագրութ -3432 +աշխարհ -3433 +պորտալ -3434 +אוטרסן -3435 +איטערז -3436 +אנגלית -3437 +ביבליה -3438 +געאגרא -3439 +האיגרת -3440 +הרפובל -3441 +וויקיפ -3442 +טורקיה -3443 +טערקיי -3444 +ענגליש -3445 +קולטור -3446 +زىلغان -3447 +موتاوس -3448 +ܛܘܪܩܝܐ -3449 +ގާފަތު -3450 +तुर्की -3451 +बाइबिल -3452 +এছিয়া -3453 +ਅੰਗ੍ਰੇ -3454 +ઊએતેર્ -3455 +કસ્તાન -3456 +પીડિયા -3457 +ର୍ସେନ୍ -3458 +டியரசு -3459 +விக்கி -3460 +బైబిల్ -3461 +ర్సెన్ -3462 +వికీపీ -3463 +ರ್ಸೆನ್ -3464 +തിമോത് -3465 +വിക്കി -3466 +സിനെഴു -3467 +ආසියාව -3468 +පීඩියා -3469 +ภูมิศา -3470 +อังกฤษ -3471 +ພູມສາດ -3472 +ེར་སེན -3473 +ဝီကီပိ -3474 +ბიბლია -3475 +እንግሊዝኛ -3476 +ᎦᏬᏂᎯᏍᏗ -3477 +ᐃᐊᐃᑖᓯᓐ -3478 +ᐅᐃᑭᐱᑎᐊ -3479 +ភូមិវិ -3480 +ẹ̀ẹ́sì -3481 +▁accra -3482 +▁bhasa -3483 +▁chine -3484 +▁către -3485 +▁enzyk -3486 +▁freie -3487 +▁godka -3488 +▁jakob -3489 +▁japan -3490 +▁johan -3491 +▁kieli -3492 +▁kinni -3493 +▁kirje -3494 +▁kitob -3495 +▁korea -3496 +▁lefyo -3497 +▁links -3498 +▁livre -3499 +▁lukas -3500 +▁mepin -3501 +▁mímọ́ -3502 +▁ngaha -3503 +▁nyelv -3504 +▁otava -3505 +▁ovana -3506 +▁paulo -3507 +▁pavel -3508 +▁pismo -3509 +▁pîroz -3510 +▁saina -3511 +▁saphi -3512 +▁simin -3513 +▁sjina -3514 +▁språk -3515 +▁sprǣc -3516 +▁sulat -3517 +▁tasav -3518 +▁tashi -3519 +▁thánh -3520 +▁tirki -3521 +▁ĉinio -3522 +▁γραφή -3523 +▁κίνασ -3524 +▁инҷил -3525 +▁йылме -3526 +▁кроле -3527 +▁крулє -3528 +▁кітай -3529 +▁хятад -3530 +▁інжіл -3531 +▁јазик -3532 +▁језик -3533 +▁الصين -3534 +▁تيموث -3535 +▁كرولي -3536 +▁ܛܝܡܬܐ -3537 +▁ܩܕܝܫܐ -3538 +▁ਭਾਸ਼ਾ -3539 +▁ലേഖനം -3540 +▁ṣáínà -3541 +ウエテルセン -3542 +aardrij -3543 +akwụkwọ -3544 +anglais -3545 +baibolo -3546 +baiboly -3547 +baibuli -3548 +baibulo -3549 +bhicipè -3550 +biblijo -3551 +biblían -3552 +biblíya -3553 +biibili -3554 +bivhili -3555 +bìoball -3556 +bíblian -3557 +bėblėjė -3558 +cheascã -3559 +cultoor -3560 +cultură -3561 +cultuur -3562 +diminwa -3563 +douaron -3564 +eaŋgals -3565 +ekulilo -3566 +enenene -3567 +geuogra -3568 +giograf -3569 +iashiya -3570 +iengels -3571 +ingeles -3572 +ingelsk -3573 +inglise -3574 +inglish -3575 +inglüse -3576 +itersen -3577 +iterzen -3578 +jamhuri -3579 +jendźel -3580 +jeograf -3581 +kitaaba -3582 +koltūra -3583 +kskunde -3584 +kultuer -3585 +kulture -3586 +kulturi -3587 +kuttura -3588 +kùltura -3589 +laíocht -3590 +leeaght -3591 +lerineq -3592 +lopädie -3593 +länapük -3594 +mbibeli -3595 +menning -3596 +nàkalan -3597 +paipala -3598 +pigàsàn -3599 +pipiria -3600 +publica -3601 +publika -3602 +républi -3603 +saesneg -3604 +saozneg -3605 +sbrevet -3606 +slanica -3607 +sowsnek -3608 +síngísi -3609 +taihuan -3610 +taiwana -3611 +taledav -3612 +thuisco -3613 +tierkei -3614 +torkėjė -3615 +turchìa -3616 +turchía -3617 +turcija -3618 +turcyjo -3619 +tureuki -3620 +turkeya -3621 +turkeye -3622 +turkiet -3623 +turkija -3624 +turquie -3625 +turčija -3626 +tyrkiet -3627 +tëreckô -3628 +türchia -3629 +türkiyə -3630 +türkiýe -3631 +tırkiya -3632 +uicchip -3633 +ukaddes -3634 +uraqita -3635 +uturuki -3636 +yterzen -3637 +üterzen -3638 +ġeograf -3639 +İngilis -3640 +αγγλική -3641 +йрамдах -3642 +թուրքիա -3643 +ծաշունչ -3644 +דיסבורג -3645 +ނގިރޭސި -3646 +इंग्लिश -3647 +युटरझेन -3648 +উটেরসেন -3649 +டர்சென் -3650 +பண்பாடு -3651 +ാസ്ത്രം -3652 +ณรัฐจีน -3653 +ทร์เซิน -3654 +ทวีปเอเ -3655 +วิกิพีเ -3656 +ພະຄໍາພີ -3657 +ཏུར་ཀི། -3658 +འི་སྐད། -3659 +ပထဝီဝင် -3660 +თურქეთი -3661 +კულტურა -3662 +ტაივანი -3663 +ᖃᓪᓗᓈᑎᑐᑦ -3664 +វប្បធម៌ -3665 +▁aaroma -3666 +▁aishey -3667 +▁anglés -3668 +▁aveshe -3669 +▁bilong -3670 +▁bizaad -3671 +▁béarla -3672 +▁bíobla -3673 +▁daniel -3674 +▁dinida -3675 +▁dulika -3676 +▁første -3677 +▁ghaney -3678 +▁inglis -3679 +▁inglés -3680 +▁jesaja -3681 +▁kepada -3682 +▁kineze -3683 +▁kwanza -3684 +▁linguo -3685 +▁mabelé -3686 +▁mateus -3687 +▁metoko -3688 +▁muamua -3689 +▁naadam -3690 +▁naanaa -3691 +▁ngrese -3692 +▁ngrisa -3693 +▁nokuli -3694 +▁nomeye -3695 +▁omunhu -3696 +▁onghee -3697 +▁pākehā -3698 +▁qillqa -3699 +▁spraak -3700 +▁sproch -3701 +▁taipei -3702 +▁turkee -3703 +▁ulikwa -3704 +▁vaadam -3705 +▁valoda -3706 +▁waadam -3707 +▁épître -3708 +▁γλώσσα -3709 +▁ѩꙁꙑ́къ -3710 +▁הסינית -3711 +▁الأولى -3712 +▁الرسول -3713 +▁ܐܢܓܠܝܐ -3714 +▁ܩܕܡܝܬܐ -3715 +▁ޗައިނާ -3716 +▁ഒന്നാം -3717 +▁ᨕᨗᨋᨗᨔᨗ -3718 +▁크룰레브스키 -3719 +ウィキペディア -3720 +デュースブルク -3721 +ピアチェンツァ -3722 +angelsko -3723 +angliais -3724 +angliana -3725 +chingere -3726 +clopedia -3727 +clopédia -3728 +coğrafya -3729 +cunghvaz -3730 +duisborg -3731 +dīsburga -3732 +engelska -3733 +englesch -3734 +englisch -3735 +eografia -3736 +episteli -3737 +epistula -3738 +geografy -3739 +huriyeti -3740 +ianayōtl -3741 +ingereza -3742 +ingiriis -3743 +ingristo -3744 +iningles -3745 +ininglis -3746 +isingesi -3747 +isingisi -3748 +iterzene -3749 +jiograpi -3750 +kipedija -3751 +klopedie -3752 +melosuuf -3753 +monument -3754 +muqaddas -3755 +mädäniät -3756 +naatitut -3757 +nunaujuq -3758 +ografeye -3759 +pagsasao -3760 +phuvipen -3761 +piacenze -3762 +piaçensa -3763 +piaĉenco -3764 +pirmasis -3765 +pjačenca -3766 +plasença -3767 +poblachd -3768 +première -3769 +primeira -3770 +raamattu -3771 +ripablik -3772 +senyesem -3773 +taglizit -3774 +taivanas -3775 +tarakalt -3776 +tohitapu -3777 +turcland -3778 +turtchie -3779 +tyrkland -3780 +vicipéid -3781 +vikipedi -3782 +vikipetã -3783 +vouiquip -3784 +xristian -3785 +yddiaeth -3786 +yinghyij -3787 +ängelske -3788 +īterzens -3789 +ντούισμπ -3790 +անգլերեն -3791 +մշակույթ -3792 +איטערזען -3793 +גאוגרפיה -3794 +ויקיפדיה -3795 +ܓܐܘܓܪܦܝܐ -3796 +ܘܝܩܝܦܕܝܐ -3797 +ބައިބަލް -3798 +ކީވިލާތް -3799 +ސަގާފަތު -3800 +ޖުމްހޫރީ -3801 +ޖުޣުރާފީ -3802 +आंगलभाषा -3803 +संस्कृती -3804 +উটার্সেন -3805 +প্রজাতন্ -3806 +সংস্কৃতি -3807 +અંગ્રેજી -3808 +ஆங்கிலம் -3809 +துருக்கி -3810 +ப்பீடியா -3811 +సంస్కృతి -3812 +ഇംഗ്ലീഷ് -3813 +സംസ്കാരം -3814 +ร์ไบเบิล -3815 +วัฒนธรรม -3816 +གསུང་རབ། -3817 +དབྱིན་ཇི -3818 +ཨེ་ཤེ་ཡ། -3819 +တနိုင်ငံ -3820 +ᐊᑲᔭᓯᒧᐃᐧᐣ -3821 +វិគីភីឌា -3822 +▁anglese -3823 +▁angleze -3824 +▁anglica -3825 +▁anglisy -3826 +▁bikéyah -3827 +▁chineză -3828 +▁chińska -3829 +▁chińsko -3830 +▁ehololo -3831 +▁eksodus -3832 +▁engleză -3833 +▁englisc -3834 +▁english -3835 +▁epsalmi -3836 +▁genesis -3837 +▁gẹ̀ẹ́sì -3838 +▁hambili -3839 +▁iilonga -3840 +▁inggréh -3841 +▁inglexe -3842 +▁inglish -3843 +▁ingresa -3844 +▁itavele -3845 +▁kungaha -3846 +▁laiškas -3847 +▁lettera -3848 +▁minzgoz -3849 +▁ngaashi -3850 +▁okumang -3851 +▁oulunde -3852 +▁pertama -3853 +▁publica -3854 +▁sinarum -3855 +▁sproake -3856 +▁thembaz -3857 +▁timotei -3858 +▁timóteo -3859 +▁tsieina -3860 +▁türkiye -3861 +▁wopanhu -3862 +▁англисӣ -3863 +▁тимофею -3864 +▁ютэрзен -3865 +▁הראשונה -3866 +▁انگلیسی -3867 +▁بىرىنچى -3868 +▁ܛܝܡܬܐܘܣ -3869 +▁गणराज्य -3870 +▁ಶಾಸ್ತ್ರ -3871 +テモテへの手紙一 -3872 +angalteer -3873 +angilɛkan -3874 +bhaibheri -3875 +bilagáana -3876 +coğrafiya -3877 +duisbourg -3878 +duisburch -3879 +engelsche -3880 +englannin -3881 +erdnîgarî -3882 +eŋlisigbe -3883 +geografía -3884 +geugrafia -3885 +giografia -3886 +giografìa -3887 +gyeografi -3888 +heografia -3889 +heografie -3890 +huacāyōtl -3891 +ibíídiiya -3892 +inglatlah -3893 +jeografia -3894 +jewografi -3895 +jiografia -3896 +juquraafi -3897 +jéografie -3898 +kulttuuri -3899 +lingaedje -3900 +lýðveldið -3901 +maantiede -3902 +maatiidüs -3903 +madaniyat -3904 +piasëinsa -3905 +pigazanos -3906 +placentia -3907 +placência -3908 +plaisance -3909 +plasencia -3910 +republike -3911 +repuvlika -3912 +rypublika -3913 +rèpublica -3914 +turkaland -3915 +turkojska -3916 +turkowska -3917 +uikipitia -3918 +utamaduni -3919 +vikipedio -3920 +wicipedia -3921 +wikipedio -3922 +wikipikia -3923 +yterzenas -3924 +zemljopis -3925 +İngilizce -3926 +İngılızki -3927 +ƿicipǣdia -3928 +γεωγραφία -3929 +ուետերսեն -3930 +վիքիփեդիա -3931 +געאגראפיע -3932 +הרפובליקה -3933 +ויקיפידיה -3934 +އިނގިރޭސި -3935 +अंग्रेज़ी -3936 +ਅੰਗ੍ਰੇਜ਼ੀ -3937 +ਵਿਕਿਪੀਡਿਆ -3938 +புவியியல் -3939 +விவிலியம் -3940 +ವಿಕಿಪೀಡಿಯ -3941 +തുര്‍ക്കി -3942 +സിനെഴുതിയ -3943 +සංස්කෘතිය -3944 +ວິກິພີເດຍ -3945 +တရုတ်သမ္မ -3946 +ဝီကီပိဒိယ -3947 +გეოგრაფია -3948 +დუისბურგი -3949 +ვიკიპედია -3950 +ინგლისური -3951 +უეტერზენი -3952 +▁aagalati -3953 +▁aaheberi -3954 +▁anglèisa -3955 +▁englaisa -3956 +▁epistolo -3957 +▁epistolă -3958 +▁geografi -3959 +▁hastangi -3960 +▁hesekiel -3961 +▁inggeris -3962 +▁ingleise -3963 +▁istanbul -3964 +▁johannes -3965 +▁kikristo -3966 +▁kronkron -3967 +▁mokufuta -3968 +▁mukaddes -3969 +▁ndelenee -3970 +▁nokumona -3971 +▁okuyamba -3972 +▁omatimba -3973 +▁pokrambu -3974 +▁saywachi -3975 +▁timoteju -3976 +▁timoteüs -3977 +▁timothée -3978 +▁waalushe -3979 +▁îngilîzî -3980 +▁послание -3981 +▁тимотеју -3982 +▁טימותיוס -3983 +▁إنجليزية -3984 +▁ئینگلیزی -3985 +▁تيموثاوس -3986 +▁تیموتاوس -3987 +▁يېزىلغان -3988 +▁குடியரசு -3989 +▁විද්‍යාව -3990 +angleščina -3991 +aperetania -3992 +biblioþēce -3993 +biquipedia -3994 +biquipédia -3995 +cheografía -3996 +chingerezi -3997 +diwylliant -3998 +dorydhyeth -3999 +duisburgas -4000 +engelšćina -4001 +földrajztu -4002 +geograafia -4003 +geografiya -4004 +geografiýa -4005 +geògrafijô -4006 +gjeografia -4007 +gjeografie -4008 +iaitaatsin -4009 +inhlonipho -4010 +jeografiya -4011 +jẹ́ọ́gráfì -4012 +kalinangan -4013 +kiingereza -4014 +landafræði -4015 +landafrøði -4016 +lingɛlɛ́sa -4017 +mədəniyyət -4018 +nalistisch -4019 +repubblica -4020 +respubliko -4021 +république -4022 +tetãnguéra -4023 +teōāmoxtli -4024 +uikipidias -4025 +vichipedia -4026 +vichipedie -4027 +vikipedija -4028 +vikipedėjė -4029 +vikipeedia -4030 +vikipeediä -4031 +vikipidiya -4032 +vikipēdija -4033 +viquipèdia -4034 +viqùipédie -4035 +wikipaedia -4036 +wikipedijô -4037 +wikipediýa -4038 +wikipedyjo -4039 +wîkîpediya -4040 +ġeografija -4041 +ģeogrāfija -4042 +γεωγραφίαν -4043 +δημοκρατία -4044 +πιατσέντζα -4045 +πολιτισμόσ -4046 +ագրություն -4047 +וויקיפעדיע -4048 +उएतेर्सेन् -4049 +तुर्कस्तान -4050 +विकिपीडियः -4051 +ઊએતેર્સેન્ -4052 +તુર્કસ્તાન -4053 +વિકિપીડિયા -4054 +ଊଏତେର୍ସେନ୍ -4055 +ఊఎతెర్సెన్ -4056 +వికీపీడియా -4057 +ಯುಟೆರ್ಸೆನ್ -4058 +തിമോത്തിയോ -4059 +විකිපීඩියා -4060 +ทวีปเอเชีย -4061 +ประเทศกานา -4062 +ภาษาอังกฤษ -4063 +ภูมิศาสตร์ -4064 +วิกิพีเดีย -4065 +ພາສາອັງກິດ -4066 +ཨུ་ཏེར་སེན -4067 +ភូមិវិទ្យា -4068 +▁angielski -4069 +▁chinskeje -4070 +▁kombibeli -4071 +▁kukalunga -4072 +▁olómìnira -4073 +▁omuuvithi -4074 +▁pelekania -4075 +▁pigazzano -4076 +▁poslanica -4077 +▁republika -4078 +▁tarkerazu -4079 +▁tasavalta -4080 +▁timoteovi -4081 +▁timotheum -4082 +▁timotheus -4083 +▁tirkiyeyê -4084 +▁wikipédia -4085 +▁кролевски -4086 +▁крулєвскі -4087 +▁найрамдах -4088 +▁посланица -4089 +▁كروليفسكي -4090 +▁శాస్త్రము -4091 +anglezikani -4092 +bhicipèidia -4093 +christendom -4094 +douaroniezh -4095 +dùgùkòlòkùn -4096 +ensimmäinen -4097 +geuograpėjė -4098 +gweriniaeth -4099 +huiquipedia -4100 +ibhayibheli -4101 +ibhayibhile -4102 +ingristongo -4103 +linglänapük -4104 +nunalerineq -4105 +oaylleeaght -4106 +senyesemane -4107 +spoločenské -4108 +törökország -4109 +uicchipèdie -4110 +wikipediija -4111 +wikipeediya -4112 +wikkipedija -4113 +யூட்டர்சென் -4114 +വിക്കിപീഡിയ -4115 +ประเทศตุรกี -4116 +อือแทร์เซิน -4117 +▁harrangule -4118 +▁linenenene -4119 +▁metokolelo -4120 +▁nomeyeleko -4121 +▁okudiminwa -4122 +▁timotiejui -4123 +▁tymoteusza -4124 +▁каралеўскі -4125 +djeyografeye -4126 +jendźelšćina -4127 +libhayibheli -4128 +tíreolaíocht -4129 +vouiquipèdia -4130 +աստվածաշունչ -4131 +ތުރުކީވިލާތް -4132 +প্রজাতন্ত্রী -4133 +ഭൂമിശാസ്ത്രം -4134 +สาธารณรัฐจีน -4135 +▁cumhuriyeti -4136 +▁mepingafano -4137 +daearyddiaeth -4138 +eaŋgalsgiella -4139 +qallunaatitut -4140 +quīxtianayōtl -4141 +thuiscoburgum -4142 +wikiibíídiiya -4143 +ντούισμπουργκ -4144 +คัมภีร์ไบเบิล -4145 +▁anglicheascã -4146 +▁enciclopédia -4147 +▁encyclopedia -4148 +▁ensiklopedie -4149 +▁enzyklopädie -4150 +▁respublikasy -4151 +▁respublikası -4152 +▁республикасы -4153 +▁республиката -4154 +aardrijkskunde -4155 +inglatlahtōlli -4156 +விக்கிப்பீடியா -4157 +▁okumangululwa -4158 +▁timoteukselle -4159 +cemānāhuacāyōtl -4160 +nationalistisch -4161 +དབྱིན་ཇིའི་སྐད། -4162 +földrajztudomány -4163 +աշխարհագրություն -4164 +▁timotheosbrevet -4165 +akw -4166 +cic -4167 +föl -4168 +iýa -4169 +ińs -4170 +kîp -4171 +kòl -4172 +mäd -4173 +pob -4174 +rij -4175 +twe -4176 +twr -4177 +uju -4178 +uks -4179 +wyl -4180 +wụk -4181 +yyə -4182 +zne -4183 +ėbl -4184 +ōtl -4185 +αίδ -4186 +ντζ -4187 +πια -4188 +που -4189 +ρατ -4190 +ραφ -4191 +τισ -4192 +του -4193 +ύισ -4194 +ώσσ -4195 +але -4196 +гли -4197 +елн -4198 +лев -4199 +eml -4200 +fij -4201 +fit -4202 +mán -4203 +nez -4204 +wán -4205 +ата -4206 +ита -4207 +най -4208 +рес -4209 +рул -4210 +теј -4211 +ютэ -4212 +іта -4213 +ўск -4214 +հագ -4215 +շակ -4216 +սեն -4217 +սիա -4218 +տեր -4219 +տվա -4220 +րեն -4221 +אוט -4222 +איט -4223 +בות -4224 +וגר -4225 +יבל -4226 +יסב -4227 +יפד -4228 +יפי -4229 +יקי -4230 +ליה -4231 +מות -4232 +סינ -4233 +إلى -4234 +ئین -4235 +بول -4236 +تاو -4237 +ثاو -4238 +غان -4239 +قدس -4240 +نجل -4241 +ولي -4242 +ىرى -4243 +یمو -4244 +ینگ -4245 +ېزى -4246 +ܐܝܛ -4247 +ܘܓܪ -4248 +ܠܘܬ -4249 +ܣܝܐ -4250 +ܦܝܐ -4251 +ܩܕܡ -4252 +ބަލ -4253 +ކީވ -4254 +އިނ -4255 +ތުރ -4256 +ޖުމ -4257 +ަގާ -4258 +ިރޭ -4259 +ްހޫ -4260 +ंस् -4261 +आंग -4262 +इंग -4263 +इबल -4264 +इबि -4265 +उएत -4266 +किप -4267 +चीन -4268 +णरा -4269 +बाय -4270 +भाष -4271 +लभा -4272 +सेन -4273 +स्त -4274 +िकि -4275 +ेज़ -4276 +ইংর -4277 +উইক -4278 +উটা -4279 +তন্ -4280 +ন্ত -4281 +রসে -4282 +রস্ -4283 +সেন -4284 +স্ক -4285 +াইব -4286 +িয় -4287 +ুরস -4288 +েজি -4289 +েরস -4290 +্কৃ -4291 +্রজ -4292 +্রী -4293 +ਿਕਿ -4294 +અંગ -4295 +ર્ક -4296 +ર્સ -4297 +ાના -4298 +ତେର -4299 +kg -4300 +km -4301 +nj -4302 +îg -4303 +ये -4304 +ிப -4305 +ಗ್ -4306 +ಿಪ -4307 +ക് -4308 +ມສ -4309 +າສ -4310 +າອ -4311 +ິພ -4312 +ເດ -4313 +იბ -4314 +ტა -4315 +ულ -4316 +វប -4317 +ិវ -4318 +៊ី -4319 +្ប -4320 +스부 -4321 +azí -4322 +hee -4323 +imm -4324 +nay -4325 +nni -4326 +ovi -4327 +und -4328 +unh -4329 +unu -4330 +vet -4331 +yrk -4332 +äni -4333 +ìni -4334 +əni -4335 +мов -4336 +моф -4337 +շու -4338 +ուե -4339 +ույ -4340 +ուր -4341 +वान -4342 +ইবে -4343 +வில -4344 +விவ -4345 +ானா -4346 +ியர -4347 +ியி -4348 +ுக் -4349 +்கி -4350 +స్క -4351 +స్త -4352 +ికీ -4353 +ెర్ -4354 +్కీ -4355 +ಂಗ್ -4356 +ಪೀಡ -4357 +ಷ್ಯ -4358 +ಸೆನ -4359 +ಿಕಿ -4360 +ೆರ್ -4361 +್ತ್ -4362 +തുര -4363 +ത്ര -4364 +ുതി -4365 +ൂമി -4366 +്കാ -4367 +്കി -4368 +്തി -4369 +്നാ -4370 +ූගෝ -4371 +ෘති -4372 +งกฤ -4373 +ทศต -4374 +ร์เ -4375 +ัฒน -4376 +าษา -4377 +ิกิ -4378 +ีร์ -4379 +เทศ -4380 +າສາ -4381 +ິກິ -4382 +་ཀི -4383 +་ཇི -4384 +་རབ -4385 +་སེ -4386 +ང་ར -4387 +དབྱ -4388 +ེ་ཤ -4389 +ྱིན -4390 +တနိ -4391 +သမ္ -4392 +ုတ် -4393 +ვან -4394 +ზენ -4395 +კიპ -4396 +კულ -4397 +ოგრ -4398 +პედ -4399 +hó -4400 +hú -4401 +họ -4402 +jb -4403 +kí -4404 +lk -4405 +má -4406 +rd -4407 +uj -4408 +wo -4409 +íb -4410 +öl -4411 +ėb -4412 +лм -4413 +դի -4414 +թո -4415 +խա -4416 +ծա -4417 +յո -4418 +նա -4419 +նչ -4420 +սի -4421 +տա -4422 +տե -4423 +տվ -4424 +րհ -4425 +րո -4426 +րս -4427 +ւն -4428 +փե -4429 +به -4430 +یز -4431 +ާނ -4432 +अं -4433 +ऊए -4434 +एश -4435 +घा -4436 +लभ -4437 +िप -4438 +ीड -4439 +ुर -4440 +ृत -4441 +ছি -4442 +େର -4443 +ิน -4444 +་ཤ -4445 +ბუ -4446 +თუ -4447 +ნგ -4448 +რქ -4449 +სბ -4450 +უი -4451 +ግሊ -4452 +ᑭᐱ -4453 +ụk -4454 +キペ -4455 +ュー -4456 +ahe -4457 +enc -4458 +ien -4459 +iia -4460 +ins -4461 +muu -4462 +thu -4463 +εια -4464 +ܫܬܐ -4465 +ূগো -4466 +ტაი -4467 +ტურ -4468 +ურგ -4469 +ურქ -4470 +ንግሊ -4471 +ፔዲያ -4472 +ᏂᎯᏍ -4473 +ᐅᐃᑭ -4474 +ᐱᑎᐊ -4475 +ᑖᓯᓐ -4476 +ᑲᔭᓯ -4477 +ᓄᓇᐅ -4478 +ᓈᑎᑐ -4479 +ភីឌ -4480 +វប្ -4481 +វិគ -4482 +ẹ̀ẹ -4483 +ẹ́ọ -4484 +‍යා -4485 +▁pc -4486 +▁ан -4487 +▁ин -4488 +▁ты -4489 +▁ті -4490 +▁яз -4491 +▁הר -4492 +▁טי -4493 +▁إن -4494 +▁او -4495 +▁تى -4496 +▁دف -4497 +▁مق -4498 +▁ܐܢ -4499 +br -4500 +fy -4501 +go -4502 +gî -4503 +ið -4504 +lê -4505 +oc -4506 +od -4507 +pä -4508 +rr -4509 +uy -4510 +vr -4511 +we -4512 +wi -4513 +yî -4514 +çh -4515 +éo -4516 +ëj -4517 +ìo -4518 +úr -4519 +ŋl -4520 +́s -4521 +ατ -4522 +βι -4523 +γρ -4524 +ει -4525 +ημ -4526 +κά -4527 +κή -4528 +κι -4529 +μπ -4530 +ργ -4531 +τί -4532 +τζ -4533 +τι -4534 +ύι -4535 +ят -4536 +հա -4537 +ստ -4538 +בו -4539 +וג -4540 +וו -4541 +טו -4542 +יב -4543 +יג -4544 +סב -4545 +עא -4546 +פי -4547 +רפ -4548 +רק -4549 +רת -4550 +ުޣ -4551 +ઘા -4552 +તા -4553 +ના -4554 +ેન -4555 +ોળ -4556 +્ક -4557 +્ર -4558 +્સ -4559 +ทร -4560 +ິກ -4561 +ზი -4562 +百科 -4563 +aga -4564 +ala -4565 +bre -4566 +eke -4567 +ers -4568 +ură -4569 +ypu -4570 +φία -4571 +рзе -4572 +גרא -4573 +ிக் -4574 +་སྐ -4575 +▁भा -4576 +▁চী -4577 +▁වි -4578 +▁ᨕᨗ -4579 +ィキペ -4580 +ースブ -4581 +杜伊斯 -4582 +維基百 -4583 +维基百 -4584 +레브스 -4585 +룰레브 -4586 +anhu -4587 +blia -4588 +cemā -4589 +chie -4590 +cile -4591 +cipé -4592 +dory -4593 +eagh -4594 +edio -4595 +eedi -4596 +eldi -4597 +elšć -4598 +enco -4599 +ah -4600 +bp -4601 +bs -4602 +bé -4603 +că -4604 +dz -4605 +dâ -4606 +dè -4607 +ev -4608 +ff -4609 +gĭ -4610 +gɛ -4611 +gẹ -4612 +hd -4613 +hh -4614 +ln -4615 +lü -4616 +nc -4617 +oj -4618 +po -4619 +tc -4620 +tt -4621 +ut -4622 +vn -4623 +zd -4624 +zk -4625 +ás -4626 +én -4627 +êb -4628 +îp -4629 +ør -4630 +ći -4631 +ır -4632 +ńs -4633 +αί -4634 +σμ -4635 +аз -4636 +ар -4637 +ву -4638 +לט -4639 +ަގ -4640 +ާފ -4641 +ުމ -4642 +ޭސ -4643 +ްހ -4644 +गल -4645 +ीन -4646 +টে -4647 +பீ -4648 +ాస -4649 +ಟರ -4650 +ಿಕ -4651 +കാ -4652 +ཏེ -4653 +བྱ -4654 +རབ -4655 +སེ -4656 +ིའ -4657 +ုင -4658 +ến -4659 +▁ܩ -4660 +エテ -4661 +르크 -4662 +ber -4663 +hie -4664 +inw -4665 +ius -4666 +let -4667 +pra -4668 +tel -4669 +thé -4670 +uta -4671 +íya -4672 +āyō -4673 +ेर् -4674 +ประ -4675 +▁em -4676 +▁hó -4677 +▁lm -4678 +▁tp -4679 +chía -4680 +endź -4681 +enyo -4682 +eret -4683 +erta -4684 +esne -4685 +firs -4686 +garî -4687 +gres -4688 +gwer -4689 +heli -4690 +hemb -4691 +henk -4692 +hiya -4693 +ibíí -4694 +iliz -4695 +ilîz -4696 +inin -4697 +inli -4698 +ioma -4699 +bc -4700 +bh -4701 +fo -4702 +iy -4703 +iz -4704 +kė -4705 +lė -4706 +nb -4707 +oà -4708 +pm -4709 +pė -4710 +pǣ -4711 +rp -4712 +sm -4713 +ud -4714 +uq -4715 +ve -4716 +wá -4717 +zė -4718 +àn -4719 +éi -4720 +ör -4721 +üt -4722 +ğa -4723 +īt -4724 +ňl -4725 +ūr -4726 +ən -4727 +̀ẹ -4728 +έν -4729 +кі -4730 +хя -4731 +գա -4732 +ւր -4733 +נג -4734 +لص -4735 +ܘܓ -4736 +ܝܩ -4737 +ނގ -4738 +ަލ -4739 +इं -4740 +কি -4741 +তন -4742 +বে -4743 +রজ -4744 +রী -4745 +রে -4746 +়া -4747 +াই -4748 +িড -4749 +িয -4750 +্ত -4751 +ಆಂ -4752 +ೆರ -4753 +ാം -4754 +ຄໍ -4755 +ີເ -4756 +ឌា -4757 +理學 -4758 +维基 -4759 +baz -4760 +bie -4761 +frø -4762 +lšć -4763 +naa -4764 +qui -4765 +sae -4766 +tol -4767 +uan -4768 +ung -4769 +ঘান -4770 +ೂಗೋ -4771 +▁cb -4772 +▁ка -4773 +cipǣ -4774 +clas -4775 +jska -4776 +juha -4777 +kasy -4778 +kası -4779 +kiet -4780 +klan -4781 +kufu -4782 +kuya -4783 +ldra -4784 +lisi -4785 +lizi -4786 +lèis -4787 +lían -4788 +líya -4789 +lėjė -4790 +mane -4791 +mang -4792 +mljo -4793 +nayō -4794 +ngil -4795 +nglu -4796 +ngoa -4797 +ngvo -4798 +nika -4799 +ae -4800 +aí -4801 +aï -4802 +bn -4803 +cu -4804 +dj -4805 +eo -4806 +gy -4807 +hí -4808 +iq -4809 +oč -4810 +pè -4811 +rh -4812 +rj -4813 +rq -4814 +rá -4815 +sc -4816 +sz -4817 +tà -4818 +uí -4819 +uī -4820 +wy -4821 +xr -4822 +yi -4823 +zz -4824 +àl -4825 +áa -4826 +çi -4827 +èi -4828 +ír -4829 +ùg -4830 +če -4831 +ęz -4832 +ōt -4833 +ƿi -4834 +σί -4835 +τσ -4836 +մշ -4837 +רג -4838 +רס -4839 +قد -4840 +لم -4841 +ܘܬ -4842 +ܝܛ -4843 +ܩܝ -4844 +ܪܬ -4845 +ܫܬ -4846 +ते -4847 +ाष -4848 +िक -4849 +র্ -4850 +স্ -4851 +াষ -4852 +ડિ -4853 +િય -4854 +டு -4855 +າພ -4856 +སུ -4857 +ཨེ -4858 +ეთ -4859 +ውክ -4860 +ፔዲ -4861 +ᏂᎯ -4862 +ᐊᐃ -4863 +紙一 -4864 +耳其 -4865 +화민 -4866 +anz -4867 +aye -4868 +iae -4869 +idi -4870 +ist -4871 +nie -4872 +tii -4873 +tla -4874 +uha -4875 +ymo -4876 +უეტ -4877 +▁wu -4878 +dusa -4879 +jagh -4880 +nira -4881 +nîga -4882 +osuu -4883 +ozne -4884 +paip -4885 +paul -4886 +plas -4887 +quía -4888 +rafi -4889 +rajz -4890 +rase -4891 +riis -4892 +rkia -4893 +ronk -4894 +rszá -4895 +rung -4896 +rzen -4897 +síng -4898 +titu -4899 +am -4900 +bb -4901 +ck -4902 +cr -4903 +hs -4904 +hì -4905 +iu -4906 +iý -4907 +lı -4908 +oe -4909 +oâ -4910 +sı -4911 +ts -4912 +íd -4913 +íy -4914 +øð -4915 +ùl -4916 +ăt -4917 +īx -4918 +İn -4919 +γγ -4920 +ολ -4921 +ай -4922 +те -4923 +շո -4924 +גא -4925 +ܣܝ -4926 +ިލ -4927 +ती -4928 +টা -4929 +ত্ -4930 +শি -4931 +সং -4932 +ৃত -4933 +ের -4934 +ಡಿ -4935 +ನ್ -4936 +ศก -4937 +་ཡ -4938 +သမ -4939 +ტუ -4940 +እን -4941 +ᏇᏗ -4942 +ាស -4943 +ルコ -4944 +드부 -4945 +add -4946 +ass -4947 +jap -4948 +kom -4949 +nnà -4950 +osi -4951 +rak -4952 +una -4953 +yac -4954 +óle -4955 +σία -4956 +ылм -4957 +لیز -4958 +ހޫރ -4959 +बाइ -4960 +स्क -4961 +ဝီက -4962 +ိဒိ -4963 +▁ba -4964 +▁du -4965 +▁ენ -4966 +ꙑ́к -4967 +cult -4968 +irki -4969 +keya -4970 +remi -4971 +tske -4972 +têba -4973 +túrk -4974 +uacā -4975 +ubli -4976 +ugra -4977 +ulto -4978 +uqad -4979 +uuvi -4980 +uvip -4981 +viqù -4982 +voui -4983 +ybel -4984 +yele -4985 +yese -4986 +ykia -4987 +yklo -4988 +ylle -4989 +ywan -4990 +zano -4991 +zsia -4992 +ãngu -4993 +ädie -4994 +ímọ́ -4995 +íoch -4996 +ísto -4997 +òkùn -4998 +örki -4999 +ao -5000 +dh -5001 +dė -5002 +em -5003 +hr -5004 +hổ -5005 +if -5006 +iv -5007 +ič -5008 +me -5009 +op -5010 +so -5011 +tü -5012 +zt -5013 +ài -5014 +ãn -5015 +èp -5016 +èr -5017 +ëi -5018 +ër -5019 +ïw -5020 +ór -5021 +ўс -5022 +գր -5023 +իք -5024 +סן -5025 +ענ -5026 +ܕܠ -5027 +ލާ -5028 +न् -5029 +्स -5030 +াত -5031 +ુર -5032 +્ત -5033 +సం -5034 +ಭೂ -5035 +කි -5036 +รณ -5037 +รั -5038 +ໍາ -5039 +ར་ -5040 +ვა -5041 +យា -5042 +▁r -5043 +アチ -5044 +des -5045 +ger -5046 +ges -5047 +hen -5048 +ill -5049 +ism -5050 +kio -5051 +kiý -5052 +laí -5053 +lei -5054 +mbu -5055 +rom -5056 +rsz -5057 +tej -5058 +tli -5059 +usp -5060 +zan -5061 +пос -5062 +խար -5063 +געא -5064 +ाना -5065 +▁те -5066 +alis -5067 +daya -5068 +kiya -5069 +mhur -5070 +teer -5071 +xtli -5072 +ùgùk -5073 +ùipé -5074 +ùkòl -5075 +ćina -5076 +čija -5077 +ğraf -5078 +īnas -5079 +İngı -5080 +ırki -5081 +ŋlis -5082 +ōlli -5083 +škas -5084 +ɛkan -5085 +ίνασ -5086 +αίδε -5087 +γλικ -5088 +ικιπ -5089 +ισμό -5090 +κρατ -5091 +ολιτ -5092 +ουργ -5093 +ουρκ -5094 +τισμ -5095 +ύισμ -5096 +азик -5097 +аниц -5098 +блик -5099 +ac -5100 +aw -5101 +eō -5102 +gb -5103 +iŭ -5104 +lẹ -5105 +of -5106 +pn -5107 +ué -5108 +vä -5109 +às -5110 +éd -5111 +òk -5112 +ök -5113 +ēc -5114 +αφ -5115 +σέ -5116 +оф -5117 +նգ -5118 +די -5119 +וי -5120 +ܕܝ -5121 +ުކ -5122 +ޫރ -5123 +़ी -5124 +ाइ -5125 +ोल -5126 +ুর -5127 +ূগ -5128 +யா -5129 +భూ -5130 +്ല -5131 +งก -5132 +ตุ -5133 +วี -5134 +ງກ -5135 +ູມ -5136 +ᑐᑦ -5137 +ូម -5138 +▁τ -5139 +▁д -5140 +テル -5141 +azy -5142 +cha -5143 +ebo -5144 +kaw -5145 +kir -5146 +kye -5147 +lli -5148 +mas -5149 +nia -5150 +ntâ -5151 +tam -5152 +tir -5153 +urq -5154 +ush -5155 +zie -5156 +ûng -5157 +имо -5158 +рам -5159 +ски -5160 +քիփ -5161 +އިބ -5162 +टर् -5163 +ᐃᐊᐃ -5164 +▁vl -5165 +heum -5166 +ibul -5167 +turo -5168 +икас -5169 +мова -5170 +нжіл -5171 +публ -5172 +рале -5173 +ятад -5174 +ітай -5175 +անգլ -5176 +աշու -5177 +ետեր -5178 +երեն -5179 +իքիփ -5180 +խարհ -5181 +հագր -5182 +րքիա -5183 +אוגר -5184 +אוטר -5185 +אפיע -5186 +בליה -5187 +בליק -5188 +גלית -5189 +דיסב -5190 +הרפו -5191 +וויק -5192 +ותיו -5193 +לטור -5194 +מותי -5195 +ענגל -5196 +רבות -5197 +שונה -5198 +تاوس -5199 +cã -5200 +fj -5201 +fí -5202 +fø -5203 +gu -5204 +kk -5205 +kn -5206 +ou -5207 +rč -5208 +sj -5209 +sn -5210 +ss -5211 +zn -5212 +éa -5213 +îk -5214 +ót -5215 +ǣd -5216 +ɛk -5217 +αν -5218 +μό -5219 +во -5220 +ակ -5221 +թյ -5222 +אש -5223 +ܦܝ -5224 +ܫܢ -5225 +ުރ -5226 +ूग -5227 +ेज -5228 +બલ -5229 +ાઇ -5230 +டர -5231 +னா -5232 +ிவ -5233 +ಗೋ -5234 +ುಟ -5235 +ാസ -5236 +ཨུ -5237 +ქე -5238 +ቱር -5239 +ᐧᐣ -5240 +ីភ -5241 +▁у -5242 +ェン -5243 +eki -5244 +iaĉ -5245 +irs -5246 +kro -5247 +puv -5248 +rac -5249 +ran -5250 +rkė -5251 +áis -5252 +èpu -5253 +éng -5254 +ćin -5255 +ята -5256 +դիա -5257 +אנג -5258 +چین -5259 +்செ -5260 +უის -5261 +▁bu -5262 +anao -5263 +anza -5264 +iija -5265 +iste -5266 +kito -5267 +lska -5268 +àlia -5269 +γκάν -5270 +ντζα -5271 +եդիա -5272 +טורק -5273 +ثاوس -5274 +كرول -5275 +نگلی -5276 +ىرىن -5277 +يفسك -5278 +يېزى -5279 +یموت -5280 +ܐܝܛܙ -5281 +ܒܫܬܐ -5282 +ܓܠܝܐ -5283 +ܛܘܪܩ -5284 +ރާފީ -5285 +ކީވި -5286 +އިބަ -5287 +ވިލާ -5288 +މްހޫ -5289 +ގިރޭ -5290 +ސަގާ -5291 +ޖުޣު -5292 +उएते -5293 +ऊएते -5294 +किपी -5295 +कृति -5296 +कृती -5297 +बायब -5298 +भूगो -5299 +aa -5300 +ab -5301 +aq -5302 +hy -5303 +iè -5304 +kr -5305 +kỳ -5306 +lò -5307 +pc -5308 +ps -5309 +ræ -5310 +rǣ -5311 +vù -5312 +wî -5313 +yc -5314 +yw -5315 +yy -5316 +zë -5317 +úb -5318 +εω -5319 +μο -5320 +им -5321 +эл -5322 +چی -5323 +बि -5324 +या -5325 +ेर -5326 +কৃ -5327 +ান -5328 +કિ -5329 +શિ -5330 +ીડ -5331 +టర -5332 +ಬಲ -5333 +ിമ -5334 +ชี -5335 +อเ -5336 +ุร -5337 +ང་ -5338 +်င -5339 +გრ -5340 +იპ -5341 +ውተ -5342 +ᐅᐃ -5343 +ᔭᓯ -5344 +ភី -5345 +▁भ -5346 +テへ -5347 +ディ -5348 +bja -5349 +eny -5350 +hin -5351 +nli -5352 +tör -5353 +uri -5354 +url -5355 +yop -5356 +́gr -5357 +дву -5358 +ված -5359 +زىل -5360 +ܬܐܘ -5361 +ੀਡਿ -5362 +વિક -5363 +ེ་ཡ -5364 +វិទ -5365 +▁os -5366 +▁tü -5367 +anda -5368 +aziý -5369 +cenz -5370 +cija -5371 +endo -5372 +ishe -5373 +kéya -5374 +roch -5375 +tură -5376 +лани -5377 +ܡܬܐܘ -5378 +रझेन -5379 +र्कस -5380 +स्कृ -5381 +ংস্ক -5382 +বাইব -5383 +ভূগো -5384 +্রজা -5385 +ਗ੍ਰੇ -5386 +ੇਜ਼ੀ -5387 +અંગ્ -5388 +બાઇબ -5389 +ભૂગો -5390 +ર્કસ -5391 +ર્સે -5392 +શિયા -5393 +િકિપ -5394 +ஆசிய -5395 +ங்கி -5396 +ர்செ -5397 +விக் -5398 +ியம் -5399 +dn -5400 +ds -5401 +ej -5402 +gl -5403 +gṳ -5404 +kò -5405 +lp -5406 +lɛ -5407 +mí -5408 +nz -5409 +nî -5410 +oþ -5411 +su -5412 +ub -5413 +up -5414 +yb -5415 +yr -5416 +ép -5417 +éy -5418 +ėj -5419 +šk -5420 +нҷ -5421 +ыв -5422 +ած -5423 +շխ -5424 +ւյ -5425 +וב -5426 +קה -5427 +أو -5428 +ܙܢ -5429 +ޣު -5430 +ता -5431 +ना -5432 +शि -5433 +ैव -5434 +ডি -5435 +প্ -5436 +તુ -5437 +રે -5438 +ૂગ -5439 +ண் -5440 +யர -5441 +ில -5442 +າດ -5443 +ད། -5444 +ီပ -5445 +იკ -5446 +ფი -5447 +ᏍᏗ -5448 +ᓯᒧ -5449 +បធ -5450 +ិទ -5451 +▁ঠ -5452 +ウィ -5453 +ブル -5454 +亞細 -5455 +민국 -5456 +아첸 -5457 +ası -5458 +bij -5459 +fræ -5460 +ile -5461 +iog -5462 +ngi -5463 +ngv -5464 +pis -5465 +sik -5466 +γία -5467 +γλώ -5468 +убл -5469 +յու -5470 +ซิน -5471 +▁mw -5472 +biqu -5473 +lich -5474 +lija -5475 +nale -5476 +tali -5477 +πουρ -5478 +σέντ -5479 +ωγρα -5480 +ியிய -5481 +ீடிய -5482 +ுடிய -5483 +ంగ్ల -5484 +త్రమ -5485 +ర్కీ -5486 +వికీ -5487 +సియా -5488 +ಂಗ್ಲ -5489 +ಏಷ್ಯ -5490 +ಿಕಿಪ -5491 +ೆರ್ಸ -5492 +ഗ്ലീ -5493 +തുര് -5494 +ത്തി -5495 +നെഴു -5496 +മോത് -5497 +ശാസ് -5498 +സ്കാ -5499 +gj -5500 +ht -5501 +lé -5502 +lî -5503 +ot -5504 +qi -5505 +sl -5506 +sw -5507 +zí -5508 +àk -5509 +ín -5510 +òg -5511 +ól -5512 +ĉe -5513 +ĭn -5514 +ил -5515 +лн -5516 +գլ -5517 +սե -5518 +אס -5519 +וט -5520 +ול -5521 +יד -5522 +ال -5523 +تى -5524 +आं -5525 +ইং -5526 +ંગ -5527 +ેર -5528 +ಾನ -5529 +་ན -5530 +ིན -5531 +ེར -5532 +რგ -5533 +ስያ -5534 +ᐃᑖ -5535 +ᓰᐊ -5536 +ម៌ -5537 +華民 -5538 +𐍂𐌰 -5539 +azë -5540 +bas -5541 +fur -5542 +hia -5543 +ibh -5544 +lag -5545 +lki -5546 +nks -5547 +qur -5548 +taa -5549 +uie -5550 +yte -5551 +ürg -5552 +ōle -5553 +ައި -5554 +ऊएत -5555 +ਾਈਬ -5556 +சிய -5557 +ఎతె -5558 +್ಕಿ -5559 +▁τη -5560 +▁ул -5561 +dəni -5562 +gele -5563 +geog -5564 +haib -5565 +kiye -5566 +leid -5567 +lsko -5568 +sina -5569 +usza -5570 +xina -5571 +ার্স -5572 +ెర్స -5573 +സ്ത് -5574 +ിക്ക -5575 +ുതിയ -5576 +ോസിന -5577 +ස්කෘ -5578 +ියාව -5579 +กิพี -5580 +ณรัฐ -5581 +ทร์เ -5582 +ทวีป -5583 +นธรร -5584 +ภีร์ -5585 +รณรั -5586 +สาธา -5587 +ะเทศ -5588 +ังกฤ -5589 +าอัง -5590 +ุรกี -5591 +เชีย -5592 +เดีย -5593 +แทร์ -5594 +ງກິດ -5595 +ພາສາ -5596 +ສາອັ -5597 +ະຄໍາ -5598 +ູມສາ -5599 +aj -5600 +ef -5601 +gd -5602 +ic -5603 +ij -5604 +iế -5605 +kw -5606 +lg -5607 +mh -5608 +my -5609 +ns -5610 +nç -5611 +on -5612 +pî -5613 +zs -5614 +ág -5615 +íà -5616 +ðv -5617 +ýw -5618 +ĉi -5619 +źe -5620 +άν -5621 +πα -5622 +ώσ -5623 +ки -5624 +ја -5625 +إن -5626 +تي -5627 +ܐܘ -5628 +ܟܬ -5629 +ައ -5630 +ީވ -5631 +गण -5632 +উই -5633 +সে -5634 +ન્ -5635 +િપ -5636 +னக -5637 +யல -5638 +್ಯ -5639 +แท -5640 +ბლ -5641 +გე -5642 +ივ -5643 +ლი -5644 +መጽ -5645 +ጽሐ -5646 +ᎦᏬ -5647 +ិគ -5648 +dor -5649 +edė -5650 +esn -5651 +hem -5652 +ijô -5653 +imi -5654 +inj -5655 +lef -5656 +lla -5657 +lum -5658 +mox -5659 +rle -5660 +teü -5661 +ure -5662 +âia -5663 +ŋli -5664 +́sa -5665 +որտ -5666 +يفس -5667 +ܩܕܝ -5668 +ܩܝܦ -5669 +ంగ్ -5670 +▁họ -5671 +▁sj -5672 +▁تي -5673 +abli -5674 +ange -5675 +ddes -5676 +engl -5677 +imin -5678 +insa -5679 +kina -5680 +liya -5681 +ters -5682 +ęzyk -5683 +οκρα -5684 +πιατ -5685 +євск -5686 +་སེན -5687 +་སྐད -5688 +ུ་ཏེ -5689 +ུར་ཀ -5690 +ེ་ཡ། -5691 +ပထဝီ -5692 +ဝီကီ -5693 +ိဒိယ -5694 +ုင်င -5695 +ბურგ -5696 +ეოგრ -5697 +ეტერ -5698 +ზენი -5699 +av -5700 +aĉ -5701 +cs -5702 +dw -5703 +gļ -5704 +ii -5705 +lý -5706 +mə -5707 +pt -5708 +rl -5709 +rí -5710 +sk -5711 +tq -5712 +uk -5713 +æð -5714 +ûn -5715 +πι -5716 +ρκ -5717 +φί -5718 +ωγ -5719 +իփ -5720 +վի -5721 +ւթ -5722 +תר -5723 +كر -5724 +ܠܫ -5725 +ܪܩ -5726 +ިބ -5727 +्र -5728 +ଯା -5729 +ுவ -5730 +భా -5731 +ಏಷ -5732 +ພະ -5733 +གས -5734 +ཏུ -5735 +ི་ -5736 +အာ -5737 +ာရ -5738 +ნი -5739 +ᐃᐧ -5740 +ᓇᐅ -5741 +ᓪᓗ -5742 +ᨋᨗ -5743 +시아 -5744 +𐌰𐌲 -5745 +𐌲𐌲 -5746 +chí -5747 +gal -5748 +han -5749 +hiè -5750 +ipe -5751 +jač -5752 +kun -5753 +lek -5754 +lex -5755 +uln -5756 +wel -5757 +àna -5758 +ère -5759 +ēce -5760 +גאנ -5761 +גלי -5762 +ܘܪܩ -5763 +ަތު -5764 +ಡಿಯ -5765 +ഏഷ് -5766 +්කෘ -5767 +ར་ཀ -5768 +▁ਭਾ -5769 +アチェ -5770 +스부르 -5771 +arko -5772 +ddia -5773 +elek -5774 +ibha -5775 +ilis -5776 +ingi -5777 +phie -5778 +plac -5779 +rist -5780 +ultu -5781 +wski -5782 +స్కృ -5783 +ისურ -5784 +ლტურ -5785 +ქეთი -5786 +ንግሊዝ -5787 +ኢውተዘ -5788 +ᏂᎯᏍᏗ -5789 +ᓯᒧᐃᐧ -5790 +ᖃᓪᓗᓈ -5791 +ប្បធ -5792 +មិវិ -5793 +ាស៊ី -5794 +ᨕᨗᨋᨗ -5795 +ẹ́ọ́ -5796 +ọ́gr -5797 +‍ക്ക -5798 +▁bik -5799 +au -5800 +ač -5801 +bm -5802 +dī -5803 +eu -5804 +gn -5805 +há -5806 +hû -5807 +ir -5808 +iw -5809 +ky -5810 +lv -5811 +no -5812 +ny -5813 +om -5814 +pj -5815 +pē -5816 +sp -5817 +sy -5818 +tr -5819 +uh -5820 +yt -5821 +zy -5822 +àṣ -5823 +āf -5824 +īb -5825 +ız -5826 +ɛ́ -5827 +κρ -5828 +вс -5829 +ит -5830 +ов -5831 +הס -5832 +ונ -5833 +פע -5834 +שו -5835 +سك -5836 +ܦܕ -5837 +ފަ -5838 +से -5839 +িপ -5840 +ੇਜ -5841 +ஆங -5842 +తె -5843 +సె -5844 +ിയ -5845 +്ത -5846 +്‍ -5847 +าอ -5848 +ຊີ -5849 +າຊ -5850 +ཤེ -5851 +ိဒ -5852 +ទ្ -5853 +ṳn -5854 +ụs -5855 +▁в -5856 +▁и -5857 +▁ভ -5858 +の手 -5859 +アジ -5860 +モテ -5861 +키백 -5862 +eel -5863 +esa -5864 +hlo -5865 +iai -5866 +ish -5867 +lta -5868 +nin -5869 +pig -5870 +pro -5871 +ttu -5872 +épu -5873 +úrk -5874 +есп -5875 +ރޭސ -5876 +ോസി -5877 +სურ -5878 +▁kr -5879 +への手 -5880 +chhi -5881 +chin -5882 +gaas -5883 +hebe -5884 +inio -5885 +lagá -5886 +mote -5887 +piya -5888 +tang -5889 +▁chi -5890 +▁dul -5891 +▁eho -5892 +▁eks -5893 +▁epí -5894 +▁god -5895 +▁gẹ̀ -5896 +▁jes -5897 +▁jez -5898 +▁joh -5899 +bj -5900 +ec -5901 +lų -5902 +mt -5903 +nä -5904 +nī -5905 +ox -5906 +ķī -5907 +сӣ -5908 +ти -5909 +ял -5910 +կո -5911 +פד -5912 +רז -5913 +ܝܒ -5914 +ܝܫ -5915 +आश -5916 +कस -5917 +बल -5918 +यु -5919 +रझ -5920 +िश -5921 +ुट -5922 +्ल -5923 +ଏତ -5924 +பண -5925 +ംഗ -5926 +ംസ -5927 +ഏഷ -5928 +കി -5929 +ഡി -5930 +നെ -5931 +ബി -5932 +ബൈ -5933 +മോ -5934 +യോ -5935 +ലീ -5936 +സം -5937 +ാഷ -5938 +ിക -5939 +ിന -5940 +ിപ -5941 +ിശ -5942 +ീഡ -5943 +ീഷ -5944 +ുത -5945 +ുര -5946 +ൈബ -5947 +ോസ -5948 +്യ -5949 +്ര -5950 +มิ -5951 +ິດ -5952 +ထဝ -5953 +နိ -5954 +မတ -5955 +ရု -5956 +ီဝ -5957 +ုတ -5958 +္မ -5959 +်သ -5960 +ტე -5961 +‍ക -5962 +오에 -5963 +aed -5964 +aya -5965 +hua -5966 +imó -5967 +iqu -5968 +jak -5969 +man -5970 +nat -5971 +ngɛ -5972 +rog -5973 +spu -5974 +uir -5975 +íob -5976 +òkù -5977 +יגר -5978 +ܪܬܐ -5979 +ງກິ -5980 +▁bc -5981 +▁ex -5982 +into -5983 +tūra -5984 +ليزي -5985 +अंग् -5986 +टर्क -5987 +ກິພີ -5988 +ክፔዲያ -5989 +▁kal -5990 +▁kro -5991 +▁lis -5992 +▁mab -5993 +▁mep -5994 +▁naa -5995 +▁oma -5996 +▁omu -5997 +▁omw -5998 +▁ota -5999 +aŋ -6000 +cj -6001 +cê -6002 +cā -6003 +gg -6004 +gv -6005 +ih -6006 +jk -6007 +lq -6008 +lw -6009 +lë -6010 +ló -6011 +mọ -6012 +nr -6013 +ov -6014 +pd -6015 +pā -6016 +rt -6017 +ră -6018 +tâ -6019 +tō -6020 +uố -6021 +yu -6022 +yv -6023 +íí -6024 +îr -6025 +óm -6026 +āh -6027 +ām -6028 +ān -6029 +ŋg -6030 +γί -6031 +υρ -6032 +φή -6033 +ני -6034 +خە -6035 +لي -6036 +نچ -6037 +यः -6038 +ਕਿ -6039 +ਗ੍ -6040 +ਪੀ -6041 +ਬਾ -6042 +ਵਿ -6043 +ਾਈ -6044 +ਿਆ -6045 +ਿਪ -6046 +ੀਡ -6047 +੍ਰ -6048 +ଊଏ -6049 +ସି -6050 +ସେ -6051 +େନ -6052 +୍ସ -6053 +ආස -6054 +ිය -6055 +ฤษ -6056 +ငံ -6057 +ပိ -6058 +ိယ -6059 +ီက -6060 +ზე -6061 +ተዘ -6062 +ᎩᎵ -6063 +ᐅᔪ -6064 +▁q -6065 +idu -6066 +iik -6067 +lac -6068 +liq -6069 +oma -6070 +ors -6071 +tin -6072 +uur -6073 +yel -6074 +мот -6075 +ܐܓܪ -6076 +ూగో -6077 +ုင် -6078 +▁dâ -6079 +lani -6080 +ltur -6081 +piaĉ -6082 +tudo -6083 +türg -6084 +zija -6085 +▁pet -6086 +▁pok -6087 +▁pos -6088 +▁say -6089 +▁sin -6090 +▁tav -6091 +▁thá -6092 +▁tus -6093 +▁vaa -6094 +▁ved -6095 +▁yaz -6096 +▁épî -6097 +▁γλώ -6098 +▁γρα -6099 +eò -6100 +kä -6101 +mp -6102 +pe -6103 +pí -6104 +rå -6105 +sq -6106 +sv -6107 +yj -6108 +yp -6109 +ím -6110 +či -6111 +ōā -6112 +šč -6113 +אפ -6114 +הא -6115 +ער -6116 +صي -6117 +ަތ -6118 +கா -6119 +சி -6120 +சீ -6121 +செ -6122 +டி -6123 +ட் -6124 +பா -6125 +பு -6126 +ப் -6127 +யம -6128 +ரு -6129 +லம -6130 +லி -6131 +ல் -6132 +ுக -6133 +ுர -6134 +ூட -6135 +ென -6136 +்க -6137 +்ட -6138 +ంస -6139 +ఊఎ -6140 +గో -6141 +గ్ -6142 +డి -6143 +నా -6144 +బి -6145 +ిల -6146 +ీడ -6147 +ీప -6148 +ృత -6149 +ెన -6150 +ెర -6151 +ైబ -6152 +్క -6153 +్స -6154 +කෘ -6155 +ගෝ -6156 +භූ -6157 +සං -6158 +ස් -6159 +ාව -6160 +ික -6161 +ීඩ -6162 +ෘත -6163 +ෝල -6164 +▁е -6165 +▁ਭ -6166 +ーナ -6167 +enç -6168 +heu -6169 +kii -6170 +oay -6171 +rmy -6172 +sim -6173 +uog -6174 +лан -6175 +ורג -6176 +ウィキ -6177 +eers -6178 +jako -6179 +lske -6180 +ltür -6181 +luna -6182 +oloč -6183 +rsta -6184 +sbur -6185 +toko -6186 +ание -6187 +▁enc -6188 +▁pav -6189 +▁дво -6190 +▁дву -6191 +▁инҷ -6192 +▁йыл -6193 +▁кин -6194 +▁пос -6195 +▁рес -6196 +▁тел -6197 +▁тил -6198 +▁тыл -6199 +aý -6200 +eü -6201 +hâ -6202 +hé -6203 +iə -6204 +kl -6205 +lä -6206 +nv -6207 +rv -6208 +tk -6209 +tn -6210 +uz -6211 +vh -6212 +wr -6213 +xi -6214 +ym -6215 +ül -6216 +ăn -6217 +το -6218 +ез -6219 +ис -6220 +ән -6221 +كي -6222 +की -6223 +ति -6224 +తి -6225 +กี -6226 +คั -6227 +ฐจ -6228 +ฒน -6229 +ดี -6230 +ตร -6231 +ธา -6232 +นธ -6233 +นา -6234 +บิ -6235 +บเ -6236 +ปเ -6237 +พี -6238 +ภา -6239 +มภ -6240 +รม -6241 +ระ -6242 +วั -6243 +สต -6244 +อั -6245 +ัฐ -6246 +ัม -6247 +าธ -6248 +าส -6249 +ิก -6250 +ิพ -6251 +ิศ -6252 +ีป -6253 +ือ -6254 +เช -6255 +เซ -6256 +เด -6257 +เท -6258 +เบ -6259 +์เ -6260 +์ไ -6261 +ပထ -6262 +სუ -6263 +ịa -6264 +▁드 -6265 +bae -6266 +fia -6267 +mua -6268 +psa -6269 +sie -6270 +கிப -6271 +ப்ப -6272 +ಟರ್ -6273 +นธร -6274 +ພະຄ -6275 +་ན། -6276 +cent -6277 +iais -6278 +rach -6279 +taly -6280 +▁ist -6281 +▁kii -6282 +▁liv -6283 +▁тіл -6284 +▁הרא -6285 +▁الأ -6286 +▁الر -6287 +▁الم -6288 +▁بول -6289 +▁زبو -6290 +▁ܕܠܘ -6291 +▁ܩܕܡ -6292 +▁गणर -6293 +▁ਭਾਸ -6294 +▁ભાષ -6295 +▁විද -6296 +への手紙 -6297 +アチェン -6298 +ウエテル -6299 +ap -6300 +a̍ -6301 +bë -6302 +cb -6303 +dü -6304 +ey -6305 +fe -6306 +jẹ -6307 +kì -6308 +lì -6309 +xh -6310 +ái -6311 +éh -6312 +él -6313 +íl -6314 +ît -6315 +ük -6316 +̀- -6317 +́g -6318 +ίν -6319 +ησ -6320 +σα -6321 +гл -6322 +жә -6323 +зи -6324 +нг -6325 +ҷи -6326 +יו -6327 +יט -6328 +יש -6329 +עד -6330 +רא -6331 +رو -6332 +زی -6333 +سی -6334 +ܢܓ -6335 +ंस -6336 +वा -6337 +તે -6338 +ିଯ -6339 +ోళ -6340 +ಟೆ -6341 +್ರ -6342 +ശാ -6343 +ཀི -6344 +▁î -6345 +▁γ -6346 +▁κ -6347 +▁а -6348 +▁ж -6349 +▁й -6350 +中华 -6351 +于特 -6352 +华民 -6353 +基大 -6354 +大典 -6355 +摩太 -6356 +斯堡 -6357 +杜伊 -6358 +民國 -6359 +細亞 -6360 +all -6361 +aro -6362 +iis -6363 +inz -6364 +jks -6365 +quí -6366 +rei -6367 +rje -6368 +rta -6369 +taý -6370 +ztu -6371 +çhe -6372 +ían -6373 +ísi -6374 +ürk -6375 +āke -6376 +త్ర -6377 +ພີເ -6378 +ᓯᒧᐃ -6379 +▁ai -6380 +igaz -6381 +immä -6382 +koma -6383 +piac -6384 +yang -6385 +änap -6386 +ńska -6387 +ඩියා -6388 +チェンツ -6389 +ꙁꙑ́к -6390 +뒤스부르 -6391 +angal -6392 +angan -6393 +angie -6394 +angil -6395 +angli -6396 +anglè -6397 +anglé -6398 +anglë -6399 +ex -6400 +hl -6401 +zg -6402 +īƿ -6403 +ад -6404 +ал -6405 +ам -6406 +ас -6407 +ат -6408 +вы -6409 +ев -6410 +ен -6411 +ею -6412 +еў -6413 +жі -6414 +зы -6415 +ие -6416 +иц -6417 +йр -6418 +йы -6419 +ка -6420 +ке -6421 +кр -6422 +кы -6423 +кя -6424 +лі -6425 +мд -6426 +ме -6427 +нж -6428 +ни -6429 +ол -6430 +от -6431 +по -6432 +ре -6433 +рз -6434 +ро -6435 +сп -6436 +тт -6437 +ты -6438 +хэ -6439 +ык -6440 +эр -6441 +ют -6442 +єв -6443 +ін -6444 +іт -6445 +је -6446 +ју -6447 +ѩꙁ -6448 +ימ -6449 +מו -6450 +תי -6451 +ئی -6452 +چى -6453 +डि -6454 +సి -6455 +ඩි -6456 +▁н -6457 +▁я -6458 +▁ѩ -6459 +▁א -6460 +▁ט -6461 +ꙁꙑ -6462 +ain -6463 +ark -6464 +gra -6465 +gul -6466 +ibl -6467 +ite -6468 +lik -6469 +loč -6470 +our -6471 +udo -6472 +ικι -6473 +يزي -6474 +กาน -6475 +ກິພ -6476 +acen -6477 +bbli -6478 +hina -6479 +inan -6480 +isin -6481 +jvan -6482 +lski -6483 +seki -6484 +ଏତେର -6485 +ಬೈಬಲ -6486 +မ္မတ -6487 +▁кял -6488 +angua -6489 +ardri -6490 +ariik -6491 +arran -6492 +asenç -6493 +atiid -6494 +atitu -6495 +baibu -6496 +bibul -6497 +blėjė -6498 +burgo -6499 +nk -6500 +nn -6501 +nw -6502 +րե -6503 +إل -6504 +بو -6505 +بى -6506 +تا -6507 +ثا -6508 +جل -6509 +زب -6510 +زي -6511 +سو -6512 +غا -6513 +لأ -6514 +لر -6515 +لس -6516 +نگ -6517 +وت -6518 +وث -6519 +ور -6520 +وس -6521 +ىر -6522 +ىل -6523 +ىن -6524 +يف -6525 +يم -6526 +گل -6527 +یس -6528 +یم -6529 +ەت -6530 +ܐܓ -6531 +ܕܡ -6532 +ܛܝ -6533 +ܝܡ -6534 +ܝܬ -6535 +ܡܝ -6536 +ܡܬ -6537 +ޗަ -6538 +ިނ -6539 +रा -6540 +ाज -6541 +ाय -6542 +्य -6543 +ষা -6544 +ীন -6545 +ਬਲ -6546 +ਸ਼ -6547 +਼ਾ -6548 +ਾਸ -6549 +ભા -6550 +વિ -6551 +ાષ -6552 +ரச -6553 +ுட -6554 +ియ -6555 +್ಸ -6556 +ිප -6557 +กา -6558 +จี -6559 +ུ་ -6560 +აფ -6561 +ედ -6562 +ᐊᑲ -6563 +ᓗᓈ -6564 +▁د -6565 +▁ي -6566 +▁ܕ -6567 +▁ग -6568 +▁চ -6569 +▁க -6570 +▁ව -6571 +セン -6572 +於特 -6573 +adu -6574 +bia -6575 +ghv -6576 +ipǣ -6577 +kki -6578 +rim -6579 +tuy -6580 +áan -6581 +ރާފ -6582 +ზია -6583 +ისბ -6584 +▁re -6585 +biib -6586 +elen -6587 +keye -6588 +tina -6589 +ürch -6590 +איגר -6591 +▁has -6592 +burgu -6593 +bìoba -6594 +cheas -6595 +culto -6596 +dafræ -6597 +dafrø -6598 +diiya -6599 +mw -6600 +mó -6601 +mā -6602 +nê -6603 +vl -6604 +áf -6605 +áí -6606 +çe -6607 +āy -6608 +λώ -6609 +րտ -6610 +ינ -6611 +ܬܒ -6612 +োল -6613 +ఆం -6614 +ము -6615 +్త -6616 +్ర -6617 +ಶಾ -6618 +ಸ್ -6619 +ೋಳ -6620 +್ತ -6621 +ഒന -6622 +േഖ -6623 +്ന -6624 +ද් -6625 +ูม -6626 +უე -6627 +ቅዱ -6628 +ዝኛ -6629 +ዱስ -6630 +ᨗᨋ -6631 +‍ය -6632 +▁భ -6633 +▁శ -6634 +▁ഒ -6635 +▁ე -6636 +▁ᨕ -6637 +▁ṣ -6638 +臺灣 -6639 +룰레 -6640 +보낸 -6641 +크룰 -6642 +aan -6643 +aip -6644 +ais -6645 +ale -6646 +anb -6647 +anc -6648 +anh -6649 +ann -6650 +ant -6651 +dan -6652 +dun -6653 +gan -6654 +iac -6655 +ian -6656 +ias -6657 +iaç -6658 +kia -6659 +kip -6660 +koe -6661 +ngl -6662 +odu -6663 +oul -6664 +raj -6665 +rch -6666 +san -6667 +síà -6668 +tee -6669 +uia -6670 +ult -6671 +yia -6672 +zia -6673 +çan -6674 +ğan -6675 +дво -6676 +ާނާ -6677 +ার্ -6678 +ംഗ് -6679 +ဝီဝ -6680 +▁nr -6681 +bibe -6682 +bilo -6683 +reza -6684 +rine -6685 +rmas -6686 +turq -6687 +yaco -6688 +áínà -6689 +કસ્ત -6690 +ბლია -6691 +▁non -6692 +▁vab -6693 +domán -6694 +doryd -6695 +däniä -6696 +dəniy -6697 +eaght -6698 +earyd -6699 +ax -6700 +ay -6701 +fì -6702 +pé -6703 +ér -6704 +ûl -6705 +յթ -6706 +رس -6707 +పీ -6708 +ిక -6709 +ോത -6710 +ဝင -6711 +ᓯᓐ -6712 +anä -6713 +ate -6714 +ban -6715 +cch -6716 +chh -6717 +daf -6718 +ena -6719 +end -6720 +enn -6721 +enz -6722 +epa -6723 +ete -6724 +eur -6725 +fan -6726 +gen -6727 +int -6728 +ipè -6729 +jin -6730 +kie -6731 +kij -6732 +kik -6733 +kit -6734 +kiy -6735 +kur -6736 +lin -6737 +ote -6738 +pen -6739 +sal -6740 +tei -6741 +ter -6742 +tes -6743 +tet -6744 +teu -6745 +toh -6746 +tte -6747 +uen -6748 +urc -6749 +urk -6750 +urs -6751 +uru -6752 +urč -6753 +yan -6754 +zen -6755 +zki -6756 +çen -6757 +óte -6758 +örs -6759 +ørs -6760 +üte -6761 +īte -6762 +źel -6763 +ליק -6764 +פיה -6765 +ूगो -6766 +รณร -6767 +ສາດ -6768 +▁sr -6769 +▁ue -6770 +anen -6771 +ania -6772 +arle -6773 +blic -6774 +blie -6775 +diae -6776 +hasa -6777 +heri -6778 +kalu -6779 +taih -6780 +taiv -6781 +ttur -6782 +uete -6783 +urki -6784 +asije -6785 +eaŋga -6786 +edija -6787 +encia -6788 +endom -6789 +enkur -6790 +enské -6791 +eogra -6792 +epist -6793 +eyele -6794 +fiten -6795 +funga -6796 +galsg -6797 +geles -6798 +ghane -6799 +eg -6800 +nd -6801 +oo -6802 +zc -6803 +́к -6804 +լե -6805 +ܪܦ -6806 +ঠা -6807 +એત -6808 +ீன -6809 +ലേ -6810 +ංස -6811 +อแ -6812 +ერ -6813 +ンツ -6814 +aas -6815 +ali -6816 +ash -6817 +asj -6818 +asy -6819 +asụ -6820 +bik -6821 +cin -6822 +das -6823 +edi -6824 +edj -6825 +elš -6826 +gas -6827 +has -6828 +ied -6829 +imb -6830 +ine -6831 +ini -6832 +las -6833 +led -6834 +lic -6835 +lie -6836 +lij -6837 +lio -6838 +liv -6839 +liy -6840 +lič -6841 +mbi -6842 +qan -6843 +ras -6844 +sas -6845 +yus -6846 +èis -6847 +îro -6848 +ňli -6849 +ντο -6850 +ியம -6851 +ლის -6852 +▁nh -6853 +▁ns -6854 +ディア -6855 +anas -6856 +ebel -6857 +enas -6858 +ensk -6859 +iina -6860 +ikii -6861 +inen -6862 +isan -6863 +kors -6864 +turs -6865 +turt -6866 +turu -6867 +uiki -6868 +ulun -6869 +vich -6870 +viki -6871 +viqu -6872 +ðvel -6873 +יקיפ -6874 +ुर्क -6875 +ડિયા -6876 +კიპე -6877 +▁acc -6878 +▁cin -6879 +▁tab -6880 +▁tas -6881 +▁язы -6882 +ペディア -6883 +提摩太前 -6884 +ebele -6885 +glish -6886 +gsasa -6887 +gàsàn -6888 +gùkòl -6889 +hayib -6890 +hicip -6891 +iacen -6892 +icipǣ -6893 +iella -6894 +ikang -6895 +ikipa -6896 +ikipe -6897 +ikipi -6898 +ikipè -6899 +cn -6900 +js -6901 +âi -6902 +сл -6903 +रे -6904 +જી -6905 +ഭൂ -6906 +ാന -6907 +รร -6908 +ອັ -6909 +ეო -6910 +რი -6911 +ᐱᑎ -6912 +age -6913 +ara -6914 +biw -6915 +ble -6916 +chr -6917 +din -6918 +dra -6919 +ein -6920 +era -6921 +gis -6922 +gle -6923 +hui -6924 +ibu -6925 +iin -6926 +ili -6927 +kra -6928 +lee -6929 +lel -6930 +ler -6931 +lez -6932 +leš -6933 +lle -6934 +ole -6935 +orí -6936 +pag -6937 +phy -6938 +pin -6939 +raa -6940 +rah -6941 +rap -6942 +raz -6943 +ree -6944 +ule -6945 +uuv -6946 +xin -6947 +ána -6948 +äin -6949 +éra -6950 +ëin -6951 +čin -6952 +ğra -6953 +րու -6954 +יפע -6955 +अंग -6956 +ဝင် -6957 +ិវិ -6958 +▁cd -6959 +▁cu -6960 +▁ds -6961 +▁ho -6962 +▁hs -6963 +▁pe -6964 +▁tu -6965 +▁ze -6966 +bang -6967 +gàsà -6968 +hana -6969 +iein -6970 +inez -6971 +inia -6972 +kera -6973 +laht -6974 +ngah -6975 +salo -6976 +türk -6977 +wana -6978 +áana -6979 +ভাষা -6980 +്നാം -6981 +▁dan -6982 +▁din -6983 +gelsk -6984 +ikipē -6985 +illqa -6986 +inghy -6987 +ingle -6988 +inglè -6989 +inglê -6990 +inglü -6991 +ingre -6992 +iogra -6993 +isbur -6994 +ishey -6995 +itaat -6996 +iterz -6997 +ińska -6998 +juqur -6999 +dk -7000 +dv -7001 +gı -7002 +jv -7003 +lb -7004 +qù -7005 +äd -7006 +ла -7007 +קי -7008 +सि -7009 +யி -7010 +ೈಬ -7011 +්‍ -7012 +ිද -7013 +སྐ -7014 +ᓈᑎ -7015 +琴察 -7016 +브스 -7017 +aba -7018 +ach -7019 +agh -7020 +bii -7021 +bil -7022 +biv -7023 +bor -7024 +bul -7025 +cun -7026 +eog -7027 +gah -7028 +gam -7029 +gar -7030 +gge -7031 +ghe -7032 +ibi -7033 +iga -7034 +lim -7035 +nge -7036 +ngh -7037 +och -7038 +ogr -7039 +ria -7040 +sto -7041 +tab -7042 +the -7043 +wog -7044 +yog -7045 +zge -7046 +ãng -7047 +éog -7048 +ómì -7049 +ŋga -7050 +รกี -7051 +ཇིའ -7052 +ვიკ -7053 +▁bp -7054 +▁bí -7055 +▁bî -7056 +▁eh -7057 +▁ek -7058 +▁il -7059 +▁in -7060 +▁lu -7061 +▁me -7062 +▁mu -7063 +▁pd -7064 +▁pm -7065 +▁pn -7066 +▁pî -7067 +bibi -7068 +enge -7069 +engh -7070 +fiya -7071 +gaed -7072 +gaha -7073 +ghan -7074 +▁ -7075 +a -7076 +i -7077 +: -7078 +e -7079 +n -7080 +s -7081 +r -7082 +t -7083 +u -7084 +l -7085 +g -7086 +k -7087 +o -7088 +p -7089 +b -7090 +h -7091 +d -7092 +w -7093 +c -7094 +m -7095 +y -7096 +z -7097 +f -7098 +v -7099 +j -7100 +- -7101 +ó -7102 +q -7103 +í -7104 +é -7105 +י -7106 +ा -7107 +ि -7108 +α -7109 +а -7110 +. -7111 +к -7112 +á -7113 +л -7114 +् -7115 +è -7116 +ü -7117 +( -7118 +) -7119 +x -7120 +и -7121 +е -7122 +ܐ -7123 +т -7124 +ل -7125 +त -7126 +া -7127 +ი -7128 +à -7129 +î -7130 +ա -7131 +ו -7132 +य -7133 +न -7134 +े -7135 +ì -7136 +ה -7137 +क -7138 +र -7139 +स -7140 +ä -7141 +н -7142 +о -7143 +א -7144 +ר -7145 +ܝ -7146 +ি -7147 +ி -7148 +் -7149 +ა -7150 +ė -7151 +ί -7152 +γ -7153 +ι -7154 +в -7155 +р -7156 +с -7157 +و -7158 +ी -7159 +് -7160 +ā -7161 +σ -7162 +ا -7163 +ल -7164 +ി -7165 +ร -7166 +κ -7167 +ג -7168 +ע -7169 +س -7170 +ن -7171 +ए -7172 +ग -7173 +র -7174 +་ -7175 +1 -7176 +ë -7177 +ú -7178 +τ -7179 +м -7180 +ى -7181 +ی -7182 +भ -7183 +व -7184 +ા -7185 +க -7186 +్ -7187 +್ -7188 +' -7189 +ù -7190 +ý -7191 +ī -7192 +ρ -7193 +у -7194 +і -7195 +ո -7196 +ב -7197 +ל -7198 +ק -7199 +ي -7200 +ে -7201 +্ -7202 +ய -7203 +า -7204 +ี -7205 +เ -7206 +ე -7207 +უ -7208 +/ -7209 +â -7210 +ç -7211 +ô -7212 +ă -7213 +č -7214 +İ -7215 +́ -7216 +ν -7217 +ο -7218 +ы -7219 +ե -7220 +ն -7221 +ր -7222 +ւ -7223 +ט -7224 +פ -7225 +ת -7226 +ܬ -7227 +ާ -7228 +ި -7229 +ު -7230 +ं -7231 +ब -7232 +ই -7233 +ন -7234 +স -7235 +ு -7236 +ം -7237 +ത -7238 +ි -7239 +რ -7240 +英 -7241 +𐌰 -7242 +, -7243 +ê -7244 +ö -7245 +û -7246 +ı -7247 +ō -7248 +ə -7249 +π -7250 +д -7251 +з -7252 +й -7253 +п -7254 +ի -7255 +נ -7256 +ס -7257 +ت -7258 +م -7259 +ܘ -7260 +ަ -7261 +ड -7262 +प -7263 +श -7264 +ु -7265 +ू -7266 +ो -7267 +ক -7268 +ত -7269 +য -7270 +় -7271 +્ -7272 +ட -7273 +ப -7274 +ா -7275 +స -7276 +ా -7277 +ి -7278 +ക -7279 +ന -7280 +ാ -7281 +ิ -7282 +ພ -7283 +າ -7284 +། -7285 +გ -7286 +ნ -7287 +ᐊ -7288 +ẹ -7289 +ọ -7290 +中 -7291 +其 -7292 +土 -7293 +基 -7294 +民 -7295 +洲 -7296 +耳 -7297 +2 -7298 +ð -7299 +š -7300 +ū -7301 +ε -7302 +ד -7303 +ب -7304 +د -7305 +ر -7306 +ز -7307 +ܓ -7308 +ܕ -7309 +ܩ -7310 +ރ -7311 +ऊ -7312 +উ -7313 +ব -7314 +ল -7315 +િ -7316 +ச -7317 +வ -7318 +ಾ -7319 +ಿ -7320 +യ -7321 +സ -7322 +ය -7323 +ව -7324 +ก -7325 +ท -7326 +น -7327 +อ -7328 +ั -7329 +ི -7330 +ེ -7331 +ᐃ -7332 +– -7333 +ア -7334 +亞 -7335 +地 -7336 +文 -7337 +理 -7338 +百 -7339 +科 -7340 +聖 -7341 +華 -7342 +ã -7343 +ò -7344 +ē -7345 +ğ -7346 +ɛ -7347 +̀ -7348 +δ -7349 +λ -7350 +μ -7351 +υ -7352 +φ -7353 +б -7354 +х -7355 +я -7356 +ј -7357 +գ -7358 +թ -7359 +շ -7360 +ս -7361 +տ -7362 +ق -7363 +چ -7364 +ܛ -7365 +ܠ -7366 +ܢ -7367 +ܪ -7368 +ܫ -7369 +ނ -7370 +އ -7371 +ތ -7372 +ގ -7373 +ީ -7374 +ް -7375 +इ -7376 +घ -7377 +ष -7378 +ृ -7379 +ै -7380 +ং -7381 +এ -7382 +জ -7383 +প -7384 +ভ -7385 +ਾ -7386 +ਿ -7387 +ત -7388 +ન -7389 +ર -7390 +ે -7391 +ன -7392 +ர -7393 +ல -7394 +క -7395 +త -7396 +ర -7397 +ీ -7398 +ಯ -7399 +ರ -7400 +ര -7401 +ഷ -7402 +ස -7403 +ා -7404 +ป -7405 +ภ -7406 +ม -7407 +ว -7408 +ศ -7409 +์ -7410 +ດ -7411 +ິ -7412 +ີ -7413 +ན -7414 +ར -7415 +ས -7416 +ུ -7417 +င -7418 +တ -7419 +ဝ -7420 +ိ -7421 +ီ -7422 +် -7423 +ბ -7424 +ლ -7425 +ტ -7426 +វ -7427 +ា -7428 +ិ -7429 +ី -7430 +ᨗ -7431 +ụ -7432 +テ -7433 +ル -7434 +亚 -7435 +化 -7436 +國 -7437 +經 -7438 +維 -7439 +語 -7440 +아 -7441 +키 -7442 +· -7443 +ø -7444 +ć -7445 +ĉ -7446 +ę -7447 +ĭ -7448 +ń -7449 +ŋ -7450 +ƿ -7451 +ǣ -7452 +ή -7453 +β -7454 +η -7455 +ω -7456 +ж -7457 +ь -7458 +э -7459 +ю -7460 +լ -7461 +յ -7462 +վ -7463 +ք -7464 +ז -7465 +ן -7466 +ש -7467 +إ -7468 +ف -7469 +ك -7470 +گ -7471 +ܒ -7472 +ܡ -7473 +ܣ -7474 +ܦ -7475 +ބ -7476 +ފ -7477 +ލ -7478 +ސ -7479 +ޖ -7480 +आ -7481 +ज -7482 +ट -7483 +গ -7484 +ঘ -7485 +ট -7486 +ড -7487 +শ -7488 +ী -7489 +ু -7490 +ূ -7491 +ো -7492 +ਬ -7493 +਼ -7494 +ੀ -7495 +એ -7496 +ક -7497 +ગ -7498 +બ -7499 +ભ -7500 +ય -7501 +સ -7502 +ી -7503 +ଏ -7504 +ସ -7505 +େ -7506 +୍ -7507 +ஆ -7508 +ம -7509 +ீ -7510 +ం -7511 +ఆ -7512 +గ -7513 +న -7514 +బ -7515 +భ -7516 +య -7517 +ల -7518 +ె -7519 +ಕ -7520 +ಗ -7521 +ಟ -7522 +ನ -7523 +ಬ -7524 +ಲ -7525 +ಸ -7526 +ೆ -7527 +ബ -7528 +ഭ -7529 +മ -7530 +ല -7531 +ീ -7532 +ു -7533 +ോ -7534 +ක -7535 +් -7536 +ต -7537 +ธ -7538 +บ -7539 +ย -7540 +ษ -7541 +ส -7542 +ะ -7543 +ກ -7544 +ສ -7545 +ອ -7546 +ཀ -7547 +ཏ -7548 +ད -7549 +བ -7550 +ཨ -7551 +ပ -7552 +မ -7553 +ရ -7554 +ု -7555 +დ -7556 +ვ -7557 +ზ -7558 +თ -7559 +კ -7560 +ს -7561 +ስ -7562 +ን -7563 +እ -7564 +ክ -7565 +ው -7566 +ያ -7567 +Ꭰ -7568 +Ꭹ -7569 +Ꮟ -7570 +Ꮧ -7571 +ᐅ -7572 +ᑎ -7573 +ᓯ -7574 +ប -7575 +ភ -7576 +ម -7577 +្ -7578 +ᨔ -7579 +ṣ -7580 +ṳ -7581 +‍ -7582 +‘ -7583 +ィ -7584 +ウ -7585 +デ -7586 +ン -7587 +ー -7588 +加 -7589 +国 -7590 +学 -7591 +書 -7592 +森 -7593 +特 -7594 +納 -7595 +维 -7596 +语 -7597 +르 -7598 +부 -7599 +스 -7600 +위 -7601 +크 -7602 +터 -7603 +화 -7604 +𐌲 -7605 +𐌹 -7606 +𐍂 -7607 +å -7608 +æ -7609 +ï -7610 +þ -7611 +đ -7612 +ġ -7613 +ģ -7614 +ĩ -7615 +ķ -7616 +ļ -7617 +ł -7618 +ň -7619 +ŭ -7620 +ų -7621 +ź -7622 +ʻ -7623 +ʼ -7624 +̄ -7625 +̍ -7626 +ά -7627 +έ -7628 +ζ -7629 +ό -7630 +ύ -7631 +ώ -7632 +г -7633 +ф -7634 +ц -7635 +ъ -7636 +є -7637 +ў -7638 +ѩ -7639 +ҷ -7640 +ә -7641 +ӣ -7642 +դ -7643 +խ -7644 +ծ -7645 +կ -7646 +հ -7647 +մ -7648 +չ -7649 +պ -7650 +փ -7651 +מ -7652 +أ -7653 +ئ -7654 +ة -7655 +ث -7656 +ج -7657 +خ -7658 +ص -7659 +غ -7660 +ه -7661 +ې -7662 +ە -7663 +ܙ -7664 +ܟ -7665 +ހ -7666 +ކ -7667 +ވ -7668 +މ -7669 +ޗ -7670 +ޣ -7671 +ޫ -7672 +ޭ -7673 +ः -7674 +अ -7675 +उ -7676 +च -7677 +झ -7678 +ण -7679 +़ -7680 +চ -7681 +ছ -7682 +ঠ -7683 +ষ -7684 +ৃ -7685 +ਅ -7686 +ਆ -7687 +ਈ -7688 +ਕ -7689 +ਗ -7690 +ਜ -7691 +ਡ -7692 +ਪ -7693 +ਭ -7694 +ਰ -7695 +ਲ -7696 +ਵ -7697 +ਸ -7698 +ੇ -7699 +੍ -7700 +ੰ -7701 +ં -7702 +અ -7703 +ઇ -7704 +ઊ -7705 +ઘ -7706 +જ -7707 +ડ -7708 +પ -7709 +લ -7710 +ળ -7711 +વ -7712 +શ -7713 +ષ -7714 +ુ -7715 +ૂ -7716 +ો -7717 +ଊ -7718 +ତ -7719 +ନ -7720 +ଯ -7721 +ର -7722 +ା -7723 +ି -7724 +ங -7725 +ண -7726 +த -7727 +ூ -7728 +ெ -7729 +ఊ -7730 +ఎ -7731 +ఘ -7732 +ట -7733 +డ -7734 +ప -7735 +మ -7736 +ళ -7737 +వ -7738 +శ -7739 +ష -7740 +ు -7741 +ూ -7742 +ృ -7743 +ై -7744 +ో -7745 +ಂ -7746 +ಆ -7747 +ಏ -7748 +ಘ -7749 +ಡ -7750 +ತ -7751 +ಪ -7752 +ಭ -7753 +ಳ -7754 +ವ -7755 +ಶ -7756 +ಷ -7757 +ೀ -7758 +ು -7759 +ೂ -7760 +ೈ -7761 +ೋ -7762 +ഇ -7763 +ഏ -7764 +ഒ -7765 +ഖ -7766 +ഗ -7767 +ഘ -7768 +ഡ -7769 +പ -7770 +ഴ -7771 +വ -7772 +ശ -7773 +ൂ -7774 +െ -7775 +േ -7776 +ൈ -7777 +ൾ -7778 +ං -7779 +ආ -7780 +ග -7781 +ඩ -7782 +ත -7783 +ද -7784 +ප -7785 +භ -7786 +ල -7787 +ී -7788 +ූ -7789 +ෘ -7790 +ෝ -7791 +ค -7792 +ง -7793 +จ -7794 +ช -7795 +ซ -7796 +ฐ -7797 +ฒ -7798 +ณ -7799 +ด -7800 +พ -7801 +ฤ -7802 +ล -7803 +ื -7804 +ุ -7805 +ู -7806 +แ -7807 +ไ -7808 +ຄ -7809 +ງ -7810 +ຊ -7811 +ຍ -7812 +ມ -7813 +ວ -7814 +ະ -7815 +ັ -7816 +ູ -7817 +ເ -7818 +ໍ -7819 +ག -7820 +ང -7821 +ཇ -7822 +འ -7823 +ཡ -7824 +ཤ -7825 +ྐ -7826 +ྱ -7827 +က -7828 +ထ -7829 +ဒ -7830 +န -7831 +ယ -7832 +သ -7833 +အ -7834 +ာ -7835 +ံ -7836 +္ -7837 +ှ -7838 +ო -7839 +პ -7840 +ფ -7841 +ქ -7842 +ሊ -7843 +ሐ -7844 +መ -7845 +ር -7846 +ቅ -7847 +ተ -7848 +ቱ -7849 +ና -7850 +ኛ -7851 +ኢ -7852 +ዘ -7853 +ዝ -7854 +ዱ -7855 +ዲ -7856 +ጋ -7857 +ግ -7858 +ጽ -7859 +ፍ -7860 +ፔ -7861 +Ꭶ -7862 +Ꭿ -7863 +Ꮅ -7864 +Ꮒ -7865 +Ꮗ -7866 +Ꮝ -7867 +Ꮻ -7868 +Ꮼ -7869 +Ꮿ -7870 +ᐣ -7871 +ᐧ -7872 +ᐱ -7873 +ᑐ -7874 +ᑖ -7875 +ᑦ -7876 +ᑭ -7877 +ᑲ -7878 +ᒧ -7879 +ᓄ -7880 +ᓇ -7881 +ᓈ -7882 +ᓐ -7883 +ᓗ -7884 +ᓪ -7885 +ᓰ -7886 +ᔪ -7887 +ᔭ -7888 +ᖃ -7889 +ᖅ -7890 +គ -7891 +ឌ -7892 +ទ -7893 +ធ -7894 +យ -7895 +ស -7896 +អ -7897 +ូ -7898 +៊ -7899 +៌ -7900 +ᨅ -7901 +ᨋ -7902 +ᨕ -7903 +ế -7904 +ị -7905 +ố -7906 +ổ -7907 +ỳ -7908 +“ -7909 +” -7910 +の -7911 +へ -7912 +ァ -7913 +ェ -7914 +エ -7915 +ガ -7916 +キ -7917 +ク -7918 +コ -7919 +ジ -7920 +ス -7921 +セ -7922 +チ -7923 +ツ -7924 +ト -7925 +ナ -7926 +ピ -7927 +ブ -7928 +ペ -7929 +モ -7930 +ュ -7931 +一 -7932 +于 -7933 +伊 -7934 +典 -7935 +前 -7936 +华 -7937 +圣 -7938 +堡 -7939 +大 -7940 +太 -7941 +學 -7942 +察 -7943 +手 -7944 +提 -7945 +摩 -7946 +斯 -7947 +於 -7948 +杜 -7949 +灣 -7950 +琴 -7951 +皮 -7952 +紙 -7953 +細 -7954 +纳 -7955 +经 -7956 +臺 -7957 +迦 -7958 +ꙁ -7959 +ꙑ -7960 +가 -7961 +간 -7962 +게 -7963 +경 -7964 +과 -7965 +국 -7966 +나 -7967 +낸 -7968 +노 -7969 +뒤 -7970 +드 -7971 +레 -7972 +룰 -7973 +리 -7974 +모 -7975 +문 -7976 +민 -7977 +백 -7978 +보 -7979 +브 -7980 +비 -7981 +서 -7982 +성 -7983 +시 -7984 +어 -7985 +에 -7986 +영 -7987 +오 -7988 +젠 -7989 +중 -7990 +지 -7991 +째 -7992 +차 -7993 +첫 -7994 +첸 -7995 diff --git a/models/vocabulary/ng_vocabulary.parquet b/models/vocabulary/ng_vocabulary.parquet new file mode 100644 index 0000000000000000000000000000000000000000..5afd4d071cca358586ad125f6f1cb11c8240aa0a --- /dev/null +++ b/models/vocabulary/ng_vocabulary.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:583c51c99cbd4e09740abcdcc5df7576ce2cc8f7d16aea843b7775033cfdee22 +size 12196 diff --git a/models/vocabulary/ng_vocabulary_metadata.json b/models/vocabulary/ng_vocabulary_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..9749f934a3bc80b0fbc03ac7048dcdcb8c0ac959 --- /dev/null +++ b/models/vocabulary/ng_vocabulary_metadata.json @@ -0,0 +1,15 @@ +{ + "language": "ng", + "vocabulary_size": 648, + "variant": "full", + "statistics": { + "type_token_ratio": 0.39977816510854064, + "coverage": { + "top_100": 0.3463793376643955, + "top_1000": 0.7586753287909999 + }, + "hapax_count": 1875, + "hapax_ratio": 0.7431629013079667, + "total_documents": 17 + } +} \ No newline at end of file diff --git a/models/word_markov/ng_markov_ctx1_word.parquet b/models/word_markov/ng_markov_ctx1_word.parquet new file mode 100644 index 0000000000000000000000000000000000000000..3ce5a44b0c8e138f6e73330e4827a95471f7aa66 --- /dev/null +++ b/models/word_markov/ng_markov_ctx1_word.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a4976d02f4a3b7f9cc67d5838410cd03eccc8de754cbeed9f8e625b7ca718171 +size 72378 diff --git a/models/word_markov/ng_markov_ctx1_word_metadata.json b/models/word_markov/ng_markov_ctx1_word_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..67657540cdbd538522bd06f586228fcd706b4eb9 --- /dev/null +++ b/models/word_markov/ng_markov_ctx1_word_metadata.json @@ -0,0 +1,7 @@ +{ + "context_size": 1, + "variant": "word", + "language": "ng", + "unique_contexts": 2515, + "total_transitions": 6294 +} \ No newline at end of file diff --git a/models/word_markov/ng_markov_ctx2_word.parquet b/models/word_markov/ng_markov_ctx2_word.parquet new file mode 100644 index 0000000000000000000000000000000000000000..95ef0c5062d9250fc93362e2dd5c49fcfd0aeb20 --- /dev/null +++ b/models/word_markov/ng_markov_ctx2_word.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e32567790fc33a99257fd8adb8e7ad0ce3344ca7d3a3eb520e45837f02b8aa0e +size 109970 diff --git a/models/word_markov/ng_markov_ctx2_word_metadata.json b/models/word_markov/ng_markov_ctx2_word_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..d27398eab0e6ecf81b8c6bf40ece05e940cc3110 --- /dev/null +++ b/models/word_markov/ng_markov_ctx2_word_metadata.json @@ -0,0 +1,7 @@ +{ + "context_size": 2, + "variant": "word", + "language": "ng", + "unique_contexts": 5756, + "total_transitions": 6277 +} \ No newline at end of file diff --git a/models/word_markov/ng_markov_ctx3_word.parquet b/models/word_markov/ng_markov_ctx3_word.parquet new file mode 100644 index 0000000000000000000000000000000000000000..2445c7315339d85b8d4ba150eaa9056fc5c40269 --- /dev/null +++ b/models/word_markov/ng_markov_ctx3_word.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:baacbbfa77ed4700cf8770e496208dc10568de3ddfb7d70f04d448e9c80f629a +size 129320 diff --git a/models/word_markov/ng_markov_ctx3_word_metadata.json b/models/word_markov/ng_markov_ctx3_word_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..01ad7220ba72972f25e520ef52617dfc900119fe --- /dev/null +++ b/models/word_markov/ng_markov_ctx3_word_metadata.json @@ -0,0 +1,7 @@ +{ + "context_size": 3, + "variant": "word", + "language": "ng", + "unique_contexts": 6060, + "total_transitions": 6260 +} \ No newline at end of file diff --git a/models/word_markov/ng_markov_ctx4_word.parquet b/models/word_markov/ng_markov_ctx4_word.parquet new file mode 100644 index 0000000000000000000000000000000000000000..439a61a081e085b018963f4bc527d1c84b8c1a03 --- /dev/null +++ b/models/word_markov/ng_markov_ctx4_word.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:895d3a6d2039026439c7f454c74fe99ddf6e0ef21d4295442199e0aefa21c515 +size 141586 diff --git a/models/word_markov/ng_markov_ctx4_word_metadata.json b/models/word_markov/ng_markov_ctx4_word_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..05efb493a1f858884a124b8776cd88f857712989 --- /dev/null +++ b/models/word_markov/ng_markov_ctx4_word_metadata.json @@ -0,0 +1,7 @@ +{ + "context_size": 4, + "variant": "word", + "language": "ng", + "unique_contexts": 6160, + "total_transitions": 6243 +} \ No newline at end of file diff --git a/models/word_ngram/ng_2gram_word.parquet b/models/word_ngram/ng_2gram_word.parquet new file mode 100644 index 0000000000000000000000000000000000000000..20e0717f6e5e8a361cf91abd46658d005e29190c --- /dev/null +++ b/models/word_ngram/ng_2gram_word.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:555cdaf3192346943b356a08cda1779b0edf79ca8e0e8c7e36f0fd3390669d76 +size 2786 diff --git a/models/word_ngram/ng_2gram_word_metadata.json b/models/word_ngram/ng_2gram_word_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..79cbd33625b4c8f08ddf181531c792124815297c --- /dev/null +++ b/models/word_ngram/ng_2gram_word_metadata.json @@ -0,0 +1,7 @@ +{ + "n": 2, + "variant": "word", + "language": "ng", + "unique_ngrams": 22, + "total_ngrams": 6294 +} \ No newline at end of file diff --git a/models/word_ngram/ng_3gram_word.parquet b/models/word_ngram/ng_3gram_word.parquet new file mode 100644 index 0000000000000000000000000000000000000000..cf9db9969335a16df457b660e23264a04f78f538 --- /dev/null +++ b/models/word_ngram/ng_3gram_word.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9b2228b37f8efe206e9b4d11db4b4b0702352753679232ea5df5944c22841163 +size 2903 diff --git a/models/word_ngram/ng_3gram_word_metadata.json b/models/word_ngram/ng_3gram_word_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..ac5a0e6b0cbee295c17dc7c9ea116aaa37c48b0a --- /dev/null +++ b/models/word_ngram/ng_3gram_word_metadata.json @@ -0,0 +1,7 @@ +{ + "n": 3, + "variant": "word", + "language": "ng", + "unique_ngrams": 23, + "total_ngrams": 6277 +} \ No newline at end of file diff --git a/models/word_ngram/ng_4gram_word.parquet b/models/word_ngram/ng_4gram_word.parquet new file mode 100644 index 0000000000000000000000000000000000000000..1034c9a47f38516ca33eca5868585693c73b872a --- /dev/null +++ b/models/word_ngram/ng_4gram_word.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1f1d6c8bb2e1c5623c56a17b5c0a7fdc99688659127f9d2d3801836cfa691002 +size 3222 diff --git a/models/word_ngram/ng_4gram_word_metadata.json b/models/word_ngram/ng_4gram_word_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..49698252ce53a6a7cbc331b34740fbb1bd31bd0d --- /dev/null +++ b/models/word_ngram/ng_4gram_word_metadata.json @@ -0,0 +1,7 @@ +{ + "n": 4, + "variant": "word", + "language": "ng", + "unique_ngrams": 29, + "total_ngrams": 6260 +} \ No newline at end of file diff --git a/models/word_ngram/ng_5gram_word.parquet b/models/word_ngram/ng_5gram_word.parquet new file mode 100644 index 0000000000000000000000000000000000000000..3ae2c69d7d43b6ad806587ba370f281c861a1171 --- /dev/null +++ b/models/word_ngram/ng_5gram_word.parquet @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:dddcc8b4ee50e6d38e4ec7e5afc3d46386c3917b2b4ad0fa518eb9cf81764ddc +size 2922 diff --git a/models/word_ngram/ng_5gram_word_metadata.json b/models/word_ngram/ng_5gram_word_metadata.json new file mode 100644 index 0000000000000000000000000000000000000000..cf00aea136fbddf9e4040a0cb61adf67a7ee0ff5 --- /dev/null +++ b/models/word_ngram/ng_5gram_word_metadata.json @@ -0,0 +1,7 @@ +{ + "n": 5, + "variant": "word", + "language": "ng", + "unique_ngrams": 15, + "total_ngrams": 6243 +} \ No newline at end of file diff --git a/ng_morph_tokenizer.json b/ng_morph_tokenizer.json new file mode 100644 index 0000000000000000000000000000000000000000..f521e73eb9d50cd786d3365cce70750114e7ffed --- /dev/null +++ b/ng_morph_tokenizer.json @@ -0,0 +1,5816 @@ +{ + "language": "ng", + "prefixes": {}, + "suffixes": { + "a": 506.9 + }, + "ngram_scores": { + "a": 1406, + "an": 583, + "e": 427, + "ng": 419, + "en": 411, + "n": 403, + "o": 401, + "na": 395, + "ia": 348, + "ki": 278, + "k": 277, + "ka": 277, + "er": 262, + "s": 262, + "wa": 261, + "tu": 253, + "te": 248, + "li": 246, + "et": 236, + "i": 232, + "no": 230, + "ik": 229, + "t": 224, + "ok": 222, + "in": 220, + "ga": 218, + "se": 215, + "ha": 211, + "la": 209, + "rs": 205, + "ur": 203, + "d": 202, + "ra": 198, + "ku": 198, + "ers": 197, + "m": 192, + "he": 190, + "di": 188, + "o:": 187, + ":u": 184, + "ue": 183, + "l": 181, + "on": 178, + "ter": 177, + "hi": 176, + "al": 176, + ":g": 175, + "rse": 175, + "erse": 175, + "u": 174, + "sen": 174, + "ete": 172, + "as": 171, + "un": 171, + "om": 171, + "th": 169, + "eter": 169, + "ters": 169, + "rsen": 169, + "uet": 168, + "uete": 168, + "p": 166, + ":ue": 165, + ":uet": 165, + ":t": 163, + "ul": 162, + "ta": 160, + "si": 160, + "lo": 160, + "ip": 159, + "el": 158, + "le": 157, + "ana": 157, + ":a": 155, + "pe": 154, + "ge": 147, + "h": 145, + "ne": 143, + "a:": 142, + "ed": 140, + "pi": 138, + "w": 137, + "l:": 137, + "wi": 137, + "iki": 137, + "ko": 135, + "tur": 134, + "kip": 134, + "ma": 133, + "ikip": 133, + "uu": 131, + "sh": 129, + "at": 128, + "is": 127, + "c": 126, + "mb": 125, + "ya": 125, + "ba": 124, + "ipe": 124, + "s:": 123, + "g": 123, + ":b": 123, + "wik": 123, + "t:": 122, + ":w": 121, + "ped": 121, + "il": 120, + "edi": 120, + "wiki": 120, + "han": 119, + "n:": 118, + ":wi": 118, + "iped": 118, + ":wik": 117, + "ang": 116, + "kipe": 116, + "pedi": 116, + "pa": 115, + "dia": 113, + ":p": 110, + "nd": 109, + "bi": 109, + "oku": 108, + "aa": 107, + "iy": 106, + "ut": 106, + "de": 106, + "ar": 105, + "em": 105, + "y": 105, + "ke": 104, + "e:": 101, + "r:": 100, + "nt": 100, + "mo": 100, + "b": 99, + "ni": 98, + "bl": 98, + "ib": 96, + "am": 95, + "eo": 95, + "za": 95, + "edia": 95, + "az": 94, + "ol": 94, + "ch": 94, + "f": 94, + ".": 93, + "ti": 93, + "gr": 93, + "mu": 93, + "gu": 92, + "fi": 92, + "ano": 92, + "re": 91, + "og": 91, + "im": 90, + "ye": 89, + "uk": 89, + "gl": 88, + ":pi": 88, + "it": 87, + "af": 86, + "ig": 86, + "ing": 86, + "ong": 86, + "rk": 85, + "gh": 85, + "i:": 84, + "yo": 84, + ":tu": 84, + "ai": 83, + ":as": 83, + "sk": 82, + "ie": 82, + "nga": 82, + "gra": 82, + "u:": 81, + "wo": 81, + "po": 80, + "op": 80, + ":bi": 80, + "ogr": 80, + "asi": 80, + ":tur": 80, + "lt": 79, + "ho": 79, + "lu": 79, + "oka": 79, + "bli": 79, + "ntu": 79, + "ogra": 79, + "mba": 78, + "eh": 77, + "ao": 76, + "we": 75, + "aw": 74, + "ngl": 74, + "hana": 74, + "es": 73, + "me": 73, + "gha": 73, + "uth": 73, + "nge": 73, + "h:": 72, + "v:": 72, + "to": 72, + "sa": 72, + "omu": 72, + "ghan": 72, + "ap": 71, + "ek": 71, + "ae": 71, + ":gh": 71, + "bib": 70, + "ak": 69, + "ja": 69, + "lw": 69, + "eng": 69, + ":bib": 69, + ":asi": 69, + ":gha": 69, + "go": 68, + "v": 67, + "the": 67, + "ui": 66, + ":d": 66, + "ac": 66, + "eog": 66, + "raf": 66, + "eogr": 66, + "um": 65, + "os": 65, + "y:": 64, + "du": 64, + "uut": 64, + "graf": 64, + "bu": 63, + "ve": 63, + "ngu": 63, + "uuth": 63, + "sb": 62, + "ep": 62, + "emb": 62, + "sia": 62, + "man": 61, + "r": 60, + "hem": 60, + "kal": 60, + "iya": 60, + "afi": 60, + "unt": 60, + "them": 60, + "hemb": 60, + "emba": 60, + "z": 59, + ",": 59, + "ii": 59, + "wan": 59, + "uthe": 59, + "ce": 58, + "ij": 58, + "ult": 58, + "rafi": 58, + "untu": 58, + "m:": 57, + "kw": 57, + "ina": 57, + "ulu": 57, + "lwa": 57, + "ri": 56, + "gw": 56, + "ala": 56, + "st": 55, + "ro": 55, + "mi": 54, + "au": 54, + "oi": 54, + "gan": 54, + "ehe": 54, + ":ge": 53, + "g:": 52, + "ltu": 52, + "lon": 52, + "asia": 52, + "ay": 51, + "geo": 51, + "mun": 51, + "iga": 51, + "urk": 51, + "ultu": 51, + "geog": 51, + "hu": 50, + "kul": 50, + ":geo": 50, + "turk": 50, + "b:": 49, + "j": 49, + "dh": 49, + "ot": 49, + "ilo": 49, + "gaz": 49, + "pig": 49, + "munt": 49, + ":k": 48, + "ad": 48, + "ew": 48, + "us": 48, + "nen": 48, + "zan": 48, + "long": 48, + "auu": 48, + "piga": 48, + "igaz": 48, + "zano": 48, + ":pig": 48, + "ah": 47, + "da": 47, + "be": 47, + "pu": 47, + "zz": 47, + "ibl": 47, + "azz": 47, + "zza": 47, + "ltur": 47, + "baw": 47, + "omun": 47, + "gazz": 47, + "azza": 47, + "zzan": 47, + "io": 46, + "c:": 46, + "zh": 45, + ":n": 45, + ".o": 45, + "bibl": 45, + "ec": 44, + "ow": 44, + "gwa": 44, + "shi": 44, + "rki": 44, + "mw": 43, + "w:": 43, + "rg": 43, + "ace": 43, + "opo": 43, + "awo": 43, + "vi": 42, + "ara": 42, + "ngo": 42, + "kat": 42, + "enge": 42, + "k:": 41, + "sc": 41, + ":i": 41, + "chi": 41, + "osh": 41, + "oo": 40, + "ub": 40, + ":r": 40, + "zi": 40, + "uni": 40, + "ski": 40, + "dui": 40, + "uis": 40, + "angu": 40, + "oiy": 40, + "kr": 39, + "ws": 39, + "yu": 39, + "nk": 39, + "isb": 39, + "bur": 39, + "urg": 39, + "okat": 39, + "duis": 39, + "uisb": 39, + "kl": 38, + "ija": 38, + "ika": 38, + "keh": 38, + "sbu": 38, + "pia": 38, + "sbur": 38, + "burg": 38, + "pl": 37, + "nz": 37, + ":e": 37, + "gul": 37, + ":du": 37, + "kehe": 37, + "isbu": 37, + "id": 36, + "ca": 36, + "wy": 36, + "bo": 36, + "ei": 36, + "do": 36, + "elo": 36, + "luk": 36, + "now": 36, + "tha": 36, + "lar": 36, + "pub": 36, + "wsk": 36, + "onga": 36, + ":dui": 36, + ":pia": 36, + "ee": 35, + "ic": 35, + "ag": 35, + "dw": 35, + "wó": 35, + "ór": 35, + "ró": 35, + "ól": 35, + ":ku": 35, + "nau": 35, + "gel": 35, + "ato": 35, + ":re": 35, + "ubl": 35, + "owy": 35, + "yd": 35, + "dwó": 35, + "wór": 35, + "kró": 35, + "ról": 35, + "óle": 35, + "lew": 35, + "ews": 35, + "publ": 35, + "ubli": 35, + "nowy": 35, + "wyd": 35, + "ydw": 35, + "dwór": 35, + "órk": 35, + "rkr": 35, + "król": 35, + "róle": 35, + "ólew": 35, + "lews": 35, + "ewsk": 35, + "wski": 35, + ":c": 34, + "d:": 34, + "qu": 34, + "mp": 34, + "yi": 34, + "ura": 34, + "pol": 34, + "enz": 34, + "una": 34, + "uka": 34, + "kwa": 34, + "olw": 34, + "nza": 34, + "rep": 34, + "cen": 34, + ":no": 34, + "ngel": 34, + "eok": 34, + "acen": 34, + ":now": 34, + "ias": 33, + "ane": 33, + ":ga": 33, + "iac": 33, + ":kul": 33, + "gulu": 33, + "uluk": 33, + "ongo": 33, + "kato": 33, + "piac": 33, + "iace": 33, + "cenz": 33, + "cl": 32, + "ls": 32, + "oe": 32, + "oma": 32, + "hil": 32, + "ngul": 32, + "kult": 32, + "kala": 32, + "ibli": 32, + "gana": 32, + "neng": 32, + ":rep": 32, + ":gan": 32, + "enza": 32, + "iw": 31, + "eu": 31, + "ns": 31, + "ci": 31, + "iv": 31, + "ele": 31, + "heo": 31, + ":ta": 31, + "uko": 31, + "lik": 31, + "epu": 31, + "tura": 31, + "tuk": 31, + "nao": 31, + "opol": 31, + "olwa": 31, + "q:": 30, + "ds": 30, + "va": 30, + "lia": 30, + "kun": 30, + "hin": 30, + "top": 30, + "ver": 30, + ".ok": 30, + "atop": 30, + "topo": 30, + "polw": 30, + "uke": 30, + "ingl": 30, + "repu": 30, + "cu": 29, + "fr": 29, + "p:": 29, + "h-": 29, + "ny": 29, + "zh-": 29, + "thi": 29, + "imo": 29, + "ath": 29, + "mang": 29, + "lara": 29, + "epub": 29, + "urki": 29, + "nn": 28, + "ey": 28, + "an:": 28, + "bel": 28, + "tim": 28, + "mot": 28, + "uun": 28, + "mwe": 28, + "dhi": 28, + "lis": 28, + "gli": 28, + "nl": 27, + "rc": 27, + "je": 27, + ":cu": 27, + "and": 27, + "nka": 27, + "niv": 27, + ":en": 27, + "ilon": 27, + "univ": 27, + "iu": 26, + ":l": 26, + "a.": 26, + "cul": 26, + "ro:": 26, + "men": 26, + "sim": 26, + ":in": 26, + ":an": 26, + "ema": 26, + "iwa": 26, + "ive": 26, + ":az": 26, + "gle": 26, + ":cul": 26, + "cult": 26, + "timo": 26, + "imot": 26, + "nive": 26, + "iver": 26, + "ngli": 26, + "ngle": 26, + ":v": 25, + "ly": 25, + "а": 25, + "nl:": 25, + "tai": 25, + "blia": 25, + "eman": 25, + "than": 25, + "vers": 25, + "ms": 24, + "-n": 24, + "ab": 24, + "av": 24, + "ms:": 24, + "nan": 24, + "cla": 24, + "aan": 24, + "sha": 24, + "sho": 24, + "ena": 24, + "nas": 24, + "iai": 24, + "luko": 24, + "atha": 24, + ":eng": 24, + "angl": 24, + "ua": 23, + "ov": 23, + "ts": 23, + "f:": 23, + "o.": 23, + "oy": 23, + "ens": 23, + "pam": 23, + "ndo": 23, + "kan": 23, + "azi": 23, + "wana": 23, + "gwan": 23, + "chin": 23, + "cs": 22, + "(": 22, + ")": 22, + "a,": 22, + "vo": 22, + "ir": 22, + "eg": 22, + "et:": 22, + "ili": 22, + "ith": 22, + "onk": 22, + "pan": 22, + "aiw": 22, + "hano": 22, + "taiw": 22, + "blik": 22, + "gi": 21, + "fu": 21, + "n-": 21, + "ht": 21, + "rm": 21, + "dj": 21, + "e.": 21, + "pr": 21, + "sb:": 21, + "olo": 21, + "ali": 21, + "lin": 21, + "min": 21, + "ita": 21, + "fia": 21, + "o:t": 21, + "ene": 21, + "sal": 21, + "alo": 21, + "wok": 21, + "kla": 21, + "afia": 21, + "woku": 21, + "hina": 21, + "iwan": 21, + ":ang": 21, + "z:": 20, + "sm": 20, + "hy": 20, + "up": 20, + "ar:": 20, + "nds": 20, + "o:u": 20, + "ako": 20, + "lat": 20, + "waa": 20, + "hik": 20, + "opa": 20, + "nok": 20, + "ash": 20, + "les": 20, + "ial": 20, + "ade": 20, + "osho": 20, + "nkal": 20, + "oshi": 20, + "aiwa": 20, + ":azi": 20, + "co": 19, + "ian": 19, + "eko": 19, + "ung": 19, + "gam": 19, + ":vi": 19, + "ica": 19, + "aka": 19, + "ibe": 19, + "kuk": 19, + "oom": 19, + "ame": 19, + "hum": 19, + "rsa": 19, + "sd": 19, + "owo": 19, + "klar": 19, + "ersa": 19, + "gles": 19, + ":tai": 19, + "mg": 18, + "tr": 18, + "ef": 18, + "ug": 18, + "ru": 18, + "jo": 18, + "wu": 18, + "o,": 18, + "ра": 18, + "ati": 18, + "ant": 18, + "nin": 18, + "mon": 18, + "imp": 18, + "non": 18, + "o:g": 18, + "aye": 18, + "umb": 18, + "ndj": 18, + "iye": 18, + "apa": 18, + "iap": 18, + "thik": 18, + "nae": 18, + "amo": 18, + "ain": 18, + "engl": 18, + "glis": 18, + ":ing": 18, + "eb": 17, + "tt": 17, + "iq": 17, + "hr": 17, + "ud": 17, + "pt": 17, + "sv": 17, + "tl": 17, + "ld": 17, + ":s": 17, + "st:": 17, + "mg:": 17, + "en:": 17, + "fr:": 17, + "it:": 17, + "nah": 17, + "ani": 17, + "rke": 17, + "am:": 17, + "lan": 17, + "iil": 17, + "ote": 17, + "yom": 17, + "ike": 17, + "epa": 17, + "dec": 17, + "sie": 17, + "mote": 17, + "ibel": 17, + "hang": 17, + "wak": 17, + "how": 17, + "onka": 17, + "noku": 17, + "okuk": 17, + "noi": 17, + "one": 17, + "clar": 17, + "rsal": 17, + "ll": 16, + "nu": 16, + "oc": 16, + "q": 16, + "oa": 16, + "ih": 16, + "ast": 16, + ".a": 16, + "cs:": 16, + "eo:": 16, + "es:": 16, + "ak:": 16, + "hr:": 16, + "enn": 16, + "ap:": 16, + "pl:": 16, + "pt:": 16, + "sv:": 16, + "s:g": 16, + "rap": 16, + ":ba": 16, + ":li": 16, + "yuu": 16, + "ond": 16, + ",o": 16, + "iyo": 16, + "uma": 16, + ":de": 16, + "ecl": 16, + "dek": 16, + "ekl": 16, + "els": 16, + "gua": 16, + "lic": 16, + "ning": 16, + "grap": 16, + "aom": 16, + "shil": 16, + "lath": 16, + "hilo": 16, + "decl": 16, + "ecla": 16, + "dekl": 16, + "ekla": 16, + "ngua": 16, + "o:ue": 16, + "or": 15, + "jv": 15, + "lv": 15, + "sp": 15, + "nc": 15, + "α": 15, + "so": 15, + "ez": 15, + "1": 15, + "2": 15, + "ty": 15, + "ca:": 15, + "kut": 15, + "de:": 15, + "fi:": 15, + "id:": 15, + "ko:": 15, + "no:": 15, + "sh:": 15, + "mpl": 15, + "ple": 15, + "le:": 15, + "zh:": 15, + "nek": 15, + "ndu": 15, + "lij": 15, + "ist": 15, + "ere": 15, + "yeh": 15, + "ima": 15, + "pau": 15, + "wen": 15, + "ilw": 15, + "oko": 15, + "igw": 15, + "rac": 15, + "urc": 15, + "kiy": 15, + "simp": 15, + "impl": 15, + "mple": 15, + "ple:": 15, + "anga": 15, + "iae": 15, + "ayeh": 15, + "yehe": 15, + "uuna": 15, + "mwen": 15, + "pang": 15, + "aok": 15, + "apo": 15, + "igwa": 15, + "nai": 15, + "o.o": 15, + "a.o": 15, + "uman": 15, + "ling": 15, + "rkiy": 15, + ":m": 14, + "cy": 14, + "vr": 14, + "ml": 14, + "dy": 14, + "ss": 14, + "nh": 14, + "su": 14, + "tü": 14, + "ür": 14, + "-m": 14, + "ks": 14, + "br": 14, + "ze": 14, + "e,": 14, + "sy": 14, + "aha": 14, + "da:": 14, + "hak": 14, + "hu:": 14, + "ja:": 14, + "bo:": 14, + "la:": 14, + "lad": 14, + "lt:": 14, + "nn:": 14, + "co:": 14, + "tür": 14, + "ls:": 14, + "kia": 14, + "lo:": 14, + "ola": 14, + "und": 14, + "ulo": 14, + "ini": 14, + "wom": 14, + "amw": 14, + "ome": 14, + "ono": 14, + "ata": 14, + "omo": 14, + "alu": 14, + "adh": 14, + "van": 14, + "okw": 14, + "ion": 14, + "o:a": 14, + "asc": 14, + "iaa": 14, + "alat": 14, + "yen": 14, + "amwe": 14, + "ukal": 14, + "amen": 14, + "koi": 14, + "lika": 14, + "ashi": 14, + "gelo": 14, + "asie": 14, + "blic": 14, + "turc": 14, + "fy": 13, + "fa": 13, + "ln": 13, + "ph": 13, + "cn": 13, + "sw": 13, + "cr": 13, + "lb": 13, + "tw": 13, + "ea": 13, + ":j": 13, + "ла": 13, + "к": 13, + "ngi": 13, + "amb": 13, + "he:": 13, + "is:": 13, + "ml:": 13, + "oc:": 13, + "sk:": 13, + "sw:": 13, + "tr:": 13, + "vi:": 13, + "war": 13, + "n-n": 13, + "o:w": 13, + "rm:": 13, + "wuu": 13, + "fie": 13, + "aro": 13, + "t:g": 13, + "n:t": 13, + "ine": 13, + "a1": 13, + "eho": 13, + "taa": 13, + "ngw": 13, + "lyo": 13, + "mbo": 13, + "mok": 13, + "yok": 13, + "ila": 13, + "awa": 13, + "lun": 13, + "kug": 13, + ":un": 13, + "rec": 13, + "s:u": 13, + "l:u": 13, + "don": 13, + "oes": 13, + "iaf": 13, + "afie": 13, + "aant": 13, + "iilo": 13, + "bibe": 13, + "ngwa": 13, + "hike": 13, + "antu": 13, + "ilwa": 13, + "kuka": 13, + "adhi": 13, + "dhil": 13, + ":dec": 13, + "arac": 13, + "nad": 13, + "ndon": 13, + "dong": 13, + "gels": 13, + "u-": 12, + "sl": 12, + "sq": 12, + "ob": 12, + "pé": 12, + "bm": 12, + "bh": 12, + "j:": 12, + "lm": 12, + "sz": 12, + "ía": 12, + "pw": 12, + "uy": 12, + "nw": 12, + "ек": 12, + "bat": 12, + "cy:": 12, + "el:": 12, + "eu:": 12, + "gl:": 12, + "ia:": 12, + "ka:": 12, + "lv:": 12, + "s:b": 12, + "pap": 12, + "qu:": 12, + "sq:": 12, + "ama": 12, + "yo:": 12, + "h-m": 12, + "-mi": 12, + "in-": 12, + "-na": 12, + "af:": 12, + "qui": 12, + "vik": 12, + "che": 12, + "nb": 12, + "l:a": 12, + "a2": 12, + "o:b": 12, + "t:p": 12, + "s:a": 12, + "ris": 12, + "uo": 12, + "uki": 12, + "him": 12, + "yop": 12, + "ota": 12, + "ula": 12, + "ela": 12, + "key": 12, + "rat": 12, + "asa": 12, + "asy": 12, + ":tü": 12, + "ast:": 12, + "war:": 12, + "zh-m": 12, + "h-mi": 12, + "-min": 12, + "min-": 12, + "in-n": 12, + "n-na": 12, + "-nan": 12, + "nan:": 12, + ":vik": 12, + "viki": 12, + "o:wi": 12, + "pamw": 12, + "game": 12, + ":uni": 12, + "nun": 12, + "lica": 12, + "urke": 12, + ":tür": 12, + "s:ue": 12, + "-s": 11, + "bn": 11, + "bs": 11, + "-v": 11, + "mr": 11, + "-y": 11, + "éd": 11, + "nr": 11, + "bí": 11, + "a-": 11, + "-c": 11, + "uv": 11, + "ا": 11, + "ال": 11, + "uh": 11, + "д": 11, + "rq": 11, + "pok": 11, + "bn:": 11, + "bs:": 11, + "fiu": 11, + "iu-": 11, + "u-v": 11, + "-vr": 11, + "vro": 11, + "hi:": 11, + "io:": 11, + "jv:": 11, + "mr:": 11, + "ds:": 11, + "aph": 11, + "scn": 11, + "cn:": 11, + "sl:": 11, + "ta:": 11, + "th:": 11, + "tl:": 11, + "vec": 11, + "ec:": 11, + "h-y": 11, + "-yu": 11, + "yue": 11, + "ue:": 11, + "ipé": 11, + "péd": 11, + "édi": 11, + "als": 11, + "s:w": 11, + "t:u": 11, + "lb:": 11, + "roa": 11, + "a:g": 11, + "b:t": 11, + "ale": 11, + "l:g": 11, + "ach": 11, + "nf": 11, + "a:a": 11, + "nom": 11, + "omb": 11, + "naa": 11, + "kom": 11, + "wat": 11, + "iin": 11, + "nem": 11, + "oon": 11, + "oye": 11, + "nat": 11, + "ish": 11, + "aku": 11, + "eta": 11, + "ien": 11, + "sya": 11, + "enk": 11, + "s:t": 11, + "l:t": 11, + "urq": 11, + "fiu-": 11, + "iu-v": 11, + "u-vr": 11, + "-vro": 11, + "vro:": 11, + "nds:": 11, + "scn:": 11, + "vec:": 11, + "zh-y": 11, + "h-yu": 11, + "-yue": 11, + "yue:": 11, + "unga": 11, + "pédi": 11, + "s:wi": 11, + "ael": 11, + "ait": 11, + "iat": 11, + "agw": 11, + "uthi": 11, + "ondj": 11, + "kwat": 11, + "wap": 11, + "huma": 11, + "wa1": 11, + "wa2": 11, + "elon": 11, + "e.o": 11, + "okug": 11, + "arat": 11, + ":lin": 11, + "rkey": 11, + ":asy": 11, + "asya": 11, + "enl": 11, + "turq": 11, + "rkia": 11, + "l:ue": 11, + "t-": 10, + "xt": 10, + "gv": 10, + "of": 10, + "ܐ": 10, + "ה": 10, + "hs": 10, + "my": 10, + ":h": 10, + "rn": 10, + "tk": 10, + "aq": 10, + "dr": 10, + "ар": 10, + "tan": 10, + "az:": 10, + "bar": 10, + "at-": 10, + "t-s": 10, + "-sm": 10, + "smg": 10, + "ceb": 10, + "eb:": 10, + "ure": 10, + "fy:": 10, + "ga:": 10, + "kil": 10, + "hy:": 10, + "ku:": 10, + "ah:": 10, + "nap": 10, + "ent": 10, + "nov": 10, + "ov:": 10, + "sco": 10, + "ona": 10, + "bil": 10, + "enc": 10, + "diy": 10, + "br:": 10, + "kaa": 10, + "li:": 10, + "lmo": 10, + "mo:": 10, + "ln:": 10, + "my:": 10, + "oa-": 10, + "a:u": 10, + "tet": 10, + "lok": 10, + "hia": 10, + "nal": 10, + "rin": 10, + "nde": 10, + "vo:": 10, + "tal": 10, + "lol": 10, + "akw": 10, + "vet": 10, + "tum": 10, + "ush": 10, + "ekw": 10, + "yon": 10, + "aal": 10, + "enw": 10, + "nwa": 10, + "meh": 10, + "ndi": 10, + "yaa": 10, + "iyu": 10, + "dho": 10, + "lwe": 10, + "she": 10, + "dul": 10, + "sch": 10, + "ech": 10, + "aci": 10, + "n:u": 10, + "tio": 10, + "t:t": 10, + "ürk": 10, + "rqu": 10, + "bat-": 10, + "at-s": 10, + "t-sm": 10, + "-smg": 10, + "smg:": 10, + "ceb:": 10, + "ture": 10, + "aet": 10, + "nah:": 10, + "nov:": 10, + "aqu": 10, + "ngan": 10, + "ipéd": 10, + "édia": 10, + "als:": 10, + "diya": 10, + "iak": 10, + "lmo:": 10, + "roa-": 10, + "raph": 10, + "olol": 10, + "poka": 10, + "kalo": 10, + "akan": 10, + "wao": 10, + "nema": 10, + "okul": 10, + "ange": 10, + "ald": 10, + "raci": 10, + "enf": 10, + "ingu": 10, + "türk": 10, + "urqu": 10, + "t:ue": 10, + "ex": 9, + "jb": 9, + "ა": 9, + "wl": 9, + "s-": 9, + "tq": 9, + "uz": 9, + "py": 9, + "fo": 9, + "-k": 9, + "या": 9, + "zl": 9, + "ल": 9, + "aj": 9, + "кл": 9, + "ац": 9, + "п": 9, + "ав": 9, + "mbu": 9, + "yat": 9, + "kol": 9, + "diq": 9, + "iq:": 9, + "ht:": 9, + "ad:": 9, + "ds-": 9, + "s-n": 9, + "-nl": 9, + "stq": 9, + "tq:": 9, + "te:": 9, + "uz:": 9, + "ada": 9, + "yi:": 9, + "n:b": 9, + "mbi": 9, + "era": 9, + "die": 9, + "dij": 9, + "l:w": 9, + "ij:": 9, + "l:b": 9, + "kii": 9, + "szl": 9, + "zl:": 9, + "tk:": 9, + "pi:": 9, + "r:g": 9, + "e:g": 9, + "n:g": 9, + "teu": 9, + "aak": 9, + "ahe": 9, + "t:b": 9, + "o:p": 9, + "g:a": 9, + "oly": 9, + "end": 9, + "imi": 9, + "ayi": 9, + "eli": 9, + "ezi": 9, + "ank": 9, + "eit": 9, + "lak": 9, + "kon": 9, + "naw": 9, + "eka": 9, + "hok": 9, + "imb": 9, + "eno": 9, + "lel": 9, + "hen": 9, + "ара": 9, + "рац": 9, + "erk": 9, + "rkl": 9, + "der": 9, + "avi": 9, + "i:u": 9, + "l:p": 9, + "i:t": 9, + "t:a": 9, + "np": 9, + "rch": 9, + "aca": 9, + "diq:": 9, + "hak:": 9, + "aid": 9, + "aio": 9, + "lad:": 9, + "alv": 9, + "nds-": 9, + "ds-n": 9, + "s-nl": 9, + "-nl:": 9, + "pap:": 9, + "sco:": 9, + "stq:": 9, + "laa": 9, + "l:wi": 9, + "jah": 9, + "lij:": 9, + "iam": 9, + "iar": 9, + "szl:": 9, + "land": 9, + "o:tu": 9, + "o:bi": 9, + "womu": 9, + "wene": 9, + "anka": 9, + "itha": 9, + "meho": 9, + "yopa": 9, + "nawa": 9, + "gano": 9, + "aiy": 9, + "eom": 9, + "wam": 9, + "undu": 9, + "awe": 9, + "rech": 9, + "sde": 9, + "atio": 9, + "sien": 9, + "lish": 9, + "cad": 9, + "kiya": 9, + "kis": 9, + "n:ue": 9, + "eni": 9, + ":y": 8, + "dv": 8, + "yk": 8, + "ė": 8, + "gd": 8, + "kn": 8, + "pm": 8, + "tp": 8, + "ία": 8, + "if": 8, + ":f": 8, + "ju": 8, + "ji": 8, + "íb": 8, + "sg": 8, + "i.": 8, + "iz": 8, + "де": 8, + "ка": 8, + "ов": 8, + ":á": 8, + "dv:": 8, + "uro": 8, + "ext": 8, + "xt:": 8, + "fur": 8, + "ur:": 8, + "gv:": 8, + "jbo": 8, + "v:b": 8, + "აk": 8, + "kl:": 8, + "mwl": 8, + "wl:": 8, + "a(": 8, + "oni": 8, + "uta": 8, + "mad": 8, + "lam": 8, + "ker": 8, + "ce:": 8, + "arc": 8, + "rc:": 8, + "idi": 8, + "bm:": 8, + "αe": 8, + "fo:": 8, + "gd:": 8, + "gu:": 8, + "hsb": 8, + "kn:": 8, + "kw:": 8, + "na:": 8, + "pms": 8, + "sc:": 8, + "so:": 8, + "tpi": 8, + "wo:": 8, + "las": 8, + "y:u": 8, + "api": 8, + "fij": 8, + "eyo": 8, + "ede": 8, + "a:t": 8, + "cht": 8, + "v:g": 8, + "eri": 8, + "m:g": 8, + "u:a": 8, + "pis": 8, + "a:d": 8, + "ate": 8, + ":bí": 8, + "bíb": 8, + "ble": 8, + "kit": 8, + "uli": 8, + "mus": 8, + "pwa": 8, + "kep": 8, + ".u": 8, + "kwe": 8, + "oth": 8, + "mit": 8, + "kok": 8, + "iig": 8, + "eny": 8, + "nos": 8, + "udh": 8, + "ukw": 8, + "mii": 8, + "alw": 8, + "djo": 8, + "amu": 8, + "ove": 8, + "uga": 8, + ".e": 8, + "nam": 8, + "кла": 8, + "лар": 8, + "nm": 8, + "t:d": 8, + "isi": 8, + "sin": 8, + "s:p": 8, + "spr": 8, + "lsk": 8, + "r:a": 8, + "ska": 8, + "n:a": 8, + "zij": 8, + "e:t": 8, + "kc": 8, + "a:p": 8, + "ase": 8, + "r:u": 8, + "bar:": 8, + "acs": 8, + "ext:": 8, + "fur:": 8, + "aht": 8, + "aja": 8, + "jbo:": 8, + "yak": 8, + "yam": 8, + "mwl:": 8, + "apt": 8, + "ief": 8, + "kipé": 8, + "ace:": 8, + "arc:": 8, + "yac": 8, + "edij": 8, + "dija": 8, + "iag": 8, + "iah": 8, + "hsb:": 8, + "ilo:": 8, + "jam": 8, + "l:bi": 8, + "pms:": 8, + "tet:": 8, + "tpi:": 8, + "loka": 8, + "oteu": 8, + ":bíb": 8, + "ible": 8, + "t:pi": 8, + "loi": 8, + "itaa": 8, + "omus": 8, + "mush": 8, + "usha": 8, + "shan": 8, + "iman": 8, + "yuun": 8, + "uunt": 8, + "woo": 8, + ".uu": 8, + "monk": 8, + "laka": 8, + "enwa": 8, + "inge": 8, + "omeh": 8, + "iyop": 8, + "anaw": 8, + "kao": 8, + "ukwa": 8, + "kulo": 8, + "yoka": 8, + "kai": 8, + "ndjo": 8, + "onge": 8, + "alun": 8, + "lung": 8, + "ayo": 8, + "veta": 8, + "kuga": 8, + "inga": 8, + "клар": 8, + "лара": 8, + "арац": 8, + "erkl": 8, + "osd": 8, + "lde": 8, + "rati": 8, + "tion": 8, + "chia": 8, + "elsk": 8, + "s:as": 8, + "zija": 8, + "o:as": 8, + "l:tu": 8, + "urch": 8, + "r:ue": 8, + "i:ue": 8, + "a:ue": 8, + "enh": 8, + "l:pi": 8, + "rr": 7, + "mt": 7, + "än": 7, + "à": 7, + "zu": 7, + "èd": 7, + "bp": 7, + "cd": 7, + "rh": 7, + "rp": 7, + "ou": 7, + "gn": 7, + "nv": 7, + "án": 7, + "oh": 7, + "g-": 7, + ":i̇": 7, + "lg": 7, + "qa": 7, + "vu": 7, + "ن": 7, + "пр": 7, + "та": 7, + "न": 7, + "т": 7, + "ay:": 7, + ":ki": 7, + "aya": 7, + "mt:": 7, + "ys": 7, + "phi": 7, + "su:": 7, + "w:u": 7, + "iqu": 7, + "ipi": 7, + "bpy": 7, + "py:": 7, + "crh": 7, + "rh:": 7, + "a:w": 7, + "dsb": 7, + "n:w": 7, + "t:w": 7, + "frp": 7, + "rp:": 7, + "haw": 7, + "iti": 7, + "kab": 7, + "ab:": 7, + "nrm": 7, + "se:": 7, + "rn:": 7, + "tw:": 7, + "uu:": 7, + "ahu": 7, + "r:d": 7, + "b:g": 7, + "u:g": 7, + "aat": 7, + "nda": 7, + "ie:": 7, + "ske": 7, + ":aa": 7, + "c:g": 7, + ":al": 7, + "all": 7, + "sm:": 7, + "ban": 7, + "esi": 7, + "muu": 7, + "ese": 7, + "eus": 7, + "kas": 7, + "ann": 7, + "nik": 7, + "g:b": 7, + "bul": 7, + "íbl": 7, + "bia": 7, + "len": 7, + "ee:": 7, + "aib": 7, + "aba": 7, + "sg:": 7, + "eto": 7, + "usi": 7, + "ibh": 7, + "sto": 7, + "iri": 7, + "kin": 7, + "ken": 7, + "aay": 7, + "mpo": 7, + "umw": 7, + "til": 7, + "lal": 7, + "ogo": 7, + ",m": 7, + "utu": 7, + "mut": 7, + "mwa": 7, + "dha": 7, + "nit": 7, + "he,": 7, + "hig": 7, + "hom": 7, + "lek": 7, + "ulw": 7, + "lil": 7, + "ndy": 7, + "dyo": 7, + "no.": 7, + "tel": 7, + "kee": 7, + "ont": 7, + "edh": 7, + "gum": 7, + "egu": 7, + "we.": 7, + "uuk": 7, + "tin": 7, + "yel": 7, + "fut": 7, + "aas": 7, + "дек": 7, + "екл": 7, + "sel": 7, + "пра": 7, + "рав": 7, + "del": 7, + "pra": 7, + "esk": 7, + "e:a": 7, + "sn": 7, + "b:u": 7, + "e:u": 7, + ":nd": 7, + "isc": 7, + "m:a": 7, + "zia": 7, + "r:t": 7, + "epi": 7, + ":pr": 7, + "teo": 7, + "r:p": 7, + "g:u": 7, + "v:u": 7, + "m:u": 7, + "u:u": 7, + "efy": 7, + "nap:": 7, + "aphi": 7, + "bele": 7, + "neko": 7, + "ngam": 7, + "bili": 7, + "ediy": 7, + "yab": 7, + "iab": 7, + "bpy:": 7, + "crh:": 7, + "a:wi": 7, + "dsb:": 7, + "n:wi": 7, + "t:wi": 7, + "afr": 7, + "frp:": 7, + "kab:": 7, + "enr": 7, + "nrm:": 7, + "pam:": 7, + "arm": 7, + "jas": 7, + "iav": 7, + "awu": 7, + "wuu:": 7, + "s:ge": 7, + "afij": 7, + "fija": 7, + "alb": 7, + "s:bi": 7, + "bíbl": 7, + "íbli": 7, + "o:pi": 7, + "angw": 7, + "heg": 7, + "tuo": 7, + "mpok": 7, + "sima": 7, + "weo": 7, + "kae": 7, + "ukil": 7, + "oos": 7, + "lala": 7, + "alak": 7, + "anen": 7, + "unin": 7, + "o,o": 7, + "menw": 7, + "mith": 7, + "kome": 7, + "iigw": 7, + "anah": 7, + "naha": 7, + "omut": 7, + "andj": 7, + "aii": 7, + "iyaa": 7, + "iyuu": 7, + "anit": 7, + "nith": 7, + "ulon": 7, + "okut": 7, + "log": 7, + "ndyo": 7, + "dyok": 7, + "pwaa": 7, + "kalu": 7, + "tali": 7, + "gep": 7, + "okun": 7, + "ook": 7, + ".om": 7, + "gos": 7, + "okwa": 7, + "kah": 7, + "декл": 7, + "екла": 7, + "sche": 7, + "прав": 7, + "enb": 7, + "sai": 7, + ":ndo": 7, + "azij": 7, + "kch": 7, + "nav": 7, + "moth": 7, + "g:ue": 7, + "v:ue": 7, + "m:ue": 7, + "u:ue": 7, + "enp": 7, + "s:gh": 7, + "o:gh": 7, + "ու": 6, + "pè": 6, + "ėj": 6, + "jė": 6, + "ικ": 6, + "î": 6, + "יה": 6, + "ია": 6, + "-r": 6, + "bc": 6, + "lé": 6, + "ì": 6, + "od": 6, + "س": 6, + "།": 6, + "bb": 6, + "i̇n": 6, + "în": 6, + "е": 6, + "pd": 6, + "dc": 6, + "ва": 6, + "x": 6, + "bw": 6, + "hw": 6, + "ци": 6, + "ان": 6, + "ки": 6, + ":א": 6, + "gg": 6, + "мо": 6, + "ик": 6, + "lè": 6, + ":英": 6, + "arr": 6, + "csb": 6, + "uri": 6, + "d:b": 6, + "day": 6, + "new": 6, + "ew:": 6, + "nsk": 6, + "edy": 6, + "ss:": 6, + "tam": 6, + "ts:": 6, + ":ma": 6, + "dan": 6, + ",a": 6, + "tar": 6, + "uw": 6, + "rei": 6, + "ipè": 6, + "uip": 6, + "ng:": 6, + "ėjė": 6, + "i:w": 6, + "b:w": 6, + "ich": 6, + "gn:": 6, + "pet": 6, + "aw:": 6, + "הh": 6, + "ig:": 6, + "iu:": 6, + "map": 6, + "nv:": 6, + "pag": 6, + "a-r": 6, + "-ru": 6, + "rup": 6, + "up:": 6, + "sa:": 6, + "wa:": 6, + "h-c": 6, + "-cl": 6, + "ass": 6, + "ssi": 6, + "sic": 6, + "cal": 6, + "al:": 6, + "k:g": 6, + "bcl": 6, + "cl:": 6, + ":he": 6, + "eth": 6, + "hif": 6, + "if:": 6, + "aar": 6, + "nes": 6, + "esa": 6, + "hes": 6, + "eki": 6, + "kie": 6, + "aga": 6, + "gal": 6, + "liy": 6, + "།b": 6, + "a:b": 6, + "ng-": 6, + ":be": 6, + "e:b": 6, + ":i̇n": 6, + "ile": 6, + "tab": 6, + "bai": 6, + "pai": 6, + "v:a": 6, + "ibi": 6, + "h:t": 6, + "pdc": 6, + "dc:": 6, + "muk": 6, + "ito": 6, + "mbw": 6, + "val": 6, + "nuu": 6, + "gwe": 6, + "ko,": 6, + "idh": 6, + "yan": 6, + "omp": 6, + "he.": 6, + "gon": 6, + "iko": 6, + "dil": 6, + "omw": 6, + "tya": 6, + "tho": 6, + "hop": 6, + "tul": 6, + "mbe": 6, + "ga,": 6, + "oti": 6, + "eha": 6, + ",n": 6, + "neg": 6, + "kap": 6, + "ami": 6, + "eme": 6, + "wa.": 6, + "thw": 6, + "age": 6, + "lul": 6, + "omi": 6, + "iik": 6, + "tun": 6, + "tol": 6, + "аци": 6, + "hte": 6, + "dre": 6, + "tos": 6, + "ans": 6, + "y:a": 6, + "dam": 6, + "cij": 6, + "ret": 6, + "tti": 6, + "nsa": 6, + "o:i": 6, + "dir": 6, + "gs": 6, + ":ti": 6, + "u:p": 6, + "аs": 6, + "cia": 6, + "ras": 6, + "ngg": 6, + "res": 6, + "gri": 6, + "siy": 6, + "b:a": 6, + "nku": 6, + "kur": 6, + "u:t": 6, + "k:t": 6, + "v:t": 6, + "h:u": 6, + "n:p": 6, + "csb:": 6, + "aex": 6, + "ois": 6, + "aml": 6, + "new:": 6, + "apl": 6, + "asv": 6, + "aen": 6, + "n:bi": 6, + "iqui": 6, + "quip": 6, + "ang:": 6, + "kipi": 6, + "aaz": 6, + "i:wi": 6, + "b:wi": 6, + "αen": 6, + "ioe": 6, + "aeu": 6, + "gan:": 6, + "haw:": 6, + "הhi": 6, + "ahs": 6, + "alm": 6, + "aln": 6, + "jal": 6, + "env": 6, + "apm": 6, + "iaq": 6, + "yar": 6, + "oa-r": 6, + "a-ru": 6, + "-rup": 6, + "rup:": 6, + "asl": 6, + "zh-c": 6, + "h-cl": 6, + "-cla": 6, + "clas": 6, + "lass": 6, + "assi": 6, + "ssic": 6, + "sica": 6, + "ical": 6, + "cal:": 6, + "bcl:": 6, + "o:ge": 6, + "hif:": 6, + "teus": 6, + "liya": 6, + "blij": 6, + ":bai": 6, + "baib": 6, + "nha": 6, + "kita": 6, + "pdc:": 6, + "imin": 6, + "hili": 6, + "aaye": 6, + "mano": 6, + "onu": 6, + "eos": 6, + "ouu": 6, + "anek": 6, + "ekwa": 6, + "emo": 6, + "yono": 6, + "moka": 6, + "tut": 6, + "hila": 6, + "aop": 6, + "thim": 6, + "moku": 6, + "opan": 6, + "ahan": 6, + "omon": 6, + "muth": 6, + "hika": 6, + "aos": 6, + "wani": 6, + "umba": 6, + "kale": 6, + "way": 6, + "aon": 6, + "elok": 6, + "okal": 6, + "nga,": 6, + "a,o": 6, + "ego": 6, + "epan": 6, + "gele": 6, + "yem": 6, + "gee": 6, + "elel": 6, + "ndul": 6, + "ovet": 6, + "edhi": 6, + "koo": 6, + "yomo": 6, + "ulul": 6, + "kana": 6, + "nag": 6, + "uok": 6, + "раци": 6, + "ring": 6, + "chte": 6, + "sdr": 6, + "dere": 6, + "shu": 6, + "adam": 6, + "cija": 6, + ".ac": 6, + "ingg": 6, + "uai": 6, + "nki": 6, + "gris": 6, + "nab": 6, + "n:as": 6, + "ahen": 6, + "henk": 6, + "enku": 6, + "nkur": 6, + "kuro": 6, + "roy": 6, + "naf": 6, + "ikc": 6, + "nac": 6, + "nene": 6, + "oteo": 6, + "epis": 6, + "pist": 6, + "t:tu": 6, + "ürki": 6, + "rqui": 6, + "rchi": 6, + "e:ue": 6, + "y:ue": 6, + "b:ue": 6, + "h:ue": 6, + "e:gh": 6, + "a:pi": 6, + "n:pi": 6, + "ת": 5, + "ă": 5, + "ܝܐ": 5, + "য়া": 5, + "ô": 5, + "ια": 5, + "ff": 5, + "קי": 5, + ":वि": 5, + "विकि": 5, + "किपी": 5, + "पीडि": 5, + "kk": 5, + "-t": 5, + "sr": 5, + "vl": 5, + "ע": 5, + "u.": 5, + ":भू": 5, + "भूगो": 5, + "गोल": 5, + "dd": 5, + "éo": 5, + "uf": 5, + "lí": 5, + "'o": 5, + "ин": 5, + "ev": 5, + "-": 5, + "ió": 5, + "ى": 5, + "я": 5, + "ч": 5, + "ве": 5, + "rz": 5, + "ê": 5, + "rl": 5, + ":o": 5, + "ej": 5, + "ел": 5, + "ín": 5, + "ი": 5, + "ли": 5, + "ás": 5, + "洲": 5, + ":中": 5, + "ول": 5, + "yr": 5, + ":土": 5, + "土耳": 5, + "耳其": 5, + "एते": 5, + "तेर्से": 5, + "र्सेन्": 5, + "न्": 5, + "has": 5, + "har": 5, + "ran": 5, + "r:k": 5, + "a:c": 5, + "o:k": 5, + "tuu": 5, + "uur": 5, + "u:k": 5, + "ume": 5, + "si:": 5, + "tt:": 5, + "àz": 5, + "ham": 5, + "e:w": 5, + "pèd": 5, + "èdi": 5, + "r:w": 5, + "ėb": 5, + "g:w": 5, + "cdo": 5, + "do:": 5, + "dio": 5, + "eed": 5, + "u:w": 5, + "ff:": 5, + "hip": 5, + "v:w": 5, + "w:w": 5, + "pik": 5, + ":विकि": 5, + "विकिपी": 5, + "किपीडि": 5, + "pit": 5, + "ksh": 5, + "ne:": 5, + "iib": 5, + "rmy": 5, + "ipa": 5, + "srn": 5, + ":भूगो": 5, + "भूगोल": 5, + "hie": 5, + "nti": 5, + "sf": 5, + "éog": 5, + "yh": 5, + "hl": 5, + "y:p": 5, + "eml": 5, + "q:g": 5, + "iel": 5, + "ebe": 5, + "hol": 5, + "r:b": 5, + "ibb": 5, + "lie": 5, + "lg:": 5, + "ste": 5, + "ten": 5, + "ny:": 5, + "nq": 5, + "tus": 5, + "bha": 5, + "hel": 5, + "zu:": 5, + "u:i": 5, + "ava": 5, + "aum": 5, + "zim": 5, + "uuy": 5, + "uyu": 5, + "kup": 5, + "uti": 5, + "ikw": 5, + "gol": 5, + "gun": 5, + "amp": 5, + "kuh": 5, + "umi": 5, + "ewa": 5, + "na.": 5, + "yii": 5, + "mek": 5, + "ein": 5, + "oya": 5, + "nyo": 5, + "mem": 5, + "uty": 5, + "buk": 5, + "guk": 5, + "mom": 5, + "nop": 5, + "uho": 5, + ",e": 5, + "tik": 5, + "gom": 5, + "tak": 5, + "oli": 5, + "nak": 5, + "ton": 5, + "elw": 5, + "ape": 5, + "upi": 5, + "eel": 5, + "kov": 5, + "ugw": 5, + "hwa": 5, + "gil": 5, + "ave": 5, + "mak": 5, + "na,": 5, + "vul": 5, + "kwi": 5, + "wii": 5, + "nim": 5, + "ka.": 5, + "was": 5, + "mal": 5, + "tay": 5, + "ino": 5, + "yal": 5, + "ail": 5, + "ihu": 5, + "kei": 5, + "ari": 5, + "век": 5, + "erz": 5, + "hp": 5, + "igh": 5, + "ghe": 5, + "s:d": 5, + "tsi": 5, + "l:d": 5, + "isa": 5, + "o:d": 5, + "aad": 5, + "lim": 5, + "bah": 5, + "o:n": 5, + "t:l": 5, + "ggr": 5, + "glé": 5, + "glè": 5, + "gla": 5, + "ais": 5, + "p:a": 5, + "ger": 5, + "pih": 5, + "ih:": 5, + "h:a": 5, + ":ás": 5, + "yn": 5, + "ije": 5, + "esp": 5, + "lus": 5, + "aul": 5, + "rst": 5, + "uia": 5, + "rci": 5, + ":ty": 5, + "tyr": 5, + "yrk": 5, + "kij": 5, + ":土耳": 5, + "土耳其": 5, + "k:u": 5, + "एतेर्से": 5, + "तेर्सेन्": 5, + "e:p": 5, + ":pl": 5, + "pla": 5, + "i:p": 5, + "h:p": 5, + "abn": 5, + "a:cu": 5, + "o:ku": 5, + "rae": 5, + "tuur": 5, + "ahy": 5, + "ms:b": 5, + "asq": 5, + "nona": 5, + "a,a": 5, + "andu": 5, + "kera": 5, + "efr": 5, + "iaw": 5, + "e:wi": 5, + "ipèd": 5, + "pèdi": 5, + "r:wi": 5, + "jėb": 5, + "g:wi": 5, + "cdo:": 5, + "acr": 5, + "ads": 5, + "aeo": 5, + "v:wi": 5, + "יהh": 5, + ":विकिपी": 5, + "विकिपीडि": 5, + "იაk": 5, + "აka": 5, + "ksh:": 5, + "yao": 5, + "aoc": 5, + "rmy:": 5, + "tara": 5, + "aso": 5, + "srn:": 5, + "asz": 5, + "atp": 5, + "yav": 5, + "amba": 5, + "heog": 5, + "rapi": 5, + ":भूगोल": 5, + "anb": 5, + "t:ge": 5, + "éogr": 5, + "agu": 5, + "luka": 5, + "t:bi": 5, + "།bp": 5, + "r:bi": 5, + "lija": 5, + "bibb": 5, + "e:bi": 5, + "led": 5, + "a:an": 5, + "itab": 5, + "bibi": 5, + "loo": 5, + "muka": 5, + "kepa": 5, + "opau": 5, + "paum": 5, + "umwe": 5, + "tua": 5, + "uay": 5, + "mane": 5, + "oyu": 5, + "okup": 5, + "woe": 5, + "oem": 5, + "poo": 5, + "leo": 5, + "tuy": 5, + "kuko": 5, + "tuw": 5, + "okuh": 5, + "humi": 5, + "umit": 5, + "pank": 5, + "okol": 5, + "noo": 5, + "aana": 5, + "weny": 5, + "enyo": 5, + "kak": 5, + "kuty": 5, + "utya": 5, + "aoy": 5, + "hoko": 5, + "bam": 5, + "mbuk": 5, + "buka": 5, + "nena": 5, + "tumb": 5, + "nguk": 5, + "guka": 5, + "e,m": 5, + "himb": 5, + "euu": 5, + "ulwa": 5, + "lola": 5, + "omad": 5, + "madh": 5, + "ilad": 5, + "ladh": 5, + "alwe": 5, + "ano.": 5, + "ingw": 5, + "emw": 5, + "atel": 5, + "elwa": 5, + "meno": 5, + "nand": 5, + "ando": 5, + "kuni": 5, + "ake": 5, + "oke": 5, + "ugam": 5, + "hit": 5, + "kugw": 5, + "ilil": 5, + "yaan": 5, + "iina": 5, + "thwa": 5, + "ngil": 5, + "eyon": 5, + "ana,": 5, + "bon": 5, + "gand": 5, + "kwii": 5, + "inim": 5, + "egum": 5, + "gumb": 5, + "umbo": 5, + "lae": 5, + "gap": 5, + "ehe,": 5, + "wash": 5, + "kulu": 5, + "dulu": 5, + "gai": 5, + ",ok": 5, + "luo": 5, + "gay": 5, + "leka": 5, + "rsel": 5, + "echt": 5, + "reit": 5, + "erec": 5, + "anos": 5, + "eret": 5, + "o:in": 5, + "anl": 5, + "nggr": 5, + "lisc": 5, + "isch": 5, + "nglé": 5, + "nglè": 5, + "ngla": 5, + "pih:": 5, + "zas": 5, + "r:as": 5, + "azia": 5, + "m:as": 5, + "a:as": 5, + ":aas": 5, + "aasi": 5, + "l:az": 5, + "kac": 5, + "enen": 5, + "aep": 5, + "stol": 5, + "paul": 5, + "n:tu": 5, + "s:tu": 5, + "a:tu": 5, + "quia": 5, + "kiye": 5, + ":tyr": 5, + "tyrk": 5, + "rkij": 5, + ":土耳其": 5, + "k:ue": 5, + "w:ue": 5, + "एतेर्सेन्": 5, + "enm": 5, + "n:gh": 5, + "t:ga": 5, + "e:pi": 5, + ":pla": 5, + "u:pi": 5, + "i:pi": 5, + "h:pi": 5, + "sta": 4, + "rra": 4, + "n:c": 4, + "t:c": 4, + "ch:": 4, + "ttu": 4, + ":di": 4, + "lli": 4, + "t:k": 4, + "túr": 4, + ":bu": 4, + "bud": 4, + "uda": 4, + ":me": 4, + "l:k": 4, + "km": 4, + "km:": 4, + "u:b": 4, + "nar": 4, + "rko": 4, + "obe": 4, + "fun": 4, + "dus": 4, + "raz": 4, + "a–": 4, + "–": 4, + "lop": 4, + "ope": 4, + "nci": 4, + "k:w": 4, + "ici": 4, + "cip": 4, + "ܐa": 4, + "pid": 4, + "bi:": 4, + "য়াb": 4, + "chr": 4, + "ôc": 4, + "vic": 4, + "基百科": 4, + "pak": 4, + "पीडिया": 4, + "डिया": 4, + "աi": 4, + "tia": 4, + "aa:": 4, + "kg": 4, + "kg:": 4, + "ks:": 4, + "ikk": 4, + "ap-": 4, + "p-b": 4, + "-bm": 4, + "bms": 4, + "mi:": 4, + "ag:": 4, + "pn": 4, + "pnt": 4, + "nt:": 4, + "a-t": 4, + "-ta": 4, + "ra:": 4, + "ve:": 4, + "vls": 4, + "יע": 4, + "zea": 4, + "ea:": 4, + "hun": 4, + "f:g": 4, + ":ch": 4, + "afí": 4, + "fía": 4, + "g:g": 4, + "piy": 4, + "ear": 4, + "phy": 4, + "raa": 4, + "maa": 4, + "tie": 4, + "o:m": 4, + "o:l": 4, + ":gé": 4, + "géo": 4, + "jeo": 4, + "y:g": 4, + ":地理": 4, + ":te": 4, + ":je": 4, + "y:ա": 4, + "s:l": 4, + "l:n": 4, + "yet": 4, + "i:g": 4, + "iog": 4, + "s:n": 4, + "dje": 4, + "los": 4, + "jes": 4, + "mat": 4, + "nne": 4, + "kor": 4, + "int": 4, + "ber": 4, + "الم": 4, + "y:b": 4, + "i:b": 4, + "bol": 4, + "ibu": 4, + "gc": 4, + "bbi": 4, + "n-k": 4, + "w:p": 4, + ":pa": 4, + "d:a": 4, + "alk": 4, + "lki": 4, + "ki:": 4, + "c:b": 4, + "om:": 4, + "des": 4, + "um:": 4, + "w:t": 4, + "ria": 4, + "a)": 4, + "xh": 4, + "xh:": 4, + "h:i": 4, + "tek": 4, + "eke": 4, + "bwa": 4, + "pav": 4, + "gwo": 4, + "wem": 4, + ",u": 4, + "owa": 4, + "tau": 4, + "uup": 4, + "pwi": 4, + "uvo": 4, + "nya": 4, + "any": 4, + "pop": 4, + "kuu": 4, + "pum": 4, + "ole": 4, + "atu": 4, + "hep": 4, + "epe": 4, + "pek": 4, + "mpa": 4, + "tat": 4, + "aki": 4, + "noy": 4, + "tok": 4, + "won": 4, + "dja": 4, + "ily": 4, + "nii": 4, + "ko.": 4, + "tag": 4, + "opw": 4, + "gas": 4, + "hal": 4, + "ega": 4, + "nel": 4, + "esh": 4, + "gok": 4, + "twa": 4, + "nta": 4, + "neh": 4, + "mol": 4, + "mos": 4, + ",p": 4, + "kus": 4, + "ind": 4, + "hos": 4, + "li.": 4, + "iku": 4, + "oga": 4, + "may": 4, + "igu": 4, + "ufu": 4, + "wa,": 4, + "sil": 4, + "aun": 4, + "uno": 4, + "ets": 4, + "eya": 4, + "itu": 4, + "f:u": 4, + "mei": 4, + "run": 4, + "n:d": 4, + "ión": 4, + "ón": 4, + "ció": 4, + "cho": 4, + "eis": 4, + "яп": 4, + "ека": 4, + "nj": 4, + "tig": 4, + "ett": 4, + "nna": 4, + "ins": 4, + "san": 4, + "nye": 4, + "ise": 4, + "gf": 4, + "zio": 4, + "fan": 4, + "-ch": 4, + "ite": 4, + "per": 4, + "c:d": 4, + "iek": 4, + "iд": 4, + "ime": 4, + "nia": 4, + "ân": 4, + "n:n": 4, + "iom": 4, + "s:e": 4, + "lés": 4, + "és": 4, + "i̇ng": 4, + "zik": 4, + "lès": 4, + "ès": 4, + "liz": 4, + "e:e": 4, + "ngr": 4, + "lai": 4, + "ngh": 4, + "c:a": 4, + "ási": 4, + "शिया": 4, + "s:r": 4, + "taj": 4, + "ajv": 4, + "jva": 4, + "a:r": 4, + ":中華": 4, + "中華民": 4, + "იk": 4, + "l:r": 4, + "tav": 4, + "pri": 4, + "rim": 4, + ":th": 4, + "quí": 4, + "uía": 4, + "c:t": 4, + "nyk": 4, + "yke": 4, + "se.": 4, + "rze": 4, + "zen": 4, + "p:u": 4, + ":ऊए": 4, + "ऊएते": 4, + "c:u": 4, + "m:p": 4, + "t:n": 4, + "angi": 4, + "hah": 4, + "harr": 4, + "arra": 4, + "rran": 4, + "rang": 4, + "t:cu": 4, + "eeo": 4, + "ltuu": 4, + "efu": 4, + ":bud": 4, + "buda": 4, + "uday": 4, + "daya": 4, + "menn": 4, + "amt": 4, + "urn": 4, + "mbil": 4, + "lit": 4, + "ia–": 4, + "edie": 4, + "èdia": 4, + "k:wi": 4, + "icip": 4, + "ܝܐa": 4, + "ܐas": 4, + "ipid": 4, + "pidi": 4, + "abs": 4, + "acd": 4, + "pedy": 4, + "ecs": 4, + "ôcy": 4, + "iad": 4, + "adi": 4, + "u:wi": 4, + "afu": 4, + "ichi": 4, + "agl": 4, + "w:wi": 4, + "किपीडिया": 4, + "աia": 4, + "aiu": 4, + "iaj": 4, + "kaa:": 4, + "map-": 4, + "ap-b": 4, + "p-bm": 4, + "-bms": 4, + "bms:": 4, + "pag:": 4, + "apn": 4, + "pnt:": 4, + "oa-t": 4, + "a-ta": 4, + "-tar": 4, + "ara:": 4, + "ies": 4, + "ask": 4, + "asw": 4, + "jot": 4, + "atl": 4, + "atr": 4, + "vls:": 4, + "zea:": 4, + "azh": 4, + "rafí": 4, + "afía": 4, + "íaa": 4, + ":heo": 4, + "apiy": 4, + "piya": 4, + "nbn": 4, + "a:ge": 4, + "sb:g": 4, + "ide": 4, + "phie": 4, + "aphy": 4, + ":géo": 4, + "géog": 4, + "jeog": 4, + "ahr": 4, + "hy:ա": 4, + "ab:t": 4, + "b:ta": 4, + "akal": 4, + "hla": 4, + "ms:g": 4, + "l:ge": 4, + "pen": 4, + "nro": 4, + "iogr": 4, + "hys": 4, + "isq": 4, + "o:ta": 4, + "yaw": 4, + "an:t": 4, + "uso": 4, + "ithi": 4, + "eld": 4, + "nnes": 4, + "aaa": 4, + "alon": 4, + "nika": 4, + "lolo": 4, + "loa": 4, + "g:bi": 4, + "bliy": 4, + "ibbi": 4, + "bbia": 4, + "a:bi": 4, + "elen": 4, + "blie": 4, + "ga:a": 4, + "lag": 4, + ":alk": 4, + "alki": 4, + "lkit": 4, + "ebel": 4, + "lem": 4, + "rist": 4, + "nno": 4, + "iao": 4, + "let": 4, + "nsw": 4, + "isto": 4, + "tum:": 4, + "y:pi": 4, + "wag": 4, + "gwaa": 4, + "waay": 4, + "mbwa": 4, + "lyom": 4, + "ikep": 4, + "epam": 4, + "aumw": 4, + "ezim": 4, + "kank": 4, + "kag": 4, + ",uu": 4, + "uuyu": 4, + "ombi": 4, + "imu": 4, + "nekw": 4, + "dhin": 4, + "inik": 4, + "loy": 4, + "nonk": 4, + "lay": 4, + "yoku": 4, + "util": 4, + "pumb": 4, + "himi": 4, + "golo": 4, + "poa": 4, + "kond": 4, + "hau": 4, + "hepe": 4, + "epek": 4, + "ango": 4, + "hoe": 4, + "noe": 4, + "kati": 4, + "ana.": 4, + "meko": 4, + "goy": 4, + "ulik": 4, + "ndil": 4, + "kiin": 4, + "lao": 4, + "ndja": 4, + "itho": 4, + "udhi": 4, + "ekwe": 4, + "dhop": 4, + "hopa": 4, + "shig": 4, + "higw": 4, + "miil": 4, + "wek": 4, + "apan": 4, + "nopw": 4, + "opwa": 4, + "asha": 4, + "hio": 4, + "liko": 4, + "naku": 4, + "gamb": 4, + "kape": 4, + "tue": 4, + "pika": 4, + "elan": 4, + "hamu": 4, + "amun": 4, + "ontu": 4, + "ntum": 4, + "tong": 4, + "kove": 4, + "hoka": 4, + "ugwa": 4, + "wau": 4, + "aow": 4, + "ntan": 4, + "ithw": 4, + "ingi": 4, + "omak": 4, + "mosh": 4, + "waw": 4, + "ikwa": 4, + "okwi": 4, + "nima": 4, + "okus": 4, + "oong": 4, + "wate": 4, + "tele": 4, + "luki": 4, + "kila": 4, + "elal": 4, + "kano": 4, + "noma": 4, + "uukw": 4, + "ulun": 4, + "gao": 4, + "okan": 4, + "gae": 4, + ",os": 4, + "gund": 4, + "tula": 4, + "yele": 4, + "yei": 4, + "ufut": 4, + "elek": 4, + "unon": 4, + ".el": 4, + "yap": 4, + "wane": 4, + "sele": 4, + "rkla": 4, + "mens": 4, + "ónu": 4, + "eito": 4, + "itos": 4, + "ació": 4, + "япр": 4, + "века": 4, + "acij": 4, + "a:de": 4, + "eyh": 4, + "sala": 4, + "zae": 4, + "list": 4, + "sing": 4, + "diri": 4, + "dem": 4, + "aada": 4, + "vi:t": 4, + "nqu": 4, + "nzh": 4, + "uan": 4, + ":bah": 4, + "hasa": 4, + "s:en": 4, + "glés": 4, + ":i̇ng": 4, + "ezik": 4, + "glès": 4, + "lese": 4, + "eml:": 4, + "lesa": 4, + "ggri": 4, + "g:as": 4, + "imba": 4, + "e:as": 4, + "asiy": 4, + "siya": 4, + "l:as": 4, + "u:as": 4, + ":ási": 4, + "ásia": 4, + "p:as": 4, + "nli": 4, + "nnd": 4, + "s:re": 4, + ":taj": 4, + "tajv": 4, + "ajva": 4, + "jvan": 4, + "a:re": 4, + "kina": 4, + ":中華民": 4, + "wal": 4, + "adu": 4, + ":pri": 4, + "prim": 4, + "aulu": 4, + "ulus": 4, + "othe": 4, + "e:tu": 4, + "rkei": 4, + "rquí": 4, + "quía": 4, + "r:tu": 4, + "yrki": 4, + "rkie": 4, + "kije": 4, + "urci": 4, + "v:tu": 4, + "c:tu": 4, + "yny": 4, + "nyke": 4, + "yker": 4, + "eras": 4, + "rase": 4, + "ase.": 4, + "e.a": 4, + "terz": 4, + "erze": 4, + "rzen": 4, + "p:ue": 4, + ":ऊएते": 4, + "ऊएतेर्से": 4, + "c:ue": 4, + "a:gh": 4, + "b:gh": 4, + "o:ga": 4, + "l:gh": 4, + "c:gh": 4, + "s:du": 4, + "o:du": 4, + "t:du": 4, + "rgn": 4, + "ital": 4, + "s:pi": 4, + "asen": 4, + "r:pi": 4, + "m:pi": 4, + "noc": 4, + "o:no": 4, + "t:no": 4, + "ule": 3, + "ram": 3, + "z:m": 3, + "niy": 3, + "ltū": 3, + "tūr": 3, + "ūra": 3, + "s:k": 3, + "y:d": 3, + "td": 3, + "i:k": 3, + "r:c": 3, + "ltú": 3, + "l:c": 3, + "תh": 3, + ":संस्कृ": 3, + "nni": 3, + ":文化": 3, + "v:k": 3, + "(s": 3, + "y)": 3, + "s:i": 3, + "ark": 3, + "kob": 3, + "kof": 3, + "ofu": 3, + "sef": 3, + "efa": 3, + "fal": 3, + "gub": 3, + "ubo": 3, + "usa": 3, + "sat": 3, + "at.": 3, + "t.": 3, + "zyk": 3, + ":ui": 3, + "uiq": 3, + "y:w": 3, + "ug:": 3, + "dya": 3, + "h:w": 3, + "jô": 3, + "q:w": 3, + "t:v": 3, + "pee": 3, + ":維基": 3, + "百科": 3, + "ויק": 3, + "יקי": 3, + "קיפ": 3, + "իա": 3, + "ik:": 3, + "sj": 3, + "p:w": 3, + "याn": 3, + "dii": 3, + "c:w": 3, + "pa:": 3, + "yjo": 3, + "iýa": 3, + "ýa": 3, + "tn": 3, + "tn:": 3, + "עy": 3, + "kar": 3, + "fiy": 3, + "লb": 3, + "ron": 3, + "hb": 3, + "hey": 3, + "yog": 3, + "γρα": 3, + "ραφ": 3, + "p:g": 3, + ":gj": 3, + "och": 3, + "eer": 3, + "lle": 3, + "ght": 3, + "jag": 3, + "նi": 3, + "d:g": 3, + "alt": 3, + "erd": 3, + "लn": 3, + "ayw": 3, + "ywa": 3, + ":gi": 3, + "opi": 3, + "eye": 3, + "ìz": 3, + "1a": 3, + "ori": 3, + "nto": 3, + "tru": 3, + "مقد": 3, + "قدس": 3, + "دس": 3, + "سa": 3, + "m:b": 3, + "b:b": 3, + "l:α": 3, + "i:r": 3, + "aam": 3, + "fj": 3, + "fj:": 3, + "bla": 3, + ":聖經": 3, + "聖經": 3, + "aip": 3, + "w:b": 3, + "bie": 3, + "ieb": 3, + "nay": 3, + ")n": 3, + "biw": 3, + "wel": 3, + "rw": 3, + "rw:": 3, + "bhe": 3, + "q:b": 3, + "lib": 3, + "hay": 3, + "yib": 3, + ":to": 3, + "dde": 3, + "ty:": 3, + "hán": 3, + "n:s": 3, + "guu": 3, + "put": 3, + "kel": 3, + "wop": 3, + "lyu": 3, + "yuk": 3, + "yun": 3, + "auk": 3, + "bok": 3, + "eiu": 3, + "iuv": 3, + "nei": 3, + "hog": 3, + "kot": 3, + "lum": 3, + "euv": 3, + "moo": 3, + "koh": 3, + "wo.": 3, + "joo": 3, + "ool": 3, + "la,": 3, + "gop": 3, + ",y": 3, + "aig": 3, + "kuy": 3, + "met": 3, + "3o": 3, + ";u": 3, + "mah": 3, + "aho": 3, + "8o": 3, + "tom": 3, + "pel": 3, + "lo.": 3, + "pew": 3, + "een": 3, + "wee": 3, + "meg": 3, + "gaa": 3, + "kam": 3, + "jok": 3, + "nez": 3, + "nol": 3, + "ga.": 3, + "tad": 3, + "upa": 3, + "kuf": 3, + "utw": 3, + "iza": 3, + "mpw": 3, + "wiy": 3, + "tas": 3, + "nse": 3, + "ser": 3, + "nsc": 3, + "حقو": 3, + "قوق": 3, + "وق": 3, + "aru": 3, + "tei": 3, + "ция": 3, + "на": 3, + "ове": 3, + "den": 3, + "ira": 3, + "dsk": 3, + "rav": 3, + "ern": 3, + "anê": 3, + "nê": 3, + "hts": 3, + "cio": 3, + "t:i": 3, + "rri": 3, + "euk": 3, + "lei": 3, + "ayv": 3, + "ell": 3, + "rit": 3, + "e:ה": 3, + "d:p": 3, + "sas": 3, + "anu": 3, + "nus": 3, + "fir": 3, + ":世界": 3, + "世界人": 3, + "rum": 3, + "n:l": 3, + "ows": 3, + "h:ป": 3, + "นt": 3, + "jin": 3, + ":sa": 3, + "i:א": 3, + "în-": 3, + "-ko": 3, + ".c": 3, + "uag": 3, + ":id": 3, + "r:n": 3, + "íng": 3, + "g:n": 3, + "pro": 3, + "мов": 3, + "ова": 3, + "зик": 3, + "jez": 3, + "kb": 3, + "-ng": 3, + "cr:": 3, + "a:e": 3, + "n:e": 3, + "i:e": 3, + "o:e": 3, + ":le": 3, + ":英語": 3, + "英語": 3, + "got": 3, + "ot:": 3, + "e:א": 3, + "भाषा": 3, + "bas": 3, + "ти": 3, + ":fi": 3, + "l:e": 3, + "lez": 3, + "rii": 3, + "لى": 3, + "a:i": 3, + "za:": 3, + "hyi": 3, + "ia)": 3, + "ië": 3, + "c:ܐ": 3, + "q:a": 3, + "t:á": 3, + "亞洲": 3, + "v:y": 3, + ":yn": 3, + ":एशि": 3, + "एशिया": 3, + ":áz": 3, + "洲z": 3, + "n:r": 3, + "spu": 3, + "r:r": 3, + "тай": 3, + "ай": 3, + "নb": 3, + "epú": 3, + "púb": 3, + "úbl": 3, + "kke": 3, + "fc": 3, + "o:r": 3, + "e(": 3, + "(t": 3, + "(ta": 3, + "an)": 3, + "n)": 3, + "lac": 3, + "cin": 3, + "vel": 3, + ":तैवा": 3, + "तैवान": 3, + "वान": 3, + "g:t": 3, + "नm": 3, + "ре": 3, + "рес": 3, + "есп": 3, + "спу": 3, + "пуб": 3, + "убл": 3, + "бли": 3, + "лик": 3, + "ика": 3, + "華民國": 3, + "esu": 3, + "sus": 3, + "epí": 3, + "pís": 3, + "íst": 3, + "gp": 3, + "rma": 3, + "urs": 3, + "rsk": 3, + "eck": 3, + "y:t": 3, + "q:t": 3, + "ουρ": 3, + "p:t": 3, + "e:d": 3, + "q:u": 3, + "d:u": 3, + ":घाना": 3, + "घाना": 3, + "w:g": 3, + "thu": 3, + "rgo": 3, + "(i": 3, + "(it": 3, + "p:p": 3, + "g:p": 3, + "дв": 3, + "рк": 3, + "tang": 3, + "ramb": 3, + "n:cu": 3, + "r:ku": 3, + "ltūr": 3, + "tūra": 3, + "s:ku": 3, + "acy": 3, + "cy:d": 3, + "lian": 3, + "t:ku": 3, + "ifi": 3, + "ultú": 3, + "ltúr": 3, + "l:cu": 3, + "תhi": 3, + ":men": 3, + "l:ku": 3, + "ral": 3, + "v:ku": 3, + "umen": 3, + "ment": 3, + "ensk": 3, + "loni": 3, + "ost": 3, + "yas": 3, + "sw:u": 3, + "àzh": 3, + "sth": 3, + "ekof": 3, + "kofu": 3, + "ofun": 3, + "fung": 3, + "gama": 3, + "ars": 3, + "sefa": 3, + "efal": 3, + "fala": 3, + "ngub": 3, + "gubo": 3, + "boa": 3, + "oan": 3, + "ndus": 3, + "dusa": 3, + "usat": 3, + "sat.": 3, + "yee": 3, + "aaf": 3, + "uipe": 3, + "st:u": 3, + "uiqu": 3, + "y:wi": 3, + "idiy": 3, + "edya": 3, + "h:wi": 3, + "chr:": 3, + "iec": 3, + "sb:w": 3, + "jôc": 3, + "q:wi": 3, + "yad": 3, + "t:vi": 3, + "ipee": 3, + "peed": 3, + "eedi": 3, + ":vic": 3, + "chip": 3, + "gd:b": 3, + "agn": 3, + "ויקי": 3, + "יקיפ": 3, + "yah": 3, + "իաi": 3, + "ms:w": 3, + "amg": 3, + "amy": 3, + "p:wi": 3, + "c:wi": 3, + "asr": 3, + "ýat": 3, + "iau": 3, + "auz": 3, + "יעy": 3, + "עyo": 3, + "iaz": 3, + "fim": 3, + "mani": 3, + "nil": 3, + "afiy": 3, + "fiya": 3, + "ėbc": 3, + "br:d": 3, + "jac": 3, + "eyog": 3, + "yogr": 3, + "b:ge": 3, + "e:ge": 3, + "edv": 3, + "γραφ": 3, + "ίαe": 3, + "yeo": 3, + "et:g": 3, + "eu:g": 3, + "anti": 3, + "anda": 3, + "gn:t": 3, + "նia": 3, + "nl:a": 3, + "anr": 3, + "rm:g": 3, + "m:ge": 3, + "iep": 3, + ":all": 3, + "pas": 3, + "aywa": 3, + "ro:g": 3, + "esc": 3, + "iss": 3, + "asm": 3, + "fis": 3, + "sq:g": 3, + "isv": 3, + "itr": 3, + "ats": 3, + "vo:t": 3, + "omuu": 3, + "lda": 3, + "1aa": 3, + "orin": 3, + "onik": 3, + "holo": 3, + "مقدس": 3, + "ibul": 3, + "bulu": 3, + "b:bi": 3, + "el:α": 3, + "et:p": 3, + "t:bí": 3, + "lif": 3, + "nfr": 3, + "lef": 3, + "ig:a": 3, + "ibil": 3, + ":kit": 3, + "bap": 3, + "w:bi": 3, + ":bie": 3, + "bieb": 3, + "iebe": 3, + "ibia": 3, + "ms:a": 3, + "eln": 3, + "ah:t": 3, + "v:bi": 3, + "c:bi": 3, + "ibhe": 3, + "ibha": 3, + "bhay": 3, + "hayi": 3, + "ayib": 3, + "yibh": 3, + "gey": 3, + "liv": 3, + "zu:i": 3, + "guut": 3, + "ekel": 3, + "apw": 3, + "valo": 3, + "wou": 3, + "wopa": 3, + "waan": 3, + "yome": 3, + "zimo": 3, + "lyuu": 3, + "ogw": 3, + "omuk": 3, + "wema": 3, + "uko,": 3, + "yuki": 3, + "muuy": 3, + "uyun": 3, + "yuni": 3, + "kau": 3, + "idhi": 3, + "mbok": 3, + "boka": 3, + "uyo": 3, + "eiuv": 3, + "iuvo": 3, + "lom": 3, + "eita": 3, + "taal": 3, + "aalo": 3, + "kane": 3, + "mana": 3, + "mini": 3, + "hogo": 3, + "ogol": 3, + "ngon": 3, + "katu": 3, + "ukon": 3, + "ampa": 3, + "mpan": 3, + "oek": 3, + "ekok": 3, + "koko": 3, + "wata": 3, + "atat": 3, + "tath": 3, + "ekot": 3, + "kota": 3, + "eine": 3, + ",mo": 3, + "ondi": 3, + "dilo": 3, + "wuut": 3, + "aalu": 3, + "toko": 3, + "omwa": 3, + "wonk": 3, + "alam": 3, + "mema": 3, + "ilyo": 3, + "oyi": 3, + "hoi": 3, + "yuut": 3, + "nosh": 3, + ".on": 3, + "neo": 3, + "kaan": 3, + "noa": 3, + "dhim": 3, + "mbul": 3, + "lukw": 3, + "udha": 3, + "hao": 3, + "weu": 3, + "moon": 3, + "emi": 3, + "lilo": 3, + "kay": 3, + "yoma": 3, + "awo.": 3, + "eoy": 3, + "alwa": 3, + "yit": 3, + "momu": 3, + "djoo": 3, + "jool": 3, + "oolo": 3, + "ngas": 3, + "gash": 3, + "uhok": 3, + "waal": 3, + "aala": 3, + ",el": 3, + "tika": 3, + "weg": 3, + "gopa": 3, + "elik": 3, + "gomu": 3, + "opap": 3, + "itik": 3, + "paig": 3, + "aigw": 3, + "ehal": 3, + "hala": 3, + "3om": 3, + "omwe": 3, + "egam": 3, + "doo": 3, + "upik": 3, + ";uu": 3, + "aol": 3, + "lelw": 3, + "omah": 3, + "kog": 3, + "i.o": 3, + "ngom": 3, + "aton": 3, + "idho": 3, + "8om": 3, + "yin": 3, + "lilw": 3, + "tomp": 3, + "mont": 3, + "onta": 3, + "tane": 3, + "aneh": 3, + "neho": 3, + "inak": 3, + "pewa": 3, + "omol": 3, + "molw": 3, + "pave": 3, + "wath": 3, + "okwe": 3, + "kwee": 3, + "imbo": 3, + "ege": 3, + "miin": 3, + "iini": 3, + "o,n": 3, + "dula": 3, + "koma": 3, + "iyok": 3, + "thig": 3, + ",no": 3, + "she.": 3, + "kwas": 3, + "tind": 3, + "lund": 3, + "yomi": 3, + "enga": 3, + "woma": 3, + "omal": 3, + "kuho": 3, + "djok": 3, + "joka": 3, + "aot": 3, + "noly": 3, + "olyo": 3, + "yalw": 3, + "ngun": 3, + "akul": 3, + "nga.": 3, + "kani": 3, + "jau": 3, + "ehan": 3, + "uka.": 3, + "e,o": 3, + "futi": 3, + "zad": 3, + "lulw": 3, + "vulu": 3, + "okuf": 3, + "kufu": 3, + "futw": 3, + "utwa": 3, + "neu": 3, + "kole": 3, + "shim": 3, + "himp": 3, + "impw": 3, + "mpwi": 3, + "pwiy": 3, + "wiyu": 3, + "paun": 3, + "silw": 3, + "oyo": 3, + "uuno": 3, + "kusi": 3, + "nali": 3, + "gak": 3, + "woi": 3, + "homa": 3, + "nong": 3, + "humb": 3, + "ulat": 3, + "verk": 3, + "ngv": 3, + "gva": 3, + "anm": 3, + "ense": 3, + "mein": 3, + "rung": 3, + "gde": 3, + "nsch": 3, + "araz": 3, + "razi": 3, + "sum": 3, + "mans": 3, + "حقوق": 3, + "t:de": 3, + "ción": 3, + "echo": 3, + "ация": 3, + "овек": 3, + "enj": 3, + "prav": 3, + "ravi": 3, + "lsd": 3, + "chp": 3, + "hpr": 3, + "enne": 3, + "nesk": 3, + "eske": 3, + "sker": 3, + "kere": 3, + "rett": 3, + "ttig": 3, + "tigh": 3, + "ighe": 3, + "anna": 3, + "anan": 3, + "nof": 3, + "nri": 3, + "o:un": 3, + "t:in": 3, + ":ini": 3, + "inen": 3, + "tsd": 3, + "azio": 3, + "zion": 3, + "irit": 3, + "vad": 3, + "he:ה": 3, + "manu": 3, + "anus": 3, + "ngs": 3, + ":世界人": 3, + "n:li": 3, + "anh": 3, + "nl:u": 3, + "eeh": 3, + "jap": 3, + "sdi": 3, + "аsc": 3, + "co:u": 3, + "th:ป": 3, + "i:tu": 3, + "yi:א": 3, + "n-ko": 3, + "lang": 3, + "guag": 3, + "uage": 3, + ":idi": 3, + "idio": 3, + "diom": 3, + "ioma": 3, + "s:ba": 3, + "baha": 3, + "ahas": 3, + "spra": 3, + "spro": 3, + "мова": 3, + "lesk": 3, + "eski": 3, + "jezi": 3, + "o:li": 3, + "gliz": 3, + "kid": 3, + "adv": 3, + "n:en": 3, + ":ens": 3, + "lais": 3, + ":len": 3, + "leng": 3, + "got:": 3, + "he:א": 3, + "uaa": 3, + ":bas": 3, + "basa": 3, + "nl:e": 3, + "glez": 3, + ":lim": 3, + "limb": 3, + "rc:ܐ": 3, + "t:as": 3, + "nba": 3, + "g:az": 3, + "b:as": 3, + "h:as": 3, + "sb:a": 3, + "b:az": 3, + "gv:y": 3, + "v:yn": 3, + ":एशिया": 3, + "t:az": 3, + "v:as": 3, + "apd": 3, + "e:ta": 3, + ":tay": 3, + "liek": 3, + "n:re": 3, + "resp": 3, + "espu": 3, + "spub": 3, + "anc": 3, + "repú": 3, + "epúb": 3, + "públ": 3, + "úbli": 3, + "ncs": 3, + "likk": 3, + "ikke": 3, + "kken": 3, + "ico": 3, + "cof": 3, + "ofc": 3, + "fch": 3, + "o:re": 3, + "u:ta": 3, + "wan)": 3, + "cina": 3, + "na(": 3, + ":तैवान": 3, + "nla": 3, + "sina": 3, + "uml": 3, + "slv": 3, + "नms": 3, + "l:re": 3, + "респ": 3, + "еспу": 3, + "спуб": 3, + "публ": 3, + "убли": 3, + "блик": 3, + "лика": 3, + "nsi": 3, + "tsin": 3, + "änw": 3, + "中華民國": 3, + "jesu": 3, + "esus": 3, + "tava": 3, + "rime": 3, + "epís": 3, + "píst": 3, + "ísto": 3, + "tola": 3, + "fr:p": 3, + "moti": 3, + "it:p": 3, + "npa": 3, + "oro": 3, + "ürke": 3, + "turs": 3, + "ursk": 3, + "rska": 3, + "b:tu": 3, + "reck": 3, + "sb:t": 3, + "kif": 3, + "o:tü": 3, + "yeg": 3, + "rcia": 3, + "h:tu": 3, + "p:tu": 3, + "e:du": 3, + ".ah": 3, + "f:ue": 3, + "sb:u": 3, + "q:ue": 3, + "d:ue": 3, + "ms:u": 3, + "t:gh": 3, + "l:ga": 3, + "r:gh": 3, + "i:gh": 3, + "w:gh": 3, + "m:gh": 3, + "r:du": 3, + "rgc": 3, + "a:du": 3, + "rge": 3, + "n:du": 3, + "urgo": 3, + "l:du": 3, + "rgi": 3, + "rgs": 3, + "pias": 3, + "ensa": 3, + "p:pi": 3, + "nl:p": 3, + "g:pi": 3, + "nor": 3, + "w:pi": 3, + "not": 3, + "n:no": 3, + "stan": 2, + "ngah": 2, + "gaha": 2, + "epo": 2, + "an:c": 2, + "ar:k": 2, + "h:ku": 2, + "ttur": 2, + "urd": 2, + "el:π": 2, + "ςen": 2, + "turo": 2, + "s:cu": 2, + "u:ku": 2, + "fi:k": 2, + "r:cu": 2, + "ref": 2, + "rag": 2, + ":संस्कृति": 2, + "संस्कृति": 2, + "rah": 2, + "túra": 2, + "úra": 2, + "rai": 2, + "id:b": 2, + "is:m": 2, + "nnin": 2, + "git": 2, + "ujv": 2, + "jv:b": 2, + "turi": 2, + "ultū": 2, + "ah:c": 2, + "rno": 2, + "eoc": 2, + "c:cu": 2, + "hir": 2, + "iro": 2, + "o:cu": 2, + "a(s": 2, + "(sp": 2, + "polo": 2, + "edy)": 2, + "dy)": 2, + "q:ku": 2, + "su:b": 2, + "w:ut": 2, + "tama": 2, + "amad": 2, + "kali": 2, + "alin": 2, + "inan": 2, + "nang": 2, + "ntr": 2, + "tr:k": 2, + "uz:m": 2, + "adan": 2, + "dani": 2, + "niya": 2, + "an:b": 2, + "onar": 2, + "nark": 2, + "arko": 2, + "rkob": 2, + "kobe": 2, + "obel": 2, + "lama": 2, + "awi": 2, + ",as": 2, + ".li": 2, + "link": 2, + "azu": 2, + "–d": 2, + "–di": 2, + "ensi": 2, + "klop": 2, + "lope": 2, + "oped": 2, + "iew": 2, + "ewi": 2, + "fre": 2, + "clop": 2, + "enci": 2, + "eac": 2, + "kipè": 2, + "f:wi": 2, + "ls:w": 2, + "ያan": 2, + ":biq": 2, + "biqu": 2, + "z:vi": 2, + "ar:w": 2, + "mg:v": 2, + "ėbi": 2, + "abm": 2, + "m:wi": 2, + "bn:উ": 2, + ":উইকি": 2, + "উইকিপি": 2, + "ইকিপিডি": 2, + "কিপিডিয়া": 2, + "পিডিয়া": 2, + "ডিয়াb": 2, + "py:উ": 2, + "য়াbr": 2, + "bug": 2, + "bug:": 2, + "a:vi": 2, + ":viq": 2, + "uipè": 2, + "co:w": 2, + "ijô": 2, + "jae": 2, + ":βικ": 2, + "βικι": 2, + "ικιπ": 2, + "κιπα": 2, + "ιπαί": 2, + "παίδ": 2, + "αίδε": 2, + "ίδει": 2, + "δεια": 2, + "εια": 2, + "o:vi": 2, + "edio": 2, + "aff": 2, + "diä": 2, + "r:vi": 2, + "vich": 2, + "hipe": 2, + ":維基百": 2, + "維基百科": 2, + "idia": 2, + "યાgv": 2, + ":ויק": 2, + "דיה": 2, + "ioi": 2, + "uiki": 2, + "piti": 2, + "ja:ウ": 2, + "アjb": 2, + "bo:u": 2, + "asj": 2, + "akg": 2, + "akl": 2, + "akm": 2, + "ko:위": 2, + "sh:w": 2, + "lt:v": 2, + "lv:v": 2, + "v:vi": 2, + "याms": 2, + "wl:b": 2, + "ap:w": 2, + "nl:w": 2, + "याnl": 2, + "rm:v": 2, + "m:vi": 2, + "αpt": 2, + "ra:u": 2, + "asu": 2, + "sw:w": 2, + "ta:வி": 2, + "யாte": 2, + "యాte": 2, + "ยtk": 2, + "tl:w": 2, + "dit": 2, + "att": 2, + "atw": 2, + "wo:w": 2, + ":维基百": 2, + "维基百科": 2, + "okar": 2, + "kara": 2, + "aram": 2, + "rahu": 2, + "ahun": 2, + "hunu": 2, + "unu.": 2, + "nu.": 2, + "u.a": 2, + ".af": 2, + "af:g": 2, + "iea": 2, + "ak:g": 2, + "ls:g": 2, + "ean": 2, + "as:": 2, + ":ভূগোল": 2, + "ভূগোল": 2, + ":xeo": 2, + "xeog": 2, + "ay:u": 2, + ":coğ": 2, + "coğr": 2, + "oğra": 2, + "ğraf": 2, + "ar:g": 2, + "r:ge": 2, + "mg:g": 2, + ":geu": 2, + "l:he": 2, + "alan": 2, + "লbr": 2, + "bs:g": 2, + "ca:g": 2, + ":hey": 2, + "heyo": 2, + "cs:g": 2, + "iaet": 2, + "aeth": 2, + "da:g": 2, + "de:g": 2, + "dv:ޖު": 2, + "el:γ": 2, + ":γεω": 2, + "γεωγ": 2, + "εωγρ": 2, + "ωγρα": 2, + "ραφί": 2, + "αφία": 2, + "en:g": 2, + "n:ge": 2, + "hye": 2, + "eo:g": 2, + "afio": 2, + "fio": 2, + "es:g": 2, + "íae": 2, + "raaf": 2, + "aafi": 2, + ":maa": 2, + "maan": 2, + "o:ma": 2, + "aati": 2, + "sfo": 2, + ":lan": 2, + "ndaf": 2, + "dafr": 2, + "ifr": 2, + "fr:g": 2, + "rp:g": 2, + ":gje": 2, + "gjeo": 2, + "fy:g": 2, + "rafy": 2, + "ocht": 2, + "n:te": 2, + "he:ג": 2, + "e:גא": 2, + "hr:g": 2, + "ia:g": 2, + "phia": 2, + "id:g": 2, + "iie": 2, + "ie:g": 2, + "io:g": 2, + "is:l": 2, + "iit": 2, + "it:g": 2, + "nuna": 2, + ":地理学": 2, + "地理学": 2, + "bo:t": 2, + "jv:g": 2, + "ka:გ": 2, + "arak": 2, + "raka": 2, + "erin": 2, + "la:g": 2, + ":jeo": 2, + "lb:g": 2, + "li:g": 2, + "ij:g": 2, + "mo:g": 2, + "mamb": 2, + "mab": 2, + "ດlt": 2, + "lt:g": 2, + "लms": 2, + "yōtl": 2, + "lnd": 2, + "ds:g": 2, + "nl:g": 2, + "गोलn": 2, + "लnl": 2, + "kund": 2, + "unde": 2, + "nn:g": 2, + "fin": 2, + "no:g": 2, + "ov:g": 2, + "v:ge": 2, + "oc:g": 2, + "c:ge": 2, + "am:g": 2, + "pcd": 2, + "pcd:": 2, + "pl:g": 2, + "ίαν": 2, + "ανp": 2, + "νpt": 2, + "pt:g": 2, + "qu:a": 2, + "u:al": 2, + "achi": 2, + "irm": 2, + "sc:g": 2, + "cn:g": 2, + ":gio": 2, + "giog": 2, + "ìas": 2, + "යාවs": 2, + "වsi": 2, + "le:g": 2, + "ysk": 2, + "sk:g": 2, + "k:ge": 2, + "sl:g": 2, + "q:gj": 2, + "sv:g": 2, + "sw:j": 2, + ":jio": 2, + "jiog": 2, + "tk:g": 2, + "mban": 2, + "uz:g": 2, + "ec:g": 2, + "cvo": 2, + "wa:d": 2, + "yew": 2, + "o:me": 2, + "ezh": 2, + "kzh": 2, + "nesi": 2, + "sis": 2, + "ijo": 2, + "kiel": 2, + "aako": 2, + "akor": 2, + "kori": 2, + "rint": 2, + "into": 2, + "2a": 2, + "2aa": 2, + "aate": 2, + "ates": 2, + "tess": 2, + "essa": 2, + "ssal": 2, + "salo": 2, + "1ti": 2, + "us2": 2, + "s2": 2, + "saa": 2, + "beri": 2, + "rij": 2, + "petr": 2, + "etru": 2, + "trus": 2, + "rus": 2, + "use": 2, + "ehol": 2, + ":byb": 2, + "bybe": 2, + "ybel": 2, + "ls:b": 2, + "blio": 2, + "المق": 2, + "لمقد": 2, + "دسa": 2, + "سar": 2, + "ܐܩܕ": 2, + "st:b": 2, + "y:bi": 2, + "mg:b": 2, + "i:ba": 2, + "m:bi": 2, + ":বাইবে": 2, + "বাইবেল": 2, + "ইবেল": 2, + "বেলb": 2, + "ng-g": 2, + "co:b": 2, + ":bei": 2, + "beib": 2, + "ldi": 2, + "iq:i̇": 2, + "q:i̇n": 2, + "l:αγ": 2, + "lee": 2, + ":pii": 2, + "piib": 2, + "leu": 2, + "u:bi": 2, + "سfi": 2, + "ro:p": 2, + "ivo": 2, + "o:bí": 2, + "elg": 2, + "obla": 2, + "ha:": 2, + "ipal": 2, + "lah": 2, + ":ביב": 2, + "ביבל": 2, + "y:աս": 2, + "id:a": 2, + "iblí": 2, + "jv:a": 2, + "i:bi": 2, + ":bii": 2, + "biib": 2, + "sh:b": 2, + "h:bi": 2, + "ku:k": 2, + "g:ba": 2, + "aibu": 2, + "lo:ພ": 2, + "aibo": 2, + "ibol": 2, + "s:al": 2, + "tian": 2, + "iana": 2, + "nl:b": 2, + "iste": 2, + ")nn": 2, + "elp": 2, + "simi": 2, + "rn:b": 2, + "iliy": 2, + "sl:s": 2, + "osm": 2, + "etu": 2, + "tusi": 2, + "sn:": 2, + "q:bi": 2, + "bhel": 2, + "heli": 2, + "est": 2, + "su:a": 2, + "ம்te": 2, + "tk:i": 2, + "ilt": 2, + "lot": 2, + "otw": 2, + "kron": 2, + "nty": 2, + "isti": 2, + "i:ki": 2, + "nhv": 2, + "xh:i": 2, + ":ibh": 2, + "聖經z": 2, + "an:s": 2, + "n:sè": 2, + "gzh": 2, + "liu": 2, + "taam": 2, + "aamb": 2, + "ambw": 2, + "koe": 2, + "esim": 2, + "yomu": 2, + "aval": 2, + "nuut": 2, + "omez": 2, + "mezi": 2, + "ukan": 2, + "o,u": 2, + "i.u": 2, + "hini": 2, + "aut": 2, + "auki": 2, + "auy": 2, + "nyan": 2, + "yany": 2, + "anyu": 2, + "nyuk": 2, + "kilw": 2, + "aem": 2, + "popy": 2, + "opya": 2, + "pya": 2, + "kuut": 2, + "oomp": 2, + "ompu": 2, + "mpum": 2, + "umbw": 2, + "mbwe": 2, + "eon": 2, + "nenw": 2, + "ehe.": 2, + "e.u": 2, + "hap": 2, + "apu": 2, + "umbi": 2, + "mbiw": 2, + "biwa": 2, + "nikw": 2, + "nkat": 2, + "uya": 2, + "ship": 2, + "hipo": 2, + "jith": 2, + "ethi": 2, + "iko,": 2, + ",op": 2, + "pou": 2, + "wuun": 2, + "uwu": 2, + "enwe": 2, + "nwe": 2, + "wep": 2, + "ngo.": 2, + "go.": 2, + "o.u": 2, + "kuhu": 2, + "uhum": 2, + "iyek": 2, + "ewan": 2, + "kiig": 2, + "a.u": 2, + "uyi": 2, + "yiig": 2, + "otam": 2, + "tamp": 2, + "aei": 2, + "yawo": 2, + "mond": 2, + "wew": 2, + "ewu": 2, + "waak": 2, + "aaki": 2, + "akii": 2, + "iint": 2, + "intu": 2, + "alum": 2, + "lume": 2, + "entu": 2, + "noya": 2, + "kola": 2, + "putu": 2, + "mwaa": 2, + "gwon": 2, + "lamw": 2, + "jaga": 2, + "iily": 2, + "yiil": 2, + "yaay": 2, + "hei": 2, + "eiy": 2, + "uko.": 2, + "noon": 2, + "oonk": 2, + "oeg": 2, + "egwa": 2, + "euva": 2, + "eko.": 2, + "onke": 2, + "nken": 2, + "kene": 2, + "bag": 2, + "egw": 2, + "okud": 2, + "imbu": 2, + "kupu": 2, + "ho,": 2, + "opas": 2, + "pash": 2, + "dhom": 2, + "iop": 2, + "alek": 2, + "kutu": 2, + "utul": 2, + "tulw": 2, + "ulwe": 2, + "yoo": 2, + "ilwe": 2, + "omap": 2, + "mapa": 2, + "gaw": 2, + "gawo": 2, + "valw": 2, + "noye": 2, + "mba.": 2, + "ba.": 2, + ".oy": 2, + "omai": 2, + "oond": 2, + "unge": 2, + "gen": 2, + "neiu": 2, + "voo": 2, + "hane": 2, + "momb": 2, + "ombe": 2, + "umwa": 2, + "aina": 2, + "2om": 2, + "muho": 2, + "lwaa": 2, + "ala,": 2, + "a,u": 2, + "e,e": 2, + "ka,": 2, + "a,e": 2, + "galw": 2, + "geg": 2, + "ano,": 2, + "no,": 2, + "o,g": 2, + ",ge": 2, + "ishe": 2, + "wei": 2, + "taku": 2, + "papo": 2, + "apol": 2, + "poli": 2, + "olit": 2, + "liti": 2, + "yepa": 2, + "opai": 2, + "peh": 2, + "peha": 2, + "iyan": 2, + "ene,": 2, + "ne,": 2, + ",me": 2, + "meto": 2, + "telo": 2, + "lip": 2, + "ambe": 2, + "mbek": 2, + "po.": 2, + "wa3": 2, + "nega": 2, + "ntu.": 2, + "tu.": 2, + "gea": 2, + "emu": 2, + "muup": 2, + "uupi": 2, + "a;u": 2, + "nela": 2, + "andi": 2, + "keel": 2, + "pekw": 2, + "gont": 2, + "shun": 2, + "hund": 2, + "6om": 2, + "kuta": 2, + "gov": 2, + "gove": 2, + "eta.": 2, + "ta.": 2, + "a.a": 2, + "koka": 2, + "taka": 2, + "aidh": 2, + "goka": 2, + "hilw": 2, + "koho": 2, + "ohof": 2, + "hofa": 2, + "ofa": 2, + "fay": 2, + "aed": 2, + "kaw": 2, + "gem": 2, + "get": 2, + "yim": 2, + "pong": 2, + "omat": 2, + "ompe": 2, + "mpel": 2, + "pelo": 2, + "elo.": 2, + "0om": 2, + "ewo": 2, + "akug": 2, + "gulo": 2, + "oag": 2, + "ageh": 2, + "gehe": 2, + "het": 2, + "wet": 2, + "tap": 2, + "bao": 2, + "moni": 2, + "avet": 2, + "aee": 2, + "thel": 2, + "makw": 2, + "akwa": 2, + "atho": 2, + "a.k": 2, + ".ka": 2, + "weed": 2, + "eedh": 2, + "etom": 2, + "omos": 2, + "a,p": 2, + "gwa.": 2, + "oit": 2, + "aaku": 2, + "ndjw": 2, + "egee": 2, + "geel": 2, + "eelo": 2, + "vule": 2, + "nash": 2, + ",yo": 2, + "omeg": 2, + ",ne": 2, + "iye.": 2, + "ye.": 2, + "ahin": 2, + "ngaa": 2, + "aaka": 2, + "dhos": 2, + "hosh": 2, + "shok": 2, + "kuth": 2, + ",mw": 2, + "aluk": 2, + "kos": 2, + "kosh": 2, + "4om": 2, + "kong": 2, + "inau": 2, + "kiwa": 2, + "wae": 2, + "aeg": 2, + "gek": 2, + "maka": 2, + "nkam": 2, + "kame": 2, + "nog": 2, + "ogi": 2, + "gii": 2, + "giig": 2, + "5om": 2, + "gow": 2, + "lula": 2, + "olwo": 2, + "awo,": 2, + "wo,": 2, + "nuun": 2, + "tayi": 2, + "yuud": 2, + "oli.": 2, + "7om": 2, + "ayal": 2, + ".ha": 2, + "uthw": 2, + "iyom": 2, + "lela": 2, + "moe": 2, + "yep": 2, + "paku": 2, + ",pa": 2, + "9om": 2, + "ugan": 2, + "lele": 2, + "tadh": 2, + "iiku": 2, + "olud": 2, + "ludh": 2, + "ongi": 2, + "naan": 2, + "mep": 2, + "yosh": 2, + "kili": 2, + "e.e": 2, + "ndik": 2, + "dika": 2, + "iul": 2, + "likw": 2, + "kath": 2, + "athi": 2, + "aath": 2, + "omaf": 2, + "mafu": 2, + "ulup": 2, + "lupa": 2, + "mona": 2, + "nomi": 2, + "omit": 2, + "higu": 2, + "igul": 2, + "lwak": 2, + "waka": 2, + "avu": 2, + "uwe": 2, + "ombw": 2, + "bwan": 2, + "niil": 2, + "pail": 2, + "ailo": 2, + "mwe.": 2, + "goo": 2, + "kuna": 2, + "wiih": 2, + "iihu": 2, + "ihup": 2, + "hupi": 2, + "upit": 2, + "pith": 2, + "negu": 2, + "boi": 2, + "uuwa": 2, + "uwan": 2, + "kwa,": 2, + "aoo": 2, + "taf": 2, + "ukol": 2, + "olel": 2, + "wai": 2, + "auna": 2, + "unam": 2, + "ios": 2, + "tilw": 2, + "meme": 2, + "usil": 2, + ".aa": 2, + "aano": 2, + "anon": 2, + "ndje": 2, + "jey": 2, + "a.e": 2, + "ongu": 2, + "dud": 2, + "opet": 2, + "peta": 2, + "etam": 2, + "tame": 2, + "amek": 2, + "o.e": 2, + "goi": 2, + "oot": 2, + "ungo": 2, + "omba": 2, + "eeg": 2, + "egul": 2, + "aes": 2, + ".ot": 2, + "oet": 2, + "etse": 2, + "ilat": 2, + ",om": 2, + "eoo": 2, + "i.a": 2, + "wiit": 2, + "iitu": 2, + "itul": 2, + "hing": 2, + "ngol": 2, + "olok": 2, + "we,": 2, + "ayu": 2, + "gono": 2, + "onon": 2, + "noni": 2, + "too": 2, + "aash": 2, + "eku": 2, + "shop": 2, + "kaha": 2, + "popi": 2, + "opiw": 2, + "piwa": 2, + "nome": 2, + "hiu": 2, + "tao": 2, + "koli": 2, + "yela": 2, + "tolo": 2, + "tash": 2, + "af:u": 2, + "lev": 2, + "eve": 2, + "lari": 2, + "arin": 2, + "nser": 2, + "tea": 2, + "eal": 2, + "ls:a": 2, + "allg": 2, + "llge": 2, + "lgem": 2, + "geme": 2, + "emei": 2, + "nee": 2, + "rklä": 2, + "klär": 2, + "läru": 2, + "ärun": 2, + "ngd": 2, + "erm": 2, + "rme": 2, + "ensc": 2, + "chen": 2, + "henr": 2, + "enre": 2, + "nrec": 2, + "eam": 2, + "an:d": 2, + "unib": 2, + "nibe": 2, + "iber": 2, + "drei": 2, + "الع": 2, + "العا": 2, + "لعال": 2, + "عالم": 2, + "لحق": 2, + "لحقو": 2, + "وقا": 2, + "قال": 2, + "نسان": 2, + "سان": 2, + "نas": 2, + "chos": 2, + "say": 2, + "pach": 2, + "nėž": 2, + "ėžm": 2, + "ogau": 2, + "gaus": 2, + "aus": 2, + "ust": 2, + "teis": 2, + "eisi": 2, + "ude": 2, + "дэк": 2, + "дэкл": 2, + "рацы": 2, + "ацыя": 2, + "цыя": 2, + "ыяп": 2, + "раво": 2, + "авоў": 2, + "воў": 2, + "оўч": 2, + "ўча": 2, + "чал": 2, + "чала": 2, + "алав": 2, + "лаве": 2, + "авек": 2, + "кад": 2, + "за": 2, + "рава": 2, + "ата": 2, + "ана": 2, + "ачо": 2, + "чов": 2, + "чове": 2, + "raw": 2, + "ɛkan": 2, + "nbs": 2, + "bs:u": 2, + "jao": 2, + "udsk": 2, + "avim": 2, + "vima": 2, + "ca:d": 2, + "dels": 2, + ":vše": 2, + "všeo": 2, + "šeob": 2, + "eobe": 2, + "obec": 2, + "becn": 2, + "ecná": 2, + "cná": 2, + "nád": 2, + "áde": 2, + "dský": 2, + "skýc": 2, + "kých": 2, + "ých": 2, + "prá": 2, + "práv": 2, + "ráv": 2, + "serk": 2, + "rklæ": 2, + "klær": 2, + "læri": 2, + "ærin": 2, + "etti": 2, + "de:a": 2, + "yhe": 2, + "insa": 2, + "nsan": 2, + "enye": 2, + "ική": 2, + "en:u": 2, + "ofh": 2, + "fhu": 2, + "rig": 2, + "righ": 2, + "ight": 2, + "ghts": 2, + "eo:u": 2, + "acio": 2, + "es:d": 2, + "set": 2, + "et:i": 2, + "õigu": 2, + "igus": 2, + "atsi": 2, + "arri": 2, + "enu": 2, + "euks": 2, + "leis": 2, + "alli": 2, + "llin": 2, + "line": 2, + "istu": 2, + "usf": 2, + "ro:i": 2, + ":man": 2, + "mann": 2, + "ttin": 2, + "inda": 2, + "nday": 2, + "irlý": 2, + "rlýs": 2, + "lýsi": 2, + "ýsin": 2, + "ngf": 2, + "gfr": 2, + "fr:d": 2, + "sell": 2, + "elle": 2, + "esd": 2, + "dro": 2, + "its": 2, + "el'": 2, + "fy:u": 2, + "hten": 2, + "insk": 2, + "nske": 2, + "nce": 2, + "egl": 2, + "gl:d": 2, + "l:de": 2, + "ldo": 2, + "dos": 2, + ":tek": 2, + "ekov": 2, + "घोषणा": 2, + "hu:a": 2, + "embe": 2, + "nid": 2, + "id:p": 2, + ":per": 2, + "asas": 2, + "sasi": 2, + "nusi": 2, + "usia": 2, + "io:u": 2, + "gsa": 2, + "it:d": 2, + ":dic": 2, + "dich": 2, + "hiar": 2, + "iara": 2, + "ione": 2, + "eun": 2, + "sale": 2, + "eid": 2, + "ka:ა": 2, + "b:ti": 2, + "mde": 2, + "lb:u": 2, + "bunt": 2, + "li:u": 2, + "ln:l": 2, + ":vis": 2, + "ациј": 2, + "ција": 2, + "ија": 2, + "ms:p": 2, + "eris": 2, + "araç": 2, + "nça": 2, + "my:အ": 2, + "l:un": 2, + "snn": 2, + "ghet": 2, + "nnv": 2, + "nv:b": 2, + "hoc": 2, + "oc:d": 2, + "c:de": 2, + "cion": 2, + "spl": 2, + "pl:p": 2, + "cja": 2, + "pt:d": 2, + "dire": 2, + "irei": 2, + "qu:p": 2, + "unap": 2, + "ink": 2, + "atun": 2, + "ro:d": 2, + "lăa": 2, + "ăa": 2, + "lui": 2, + "iде": 2, + "каs": 2, + "tss": 2, + "tî": 2, + "tï": 2, + "le:u": 2, + ":baa": 2, + ":dek": 2, + "ss:s": 2, + "sv:f": 2, + "oki": 2, + "engu": 2, + "kiz": 2, + "abi": 2, + "et:d": 2, + "aras": 2, + "nth": 2, + "tr:i̇": 2, + "r:i̇n": 2, + ":tuy": 2, + "nng": 2, + "quố": 2, + "quốc": 2, + "uốc": 2, + "ânq": 2, + "r:sa": 2, + "net": 2, + "în-k": 2, + "nso": 2, + "cs:n": 2, + "en:n": 2, + "eo:n": 2, + "ingv": 2, + "ngvo": 2, + "gvo": 2, + "voe": 2, + "es:i": 2, + "s:id": 2, + "gaf": 2, + "fr:n": 2, + "it:l": 2, + "t:li": 2, + "pl:j": 2, + "l:ję": 2, + ":jęz": 2, + "języ": 2, + "ęzyk": 2, + "knd": 2, + "pt:l": 2, + "t:lí": 2, + ":lín": 2, + "líng": 2, + "íngu": 2, + "sv:n": 2, + "f:en": 2, + "ak:e": 2, + "prac": 2, + "rach": 2, + "ache": 2, + "am:እ": 2, + "ésa": 2, + "y:in": 2, + ":inl": 2, + "inli": 2, + "nlis": 2, + "i̇ngi": 2, + "gili": 2, + "isd": 2, + "dili": 2, + "тел": 2, + "теле": 2, + "еле": 2, + "r:en": 2, + "nglu": 2, + "kalb": 2, + "alba": 2, + "lba": 2, + "abc": 2, + "l:in": 2, + "език": 2, + "m:an": 2, + ":ইংরেজি": 2, + "ইংরেজি": 2, + "ikb": 2, + "inin": 2, + "نگلی": 2, + "izt": 2, + "zti": 2, + "tili": 2, + "ics": 2, + "cs:a": 2, + "glič": 2, + "ličt": 2, + "ičti": 2, + "čtin": 2, + "tina": 2, + "gda": 2, + "e:en": 2, + "b:en": 2, + "elšć": 2, + "lšći": 2, + "šćin": 2, + "ćina": 2, + "αem": 2, + "shl": 2, + "eo:a": 2, + "o:an": 2, + "sek": 2, + "u:in": 2, + "ingr": 2, + "ngre": 2, + "gres": 2, + "fi:e": 2, + "i:en": 2, + "o:en": 2, + "fr:a": 2, + "glai": 2, + "rp:a": 2, + "nghe": 2, + "rla": 2, + "ago": 2, + "ot:𐌰": 2, + "𐌰gu": 2, + "v:ba": 2, + "leh": 2, + "aw:‘": 2, + "lelo": 2, + "ania": 2, + "נגלי": 2, + "יתh": 2, + "ht:a": 2, + "t:an": 2, + "ենi": 2, + "a:li": 2, + "sei": 2, + "ie:a": 2, + "o:pa": 2, + "io:a": 2, + "glia": 2, + "ნაk": 2, + "glic": 2, + "chl": 2, + "lt:a": 2, + "v:an": 2, + "isк": 2, + "кел": 2, + "h:in": 2, + "l:en": 2, + "lsn": 2, + "kno": 2, + "ov:a": 2, + "rm:a": 2, + "gáan": 2, + "áana": 2, + "nger": 2, + "gere": 2, + "erez": 2, + "oc:a": 2, + "spa": 2, + "giel": 2, + "sap": 2, + "aisa": 2, + "y:an": 2, + "ikan": 2, + "eză": 2, + "baa": 2, + "ngri": 2, + "isl": 2, + "asg": 2, + "sl:a": 2, + "l:an": 2, + "eze": 2, + "oss": 2, + "kes": 2, + ":kii": 2, + "zl:a": 2, + "sko": 2, + "ta:ஆ": 2, + "pi:t": 2, + ":英语": 2, + ":isi": 2, + "isin": 2, + "èdè": 2, + "za:y": 2, + "ia(": 2, + "ehyi": 2, + "hyia": 2, + "naj": 2, + "f:as": 2, + "an:a": 2, + "ar:a": 2, + "mg:a": 2, + ":এশিয়া": 2, + "এশিয়া": 2, + "শিয়াb": 2, + "r:az": 2, + "cbk": 2, + "cbk-": 2, + "bk-z": 2, + "k-za": 2, + "-zam": 2, + "zam:": 2, + "am:a": 2, + "co:a": 2, + "y:as": 2, + "q:as": 2, + ":ασί": 2, + "ασία": 2, + "o:az": 2, + "t:ás": 2, + "ro:a": 2, + "o:aa": 2, + "jeg": 2, + ":亞洲": 2, + "d:as": 2, + "aie": 2, + "sh:a": 2, + "w:as": 2, + "azië": 2, + "zië": 2, + "ap:a": 2, + "c:as": 2, + "or:": 2, + ":ázi": 2, + "ázia": 2, + "nsu": 2, + "siyo": 2, + "iwu": 2, + ":亚洲": 2, + "àza": 2, + "洲zh": 2, + "亞洲z": 2, + "uzh": 2, + "ce:t": 2, + "tayw": 2, + "ywan": 2, + "sjin": 2, + "jina": 2, + "st:t": 2, + "t:ta": 2, + "aiwá": 2, + "iwán": 2, + "wán": 2, + "naz": 2, + ":çin": 2, + "çin": 2, + "ikas": 2, + "ar:r": 2, + "r:re": 2, + "naк": 2, + "кит": 2, + "кита": 2, + "итай": 2, + "bs:t": 2, + "s:ta": 2, + "nca": 2, + "eb:t": 2, + "cs:t": 2, + "ncy": 2, + "cy:g": 2, + "e:re": 2, + "ނާee": 2, + "ee:t": 2, + ":res": 2, + ":hii": 2, + "hiin": 2, + "vab": 2, + "vaba": 2, + "abar": 2, + "bari": 2, + "arii": 2, + "riik": 2, + "eu:t": 2, + "چین": 2, + "hine": 2, + "agd": 2, + "gl:t": 2, + "l:ta": 2, + "hr:t": 2, + "nhu": 2, + "hu:t": 2, + "d:re": 2, + "io:t": 2, + "nis": 2, + "t:re": 2, + "a(t": 2, + "(tai": 2, + "v:re": 2, + "ნიk": 2, + "იko": 2, + "kad": 2, + "lb:t": 2, + "ij:t": 2, + "nlm": 2, + "mo:t": 2, + "nln": 2, + "ln:t": 2, + "n:ta": 2, + "nlt": 2, + "lt:t": 2, + "vana": 2, + "mg:t": 2, + "nmi": 2, + "кин": 2, + "кина": 2, + "ина": 2, + "ekc": 2, + "c:re": 2, + "chiń": 2, + "hińs": 2, + "ińsk": 2, + "ms:t": 2, + "anp": 2, + "npt": 2, + "naq": 2, + "inez": 2, + "cn:t": 2, + "sk:t": 2, + "k:ta": 2, + "nsl": 2, + "sl:t": 2, + "nsq": 2, + "sv:t": 2, + "mhur": 2, + "huri": 2, + "นtk": 2, + "tk:t": 2, + "ntl": 2, + "kbi": 2, + "nts": 2, + "nàz": 2, + "民國z": 2, + "國zh": 2, + "sok": 2, + "dam.": 2, + "am.": 2, + "kuli": 2, + "yow": 2, + "okum": 2, + "avel": 2, + "اوس": 2, + "ca:p": 2, + "imer": 2, + "mera": 2, + "cs:p": 2, + "s:pr": 2, + ":prv": 2, + ":1.": 2, + "bri": 2, + "brie": 2, + "rief": 2, + "theu": 2, + ":fir": 2, + "firs": 2, + "irst": 2, + "tep": 2, + "istl": 2, + "stle": 2, + "tle": 2, + "tot": 2, + "othy": 2, + "thy": 2, + "lti": 2, + "es:p": 2, + "eti": 2, + "r:pr": 2, + "tre": 2, + "ak:t": 2, + "k:th": 2, + "hr:p": 2, + "otiu": 2, + "tius": 2, + "ius": 2, + "t:pr": 2, + ":epi": 2, + "lt:p": 2, + "rste": 2, + "teb": 2, + "ebr": 2, + "no:p": 2, + "brev": 2, + "zap": 2, + "pt:p": 2, + "пос": 2, + "посл": 2, + "осла": 2, + "слан": 2, + "лани": 2, + "тим": 2, + "тимо": 2, + "oia": 2, + "theo": 2, + "tsw": 2, + "zak": 2, + "tl:u": 2, + "f:tu": 2, + "yea": 2, + "ls:t": 2, + "ar:t": 2, + "r:tü": 2, + ":তুরস্ক": 2, + "তুরস্ক": 2, + "রস্কb": 2, + "urec": 2, + "ecko": 2, + "cko": 2, + "ida": 2, + "kiet": 2, + "iet": 2, + "urko": 2, + ":του": 2, + "τουρ": 2, + "ουρκ": 2, + "υρκί": 2, + "ρκία": 2, + "κία": 2, + "et:t": 2, + "türg": 2, + "u:tu": 2, + "ro:t": 2, + "urka": 2, + "y:tu": 2, + "tui": 2, + "tuir": 2, + "uirc": 2, + "irc": 2, + "rcg": 2, + "w:tu": 2, + "kih": 2, + ":tör": 2, + "d:tu": 2, + "kik": 2, + "იka": 2, + ":तुर्किये": 2, + "तुर्किये": 2, + "tirk": 2, + "irki": 2, + "la:t": 2, + "ap:t": 2, + "nl:t": 2, + "jen": 2, + "m:tu": 2, + "ipl": 2, + "eys": 2, + "q:tu": 2, + "isz": 2, + "h:ปร": 2, + ":ประ": 2, + "ประเ": 2, + "ระเท": 2, + "ะเทศ": 2, + "yeu": 2, + "耳其z": 2, + "其zh": 2, + "germ": 2, + "erma": 2, + "rman": 2, + "many": 2, + "nyn": 2, + "ak:u": 2, + "ls:u": 2, + "an:u": 2, + "z:ue": 2, + "ar:u": 2, + "am:u": 2, + "ncr": 2, + "et:u": 2, + "nfi": 2, + "ro:u": 2, + "j:ue": 2, + "lo:u": 2, + "sh:u": 2, + ":ite": 2, + "iter": 2, + "ap:u": 2, + "र्सेन्n": 2, + "rm:u": 2, + "nst": 2, + "nve": 2, + "特森z": 2, + "f:gh": 2, + "k:gh": 2, + "an:g": 2, + "g:ga": 2, + ":ঘানা": 2, + "ঘানাb": 2, + "s:ga": 2, + "q:ga": 2, + "u:gh": 2, + "ána": 2, + "r:ga": 2, + ":ghá": 2, + "ghán": 2, + "v:gh": 2, + "u:ga": 2, + "a:ga": 2, + "v:ga": 2, + "avo": 2, + "thum": 2, + "rgd": 2, + "rgf": 2, + "isbo": 2, + "urga": 2, + "rgp": 2, + "taly": 2, + "aly": 2, + "lyn": 2, + "zac": 2, + "ml:p": 2, + "en:p": 2, + "plas": 2, + "lase": 2, + "ncia": 2, + "a(i": 2, + "(ita": 2, + "lia)": 2, + "zaf": 2, + "zai": 2, + "ki:p": 2, + "plac": 2, + "j:pi": 2, + "iase": 2, + "sens": 2, + "ap:p": 2, + "zar": 2, + "sg:p": 2, + "zav": 2, + "k:pi": 2, + "st:p": 2, + "nob": 2, + "ocr": 2, + "f:no": 2, + "дво": 2, + "двор": 2, + "вор": 2, + "орк": 2, + "скі": 2, + "a:no": 2, + "s:no": 2, + "e:no": 2, + "r:no": 2, + "g:no": 2, + "l:no": 2, + "kiд": 2, + "iдв": 2, + "ркр": 2 + } +} \ No newline at end of file diff --git a/visualizations/embedding_isotropy.png b/visualizations/embedding_isotropy.png new file mode 100644 index 0000000000000000000000000000000000000000..a91a2d44adb11390d31f16614130152d646eb980 Binary files /dev/null and b/visualizations/embedding_isotropy.png differ diff --git a/visualizations/embedding_norms.png b/visualizations/embedding_norms.png new file mode 100644 index 0000000000000000000000000000000000000000..ef49fa3ecc1a8755236759518877dbd8fd71f241 Binary files /dev/null and b/visualizations/embedding_norms.png differ diff --git a/visualizations/embedding_similarity.png b/visualizations/embedding_similarity.png new file mode 100644 index 0000000000000000000000000000000000000000..a5cc39226f2df9957571de9623fc8cf7f7f0ab15 --- /dev/null +++ b/visualizations/embedding_similarity.png @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:29d209c876afeff9aea88b6984a3e221213fa2e49bcbdf8b91d87ea579c7f946 +size 136422 diff --git a/visualizations/embedding_tsne_multilingual.png b/visualizations/embedding_tsne_multilingual.png new file mode 100644 index 0000000000000000000000000000000000000000..d05137655039f3a27997cd17a1ba277f4abeef96 --- /dev/null +++ b/visualizations/embedding_tsne_multilingual.png @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a0774d1e70ffe3d18844cccd243621fe102760812ab9c668542778c203f8512e +size 197865 diff --git a/visualizations/markov_branching.png b/visualizations/markov_branching.png new file mode 100644 index 0000000000000000000000000000000000000000..31b080bb17252b682975a2cce5de83adcb97f28d Binary files /dev/null and b/visualizations/markov_branching.png differ diff --git a/visualizations/markov_contexts.png b/visualizations/markov_contexts.png new file mode 100644 index 0000000000000000000000000000000000000000..afbaa836417bb5bf3baa8f7ca41d3809f734290b Binary files /dev/null and b/visualizations/markov_contexts.png differ diff --git a/visualizations/markov_entropy.png b/visualizations/markov_entropy.png new file mode 100644 index 0000000000000000000000000000000000000000..0c0a5e37f2de914f7dc4e74218186c86312498a0 Binary files /dev/null and b/visualizations/markov_entropy.png differ diff --git a/visualizations/model_sizes.png b/visualizations/model_sizes.png new file mode 100644 index 0000000000000000000000000000000000000000..3fb87aa46a7f0f3313a8fc7c4633d9990da31288 Binary files /dev/null and b/visualizations/model_sizes.png differ diff --git a/visualizations/nearest_neighbors.png b/visualizations/nearest_neighbors.png new file mode 100644 index 0000000000000000000000000000000000000000..a67b525191e3e6f303e74abb4b1e7f407502bec3 Binary files /dev/null and b/visualizations/nearest_neighbors.png differ diff --git a/visualizations/ngram_coverage.png b/visualizations/ngram_coverage.png new file mode 100644 index 0000000000000000000000000000000000000000..04f43aa204d3e55cf47c540d586c439ba8761400 Binary files /dev/null and b/visualizations/ngram_coverage.png differ diff --git a/visualizations/ngram_entropy.png b/visualizations/ngram_entropy.png new file mode 100644 index 0000000000000000000000000000000000000000..343d862d667ded7799fb0f8cfa1bedc1646ae924 Binary files /dev/null and b/visualizations/ngram_entropy.png differ diff --git a/visualizations/ngram_perplexity.png b/visualizations/ngram_perplexity.png new file mode 100644 index 0000000000000000000000000000000000000000..11e80b658f29f76ca3969b517aade9e9de94e110 Binary files /dev/null and b/visualizations/ngram_perplexity.png differ diff --git a/visualizations/ngram_unique.png b/visualizations/ngram_unique.png new file mode 100644 index 0000000000000000000000000000000000000000..d553e5383b4dd91f5eb50119c11bcfc3a7549891 Binary files /dev/null and b/visualizations/ngram_unique.png differ diff --git a/visualizations/performance_dashboard.png b/visualizations/performance_dashboard.png new file mode 100644 index 0000000000000000000000000000000000000000..10ad43e786afd522a5141e594a34199e2dd0d0e9 --- /dev/null +++ b/visualizations/performance_dashboard.png @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9eb28f5eb94f8721c9170ea6ed0e57fa25f358a41fad40db233dc727e3b780f4 +size 376265 diff --git a/visualizations/position_encoding_comparison.png b/visualizations/position_encoding_comparison.png new file mode 100644 index 0000000000000000000000000000000000000000..19e6658a4c80c6204496af729332b229598b6642 --- /dev/null +++ b/visualizations/position_encoding_comparison.png @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8223ab5e9e3842c58c22c1b12fc9d4c473cd295609024a334fa1257cb73de868 +size 101459 diff --git a/visualizations/tokenizer_compression.png b/visualizations/tokenizer_compression.png new file mode 100644 index 0000000000000000000000000000000000000000..0e8c2d12ee388e7ee4c30b85f02ed30ea42457f0 Binary files /dev/null and b/visualizations/tokenizer_compression.png differ diff --git a/visualizations/tokenizer_fertility.png b/visualizations/tokenizer_fertility.png new file mode 100644 index 0000000000000000000000000000000000000000..ff7f9aa19cd0bc67cb553e2717d253feb8496c88 Binary files /dev/null and b/visualizations/tokenizer_fertility.png differ diff --git a/visualizations/tokenizer_oov.png b/visualizations/tokenizer_oov.png new file mode 100644 index 0000000000000000000000000000000000000000..abea23a062c027bbe784a06cdc26d8b36de4a79d Binary files /dev/null and b/visualizations/tokenizer_oov.png differ diff --git a/visualizations/tokenizer_total_tokens.png b/visualizations/tokenizer_total_tokens.png new file mode 100644 index 0000000000000000000000000000000000000000..075a103119fe215c6aadc86d51fee72bfb9a7da1 Binary files /dev/null and b/visualizations/tokenizer_total_tokens.png differ diff --git a/visualizations/top20_words.png b/visualizations/top20_words.png new file mode 100644 index 0000000000000000000000000000000000000000..7d6b4c7936620d7643db6ccfe74237fa729f0349 Binary files /dev/null and b/visualizations/top20_words.png differ diff --git a/visualizations/tsne_sentences.png b/visualizations/tsne_sentences.png new file mode 100644 index 0000000000000000000000000000000000000000..03c2e6e1772b96dff1acf3dcf89eb45718722f36 Binary files /dev/null and b/visualizations/tsne_sentences.png differ diff --git a/visualizations/tsne_words.png b/visualizations/tsne_words.png new file mode 100644 index 0000000000000000000000000000000000000000..6d76c7c03bb5f8eb9441af523d544202ad381598 Binary files /dev/null and b/visualizations/tsne_words.png differ diff --git a/visualizations/vocab_coverage.png b/visualizations/vocab_coverage.png new file mode 100644 index 0000000000000000000000000000000000000000..f48dca31c5f32adbe0acd808eeaf8502686ba4ab Binary files /dev/null and b/visualizations/vocab_coverage.png differ diff --git a/visualizations/vocab_freq_dist.png b/visualizations/vocab_freq_dist.png new file mode 100644 index 0000000000000000000000000000000000000000..286ba2bdc24e57b513b745b80dbe12600d47c705 Binary files /dev/null and b/visualizations/vocab_freq_dist.png differ diff --git a/visualizations/zipf_law.png b/visualizations/zipf_law.png new file mode 100644 index 0000000000000000000000000000000000000000..210494785b7c14d1220addce48c5e6b5d4e85361 Binary files /dev/null and b/visualizations/zipf_law.png differ