| ================================================================================ |
| LANGUAGE IDENTIFICATION MODEL EVALUATION SUMMARY |
| ================================================================================ |
|
|
| Evaluation Date: 2026-01-31 15:03:05 |
| Model: Hybrid TF-IDF + BiLSTM Language Identifier |
| Dataset: WiLI-2018 |
| Number of languages: 235 |
| Vocabulary size: 20,002 |
| Total test samples: 117,500 |
|
|
| PERFORMANCE METRICS: |
| Test Accuracy: 93.7481% |
| Test F1 Score: 93.7531% |
|
|
| TOP 5 BEST PERFORMING LANGUAGES: |
| 1. ckb - 100.00% accuracy (500 samples) |
| 2. kbd - 100.00% accuracy (500 samples) |
| 3. min - 100.00% accuracy (500 samples) |
| 4. mlg - 100.00% accuracy (500 samples) |
| 5. bod - 99.80% accuracy (500 samples) |
|
|
| MOST CHALLENGING 5 LANGUAGES: |
| 1. wuu - 15.60% accuracy (500 samples) |
| 2. zh-yue - 22.80% accuracy (500 samples) |
| 3. zho - 37.00% accuracy (500 samples) |
| 4. hrv - 46.80% accuracy (500 samples) |
| 5. hbs - 53.80% accuracy (500 samples) |
|
|
| PERFORMANCE DISTRIBUTION: |
| Excellent (≥99%): 25 languages |
| Good (95-99%): 131 languages |
| Average (80-95%): 68 languages |
| Poor (<80%): 11 languages |
|
|
| INTERESTING FINDINGS: |
| 1. Several languages achieve 100% accuracy (ckb, kbd, min, mlg) |
| 2. Chinese variants are the most challenging (wuu: 15.6%, zh-yue: 22.8%) |
| 3. Japanese is surprisingly challenging (56.0% accuracy) |
| 4. 93.75% overall accuracy is excellent for 235 languages |
|
|
| RECOMMENDATIONS FOR IMPROVEMENT: |
| 1. Add data augmentation for low-accuracy languages |
| 2. Consider language family-based transfer learning |
| 3. Ensemble methods could boost performance |
|
|
| ================================================================================ |
|
|