williamsassa commited on
Commit ·
cf495ab
1
Parent(s): b17e6a6
Add model
Browse files- best_model.pth +3 -0
- evaluation_summary.txt +47 -0
best_model.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:537695304e0fa7d7b0518f96a6d1107dd0488de6cd1458c5fc0e0101c8db83cc
|
| 3 |
+
size 21540213
|
evaluation_summary.txt
ADDED
|
@@ -0,0 +1,47 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
================================================================================
|
| 2 |
+
LANGUAGE IDENTIFICATION MODEL EVALUATION SUMMARY
|
| 3 |
+
================================================================================
|
| 4 |
+
|
| 5 |
+
Evaluation Date: 2026-01-31 15:03:05
|
| 6 |
+
Model: Hybrid TF-IDF + BiLSTM Language Identifier
|
| 7 |
+
Dataset: WiLI-2018
|
| 8 |
+
Number of languages: 235
|
| 9 |
+
Vocabulary size: 20,002
|
| 10 |
+
Total test samples: 117,500
|
| 11 |
+
|
| 12 |
+
PERFORMANCE METRICS:
|
| 13 |
+
Test Accuracy: 93.7481%
|
| 14 |
+
Test F1 Score: 93.7531%
|
| 15 |
+
|
| 16 |
+
TOP 5 BEST PERFORMING LANGUAGES:
|
| 17 |
+
1. ckb - 100.00% accuracy (500 samples)
|
| 18 |
+
2. kbd - 100.00% accuracy (500 samples)
|
| 19 |
+
3. min - 100.00% accuracy (500 samples)
|
| 20 |
+
4. mlg - 100.00% accuracy (500 samples)
|
| 21 |
+
5. bod - 99.80% accuracy (500 samples)
|
| 22 |
+
|
| 23 |
+
MOST CHALLENGING 5 LANGUAGES:
|
| 24 |
+
1. wuu - 15.60% accuracy (500 samples)
|
| 25 |
+
2. zh-yue - 22.80% accuracy (500 samples)
|
| 26 |
+
3. zho - 37.00% accuracy (500 samples)
|
| 27 |
+
4. hrv - 46.80% accuracy (500 samples)
|
| 28 |
+
5. hbs - 53.80% accuracy (500 samples)
|
| 29 |
+
|
| 30 |
+
PERFORMANCE DISTRIBUTION:
|
| 31 |
+
Excellent (≥99%): 25 languages
|
| 32 |
+
Good (95-99%): 131 languages
|
| 33 |
+
Average (80-95%): 68 languages
|
| 34 |
+
Poor (<80%): 11 languages
|
| 35 |
+
|
| 36 |
+
INTERESTING FINDINGS:
|
| 37 |
+
1. Several languages achieve 100% accuracy (ckb, kbd, min, mlg)
|
| 38 |
+
2. Chinese variants are the most challenging (wuu: 15.6%, zh-yue: 22.8%)
|
| 39 |
+
3. Japanese is surprisingly challenging (56.0% accuracy)
|
| 40 |
+
4. 93.75% overall accuracy is excellent for 235 languages
|
| 41 |
+
|
| 42 |
+
RECOMMENDATIONS FOR IMPROVEMENT:
|
| 43 |
+
1. Add data augmentation for low-accuracy languages
|
| 44 |
+
2. Consider language family-based transfer learning
|
| 45 |
+
3. Ensemble methods could boost performance
|
| 46 |
+
|
| 47 |
+
================================================================================
|