williamsassa commited on
Commit
cf495ab
·
1 Parent(s): b17e6a6

Add model

Browse files
Files changed (2) hide show
  1. best_model.pth +3 -0
  2. evaluation_summary.txt +47 -0
best_model.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:537695304e0fa7d7b0518f96a6d1107dd0488de6cd1458c5fc0e0101c8db83cc
3
+ size 21540213
evaluation_summary.txt ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ================================================================================
2
+ LANGUAGE IDENTIFICATION MODEL EVALUATION SUMMARY
3
+ ================================================================================
4
+
5
+ Evaluation Date: 2026-01-31 15:03:05
6
+ Model: Hybrid TF-IDF + BiLSTM Language Identifier
7
+ Dataset: WiLI-2018
8
+ Number of languages: 235
9
+ Vocabulary size: 20,002
10
+ Total test samples: 117,500
11
+
12
+ PERFORMANCE METRICS:
13
+ Test Accuracy: 93.7481%
14
+ Test F1 Score: 93.7531%
15
+
16
+ TOP 5 BEST PERFORMING LANGUAGES:
17
+ 1. ckb - 100.00% accuracy (500 samples)
18
+ 2. kbd - 100.00% accuracy (500 samples)
19
+ 3. min - 100.00% accuracy (500 samples)
20
+ 4. mlg - 100.00% accuracy (500 samples)
21
+ 5. bod - 99.80% accuracy (500 samples)
22
+
23
+ MOST CHALLENGING 5 LANGUAGES:
24
+ 1. wuu - 15.60% accuracy (500 samples)
25
+ 2. zh-yue - 22.80% accuracy (500 samples)
26
+ 3. zho - 37.00% accuracy (500 samples)
27
+ 4. hrv - 46.80% accuracy (500 samples)
28
+ 5. hbs - 53.80% accuracy (500 samples)
29
+
30
+ PERFORMANCE DISTRIBUTION:
31
+ Excellent (≥99%): 25 languages
32
+ Good (95-99%): 131 languages
33
+ Average (80-95%): 68 languages
34
+ Poor (<80%): 11 languages
35
+
36
+ INTERESTING FINDINGS:
37
+ 1. Several languages achieve 100% accuracy (ckb, kbd, min, mlg)
38
+ 2. Chinese variants are the most challenging (wuu: 15.6%, zh-yue: 22.8%)
39
+ 3. Japanese is surprisingly challenging (56.0% accuracy)
40
+ 4. 93.75% overall accuracy is excellent for 235 languages
41
+
42
+ RECOMMENDATIONS FOR IMPROVEMENT:
43
+ 1. Add data augmentation for low-accuracy languages
44
+ 2. Consider language family-based transfer learning
45
+ 3. Ensemble methods could boost performance
46
+
47
+ ================================================================================