Sandroeth commited on
Commit
2e4a1f6
·
verified ·
1 Parent(s): b74c3b1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md CHANGED
@@ -62,6 +62,23 @@ Dataset meliputi:
62
 
63
  ---
64
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
65
  ## Progress Training
66
 
67
  | Tokens | Step | Final Loss |
 
62
 
63
  ---
64
 
65
+ Below is the performance comparison of CALI-0.1B against other prominent Small Language Models (SLMs) in the 100M+ parameter tier.
66
+
67
+ | Model Name | Piqa | MMLU Math | ARC-Challenge | HellaSwag |
68
+ | :--- | :---: | :---: | :---: | :---: |
69
+ | **CALI-0.1B** | 54.19% | 28.04% | 24.66% | 27.00% |
70
+ | **SmolLM2-135M** | **58.50%** | **29.90%** | **31.10%** | **43.20%** |
71
+ | **GPT-X2-125M** | 51.60% | 27.80% | 27.80% | 40.50% |
72
+ | **SmolLM-135M** | 56.30% | 28.80% | 28.80% | 42.70% |
73
+ | **MobileLLM-R1-140M-base** | 49.90% | 24.70% | 24.70% | 33.90% |
74
+ | **GPT-X-125M** | 50.80% | 26.70% | 26.70% | 36.50% |
75
+ | **GPT-2 (124M)** | 39.50% | 22.60% | 22.60% | 31.50% |
76
+ | **GPT-Neo-125M** | 39.40% | 22.90% | 22.90% | 30.40% |
77
+ | **OPT-125M** | 40.20% | 22.90% | 22.90% | 31.40% |
78
+
79
+ *Note: For CALI-0.1B, the scores represent strict raw accuracies (`acc` / `acc_norm`) extracted directly from the evaluation tracker logs.*
80
+
81
+
82
  ## Progress Training
83
 
84
  | Tokens | Step | Final Loss |