Update README.md
Browse files
README.md
CHANGED
|
@@ -49,6 +49,10 @@ library_name: transformers
|
|
| 49 |
<a href="https://huggingface.co/spaces/FINAL-Bench/all-bench-leaderboard"><img src="https://img.shields.io/badge/📊_ALL_Bench-Leaderboard-orange?style=for-the-badge" alt="ALL Bench"></a>
|
| 50 |
</p>
|
| 51 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 52 |
> Gemma 4 Expert 4B (MoE) | Thinking Mode | 128K Context | 140+ Languages | BF16 | Apache 2.0
|
| 53 |
|
| 54 |
---
|
|
@@ -70,6 +74,15 @@ As the smallest member of the Darwin Opus family, Darwin-4B-Opus delivers Claude
|
|
| 70 |
| Father | google/gemma-4-E4B-it | Gemma 4 Expert 4B (MoE), multimodal, 128K context, efficient inference |
|
| 71 |
| Mother | arsovskidev/Gemma-4-E4B-Claude-4.6-Opus-Reasoning-Distilled | Claude 4.6 Opus high-effort reasoning distillation, enhanced code/science/analysis |
|
| 72 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 73 |
---
|
| 74 |
|
| 75 |
## Benchmarks
|
|
@@ -116,6 +129,12 @@ final_ratio = mri_ratio × mri_trust + genome_ratio × (1 - mri_trust)
|
|
| 116 |
|
| 117 |
The mri_trust parameter itself is optimized by the CMA-ES evolutionary algorithm, allowing the system to automatically determine the optimal balance between diagnostic prescription and evolutionary search for each model pair.
|
| 118 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 119 |
---
|
| 120 |
|
| 121 |
## Evolution Result
|
|
|
|
| 49 |
<a href="https://huggingface.co/spaces/FINAL-Bench/all-bench-leaderboard"><img src="https://img.shields.io/badge/📊_ALL_Bench-Leaderboard-orange?style=for-the-badge" alt="ALL Bench"></a>
|
| 50 |
</p>
|
| 51 |
|
| 52 |
+
<p align="center">
|
| 53 |
+
<img src="info.png" alt="Darwin-4B-Opus" width="100%">
|
| 54 |
+
</p>
|
| 55 |
+
|
| 56 |
> Gemma 4 Expert 4B (MoE) | Thinking Mode | 128K Context | 140+ Languages | BF16 | Apache 2.0
|
| 57 |
|
| 58 |
---
|
|
|
|
| 74 |
| Father | google/gemma-4-E4B-it | Gemma 4 Expert 4B (MoE), multimodal, 128K context, efficient inference |
|
| 75 |
| Mother | arsovskidev/Gemma-4-E4B-Claude-4.6-Opus-Reasoning-Distilled | Claude 4.6 Opus high-effort reasoning distillation, enhanced code/science/analysis |
|
| 76 |
|
| 77 |
+
### Model Diagnostic Scan (MDS)
|
| 78 |
+
|
| 79 |
+
<p align="center">
|
| 80 |
+
<img src="s1.png" alt="Father (gemma-4-E4B-it) MDS Scan" width="48%">
|
| 81 |
+
<img src="s2.png" alt="Mother (Claude-Opus-Distill) MDS Scan" width="48%">
|
| 82 |
+
</p>
|
| 83 |
+
|
| 84 |
+
Left: Father (gemma-4-E4B-it) — balanced generalist with low activation across most probes. Right: Mother (Claude-Opus-Distill) — strong REASONING concentration in later layers, CODE activation in late layers. The Mother shows significantly more specialized layer patterns from Claude Opus distillation.
|
| 85 |
+
|
| 86 |
---
|
| 87 |
|
| 88 |
## Benchmarks
|
|
|
|
| 129 |
|
| 130 |
The mri_trust parameter itself is optimized by the CMA-ES evolutionary algorithm, allowing the system to automatically determine the optimal balance between diagnostic prescription and evolutionary search for each model pair.
|
| 131 |
|
| 132 |
+
### Parent Comparison (MDS Result)
|
| 133 |
+
|
| 134 |
+
<p align="center">
|
| 135 |
+
<img src="parent_comparison.png" alt="Parent Comparison — Layer-wise Importance" width="100%">
|
| 136 |
+
</p>
|
| 137 |
+
|
| 138 |
---
|
| 139 |
|
| 140 |
## Evolution Result
|