SeaWolf-AI commited on
Commit
c47eb81
·
verified ·
1 Parent(s): fc603d7

Add Family Tree (5-member lineage) + datasets metadata

Browse files
Files changed (1) hide show
  1. README.md +44 -7
README.md CHANGED
@@ -15,9 +15,13 @@ tags:
15
  - mixture-of-experts
16
  - cohere2_moe
17
  - 218b
 
18
  base_model:
19
- - CohereLabs/command-a-plus-05-2026-bf16
20
  - FINAL-Bench/Darwin-218B-kr
 
 
 
 
21
  ---
22
 
23
  # Darwin-218B-Delphi
@@ -55,16 +59,49 @@ DELPHI improvement | +4.04pp (+8 questions)
55
 
56
  ---
57
 
58
- ## Lineage
59
 
60
  ```
61
- CohereLabs/command-a-plus-05-2026-bf16 (Apache-2.0 base, 218B MoE, ~25B active, 128 expert)
62
- ↓ Korean LoRA merge
63
- Darwin-218B-kr (Korean-aligned base)
64
- ↓ Chemistry SFT LoRA merge (Opus-distilled, anti-contamination)
65
- Darwin-218B-Delphi ← THIS MODEL
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
66
  ```
67
 
 
 
 
 
 
68
  **Distillation**:
69
  - Teacher: large frontier model (proprietary API; no logits exposure → SFT-on-outputs pattern)
70
  - 993 high-quality chemistry CoT examples across 6 sub-domains:
 
15
  - mixture-of-experts
16
  - cohere2_moe
17
  - 218b
18
+ - gpqa-90
19
  base_model:
 
20
  - FINAL-Bench/Darwin-218B-kr
21
+ - CohereLabs/command-a-plus-05-2026-bf16
22
+ base_model_relation: merge
23
+ datasets:
24
+ - FINAL-Bench/darwin-chem-data-v1
25
  ---
26
 
27
  # Darwin-218B-Delphi
 
59
 
60
  ---
61
 
62
+ ## 🌳 Family Tree (족보)
63
 
64
  ```
65
+ 🧓 GRANDFATHER (조부) 🧓 GRANDMOTHER (조모)
66
+ ─────────────────── ───────────────────
67
+ CohereLabs/ Anthropic Claude
68
+ command-a-plus-05-2026-bf16 Opus 4.5
69
+ (Apache-2.0) (chemistry knowledge donor)
70
+ 218B MoE / ~25B active via SFT distillation
71
+ 128 experts, BF16 (no logits, output-only)
72
+ │ │
73
+ │ │
74
+ └────────────────┬──────────────────────┘
75
+
76
+
77
+ 👨 FATHER (부친) 👩 MOTHER (모친)
78
+ ─────────────────── ───────────────────
79
+ FINAL-Bench/ FINAL-Bench/
80
+ Darwin-218B-kr darwin-chem-data-v1
81
+ (Korean LoRA merged) (993 chemistry CoT samples,
82
+ Korean fluency layer 6 sub-domains,
83
+ anti-contamination guaranteed)
84
+ │ │
85
+ │ │
86
+ └────────────────┬──────────────────────┘
87
+
88
+
89
+ 👦 CHILD (자식 / THIS MODEL)
90
+ ──────────────────────────────
91
+ FINAL-Bench/Darwin-218B-Delphi
92
+ ──────────────────────────────
93
+ • Korean + Chemistry specialist
94
+ • 218B MoE, ~25B active
95
+ • Apache-2.0
96
+ • GPQA Diamond 90.91% (DELPHI cascade)
97
+ • Served via DELPHI 5-Phase inference
98
  ```
99
 
100
+ ### Lineage notes
101
+ - **Paternal line (모델 골격)**: Cohere Command A+ → Korean LoRA → Chemistry LoRA merge → Delphi
102
+ - **Maternal line (지식 source)**: Claude Opus 4.5 → 993 distilled chemistry CoT samples → Delphi's chemistry reasoning
103
+ - **Apache-2.0 compatibility**: All ancestors (paternal line) are Apache-2.0 licensed; maternal line is data-only output (Anthropic ToS compliant for derivative model training)
104
+
105
  **Distillation**:
106
  - Teacher: large frontier model (proprietary API; no logits exposure → SFT-on-outputs pattern)
107
  - 993 high-quality chemistry CoT examples across 6 sub-domains: