devwoo
/

Kybalion-1B

continued-pretraining

Model card Files Files and versions

devwoo commited on Apr 13

Commit

7ce0d37

·

verified ·

1 Parent(s): cbf9c41

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -40,6 +40,14 @@ datasets:
 ---
 ## 📊 Benchmark Results
 All scores measured with [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) under **identical conditions** (same prompts, same few-shot settings, same hardware).

 ---
+## 🔬 Key Contributions
+- Demonstrates that domain-balanced continued pretraining on curated multi-domain data (education, math, code, science) yields consistent improvements across commonsense reasoning benchmarks in 1B-scale models
+- Suggests that multi-step mathematical reasoning remains a fundamental bottleneck for 1B-scale models, even when combining math-focused pretraining (OpenWebMath) with instruction tuning (MetaMathQA)
+- Provides a fully reproducible, compute-efficient training recipe (CPT → LoRA SFT) built and executed **by a single undergraduate student in under one week**, demonstrating that meaningful LLM research is achievable without institutional resources or large teams
+---
 ## 📊 Benchmark Results
 All scores measured with [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) under **identical conditions** (same prompts, same few-shot settings, same hardware).