Add base model to the model card
#2
by
qingy2024 - opened
README.md
CHANGED
|
@@ -9,7 +9,7 @@ license_link: >-
|
|
| 9 |
|
| 10 |
## Overview
|
| 11 |
|
| 12 |
-
Chirp-3b is a high-performing 3B parameter language model crafted by the Ozone Research team. Fine-tuned from a robust base model, it was trained on 50 million tokens of distilled data from GPT-4o. This compact yet powerful model delivers exceptional results, outperforming expectations on benchmarks like MMLU Pro and IFEval.
|
| 13 |
|
| 14 |
Chirp-3b is an open-source effort to push the limits of what small-scale LLMs can achieve, making it a valuable tool for researchers and enthusiasts alike.
|
| 15 |
|
|
|
|
| 9 |
|
| 10 |
## Overview
|
| 11 |
|
| 12 |
+
Chirp-3b is a high-performing 3B parameter language model crafted by the Ozone Research team. Fine-tuned from a robust base model (Qwen2.5 3B Instruct), it was trained on 50 million tokens of distilled data from GPT-4o. This compact yet powerful model delivers exceptional results, outperforming expectations on benchmarks like MMLU Pro and IFEval.
|
| 13 |
|
| 14 |
Chirp-3b is an open-source effort to push the limits of what small-scale LLMs can achieve, making it a valuable tool for researchers and enthusiasts alike.
|
| 15 |
|