Update README.md
Browse files
README.md
CHANGED
|
@@ -8,7 +8,7 @@ base_model:
|
|
| 8 |
|
| 9 |
# Gladiator-Mini-exp-1211: A Compact and Powerful Reasoning Engine
|
| 10 |
|
| 11 |
-
**Gladiator-Mini-exp-1211** is a 3-billion parameter language model designed for **complex reasoning tasks**. This experimental model, based on [meta-llama/Llama-3.2-3B-Instruct]
|
| 12 |
|
| 13 |
**What Makes it Stand Out?**
|
| 14 |
|
|
@@ -38,10 +38,14 @@ We encourage you to experiment with Gladiator-Mini-exp-1211, test its limits, an
|
|
| 38 |
|
| 39 |
**Limitations:**
|
| 40 |
|
| 41 |
-
* Requires system prompts for optimal performance. An incorrect system prompt can lead to an incorrect answer, while a correct prompt usually leads to a correct one.
|
| 42 |
* Reasoning abilities are still being refined.
|
| 43 |
* May exhibit biases or inaccuracies.
|
| 44 |
|
| 45 |
**Disclaimer:**
|
| 46 |
|
| 47 |
-
Gladiator-Mini-exp-1211 is an experimental model and should be used with caution. Please critically evaluate its outputs and do not take them as absolute truth.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
|
| 9 |
# Gladiator-Mini-exp-1211: A Compact and Powerful Reasoning Engine
|
| 10 |
|
| 11 |
+
**Gladiator-Mini-exp-1211** is a 3-billion parameter language model designed for **complex reasoning tasks**. This experimental model, based on [meta-llama/Llama-3.2-3B-Instruct] [https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct], offers surprisingly strong analytical capabilities for its size. It demonstrates the potential of smaller models to achieve impressive performance in analytical thinking. We chose to finetune on a Llama model due to finetuning difficulties with the Qwen 2.5 4B model.
|
| 12 |
|
| 13 |
**What Makes it Stand Out?**
|
| 14 |
|
|
|
|
| 38 |
|
| 39 |
**Limitations:**
|
| 40 |
|
| 41 |
+
* Requires system prompts for optimal performance. An incorrect system prompt can lead to an incorrect answer, while a correct prompt usually leads to a correct one. An incorrect system prompt example, is using a math reasoning system prompt for a task requiring chain of thought on a tricky text equation.
|
| 42 |
* Reasoning abilities are still being refined.
|
| 43 |
* May exhibit biases or inaccuracies.
|
| 44 |
|
| 45 |
**Disclaimer:**
|
| 46 |
|
| 47 |
+
Gladiator-Mini-exp-1211 is an experimental model and should be used with caution. Please critically evaluate its outputs and do not take them as absolute truth.
|
| 48 |
+
|
| 49 |
+
Base model: https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct
|
| 50 |
+
|
| 51 |
+
Thanks to Meta for the fantastic Llama-3.2-3B model!
|