MultivexAI commited on
Commit
e0976c3
·
verified ·
1 Parent(s): 78a47c7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -8,7 +8,7 @@ base_model:
8
 
9
  # Gladiator-Mini-exp-1211: A Compact and Powerful Reasoning Engine
10
 
11
- **Gladiator-Mini-exp-1211** is a 3-billion parameter language model designed for **complex reasoning tasks**. This experimental model, based on [meta-llama/Llama-3.2-3B-Instruct](link to the base model), offers surprisingly strong analytical capabilities for its size. It demonstrates the potential of smaller models to achieve impressive performance in analytical thinking. We chose to finetune on a Llama model due to finetuning difficulties with the Qwen 2.5 4B model.
12
 
13
  **What Makes it Stand Out?**
14
 
@@ -38,10 +38,14 @@ We encourage you to experiment with Gladiator-Mini-exp-1211, test its limits, an
38
 
39
  **Limitations:**
40
 
41
- * Requires system prompts for optimal performance. An incorrect system prompt can lead to an incorrect answer, while a correct prompt usually leads to a correct one.
42
  * Reasoning abilities are still being refined.
43
  * May exhibit biases or inaccuracies.
44
 
45
  **Disclaimer:**
46
 
47
- Gladiator-Mini-exp-1211 is an experimental model and should be used with caution. Please critically evaluate its outputs and do not take them as absolute truth.
 
 
 
 
 
8
 
9
  # Gladiator-Mini-exp-1211: A Compact and Powerful Reasoning Engine
10
 
11
+ **Gladiator-Mini-exp-1211** is a 3-billion parameter language model designed for **complex reasoning tasks**. This experimental model, based on [meta-llama/Llama-3.2-3B-Instruct] [https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct], offers surprisingly strong analytical capabilities for its size. It demonstrates the potential of smaller models to achieve impressive performance in analytical thinking. We chose to finetune on a Llama model due to finetuning difficulties with the Qwen 2.5 4B model.
12
 
13
  **What Makes it Stand Out?**
14
 
 
38
 
39
  **Limitations:**
40
 
41
+ * Requires system prompts for optimal performance. An incorrect system prompt can lead to an incorrect answer, while a correct prompt usually leads to a correct one. An incorrect system prompt example, is using a math reasoning system prompt for a task requiring chain of thought on a tricky text equation.
42
  * Reasoning abilities are still being refined.
43
  * May exhibit biases or inaccuracies.
44
 
45
  **Disclaimer:**
46
 
47
+ Gladiator-Mini-exp-1211 is an experimental model and should be used with caution. Please critically evaluate its outputs and do not take them as absolute truth.
48
+
49
+ Base model: https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct
50
+
51
+ Thanks to Meta for the fantastic Llama-3.2-3B model!