olumideola commited on
Commit
7f806db
·
verified ·
1 Parent(s): 83aabe7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -0
README.md CHANGED
@@ -129,6 +129,38 @@ model = AutoModelForCausalLM.from_pretrained(
129
  | bfloat16 | 16GB | 15GB |
130
  | 4-bit (NF4) | 6GB | ~4GB |
131
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
132
  ## License
133
 
134
  [Llama 3.1 Community License](https://llama.meta.com/llama3/license/)
 
129
  | bfloat16 | 16GB | 15GB |
130
  | 4-bit (NF4) | 6GB | ~4GB |
131
 
132
+ ## Recommended Generation Settings
133
+
134
+ ```python
135
+ outputs = model.generate(
136
+ **inputs,
137
+ max_new_tokens=1024,
138
+ do_sample=True,
139
+ temperature=0.6,
140
+ top_p=0.95,
141
+ min_p=0.05,
142
+ repetition_penalty=1.5,
143
+ eos_token_id=[128040, 128009, 128001],
144
+ pad_token_id=128001,
145
+ )
146
+ ```
147
+
148
+ ### Notes
149
+ - Temperature 0.6 (lower than base model) gives more consistent reasoning
150
+ - `<think>` and `</think>` are plain text tokens, not special tokens —
151
+ the model learned them through GRPO training
152
+ - Always include the system prompt instruction to use `<think>` tags
153
+ for reliable reasoning behaviour
154
+
155
+ ### Stop Tokens
156
+ Same as MIST-1-8B — ChatML tokens survived the merge:
157
+
158
+ | Token | ID |
159
+ |---|---|
160
+ | `<\|im_end\|>` | 128040 |
161
+ | `<\|eot_id\|>` | 128009 |
162
+ | `<\|end_of_text\|>` | 128001 |
163
+
164
  ## License
165
 
166
  [Llama 3.1 Community License](https://llama.meta.com/llama3/license/)