Update README.md
Browse files
README.md
CHANGED
|
@@ -127,6 +127,22 @@ curl http://localhost:30000/v1/chat/completions \
|
|
| 127 |
|
| 128 |
```
|
| 129 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 130 |
## Citation
|
| 131 |
```bibtex
|
| 132 |
@article{GroveMoE,
|
|
|
|
| 127 |
|
| 128 |
```
|
| 129 |
|
| 130 |
+
## Best Practices for Model Configuration
|
| 131 |
+
To achieve optimal performance, we recommend the following settings:
|
| 132 |
+
|
| 133 |
+
1. **Sampling Parameters**:
|
| 134 |
+
- We suggest using `Temperature=0.7`, `TopP=0.8`, `TopK=20`, and `MinP=0`.
|
| 135 |
+
⚠️ For benchmarking scenarios requiring sampling (e.g., AIME), these parameters must be explicitly configured.
|
| 136 |
+
|
| 137 |
+
2. **Adequate Output Length**: Set output length to 16,384 tokens for general use cases to accommodate complex reasoning tasks in instruct models.
|
| 138 |
+
|
| 139 |
+
3. **Standardize Output Format**: We recommend using prompts to standardize model outputs when benchmarking.
|
| 140 |
+
- **Math Problems**: Include "Please reason step by step, and put your final answer within \boxed{}." in the prompt.
|
| 141 |
+
- **Multiple-Choice Questions**: Add the following JSON structure to the prompt to standardize responses: "Please show your choice in the `answer` field with only the choice letter, e.g., `"answer": "C"`."
|
| 142 |
+
|
| 143 |
+
|
| 144 |
+
|
| 145 |
+
|
| 146 |
## Citation
|
| 147 |
```bibtex
|
| 148 |
@article{GroveMoE,
|