Update README.md
Browse files
README.md
CHANGED
|
@@ -18,3 +18,28 @@ Recent advances in large language models (LLMs) have highlighted the potential o
|
|
| 18 |
<div align="center">
|
| 19 |
<img src="home.jpg" width="80%" />
|
| 20 |
</div>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
<div align="center">
|
| 19 |
<img src="home.jpg" width="80%" />
|
| 20 |
</div>
|
| 21 |
+
|
| 22 |
+
## Inference Parameters
|
| 23 |
+
### 128k setting:
|
| 24 |
+
```
|
| 25 |
+
temperature=0.85
|
| 26 |
+
top_p=0.95
|
| 27 |
+
top_k=20
|
| 28 |
+
max_tokens=131072
|
| 29 |
+
```
|
| 30 |
+
|
| 31 |
+
### 140k setting (with Yarn)
|
| 32 |
+
```
|
| 33 |
+
temperature=0.85
|
| 34 |
+
top_p=0.95
|
| 35 |
+
top_k=20
|
| 36 |
+
max_tokens=143360
|
| 37 |
+
rope_scaling: {
|
| 38 |
+
"rope_type": "yarn",
|
| 39 |
+
"factor": 1.5,
|
| 40 |
+
"original_max_position_embeddings": 95232
|
| 41 |
+
}
|
| 42 |
+
```
|
| 43 |
+
### Results
|
| 44 |
+
<img src="overl_results.jpg" width="80%" />
|
| 45 |
+
|