Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,7 @@ license: apache-2.0
|
|
| 11 |
</p>
|
| 12 |
|
| 13 |
## Introduction
|
| 14 |
-
We performed **Reinforcement Learning (RL)**
|
| 15 |
|
| 16 |
<div align="center">
|
| 17 |
|
|
|
|
| 11 |
</p>
|
| 12 |
|
| 13 |
## Introduction
|
| 14 |
+
We performed **Reinforcement Learning (RL)** on the **InfiR2-7B-Instruct-FP8** model using the **dapo-math-17k** and the **FP8 format**, with hyperparameters shown below.
|
| 15 |
|
| 16 |
<div align="center">
|
| 17 |
|