Update README.md
Browse files
README.md
CHANGED
|
@@ -10,7 +10,6 @@ license: apache-2.0
|
|
| 10 |
<a href="https://infix-ai.com/research/infir2/">🌐 Project Website</a>
|
| 11 |
</p>
|
| 12 |
|
| 13 |
-
## Introduction
|
| 14 |
We performed **Reinforcement Learning (RL)** on the **InfiR2-7B-Instruct-FP8** model using the **dapo-math-17k** and the **FP8 format**, with hyperparameters shown below.
|
| 15 |
|
| 16 |
<div align="center">
|
|
|
|
| 10 |
<a href="https://infix-ai.com/research/infir2/">🌐 Project Website</a>
|
| 11 |
</p>
|
| 12 |
|
|
|
|
| 13 |
We performed **Reinforcement Learning (RL)** on the **InfiR2-7B-Instruct-FP8** model using the **dapo-math-17k** and the **FP8 format**, with hyperparameters shown below.
|
| 14 |
|
| 15 |
<div align="center">
|