Safetensors
qwen2
fp8
juezhi commited on
Commit
2af409f
·
verified ·
1 Parent(s): 244150d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -11,7 +11,7 @@ license: apache-2.0
11
  </p>
12
 
13
  ## Introduction
14
- We performed **Reinforcement Learning (RL)** fine-tuning on the **InfiR2-7B-Instruct-FP8** model in two stages using the **dapo-math-17k** and the **FP8 format**, with hyperparameters shown below.
15
 
16
  <div align="center">
17
 
 
11
  </p>
12
 
13
  ## Introduction
14
+ We performed **Reinforcement Learning (RL)** on the **InfiR2-7B-Instruct-FP8** model using the **dapo-math-17k** and the **FP8 format**, with hyperparameters shown below.
15
 
16
  <div align="center">
17