InfiX-ai
/

InfiR2-R1-7B-FP8-Preview

Model card Files Files and versions

juezhi commited on Oct 14, 2025

Commit

5d1a0c6

·

verified ·

1 Parent(s): 5f3cb04

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -17,12 +17,12 @@ We performed **Reinforcement Learning (RL)** on the **InfiR2-7B-Instruct-FP8** m
 | Parameter | Value |
 | :---: | :---: |
-| **Batch Size (train\_prompt\_bsz)** | 128 |
 | **N Samples Per Prompt** | 16 |
 | **Global Batch Size** | 2048 |
 | **Maximum Response Length** | 16384 |
 | **Rollout Temperature** | 1.1 |
-| **Learning Rate (LR)** | 1e-6 |
 | **Weight Decay** | 0.1 |
 | **Eps Clip** | 0.2 |
 | **KL Loss Coefficient** | 0.00 |

 | Parameter | Value |
 | :---: | :---: |
+| **Batch Size** | 128 |
 | **N Samples Per Prompt** | 16 |
 | **Global Batch Size** | 2048 |
 | **Maximum Response Length** | 16384 |
 | **Rollout Temperature** | 1.1 |
+| **Learning Rate** | 1e-6 |
 | **Weight Decay** | 0.1 |
 | **Eps Clip** | 0.2 |
 | **KL Loss Coefficient** | 0.00 |