InfiX-ai
/

InfiR2-7B-Instruct-FP8

Model card Files Files and versions

baicaihaochi121 commited on Oct 15, 2025

Commit

4e4e430

·

verified ·

1 Parent(s): 9055bc2

Update README.md

Files changed (1) hide show

README.md +14 -13

README.md CHANGED Viewed

@@ -10,19 +10,7 @@ license: apache-2.0
   <a href="https://infix-ai.com/research/infir2/">🌐 Project Website</a> &nbsp;
 </p>
-We performed supervised fine-tuning on the **InfiR2-7B-base-FP8** with FP8 format in two stages using the InfiAlign-SFT-72k and InfiAlign-SFT-165k datasets, with hyperparameters shown in below.
-<div align="center">
-| Parameter | Value |
-| :---: | :---: |
-| **Batch Size** | 64 |
-| **Learning Rate** | 1e-5 |
-| **Minimum Learning Rate** | 1e-6 |
-| **Weight Decay** | 0.05 |
-| **Context Length** | 32k |
-</div>
 The resulting model is the **InfiR2-7B-Instruct-FP8**.
@@ -35,6 +23,19 @@ The resulting model is the **InfiR2-7B-Instruct-FP8**.
 - Stable and Reproducible Performance
 - Efficient and Low memory Training
 ## 🚀 InfiR2 Model Series

   <a href="https://infix-ai.com/research/infir2/">🌐 Project Website</a> &nbsp;
 </p>
+We performed supervised fine-tuning on the **InfiR2-7B-base-FP8** with FP8 format in two stages using the InfiAlign-SFT-72k and InfiAlign-SFT-165k datasets.
 The resulting model is the **InfiR2-7B-Instruct-FP8**.
 - Stable and Reproducible Performance
 - Efficient and Low memory Training
+**Hyperparameters**:
+<div align="center">
+| Parameter | Value |
+| :---: | :---: |
+| **Batch Size** | 64 |
+| **Learning Rate** | 1e-5 |
+| **Minimum Learning Rate** | 1e-6 |
+| **Weight Decay** | 0.05 |
+| **Context Length** | 32k |
+</div>
 ## 🚀 InfiR2 Model Series