Update README.md
Browse files
README.md
CHANGED
|
@@ -21,7 +21,7 @@ license: mit
|
|
| 21 |
|
| 22 |
## Overview
|
| 23 |
|
| 24 |
-
**Omni-R1-Zero** is trained **without multimodal annotations**. It bootstraps **step-wise visualizations** from **text-only CoT seeds** (e.g., M3CoT), and then follows the same
|
| 25 |
|
| 26 |
## Usage
|
| 27 |
|
|
|
|
| 21 |
|
| 22 |
## Overview
|
| 23 |
|
| 24 |
+
**Omni-R1-Zero** is trained **without multimodal annotations**. It bootstraps **step-wise visualizations** from **text-only CoT seeds** (e.g., M3CoT), and then follows the same PeSFT+PeRPO recipe as Omni-R1 to learn interleaved multimodal reasoning.
|
| 25 |
|
| 26 |
## Usage
|
| 27 |
|