Update README.md
Browse files
README.md
CHANGED
|
@@ -53,7 +53,7 @@ Skywork-R1V3 is an advanced, open-source Vision-Language Model (VLM) built on se
|
|
| 53 |
|
| 54 |
- **Entropy of Critical Reasoning Tokens**: This unique indicator effectively gauges reasoning capability, guiding checkpoint selection during RL training.
|
| 55 |
|
| 56 |
-
These innovations lead to Broad Reasoning Generalization, allowing our RL-powered approach to successfully extend mathematical reasoning to diverse subject areas. Additionally, our work delves into RL-specific explorations like curriculum learning and learning rate strategies, alongside a broader discussion on multimodal reasoning. For more details, refer to our [๐ R1V3 Report].
|
| 57 |
## 3. Evaluation
|
| 58 |
|
| 59 |
### ๐ Key Results
|
|
|
|
| 53 |
|
| 54 |
- **Entropy of Critical Reasoning Tokens**: This unique indicator effectively gauges reasoning capability, guiding checkpoint selection during RL training.
|
| 55 |
|
| 56 |
+
These innovations lead to Broad Reasoning Generalization, allowing our RL-powered approach to successfully extend mathematical reasoning to diverse subject areas. Additionally, our work delves into RL-specific explorations like curriculum learning and learning rate strategies, alongside a broader discussion on multimodal reasoning. For more details, refer to our [[๐ R1V3 Report](https://github.com/SkyworkAI/Skywork-R1V/blob/main/report/Skywork_R1V3.pdf)]ย .
|
| 57 |
## 3. Evaluation
|
| 58 |
|
| 59 |
### ๐ Key Results
|