Update README.md
Browse files
README.md
CHANGED
|
@@ -14,4 +14,4 @@ base_model:
|
|
| 14 |
|
| 15 |
S1-M-7B-Beta used for developing the algorithm "Simple Test-time Scaling in Multimodal Reasoning". By fine-tuning the base model `Qwen/Qwen2-VL-7B-Instruct` on data with thinking tags `<think>` and `</think>`, the model acquired the `think first, then response` paradigm, allowing for experiments on "Test-time Scaling".
|
| 16 |
|
| 17 |
-
Note: The current model is a development version, not the final official version.
|
|
|
|
| 14 |
|
| 15 |
S1-M-7B-Beta used for developing the algorithm "Simple Test-time Scaling in Multimodal Reasoning". By fine-tuning the base model `Qwen/Qwen2-VL-7B-Instruct` on data with thinking tags `<think>` and `</think>`, the model acquired the `think first, then response` paradigm, allowing for experiments on "Test-time Scaling".
|
| 16 |
|
| 17 |
+
**Note: The current model is a development version, not the final official version.**
|