Update README.md
Browse files
README.md
CHANGED
|
@@ -32,7 +32,7 @@ In terms of performance, the hybrid linear model is comparable in overall perfor
|
|
| 32 |
|
| 33 |
## Evaluation
|
| 34 |
|
| 35 |
-
To better demonstrate our model's reasoning capabilities, we compared it with three other models—Ring-mini-2.0, Qwen3-8B-thinking, and GPT-OSS-20B-Medium—on 5 challenging reasoning benchmarks across mathematics, code, and science. We observe that the
|
| 36 |
|
| 37 |
<div style="display: flex; justify-content: center;">
|
| 38 |
<div style="text-align: center;">
|
|
@@ -68,7 +68,7 @@ The results are remarkable. In the prefill stage, Ring-mini-linear-2.0's perform
|
|
| 68 |
|
| 69 |
| **Model** | **#Total Params** | **#Activated Params** | **Context Length** | **Download** |
|
| 70 |
| :----------------: | :---------------: | :-------------------: | :----------------: | :----------: |
|
| 71 |
-
| Ring-mini-linear-2.0 |
|
| 72 |
</div>
|
| 73 |
|
| 74 |
## Quickstart
|
|
|
|
| 32 |
|
| 33 |
## Evaluation
|
| 34 |
|
| 35 |
+
To better demonstrate our model's reasoning capabilities, we compared it with three other models—Ring-mini-2.0, Qwen3-8B-thinking, and GPT-OSS-20B-Medium—on 5 challenging reasoning benchmarks across mathematics, code, and science. We observe that the hybrid-linear architecture achieves performance comparable to that of softmax attention.
|
| 36 |
|
| 37 |
<div style="display: flex; justify-content: center;">
|
| 38 |
<div style="text-align: center;">
|
|
|
|
| 68 |
|
| 69 |
| **Model** | **#Total Params** | **#Activated Params** | **Context Length** | **Download** |
|
| 70 |
| :----------------: | :---------------: | :-------------------: | :----------------: | :----------: |
|
| 71 |
+
| Ring-mini-linear-2.0 | 16B | 1.4B | 128K | [🤗 HuggingFace](https://huggingface.co/inclusionAI/Ring-mini-linear-2.0) <br>[🤖 Modelscope](https://modelscope.cn/models/inclusionAI/Ring-mini-linear-2.0)|
|
| 72 |
</div>
|
| 73 |
|
| 74 |
## Quickstart
|