inclusionAI
/

Ring-lite

@@ -2,7 +2,7 @@
 # Ring-lite
 <p align="center">
-    <img src="https://huggingface.co/inclusionAI/Ring-lite-distill-preview/resolve/main/ant-bailing.png" width="100"/>
 <p>
 <p align="center">
@@ -11,7 +11,7 @@
 ## Introduction
-Ring-lite is an fully open-source MoE LLM provided by InclusionAI, which has 16.8B parameters with 2.75B activated parameters. It was derived from [Ling-lite-1.5](https://huggingface.co/inclusionAI/Ling-lite-1.5) through a training process involving reasoning SFT, reasoning RL and general SFT. This model delivers performance comparable to [Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) on reasoning benchmarks, while activating only one-third of their parameter. . This demonstrates that Ring-lite-distill is a more balanced and versatile model. Additionaly, it maintains competitive latency and throughput compared to other reasoning LLMs of similar size.
 ## Model Downloads
@@ -19,34 +19,15 @@ Ring-lite is an fully open-source MoE LLM provided by InclusionAI, which has 16.
 |     **Model**      | **#Total Params** | **#Activated Params** | **Context Length** | **Download** |
 | :----------------: | :---------------: | :-------------------: | :----------------: | :----------: |
-| Ring-lite-distill-preview |       16.8B       |         2.75B         |        64K         |      [🤗 HuggingFace](https://huggingface.co/inclusionAI/Ring-lite-distill) |
 </div>
 ## Evaluation
-In order to fully evaluate the model's performance, we examined Ring-lite-distill-preview in terms of both reasoning ability and general ability.
 ### Reasoning ability
-<div align="center">
-|     **Model**      | **AIME24** | **MATH-500** | **GPQA-diamond** | **LiveCodeBench** |
-| :----------------: | :---------------: | :-------------------: | :----------------: | :----------: |
-| DeepSeek-R1-Distill-Qwen-7B (reported) |       55.5       |         92.8         |        49.1           |          37.6       |
-| DeepSeek-R1-Distill-Qwen-7B (reproduce)  |       53.2       |         93.7         |        50.4         |         36.5       |
-| Ring-lite-distill-preview |       56.3       |         93.7         |        46.2        |        31.9       |
-</div>
-### General ability
-<div align="center">
-|     **Model**      | **IFEval**  | **T-eval** | **BFCL_v2** | **MMLU** |
-| :----------------: | :---------------: | :-------------------: | :----------------: | :----------: |
-| DeepSeek-R1-Distill-Qwen-7B (reproduce)  |       39.3       |         26.9          | 38.9 | 44.1 |
-| Ring-lite-distill-preview |      75.3 | 81.3 | 63.0 | 63.3 |
-</div>
 More details will be reported in our technical report. [TBD]
 ## Quickstart

 # Ring-lite
 <p align="center">
+    <img src="https://huggingface.co/inclusionAI/Ring-lite/blob/main/ant-bailing.png" width="100"/>
 <p>
 <p align="center">
 ## Introduction
+Ring-lite is an fully open-source MoE LLM provided by InclusionAI, which has 16.8B parameters with 2.75B activated parameters. It was derived from [Ling-lite-1.5](https://huggingface.co/inclusionAI/Ling-lite-1.5) through a training process involving reasoning SFT, reasoning RL and general SFT. This model delivers performance comparable to [Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) on reasoning benchmarks, while activating only one-third of their parameters.
 ## Model Downloads
 |     **Model**      | **#Total Params** | **#Activated Params** | **Context Length** | **Download** |
 | :----------------: | :---------------: | :-------------------: | :----------------: | :----------: |
+| Ring-lite-distill-preview |       16.8B       |         2.75B         |        64K         |      [🤗 HuggingFace](https://huggingface.co/inclusionAI/Ring-lite) |
 </div>
 ## Evaluation
+In order to fully evaluate the model's reasoning performance, we examined Ring-lite on several reasoning benchmarks, including MATH-500, AIME-24, AIME-24, Livecodebench, Codeforces and GPQA.
 ### Reasoning ability
 More details will be reported in our technical report. [TBD]
 ## Quickstart