JT-LM
/

JT-Math-8B-Thinking

Safetensors

jiutian

custom_code

Model card Files Files and versions

xet

Community

lyk2586 commited on Jul 25, 2025

Commit

cd84089

verified ·

1 Parent(s): 1b113b1

Update README.md

Browse files

Files changed (1) hide show

README.md +11 -16

README.md CHANGED Viewed

@@ -12,8 +12,8 @@ license: apache-2.0
     <a href="https://huggingface.co/JT-LM/JT-Math-8B-Thinking" target="_blank">
         <img src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Models-blue">
     </a>
-    <a href="./LICENSE" target="_blank">
-        <img alt="License" src="https://img.shields.io/badge/License-Apache%202.0-yellow.svg">
     </a>
 </p>
@@ -28,27 +28,22 @@ For full transparency and reproducibility, please refer to our technical report
-**Figure 1: Performance of JT-Math-8B-Thinking on math reasoning benchmarks.**
-## Model Highlights
-JT-Math-8B-Thinking achieves its cutting-edge performance on complex mathematical challenges through a rigorous, multi-stage training methodology. Starting with the robust JT-Math-8B-Base model, our pipeline first implemented Supervised Fine-Tuning (SFT). This involved training on a high-quality, bilingual dataset of intricate math problems with 32,768-token context window. Subsequently, an advanced Reinforcement Learning (RL) phase, incorporating a multi-stage curriculum of progressively harder problems, further honed its reasoning abilities.
 ## Model Downloads
-We release the following model to support a wide range of applications.
-| Model Name          | Length | Download                                                     | Notes                                                        |
-| ------------------- | ------ | ------------------------------------------------------------ | ------------------------------------------------------------ |
-| JT-Math-8B-Thinking | 32K    | [🤗](https://huggingface.co/JT-LM/JT-Math-8B-Thinking/tree/main) | Optimized with SFT on a 32K context window and RL based on multi-stage curriculum learning. |
@@ -57,7 +52,7 @@ We release the following model to support a wide range of applications.
 ## Evaluation Results
 JT-Math-8B-Thinking achieves competitive performance among open-source models in the ~8B class on mathematical reasoning benchmarks.
@@ -71,7 +66,7 @@ This example shows how to use the `JT-Math-8B-Thinking` model to solve math prob
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
-model_name = "Jiutian/JT-Math-8B-Thinking"
 tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
 model = AutoModelForCausalLM.from_pretrained(

     <a href="https://huggingface.co/JT-LM/JT-Math-8B-Thinking" target="_blank">
         <img src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Models-blue">
     </a>
+        <a href="https://www.modelscope.cn/models/JiuTian-AI/JT-Math-8B-Thinking" target="_blank">
+        <img src="https://img.shields.io/badge/%F0%9F%A4%96%20ModelScope-Models-blue">
     </a>
 </p>
+## Model Details
+The performance of **JT-Math-8B-Thinking** stems from a meticulous, multi-stage training approach aimed at tackling complex mathematical challenges with state-of-the-art accuracy. Building on the **JT-Math-8B-Base** model, its training pipeline involved **Supervised Fine-Tuning (SFT)** using a high-quality, bilingual dataset of intricate math problems. This SFT phase leveraged the model's native **32,768-token context window**, enabling it to comprehend lengthy premises, multi-step instructions, and problems with extensive background information right from the start. Following SFT, an advanced **Reinforcement Learning (RL)** phase further refined its reasoning capabilities. This RL process employed a multi-stage curriculum, gradually introducing problems of increasing difficulty, and was specifically engineered to boost the model's focus and accuracy across the entire 32K context window, ensuring the coherence and precision of even the longest reasoning chains.
 ## Model Downloads
+We release the following models to support a wide range of applications.
+| Model Name          | Context Length | Hugging Face Link                                          | ModelScope Link                                            | Notes                                                      |
+| ------------------- | -------------- | ---------------------------------------------------------- | ---------------------------------------------------------- | ---------------------------------------------------------- |
+| JT-Math-8B-Thinking | 32K            |  [Link](https://huggingface.co/JT-LM/JT-Math-8B-Thinking) |  [Link](https://www.modelscope.cn/models/JiuTian-AI/JT-Math-8B-Thinking) | The premier model for complex, long-context reasoning.     |
+------
 ## Evaluation Results
 JT-Math-8B-Thinking achieves competitive performance among open-source models in the ~8B class on mathematical reasoning benchmarks.
+![alt text](<Evaluation Results.png>)
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+model_name = "JT-LM/JT-Math-8B-Thinking"
 tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
 model = AutoModelForCausalLM.from_pretrained(