SUSTech
/

SUS-Chat-34B

@@ -87,20 +87,20 @@ pipeline_tag: text-generation
 # Inrtoduction
 <img src="https://hackmd.io/_uploads/HJlDtzhBa.png" id="fig-sus"
-alt="Figure 1: DALL·E 2023-12-01 11.03.28 - An imposing, majestic wild boar combined with elements of a futuristic transformer robot. The boar itself should be intricately blended with these tra" />
 **SUS-Chat** is a 34B bilingual Chinese-English dialogue model, jointly
 released by the **Southern University of Science and Technology** and
 **Cognitive Computing and Natural Language Center of International
-Digital Economy Academy (IDEA-CCNL)**. The SUS-Chat-34B model has been
-fine-tuned on millions of high-quality, multilingual instruction data.
-While maintaining the strong language capabilities of the base model,
-the SUS-Chat-34B model has improved the model’s response to human
-instructions through high-quality instruction fine-tuning and excels at
-imitating human thought processes through chains of thought. It
-introduces inter-instruction attention sharing in long texts, expanding
-the window size from 4K to 8K, significantly enhancing the usability of
-multi-round dialogues.
 It has surpassed all models of the same size in almost all benchmark
 tests and is better suited to meet the practical needs of complex
@@ -147,7 +147,7 @@ similar scale and achieved the most advanced comprehensive performance.
 <img
 src="https://github.com/SUSTech-IDEA/SUS-Chat/raw/main/assets/radar.png"
-id="fig-bench" alt="Figure 2: Benchmark" />
 # Usage
@@ -231,5 +231,4 @@ model.
 This model is developed entirely for academic research and free
 commercial use, but it must adhere to the
 [license](https://github.com/SUSTech-IDEA/SUS-Chat/blob/main/MODEL_LICENSE_AGREEMENT.txt)
-from 01-ai.

 # Inrtoduction
 <img src="https://hackmd.io/_uploads/HJlDtzhBa.png" id="fig-sus"
+alt="Figure 1: DALL·E 2023-12-01 11.03.28 - An imposing, majestic wild boar combined with elements of a futuristic transformer robot. The boar itself should be intricately blended with these tra" />
 **SUS-Chat** is a 34B bilingual Chinese-English dialogue model, jointly
 released by the **Southern University of Science and Technology** and
 **Cognitive Computing and Natural Language Center of International
+Digital Economy Academy (IDEA-CCNL)**. This model is based on
+`01-ai/Yi-34B` and has been fine-tuned on millions of high-quality,
+multilingual instruction data. While maintaining the strong language
+capabilities of the base model, the SUS-Chat-34B model has improved the
+model’s response to human instructions through high-quality instruction
+fine-tuning and excels at imitating human thought processes through
+chains of thought. It introduces inter-instruction attention sharing in
+long texts, expanding the window size from 4K to 8K, significantly
+enhancing the usability of multi-round dialogues.
 It has surpassed all models of the same size in almost all benchmark
 tests and is better suited to meet the practical needs of complex
 <img
 src="https://github.com/SUSTech-IDEA/SUS-Chat/raw/main/assets/radar.png"
+id="fig-bench" alt="Figure 2: Benchmark" />
 # Usage
 This model is developed entirely for academic research and free
 commercial use, but it must adhere to the
 [license](https://github.com/SUSTech-IDEA/SUS-Chat/blob/main/MODEL_LICENSE_AGREEMENT.txt)
+from 01-ai.