tencent
/

Youtu-LLM-2B

@@ -1,6 +1,7 @@
 ---
 library_name: transformers
 license: other
 license_link: https://huggingface.co/tencent/Youtu-LLM-2B/LICENSE.txt
 pipeline_tag: text-generation
 base_model:
@@ -29,6 +30,13 @@ base_model:
 - Context Length: 131,072
 - Vocabulary Size: 128,256
 <a id="benchmarks"></a>
 ## 📊 Performance Comparisons
@@ -77,8 +85,6 @@ base_model:
 ## 🚀 Quick Start
 This guide will help you quickly deploy and invoke the **Youtu-LLM-2B** model. This model supports "Reasoning Mode", enabling it to generate higher-quality responses through Chain of Thought (CoT).
----
 ### 1. Environment Preparation
 Ensure your Python environment has the `transformers` library installed and that the version meets the requirements.
@@ -88,8 +94,6 @@ pip install "transformers>=4.56" torch accelerate
 ```
----
 ### 2. Core Code Example
 The following example demonstrates how to load the model, enable Reasoning Mode, and use the `re` module to parse the "Thought Process" and the "Final Answer" from the output.
@@ -156,8 +160,6 @@ print(f"\n{'='*20} Final Answer {'='*20}\n{final_answer}")
 ```
----
 ### 3. Key Configuration Details
 #### Reasoning Mode Toggle
@@ -181,8 +183,6 @@ Depending on your use case, we suggest adjusting the following hyperparameters f
 > **Tip:** When using Reasoning Mode, a higher `temperature` helps the model perform deeper, more divergent thinking.
----
 ### 4. vLLM Deployment
 We provide support for deploying the model using **vLLM 0.10.2**. The recommended Docker image is `vllm/vllm-openai:v0.10.2`.
@@ -211,7 +211,6 @@ To enable tool calling capabilities, please append the following arguments to th
 ```bash
 --enable-auto-tool-choice --tool-call-parser hermes
 ```
----
 <a id="highlights"></a>

 ---
 library_name: transformers
 license: other
+license_name: youtu-llm
 license_link: https://huggingface.co/tencent/Youtu-LLM-2B/LICENSE.txt
 pipeline_tag: text-generation
 base_model:
 - Context Length: 131,072
 - Vocabulary Size: 128,256
+## 🤗 Model Download
+| Model Name  | Description | Download |
+| ----------- | ----------- |-----------
+| Youtu-LLM-2B-Base  | Base model of Youtu-LLM-2B |🤗 [Model](https://huggingface.co/tencent/Youtu-LLM-2B-Base)|
+| Youtu-LLM-2B | Instruct model of Youtu-LLM-2B | 🤗 [Model](https://huggingface.co/tencent/Youtu-LLM-2B)|
+| Youtu-LLM-2B-GGUF | Instruct model of Youtu-LLM-2B, in GGUF format | 🤗 [Model](https://huggingface.co/tencent/Youtu-LLM-2B-GGUF)|
 <a id="benchmarks"></a>
 ## 📊 Performance Comparisons
 ## 🚀 Quick Start
 This guide will help you quickly deploy and invoke the **Youtu-LLM-2B** model. This model supports "Reasoning Mode", enabling it to generate higher-quality responses through Chain of Thought (CoT).
 ### 1. Environment Preparation
 Ensure your Python environment has the `transformers` library installed and that the version meets the requirements.
 ```
 ### 2. Core Code Example
 The following example demonstrates how to load the model, enable Reasoning Mode, and use the `re` module to parse the "Thought Process" and the "Final Answer" from the output.
 ```
 ### 3. Key Configuration Details
 #### Reasoning Mode Toggle
 > **Tip:** When using Reasoning Mode, a higher `temperature` helps the model perform deeper, more divergent thinking.
 ### 4. vLLM Deployment
 We provide support for deploying the model using **vLLM 0.10.2**. The recommended Docker image is `vllm/vllm-openai:v0.10.2`.
 ```bash
 --enable-auto-tool-choice --tool-call-parser hermes
 ```
 <a id="highlights"></a>