ERC-ITEA
/

MuduoLLM

@@ -1,33 +1,94 @@
-## 2.2 技术架构
-- **基础架构**：
-- **参数量**：[填写具体参数量，如1.3B/7B等]
-- **训练数据**：约[X]GB基础教育领域文本数据，
-- **训练方法**：[SFT+DPO]
-## 2.3 训练环境
-- **硬件**：[如NVIDIA A100 GPU集群，数量]
-- **软件**：[如CUDA版本，深度学习框架版本]
-- **训练时长**：[X]天
-## 4.1 环境要求
-- **最低硬件**：[如Intel i5 CPU，16GB RAM]
-- **推荐硬件**：[如NVIDIA RTX 3060 GPU，32GB RAM]
-- **依赖库**：`transformers>=4.20.0`, `torch>=1.10.0`, 等
-## 4.3 使用示例（给一个教育领域的prompt）
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
-# ��载模型和分词器
-model_name = "SCWX_LM"
 model = AutoModelForCausalLM.from_pretrained(
     model_name,
     torch_dtype="auto",
@@ -35,14 +96,14 @@ model = AutoModelForCausalLM.from_pretrained(
 )
 tokenizer = AutoTokenizer.from_pretrained(model_name)
-# 准备输入
-prompt = "Give me a short introduction to large language model." ### 给一个教育领域的prompt
 messages = [
     {"role": "system", "content": "你是北京师范大学和好未来开发的人工智能语言模型，名为师承万象。可以回答问题、提供信息、进行对话并帮助解决问题。"},
     {"role": "user", "content": prompt}
 ]
-# 生成回复
 text = tokenizer.apply_chat_template(
     messages,
     tokenize=False,
@@ -61,36 +122,19 @@ generated_ids = [
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
-## 4.4 专业场景使用
-### 4.4.1 智能出题
-```python
-system_prompt = """我是一个学生，请你扮演一位苏格拉底式答疑的老师。与我进行多轮对话，遵守以下规则，对我的问题进行引导式解答：
-        - 在第一次回复时对题目知识点简要说明。
-        - 始终保持对话自然流畅，让交流富有逻辑性和互动性。
-        - 不要直接给出答案或完整解题步骤，而是通过提问引导我思考。
-        - 每次只提出一个引导性问题，问题应基于我的回答情况，帮助我逐步接近正确答案。
-        - 如果我一直表现出不理解，应该调整讲解方式，提供进一步的解释或更基础的问题。
-        - 答疑过程中，应确保最终引导我得出正确答案，而不是在中途终止推理。
-        - 答疑过程中对我的提问要明确回答，要提醒用户回归对话主题
-        - 在通过一步步推理得到答案之前，不能结束对话。
-        答疑结束条件：
-        - 你在得出完整的正确答案的回复中消息：
-            * 避免突然结束，确保有完整的认知闭环
-            * 明确说出正确答案（如"正确答案为...""本题答案为..."等）"""
-```
-### 4.4.2 智能答疑
-```python
-system_prompt = """"""
-```
-### 4.4.3 教案生成
-```python
-system_prompt = """"""
 ```

+---
+license: apache-2.0
+language:
+- en
+- zh
+base_model:
+- Qwen/Qwen2.5-14B-Instruct
+pipeline_tag: text-generation
+tags:
+- Education
+- K12
+---
+<div align="center">
+<h1 style="font-size: 2.8em; margin-bottom: 0.5em;">师承万象教育大模型（MuduoLLM）</h1>
+<h2 style="font-size: 1.8em; color: #666; margin-top: 0;">传承木铎金声，智启教育未来<br>Inheriting Wisdom, Inspiring Future Education</h2>
+[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
+[![Model Size](https://img.shields.io/badge/Model%20Size-14B-green.svg)]()
+[![Python](https://img.shields.io/badge/Python-3.10-blue.svg)]()
+[![Framework](https://img.shields.io/badge/Framework-PyTorch-orange.svg)]()
+[![GitHub](https://img.shields.io/badge/GitHub-MuduoLLM-blue)](https://github.com/ERC-ITEA/MuduoLLM)
+</div>
+# 简介 | Introduction
+师承万象大模型（MuduoLLM）是北京师范大学和北京世纪好未来教育科技有限公司共同研发的首个紧扣新课标知识体系的基础教育大模型，确保所学知识内容与基础教育课程标准高度契合，精准对接学生核心素养培育与教师专业成长需求。在应用层面，基础教育大模型深度融合新课标理念，实现探究启发式智能答疑、素养导向型智能出题、情境沉浸式教案生成，从知识传授转向核心素养培育，助力培养全面发展时代新人。同时，师承万象大模型是当前性能表现较为突出的开源基础教育大模型之一，为开发者提供了可进一步优化的空间。
+MuduoLLM is the educational large language model jointly developed by Beijing Normal University and TAL Education Group, tightly integrated with the new curriculum standards knowledge system. It ensures that the knowledge content aligns perfectly with basic education curriculum standards, precisely meeting the needs of student core competency cultivation and teacher professional development. At the application level, the model deeply integrates new curriculum concepts, enabling inquiry-based intelligent Q&A, competency-oriented question generation, and immersive lesson plan creation, shifting from knowledge transmission to core competency cultivation, helping to nurture well-rounded individuals for the new era. Additionally, MuduoLLM is one of the most outstanding open-source educational large language models, providing developers with room for further optimization.
+## 模型概述 | Model Overview
+- **Base Architecture**: [Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
+- **Parameters**: 14 billion (14B)
+- **Training Data**: Approximately 400GB of educational domain text data, including question generation, Q&A, and lesson plans
+- **Training Methods**:
+  - Domain-specific Pretraining: Injecting educational domain-specific corpora to enhance semantic understanding
+  - Supervised Fine-Tuning (SFT): Targeted optimization for educational scenarios (question generation/Q&A/lesson plan generation)
+  - Direct Preference Optimization (DPO): Improving generation accuracy and educational ethics compliance through expert-annotated preference data
+## 训练环境 | Training Environment
+- **Hardware Configuration**:
+  - Number of Servers: 4
+  - GPU Configuration: 8 NVIDIA A800-SXM4-80GB per server (32 total)
+    - Single GPU Memory: 80GB
+    - Interconnection: NVLink 4.0 (9.6TB/s bandwidth)
+    - Parallel Strategy: Data Parallel + Tensor Parallel
+- **Software**:
+  - Base Framework:
+    - CUDA: 12.4
+    - PyTorch: 2.5.1+cu124
+  - Optimization Tools:
+    - DeepSpeed: 0.15.4 (ZeRO-3 optimizer)
+    - FlashAttention
+    - Training Precision: bfloat16 mixed precision
+  - Runtime Environment: Conda virtual environment + Weights & Biases monitoring
+- **Training Duration**: 10 days
+# 快速开始 | Quick Start
+## 环境要求 | Requirements
+- Python 3.10
+- PyTorch
+- transformers >= 4.37.0
+## 安装 | Installation
+```bash
+# 克隆仓库 | Clone repository
+huggingface-cli download --resume-download ERC-ITEA/MuduoLLM --local-dir ./muduo-llm/
+# 创建环境 | Create environment
+conda create --name muduo python=3.10
+conda activate muduo
+# 安装依赖 | Install dependencies
+pip install transformers
+```
+## 使用示例 | Usage Example
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+# 加载模型和分词器 | Load model and tokenizer
+model_name = "MuduoLLM"
 model = AutoModelForCausalLM.from_pretrained(
     model_name,
     torch_dtype="auto",
 )
 tokenizer = AutoTokenizer.from_pretrained(model_name)
+# 准备输入 | Prepare input
+prompt = "Give me a short introduction to large language model."
 messages = [
     {"role": "system", "content": "你是北京师范大学和好未来开发的人工智能语言模型，名为师承万象。可以回答问题、提供信息、进行对话并帮助解决问题。"},
     {"role": "user", "content": prompt}
 ]
+# 生成回复 | Generate response
 text = tokenizer.apply_chat_template(
     messages,
     tokenize=False,
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
+# 许可证 | License
+This project is licensed under the [Apache 2.0](https://opensource.org/licenses/Apache-2.0) License.
+This project is for research purposes only. The project developers are not responsible for any harm or loss caused by using this project (including but not limited to data, models, code, etc.).
+# 引用 | Citation
+```bibtex
+@misc{muduollm2025,
+  title={MuduoLLM: A High-Performance LLM for Intelligent Education Solutions},
+  author={MuduoLLM Contributors from BNU and TAL},
+  year={2025},
+  howpublished={\url{https://huggingface.co/ERC-ITEA/MuduoLLM}},
+}
 ```