EdwinUstb commited on
Commit
df413c4
·
verified ·
1 Parent(s): ca7b1d8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +164 -38
README.md CHANGED
@@ -1,52 +1,178 @@
1
  ---
2
- frameworks:
3
- - Pytorch
4
  license: apache-2.0
5
- tags: []
6
- tasks:
7
- - text-generation
 
 
 
 
 
 
 
 
 
 
 
 
8
 
9
- #model-type:
10
- ##如 gpt、phi、llama、chatglm、baichuan 等
11
- #- gpt
12
 
13
- #domain:
14
- ##如 nlp、cv、audio、multi-modal
15
- #- nlp
16
 
17
- #language:
18
- ##语言代码列表 https://help.aliyun.com/document_detail/215387.html?spm=a2c4g.11186623.0.0.9f8d7467kni6Aa
19
- #- cn
20
 
21
- #metrics:
22
- ##如 CIDEr、Blue、ROUGE 等
23
- #- CIDEr
24
 
25
- #tags:
26
- ##各种自定义,包括 pretrained、fine-tuned、instruction-tuned、RL-tuned 等训练方法和其他
27
- #- pretrained
28
 
29
- #tools:
30
- ##如 vllm、fastchat、llamacpp、AdaSeq 等
31
- #- vllm
32
- ---
33
- ### 当前模型的贡献者未提供更加详细的模型介绍。模型文件和权重,可浏览“模型文件”页面获取。
34
- #### 您可以通过如下git clone命令,或者ModelScope SDK来下载模型
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
 
36
- SDK下载
37
- ```bash
38
- #安装ModelScope
39
- pip install modelscope
 
 
 
 
 
 
40
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
41
  ```python
42
- #SDK模型下载
43
- from modelscope import snapshot_download
44
- model_dir = snapshot_download('gouchaogui/CPCD-Chat')
45
- ```
46
- Git下载
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
47
  ```
48
- #Git模型下载
49
- git clone https://www.modelscope.cn/gouchaogui/CPCD-Chat.git
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
50
  ```
51
 
52
- <p style="color: lightgrey;">如果您是本模型的贡献者,我们邀请您根据<a href="https://modelscope.cn/docs/ModelScope%E6%A8%A1%E5%9E%8B%E6%8E%A5%E5%85%A5%E6%B5%81%E7%A8%8B%E6%A6%82%E8%A7%88" style="color: lightgrey; text-decoration: underline;">模型贡献文档</a>,及时完善模型卡片内容。</p>
 
 
 
1
  ---
 
 
2
  license: apache-2.0
3
+ language:
4
+ - zh
5
+ - en
6
+ library_name: transformers
7
+ pipeline_tag: text-generation
8
+ base_model: Qwen/Qwen3-8B-Base
9
+ tags:
10
+ - qwen3
11
+ - mental-health
12
+ - psychological-counseling
13
+ - chinese
14
+ - long-context
15
+ - dialogue
16
+ - campus-counseling
17
+ ---
18
 
19
+ # CPCD-Chat-8B
 
 
20
 
21
+ CPCD-Chat-8B is a Chinese long-horizon campus psychological counseling dialogue model developed as part of the **Psy-Chronicle** project.
 
 
22
 
23
+ - **Model page**: https://huggingface.co/EdwinUstb/CPCD-Chat-8B
24
+ - **Project GitHub**: https://github.com/EdwinUSTB/Psy-Chronicle
 
25
 
26
+ ## Model Description
 
 
27
 
28
+ CPCD-Chat-8B is fine-tuned from **Qwen3-8B-Base** on CPCD, a synthetic Chinese long-horizon campus psychological counseling dialogue dataset.
 
 
29
 
30
+ The model is designed for research on:
31
+
32
+ - long-horizon psychological counseling dialogue generation;
33
+ - campus mental-health support scenarios;
34
+ - cross-session counseling memory;
35
+ - student stress-event evolution;
36
+ - temporal-causal reasoning in counseling conversations.
37
+
38
+ ## Dataset
39
+
40
+ The model is trained on **CPCD**, a Chinese long-horizon dialogue dataset for college psychological counseling scenarios.
41
+
42
+ CPCD is generated by the Psy-Chronicle framework, which constructs:
43
+
44
+ 1. structured student profiles;
45
+ 2. semester-level temporal stress event graphs;
46
+ 3. cross-session counseling dialogues;
47
+ 4. structured memory summaries.
48
+
49
+ Dataset statistics:
50
+
51
+ | Component | Value |
52
+ |---|---:|
53
+ | Student profiles | 100 |
54
+ | Counseling dialogue units | 90,000 |
55
+ | Chinese characters | ~11.45M |
56
+ | Scenario | Chinese campus psychological counseling |
57
+
58
+ ## Psy-Chronicle Framework
59
+
60
+ Psy-Chronicle synthesizes long-horizon counseling trajectories through a structured pipeline:
61
 
62
+ ```text
63
+ Student Profile
64
+
65
+ Temporal Stress Event Graph
66
+
67
+ Cross-session Counseling Simulation
68
+
69
+ Structured Memory Update
70
+
71
+ CPCD Dataset / CPCD-Bench
72
  ```
73
+
74
+ Unlike single-turn or short multi-turn counseling datasets, Psy-Chronicle focuses on how college students' psychological distress accumulates, interacts, and evolves across a semester.
75
+
76
+ ## CPCD-Bench
77
+
78
+ CPCD-Bench evaluates long-horizon campus counseling capabilities from three dimensions:
79
+
80
+ | Task | Description |
81
+ |---|---|
82
+ | Session-level Response | Generate appropriate counselor responses using current context and historical memory |
83
+ | Memory Recall | Recall factual information from long counseling histories |
84
+ | Temporal-Causal Reasoning | Analyze chronological event development and causal relationships |
85
+
86
+ ## Usage
87
+
88
  ```python
89
+ from transformers import AutoTokenizer, AutoModelForCausalLM
90
+ import torch
91
+
92
+ model_name = "EdwinUstb/CPCD-Chat-8B"
93
+
94
+ tokenizer = AutoTokenizer.from_pretrained(
95
+ model_name,
96
+ trust_remote_code=True
97
+ )
98
+
99
+ model = AutoModelForCausalLM.from_pretrained(
100
+ model_name,
101
+ torch_dtype=torch.bfloat16,
102
+ device_map="auto",
103
+ trust_remote_code=True
104
+ )
105
+
106
+ messages = [
107
+ {
108
+ "role": "user",
109
+ "content": "我最近因为学业和家庭压力感到很焦虑,不知道该怎么办。"
110
+ }
111
+ ]
112
+
113
+ text = tokenizer.apply_chat_template(
114
+ messages,
115
+ tokenize=False,
116
+ add_generation_prompt=True
117
+ )
118
+
119
+ inputs = tokenizer([text], return_tensors="pt").to(model.device)
120
+
121
+ outputs = model.generate(
122
+ **inputs,
123
+ max_new_tokens=512,
124
+ temperature=0.7,
125
+ top_p=0.9
126
+ )
127
+
128
+ response = tokenizer.decode(
129
+ outputs[0][inputs.input_ids.shape[-1]:],
130
+ skip_special_tokens=True
131
+ )
132
+
133
+ print(response)
134
  ```
135
+
136
+ ## Intended Use
137
+
138
+ This model is intended for research on:
139
+
140
+ - psychological counseling dialogue modeling;
141
+ - long-horizon dialogue generation;
142
+ - cross-session memory modeling;
143
+ - campus mental-health support datasets;
144
+ - temporal-causal reasoning in counseling scenarios.
145
+
146
+ ## Limitations
147
+
148
+ CPCD-Chat-8B is trained on synthetic counseling data. It may generate responses that are incomplete, overly generic, or inappropriate in high-risk mental-health situations.
149
+
150
+ The model should **not** be used as a substitute for professional psychological counseling, clinical diagnosis, or treatment.
151
+
152
+ ## Ethical Considerations
153
+
154
+ This model is released for research and evaluation purposes only.
155
+
156
+ Users should be aware that:
157
+
158
+ - the training data are synthetic and do not represent real counseling records;
159
+ - the model may fail to detect or properly handle crisis situations;
160
+ - any deployment-oriented use should include professional review, safety monitoring, and clear user-facing disclaimers.
161
+
162
+ ## Citation
163
+
164
+ If you find this model or project useful, please cite:
165
+
166
+ ```bibtex
167
+ @misc{gou2026psychronicle,
168
+ title = {Psy-Chronicle: A Structured Pipeline for Synthesizing Long-Horizon Campus Psychological Counseling Dialogues},
169
+ author = {Chaogui Gou and Jiarui Liang},
170
+ year = {2026},
171
+ note = {Preprint},
172
+ url = {https://github.com/EdwinUSTB/Psy-Chronicle}
173
+ }
174
  ```
175
 
176
+ ## License
177
+
178
+ This model is released under the Apache License 2.0.