File size: 4,919 Bytes
2186f7f
 
4ca15a0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12954b3
4ca15a0
12954b3
4ca15a0
12954b3
4ca15a0
 
 
 
12954b3
4ca15a0
12954b3
4ca15a0
12954b3
4ca15a0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12954b3
4ca15a0
 
 
 
 
 
 
 
 
 
12954b3
4ca15a0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12954b3
4ca15a0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12954b3
4ca15a0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12954b3
 
4ca15a0
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
---
license: apache-2.0
language:
- zh
- en
library_name: transformers
pipeline_tag: text-generation
base_model: Qwen/Qwen3-4B-Base
tags:
- qwen3
- mental-health
- psychological-counseling
- chinese
- long-context
- dialogue
- campus-counseling
---

# CPCD-Chat-8B

CPCD-Chat-8B is a Chinese long-horizon campus psychological counseling dialogue model developed as part of the **Psy-Chronicle** project.

- **Model page**: https://huggingface.co/EdwinUstb/CPCD-Chat-4B
- **Project GitHub**: https://github.com/EdwinUSTB/Psy-Chronicle
- **The Hugging Face paper URL**: https://huggingface.co/papers/2605.22140
- **The arXiv URL**: https://arxiv.org/abs/2605.22140

## Model Description

CPCD-Chat-8B is fine-tuned from **Qwen3-4B-Base** on CPCD, a synthetic Chinese long-horizon campus psychological counseling dialogue dataset.

The model is designed for research on:

- long-horizon psychological counseling dialogue generation;
- campus mental-health support scenarios;
- cross-session counseling memory;
- student stress-event evolution;
- temporal-causal reasoning in counseling conversations.

## Dataset

The model is trained on **CPCD**, a Chinese long-horizon dialogue dataset for college psychological counseling scenarios.

CPCD is generated by the Psy-Chronicle framework, which constructs:

1. structured student profiles;
2. semester-level temporal stress event graphs;
3. cross-session counseling dialogues;
4. structured memory summaries.

Dataset statistics:

| Component | Value |
|---|---:|
| Student profiles | 100 |
| Counseling dialogue units | 90,000 |
| Chinese characters | ~11.45M |
| Scenario | Chinese campus psychological counseling |

## Psy-Chronicle Framework

Psy-Chronicle synthesizes long-horizon counseling trajectories through a structured pipeline:

```text
Student Profile

Temporal Stress Event Graph

Cross-session Counseling Simulation

Structured Memory Update

CPCD Dataset / CPCD-Bench
```

Unlike single-turn or short multi-turn counseling datasets, Psy-Chronicle focuses on how college students' psychological distress accumulates, interacts, and evolves across a semester.

## CPCD-Bench

CPCD-Bench evaluates long-horizon campus counseling capabilities from three dimensions:

| Task | Description |
|---|---|
| Session-level Response | Generate appropriate counselor responses using current context and historical memory |
| Memory Recall | Recall factual information from long counseling histories |
| Temporal-Causal Reasoning | Analyze chronological event development and causal relationships |

## Usage

```python
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

model_name = "EdwinUstb/CPCD-Chat-4B"

tokenizer = AutoTokenizer.from_pretrained(
    model_name,
    trust_remote_code=True
)

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype=torch.bfloat16,
    device_map="auto",
    trust_remote_code=True
)

messages = [
    {
        "role": "user",
        "content": "我最近因为学业和家庭压力感到很焦虑,不知道该怎么办。"
    }
]

text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)

inputs = tokenizer([text], return_tensors="pt").to(model.device)

outputs = model.generate(
    **inputs,
    max_new_tokens=512,
    temperature=0.7,
    top_p=0.9
)

response = tokenizer.decode(
    outputs[0][inputs.input_ids.shape[-1]:],
    skip_special_tokens=True
)

print(response)
```

## Intended Use

This model is intended for research on:

- psychological counseling dialogue modeling;
- long-horizon dialogue generation;
- cross-session memory modeling;
- campus mental-health support datasets;
- temporal-causal reasoning in counseling scenarios.

## Limitations

CPCD-Chat-8B is trained on synthetic counseling data. It may generate responses that are incomplete, overly generic, or inappropriate in high-risk mental-health situations.

The model should **not** be used as a substitute for professional psychological counseling, clinical diagnosis, or treatment.

## Ethical Considerations

This model is released for research and evaluation purposes only.

Users should be aware that:

- the training data are synthetic and do not represent real counseling records;
- the model may fail to detect or properly handle crisis situations;
- any deployment-oriented use should include professional review, safety monitoring, and clear user-facing disclaimers.

## Citation

If you find this model or project useful, please cite:

```bibtex
@misc{gou2026psychronicle,
  title  = {Psy-Chronicle: A Structured Pipeline for Synthesizing Long-Horizon Campus Psychological Counseling Dialogues},
  author = {Chaogui Gou and Jiarui Liang},
  year   = {2026},
  note   = {Preprint},
  url    = {https://github.com/EdwinUSTB/Psy-Chronicle}
}
```

## License

This model is released under the Apache License 2.0.