CPCD-Chat-4B / README.md
EdwinUstb's picture
Update README.md
4ca15a0 verified
---
license: apache-2.0
language:
- zh
- en
library_name: transformers
pipeline_tag: text-generation
base_model: Qwen/Qwen3-4B-Base
tags:
- qwen3
- mental-health
- psychological-counseling
- chinese
- long-context
- dialogue
- campus-counseling
---
# CPCD-Chat-8B
CPCD-Chat-8B is a Chinese long-horizon campus psychological counseling dialogue model developed as part of the **Psy-Chronicle** project.
- **Model page**: https://huggingface.co/EdwinUstb/CPCD-Chat-4B
- **Project GitHub**: https://github.com/EdwinUSTB/Psy-Chronicle
- **The Hugging Face paper URL**: https://huggingface.co/papers/2605.22140
- **The arXiv URL**: https://arxiv.org/abs/2605.22140
## Model Description
CPCD-Chat-8B is fine-tuned from **Qwen3-4B-Base** on CPCD, a synthetic Chinese long-horizon campus psychological counseling dialogue dataset.
The model is designed for research on:
- long-horizon psychological counseling dialogue generation;
- campus mental-health support scenarios;
- cross-session counseling memory;
- student stress-event evolution;
- temporal-causal reasoning in counseling conversations.
## Dataset
The model is trained on **CPCD**, a Chinese long-horizon dialogue dataset for college psychological counseling scenarios.
CPCD is generated by the Psy-Chronicle framework, which constructs:
1. structured student profiles;
2. semester-level temporal stress event graphs;
3. cross-session counseling dialogues;
4. structured memory summaries.
Dataset statistics:
| Component | Value |
|---|---:|
| Student profiles | 100 |
| Counseling dialogue units | 90,000 |
| Chinese characters | ~11.45M |
| Scenario | Chinese campus psychological counseling |
## Psy-Chronicle Framework
Psy-Chronicle synthesizes long-horizon counseling trajectories through a structured pipeline:
```text
Student Profile
Temporal Stress Event Graph
Cross-session Counseling Simulation
Structured Memory Update
CPCD Dataset / CPCD-Bench
```
Unlike single-turn or short multi-turn counseling datasets, Psy-Chronicle focuses on how college students' psychological distress accumulates, interacts, and evolves across a semester.
## CPCD-Bench
CPCD-Bench evaluates long-horizon campus counseling capabilities from three dimensions:
| Task | Description |
|---|---|
| Session-level Response | Generate appropriate counselor responses using current context and historical memory |
| Memory Recall | Recall factual information from long counseling histories |
| Temporal-Causal Reasoning | Analyze chronological event development and causal relationships |
## Usage
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
model_name = "EdwinUstb/CPCD-Chat-4B"
tokenizer = AutoTokenizer.from_pretrained(
model_name,
trust_remote_code=True
)
model = AutoModelForCausalLM.from_pretrained(
model_name,
torch_dtype=torch.bfloat16,
device_map="auto",
trust_remote_code=True
)
messages = [
{
"role": "user",
"content": "我最近因为学业和家庭压力感到很焦虑,不知道该怎么办。"
}
]
text = tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True
)
inputs = tokenizer([text], return_tensors="pt").to(model.device)
outputs = model.generate(
**inputs,
max_new_tokens=512,
temperature=0.7,
top_p=0.9
)
response = tokenizer.decode(
outputs[0][inputs.input_ids.shape[-1]:],
skip_special_tokens=True
)
print(response)
```
## Intended Use
This model is intended for research on:
- psychological counseling dialogue modeling;
- long-horizon dialogue generation;
- cross-session memory modeling;
- campus mental-health support datasets;
- temporal-causal reasoning in counseling scenarios.
## Limitations
CPCD-Chat-8B is trained on synthetic counseling data. It may generate responses that are incomplete, overly generic, or inappropriate in high-risk mental-health situations.
The model should **not** be used as a substitute for professional psychological counseling, clinical diagnosis, or treatment.
## Ethical Considerations
This model is released for research and evaluation purposes only.
Users should be aware that:
- the training data are synthetic and do not represent real counseling records;
- the model may fail to detect or properly handle crisis situations;
- any deployment-oriented use should include professional review, safety monitoring, and clear user-facing disclaimers.
## Citation
If you find this model or project useful, please cite:
```bibtex
@misc{gou2026psychronicle,
title = {Psy-Chronicle: A Structured Pipeline for Synthesizing Long-Horizon Campus Psychological Counseling Dialogues},
author = {Chaogui Gou and Jiarui Liang},
year = {2026},
note = {Preprint},
url = {https://github.com/EdwinUSTB/Psy-Chronicle}
}
```
## License
This model is released under the Apache License 2.0.