scdong
/

distilbert-response-type

Model card Files Files and versions

distilbert-response-type / README.md

scdong's picture

Update README.md

289325a verified 9 months ago

|

history blame contribute delete

2.52 kB

	---
	license: mit
	language:
	- en
	---
	# 🧠 DistilBERT Response Type Classifier

	This is a fine-tuned [DistilBERT](https://huggingface.co/distilbert-base-uncased) model designed to classify patient messages into one of four mental health support categories:

	- advice
	- information
	- question
	- validation

	It is used as part of the [Mental Health Counselor Assistant](https://huggingface.co/spaces/scdong/counselor-assistant) app to help generate helpful, therapeutic responses.

	## 💼 Use Case

	Given a short text input from a patient, this model predicts the most appropriate type of response a mental health counselor might provide.

	### Example:
	```python
	from transformers import DistilBertForSequenceClassification, DistilBertTokenizerFast
	import torch

	model = DistilBertForSequenceClassification.from_pretrained("scdong/distilbert-response-type")
	tokenizer = DistilBertTokenizerFast.from_pretrained("scdong/distilbert-response-type")

	text = "I just feel so overwhelmed lately"
	inputs = tokenizer(text, return_tensors="pt")
	with torch.no_grad():
	logits = model(**inputs).logits

	predicted_label = torch.argmax(logits, dim=1).item()
	print(predicted_label) # Maps to: 0=advice, 1=information, 2=question, 3=validation
	```

	The model is used to route text to custom prompt templates like:
	- Advice prompt: “You are a licensed counselor. What supportive advice would you give to someone who said: {msg}?”
	- Validation prompt: “You are an empathetic therapist. Validate the client’s emotions in response to: {msg}”

	## 📁 Files

	This repo includes:
	- `config.json` — model architecture config
	- `model.safetensors` — trained model weights
	- `tokenizer_config.json`, `tokenizer.json`, `vocab.txt` — tokenizer files
	- `special_tokens_map.json` — optional token mappings
	- `training_args.bin` — training metadata (optional)

	## 🧪 Training Details

	The model was fine-tuned using a balanced dataset labeled with response types based on:
	- [Kaggle Mental Health Conversations](https://www.kaggle.com/datasets/ayaanalahmed/mental-health-conversations)
	- [CounselChat dataset](https://github.com/nbertagnolli/counsel-chat)
	- [PAIR dataset](https://lit.eecs.umich.edu/downloads.html#PAIR)

	The final model was validated on a held-out test set and integrated into the counselor assistant tool.

	## 📜 License

	This model is released under an open license for research and educational purposes. Please use responsibly and do not deploy for unsupervised clinical use.