LilithHu
/

mbert-manipulative-detector

Text Classification

manipulative-language

social-psychology

text-embeddings-inference

Model card Files Files and versions

mbert-manipulative-detector / README.md

LilithHu's picture

Update README.md

2cbe0f1 verified 8 months ago

|

history blame contribute delete

3.43 kB

	---
	license: other
	datasets:
	- thu-coai/cdconv
	language:
	- en
	- zh
	metrics:
	- accuracy
	- precision
	- recall
	- f1
	base_model:
	- google-bert/bert-base-multilingual-cased
	pipeline_tag: text-classification
	library_name: transformers
	tags:
	- manipulative-language
	- social-psychology
	---

	# Model Card for mBERT Manipulative Language Detector

	本模型用于检测中文和英文文本中的操纵性语言（Manipulative Language），例如隐性控制、情感勒索、语言操控等，广泛应用于社交心理、文本筛查和内容审核等场景。

	## 🧠 Model Details

	* Developed by: LilithHu
	* Finetuned from: google-bert/bert-base-multilingual-cased
	* Languages: 中文、英文
	* License: other
	* Model type: 文本分类模型（binary classifier: manipulative / non-manipulative）

	## 🔧 Uses

	### ✅ Direct Use

	* 输入一段文本，模型将返回该文本是否包含操纵性语言。
	* 可通过 Hugging Face Inference API 或 Web UI（Streamlit）直接调用。

	### 👥 Intended Users

	* NLP 研究者
	* 内容审核从业者
	* 心理学研究人员
	* 社交平台或对话系统开发者

	### 🚫 Out-of-Scope Use

	* 本模型不适合用于：

	* 法律审判
	* 医疗诊断
	* 精准营销等高风险商业行为
	* 判定他人动机、人格或情感

	## ⚠️ Bias, Risks and Limitations

	请注意：

	* 模型输出不等于事实，仅基于训练数据的模式进行分类
	* 操纵性语言的判断带有一定主观性与文化偏差
	* 不应被用于评判具体个人、情感或行为正当性

	### ✅ 建议

	使用者应结合人工判断，多模态、多渠道地理解文本含义。对于模型预测结果不可盲信，应视为辅助工具。

	## 🚀 How to Use

	```python
	from transformers import pipeline
	classifier = pipeline("text-classification", model="LilithHu/mbert-manipulative-detector")
	result = classifier("我爱你")
	print(result)
	```

	也可通过终端调用：

	```bash
	curl -X POST https://api-inference.huggingface.co/models/LilithHu/mbert-manipulative-detector \
	-H "Authorization: Bearer <your_hf_token>" \
	-H "Content-Type: application/json" \
	-d '{"inputs": "我爱你"}'
	```

	## 🏋️ Training Details

	### 📚 Training Data

	* [CDial-GPT/toy\_valid](https://github.com/thu-coai/CDial-GPT/blob/master/data/toy_valid.txt)
	* thu-coai/esconv、cdconv 数据集
	* 自建中文操纵性语言语料（未公开）

	### ⚙️ Training Procedure

	* 训练平台：Google Colab，GPU：T4
	* Epochs: 3
	* Batch size: 32
	* Optimizer: AdamW
	* LR: 2e-5

	## 📊 Evaluation

	\| Metric \| Score \|
	\| --------- \| ----- \|
	\| Accuracy \| 0.** \|
	\| Precision \| 0.** \|
	\| Recall \| 0.** \|
	\| F1-score \| 0.** \|

	## 🌍 Environmental Impact

	* 训练时间约 3 小时，使用 Google Colab GPU（T4）
	* 估算碳排放 < 2kg CO2eq

	## 🔒 Disclaimer

	* 本模型用于研究与教育用途，不得作为法律、道德、医疗或商业判断依据。
	* 预测结果仅为参考，使用者需自行承担风险。
	* 请勿用于恶意攻击、舆情操纵或误导他人行为。

	## 📌 Model Card Authors

	LilithHu

	## 📬 Contact

	如需反馈建议，请通过 Hugging Face 留言联系作者。

	## 📚 Citation

	```bibtex
	@misc{LilithHu2025,
	title={mBERT Manipulative Language Detector},
	author={LilithHu},
	year={2025},
	url={https://huggingface.co/LilithHu/mbert-manipulative-detector}
	}
	```