H-Z-Ning
/

Senti-RoBERTa-Mini

Text Classification

sentiment-analysis

text-embeddings-inference

Model card Files Files and versions

Senti-RoBERTa-Mini / README.md

H-Z-Ning's picture

Update README.md

0d23886 verified 7 months ago

|

history blame contribute delete

3.31 kB

	---
	language: zh
	license: apache-2.0
	library_name: transformers
	tags:
	- text-classification
	- sentiment-analysis
	- chinese
	- movie-review
	datasets:
	- utmhikari/doubanmovieshortcomments
	base_model: hfl/chinese-roberta-wwm-ext
	pipeline_tag: text-classification
	---

	Chinese Movie Review Sentiment Classification Model (5-Star Rating)

	---

	## 1. Model Overview
	`H-Z-Ning/Senti-RoBERTa-Mini` is a lightweight Chinese RoBERTa model fine-tuned specifically for assigning 1-to-5-star sentiment ratings to Chinese movie short reviews. Built on the HFL-Tencent `hfl/chinese-roberta-wwm-ext` checkpoint, it retains a small footprint and fast inference, making it ideal for resource-constrained deployments.

	---

	## 2. Model Facts

	\| Item \| Details \|
	\|---\|---\|
	\| Task \| Chinese text classification (sentiment / star rating) \|
	\| Labels \| 5 classes (1 star – 5 stars) \|
	\| Base model \| [hfl/chinese-roberta-wwm-ext](https://huggingface.co/hfl/chinese-roberta-wwm-ext) \|
	\| Dataset \| [Kaggle: Douban Movie Short Comments (2000 K)](https://www.kaggle.com/datasets/utmhikari/doubanmovieshortcomments) \|
	\| Training framework \| 🤗 transformers + Trainer \|
	\| Language \| Simplified Chinese \|
	\| Parameters \| ≈ 102 M (same as base model) \|

	---

	## 3. Quick Start

	### 3.1 Install Dependencies
	```bash
	pip install transformers torch
	```

	### 3.2 One-Line Inference
	```python
	from transformers import AutoTokenizer, AutoModelForSequenceClassification
	import torch

	repo = "H-Z-Ning/Senti-RoBERTa-Mini"
	tok = AutoTokenizer.from_pretrained(repo)
	model = AutoModelForSequenceClassification.from_pretrained(repo)

	text = "这个导演真厉害。"
	inputs = tok(text, return_tensors="pt", truncation=True, max_length=256)
	with torch.no_grad():
	logits = model(**inputs).logits
	pred = int(torch.argmax(logits, dim=-1).item()) + 1 # 1..5
	print("predicted rating:", pred)
	```

	---
	## 4.Training source code

	[senti-roberta-mini training source code](https://www.kaggle.com/code/hzning/senti-roberta-mini)

	## 5. Training Details



	\| Hyper-parameter \| Value \|
	\|---\|---\|
	\| Base model \| hfl/chinese-roberta-wwm-ext \|
	\| Training framework \| 🤗 transformers `Trainer` \|
	\| Training set \| 150 000 samples (randomly drawn from 2000 K) \|
	\| Validation set \| 15 000 samples (same random draw) \|
	\| Test set \| full original test set \|
	\| Max sequence length \| 256 \|
	\| Training epochs \| 3 \|
	\| Batch size \| 32 (train) / 64 (eval) \|
	\| Learning rate \| 2e-5 \|
	\| Optimizer \| AdamW \|
	\| Weight decay \| 0.01 \|
	\| Scheduler \| linear warmup (warmup_ratio=0.1) \|
	\| Precision \| FP16 \|
	\| Best-model criterion \| QWK (↑) \|
	\| Training time \| ≈ 120 min on single P100 (FP16) \|
	\| Logging interval \| every 10 steps \|

	---

	## 6. Citation

	```bibtex
	@misc{senti-roberta-mini-2025,
	title={Senti-RoBERTa-Mini: A Mini Chinese RoBERTa for Movie Review Rating},
	author={H-Z-Ning},
	year={2025},
	howpublished={\url{https://huggingface.co/H-Z-Ning/Senti-RoBERTa-Mini}}
	}
	```

	---

	## 7. License
	This model is released under [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0). The base model `hfl/chinese-roberta-wwm-ext` is also Apache-2.0.

	---

	Community contributions and feedback are welcome! If you encounter any issues, please open an [Issue](https://huggingface.co/H-Z-Ning/Senti-RoBERTa-Mini/discussions) or email the author.