Qwen2.5-Coder-Sentiment-Freeze / README.md

FutureMa

Update README.md

bbe8348 verified 3 months ago

preview code

raw

history blame contribute delete

5.89 kB

metadata

license: apache-2.0
language:
  - en
pipeline_tag: text-generation
tags:
  - qwen
  - qwen2.5
  - sentiment-analysis
  - chinese
  - freeze-tuning
  - llama-factory
metrics:
  - accuracy
base_model:
  - Qwen/Qwen2.5-Coder-1.5B-Instruct

Qwen2.5-Coder-Sentiment-Freeze

This is a fine-tuned version of Qwen/Qwen2.5-Coder-1.5B-Instruct specialized for Chinese sentiment analysis.

The model was trained using the efficient freeze training method on the ChnSentiCorp dataset. By training only the last 6 layers, this approach achieved a significant performance boost, increasing accuracy from 91.6% to 97.8% on the evaluation set.

This model is designed to classify Chinese text into positive (1) or negative (0) sentiment and output the result in a clean JSON format.

🚀 Full Tutorial & Repository

This model was trained as part of a comprehensive, beginner-friendly tutorial that walks through every step of the process, from data preparation to evaluation and deployment.

👉 GitHub Repository: IIIIQIIII/MSJ-Factory

The repository includes:

💻 A step-by-step Google Colab notebook.
⚙️ All training and evaluation scripts.
📊 Detailed explanations of the training method and results.

If you find this model or the tutorial helpful, please give the repository a ⭐️ star! It helps support the author's work.

💡 How to Use

This model follows a specific instruction format to ensure reliable JSON output. Use the prompt template below for the best results.

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

# Model repository on Hugging Face Hub
model_name = "FutureMa/Qwen2.5-Coder-Sentiment-Freeze"

# Load tokenizer and model
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype=torch.bfloat16,
    device_map="auto"
)

# --- Define the text for sentiment analysis ---
text = "这个酒店的服务态度非常好，房间也很干净！" # Example: "The service at this hotel was excellent, and the room was very clean!"

# --- Create the prompt using the required template ---
prompt = f"""请对以下中文文本进行情感分析，判断其情感倾向。

任务说明：
- 分析文本表达的整体情感态度
- 判断是正面(1)还是负面(0)

文本内容：
```sentence
{text}
```

输出格式：
```json
{{
  "sentiment": 0 or 1
}}
```"""

messages = [{"role": "user", "content": prompt}]
text_input = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
model_inputs = tokenizer([text_input], return_tensors="pt").to(model.device)

# --- Generate the response ---
generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=256,
    temperature=0.1
)
response = tokenizer.batch_decode(generated_ids[:, model_inputs.input_ids.shape[1]:], skip_special_tokens=True)[0]

print(f"Input Text: {text}")
print(f"Model Output:\n{response}")

# Expected output for the example text:
# Input Text: 这个酒店的服务态度非常好，房间也很干净！
# Model Output:
# ```json
# {
#   "sentiment": 1
# }
# ```

📊 Evaluation Results

The fine-tuned model shows significant improvements across all key metrics compared to the base model. It demonstrates perfect precision, meaning it makes no false positive predictions on the test set.

Model	Accuracy	Precision	Recall	F1-Score
Base Model (Qwen2.5-Coder-1.5B)	91.62%	98.57%	83.13%	90.20%
This Model (Fine-tuned)	97.77%	100.00%	95.18%	97.53%

⚙️ Training Details

Base Model: Qwen/Qwen2.5-Coder-1.5B-Instruct
Dataset: A balanced 3,000-sample subset of ChnSentiCorp.
Training Method: Freeze Training (only the last 6 layers and embeddings were trained).
Framework: LLaMA-Factory
Hardware: T4 GPU (via Google Colab)
Training Time: ~20 minutes

⚠️ Limitations and Bias

Task-Specific: This model is highly specialized for Chinese sentiment analysis. It may not perform well on other NLP tasks or languages without further fine-tuning.
Binary Classification: The model is designed for binary (positive/negative) classification and does not capture neutral or mixed sentiment.
Data Bias: The model inherits any biases present in the ChnSentiCorp dataset, which primarily consists of hotel and product reviews. Its performance may vary on text from different domains.

Citation

If you use this model or the associated tutorial in your work, please cite the repository:

@misc{msj-factory-2025,
  title={Qwen2.5-Coder Sentiment Analysis Fine-tuning Tutorial},
  author={MASHIJIAN},
  year={2025},
  howpublished={\url{https://github.com/IIIIQIIII/MSJ-Factory}}
}

Acknowledgments

This work would not be possible without the incredible open-source tools and models from the community:

Qwen Team for the powerful base model.
Hugging Face for the transformers library and model hosting.
hiyouga for the excellent LLaMA-Factory framework.
Google Colab for providing accessible GPU resources.