Vietnamese Clickbait Detection Model
This model is a fine-tuned version of Llama for Vietnamese clickbait detection.
Model Description
- Model type: Causal Language Model (Fine-tuned for Classification)
- Language: Vietnamese
- Base model: meta-llama/Llama-3.1-8B-Instruct
- Task: Clickbait Detection
- Dataset: Vietnamese clickbait dataset
Usage
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
# Load model and tokenizer
model_name = "PhaaNe/clickbait_KLTN"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
model_name,
torch_dtype=torch.float16,
device_map="auto"
)
# Example usage
text = "Bạn sẽ không tin được điều này xảy ra!"
inputs = tokenizer(text, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=10)
result = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(result)
Training Details
- Fine-tuned using LoRA (Low-Rank Adaptation)
- Training framework: Transformers + PEFT
- Hardware: GPU-enabled server
Performance
The model achieves good performance on Vietnamese clickbait detection tasks.
Citation
If you use this model, please cite:
@misc{clickbait_kltn_2025,
title={Vietnamese Clickbait Detection using Fine-tuned Llama},
author={PhaaNe},
year={2025},
url={https://huggingface.co/PhaaNe/clickbait_KLTN}
}
- Downloads last month
- 7