AventIQ-AI
/

topic-classification-for-news-title

Model card Files Files and versions

topic-classification-for-news-title / README.md

nimishgarg's picture

Upload 7 files

84fecb3 verified 8 months ago

|

history blame contribute delete

3.74 kB


	# RoBERTa-Base Quantized Model for Topic Classification

	This repository hosts a quantized version of the RoBERTa model, fine-tuned for topic classification using the AG News dataset. The model has been optimized using FP16 quantization for efficient deployment without significant accuracy loss.
	## Model Details

	- Model Architecture: RoBERTa Base
	- Task: Multi-class Topic Classification (4 classes)
	- Dataset: AG News (Hugging Face Datasets)
	- Quantization: Float16
	- Fine-tuning Framework: Hugging Face Transformers

	---

	## Installation

	```bash
	pip install transformers torch datasets
	```

	---

	## Loading the Model

	```python

	from transformers import RobertaTokenizer
	from transformers import RobertaForSequenceClassification

	import torch

	# Load tokenizer and model

	tokenizer = RobertaTokenizer.from_pretrained("roberta-base")
	model = RobertaForSequenceClassification.from_pretrained("roberta-base", num_labels=4).to(device)
	# Define test sentences
	samples = [
	"Tensions rise in the Middle East as diplomats gather for emergency talks to prevent further escalation.",
	"Tesla reports a 25% increase in quarterly revenue, driven by strong demand for its Model Y vehicles in Asia.",

	"Researchers develop a new quantum computing chip that significantly reduces energy consumption.",
	"Argentina defeats Brazil 2-1 in the Copa América final, securing their 16th continental title.",
	"Meta unveils its latest AI model capable of generating 3D virtual environments from text prompts."
	]



	from transformers import pipeline

	# Load pipeline for inference
	classifier = pipeline("text-classification", model=trainer.model, tokenizer=tokenizer, device=0) # device=-1 if using CPU

	predictions = classifier(samples)

	# Print results
	for text, pred in zip(samples, predictions):
	print(f"\nText: {text}\nPredicted Topic: {pred['label']} (Score: {pred['score']:.4f})")
	```

	---

	## Performance Metrics

	- Accuracy: 0.9471
	- Precision: 0.9471
	- Recall: 0.9471
	- F1 Score: 0.9471

	---

	## Fine-Tuning Details

	### Dataset

	The dataset is sourced from Hugging Face’s ag_news dataset. It contains 120,000 training samples and 7,600 test samples, with each news article labeled into one of four categories: World, Sports, Business, or Sci/Tech. The original dataset was used as provided, and input texts were tokenized using the RoBERTa tokenizer and truncated/padded to a maximum length of 128 tokens.

	### Training

	- Epochs: 3
	- Batch size: 8
	- Learning rate: 2e-5
	- Evaluation strategy: `epoch`

	---

	## Quantization

	Post-training quantization was applied using PyTorch’s `half()` precision (FP16) to reduce model size and inference time.

	---

	## Repository Structure

	```python
	.
	├── config.json # Model configuration
	├── merges.txt # Byte Pair Encoding (BPE) merge rules for tokenizer
	├── model.safetensors # Quantized model weights
	├── README.md # Model documentation
	├── special_tokens_map.json # Tokenizer special tokens
	├── tokenizer_config.json # Tokenizer configuration
	├── vocab.json # Tokenizer vocabulary

	├── README.md # Model documentation
	```

	---

	## Limitations

	- The model is trained specifically for binary topic classification on ag news dataset.
	- FP16 quantization may result in slight numerical instability in edge cases.


	---

	## Contributing

	Feel free to open issues or submit pull requests to improve the model or documentation.