FarhanAK128's picture
Update README.md
f1b8e3e verified
---
language: en
license: mit
tags:
- text-classification
- gpt2
- transformers
- pytorch
- custom-architecture
- tiktoken
library_name: transformers
---
# TicketClassificationGPT
## Model Summary
**TicketClassificationGPT** is a GPT-2–based text classification model designed entirely from scratct to classify IT support tickets into 8 predefined categories.
The model uses the original OpenAI GPT-2 architecture and weights, with the language modeling head replaced by a custom classification head. Only the final layers were fine-tuned for the ticket classification task.
This model is fully compatible with the Hugging Face `transformers` ecosystem and can be loaded using `AutoModel.from_pretrained`.
---
## How to Get Started with the Model
### Inference Example (Transformers + tiktoken)
```python
from transformers import AutoModel
import tiktoken
# Load tokenizer
tokenizer = tiktoken.get_encoding("gpt2")
# Load model
model_id = "FarhanAK128/TicketClassificationGPT"
model = AutoModel.from_pretrained(
model_id,
trust_remote_code=True
)
# Example prediction
text = "Need extra space on Google Drive."
prediction = model.predict(text, tokenizer)
print("Predicted class:", prediction) # Predicted class: Storage
```
**Note:** This model uses a custom `.predict()` method defined in the repository and requires `trust_remote_code=True` to function.
---
## Model Details
### πŸ“ Model Description
- **Developed by:** Farhan Ali Khan
- **Model type:** GPT-2–based text classification model
- **Base architecture:** GPT-2 (OpenAI)
- **Framework:** PyTorch
- **Task:** Text Classification
- **Number of classes:** 8
- **Language:** English
- **License:** MIT
- **Finetuned from model:** OpenAI GPT-2
### πŸ“‹ Classification Labels
| Class ID | Category |
|----------|--------------------------|
| 0 | Hardware |
| 1 | HR Support |
| 2 | Access |
| 3 | Miscellaneous |
| 4 | Storage |
| 5 | Purchase |
| 6 | Internal Project |
| 7 | Administrative Rights |
### Model Sources
- **Repository:** https://huggingface.co/FarhanAK128/TicketClassificationGPT
- **Base model:** OpenAI GPT-2 like architecture from scratch
- **Paper:** https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf
---
## Training Details
### Training Data
The model was trained on the **IT Service Ticket Classification Dataset** available on Kaggle.
- **Dataset name:** IT Service Ticket Classification Dataset
- **Source:** Kaggle
- **Link:** https://www.kaggle.com/datasets/adisongoh/it-service-ticket-classification-dataset
- **Content:** Labeled IT support ticket text data
- **Language:** English
The dataset was used for supervised multi-class classification after standard text preprocessing and tokenization.
### Training Procedure
- **Base weights:** OpenAI GPT-2
- **Fine-tuning strategy:** Partial fine-tuning (classification head + final transformer layers)
- **Optimizer:** AdamW
- **Learning rate:** 1e-4
- **Weight decay:** 0.1
- **Epochs:** 5
- **Random seed:** 123
- **Loss function:** Cross-Entropy Loss
- **Training regime:** FP32
- **Evaluation frequency:** Every 30 steps
- **Total training time:** ~140 minutes
- **Final training loss:** ~0.61
- **Final validation loss:** ~0.86
### πŸ“ˆ Training Progress
#### Training and Validation Loss
![Training and Validation Loss](https://cdn-uploads.huggingface.co/production/uploads/65bc1af7ce846f8aa908a978/IDm_z1ud6mP35T_8eMHNl.png)
#### Training and Validation Accuracy
![Training and Validation Accuracy](https://cdn-uploads.huggingface.co/production/uploads/65bc1af7ce846f8aa908a978/WwprMh8Ohj3fy0tjnb0Sd.png)
### πŸ“Š Model Performance
| Dataset Split | Accuracy |
|--------------|----------|
| πŸ‹οΈ Training | **76.54%** |
| πŸ§ͺ Validation | **75.67%** |
| 🧠 Test | **73.83%** |
---
## Uses
### Direct Use
This model can be used directly to classify short IT support ticket texts into predefined categories.
Example use cases:
- Automated ticket routing
- Helpdesk prioritization
- Internal IT workflow automation
### Downstream Use
The model may be further fine-tuned on:
- Organization-specific ticket data
- Expanded label sets
- Domain-specific terminology
### Out-of-Scope Use
- Multilingual text classification
- Open-domain topic classification
- Legal, medical, or safety-critical decision-making
---
## Bias, Risks, and Limitations
- Trained on a limited-domain dataset (IT support tickets)
- Not evaluated for demographic or social bias
- Predictions may be unreliable for unseen ticket categories
- Performance depends on input text quality and length
### Recommendations
Human validation is recommended before using predictions in production systems.
For best results, further fine-tuning on in-domain data is advised.
---
## Model Card Authors
**Farhan Ali Khan**
## Model Card Contact
For questions or feedback, please reach out via my Hugging Face profile:
[FarhanAK128](https://huggingface.co/FarhanAK128)