Grinding's picture
Upload README.md with huggingface_hub
8392978 verified
metadata
language: en
license: apache-2.0
library_name: transformers
pipeline_tag: text-classification
base_model: distilroberta-base
tags:
  - text-classification
  - customer-support
  - sentiment-analysis
  - lora
  - distillation

TicketSense — Sentiment Classifier

Fine-tuned distilroberta-base (82M params) for sentiment classification on real customer-support tweets, with labels distilled from Claude Sonnet 4.6.

Part of the TicketSense project: a tiny specialized classifier that approximates frontier-LLM judgement at a fraction of the cost and latency.

Labels

  • positive
  • neutral
  • negative

Eval

  • Macro F1 (validation): 0.7634

Training

  • Base model: distilroberta-base
  • Method: LoRA (PEFT), then merged
  • LoRA rank: 16, alpha: 32
  • LR: 0.0002, epochs: 4
  • Train data: ~1400 real customer-support tweets, Claude-labeled

Use

from transformers import pipeline
clf = pipeline("text-classification", model="Grinding/ticketsense-sentiment")
clf("my card was charged twice and nobody has responded")

Limitations

  • Trained on noisy real-world Twitter data — picks up typos and slang but may also reflect their biases.
  • Labels are themselves model-generated (Claude Sonnet 4.6) — treat F1 vs Claude as agreement, not gold-standard truth.