News Topic Classifier (Asymmetric Loss)

Fine-tuned bert-base-uncased for multi-label news topic classification using Asymmetric Loss (ASL, ICCV 2021). ASL suppresses easy-negative gradients (γ⁻=4, γ⁺=1, margin=0.15), which helps on the long tail of rare topic codes.

Model details


Base model	`bert-base-uncased`
Task	Multi-label classification
Loss	Asymmetric Loss (γ⁺=1, γ⁻=4, margin=0.15)
Number of labels	126 topic codes
Max input length	256 tokens (title + body)
Best val micro-F1	~0.844

Usage

from transformers import BertForSequenceClassification, AutoTokenizer
import torch

tokenizer = AutoTokenizer.from_pretrained("chiunhau/news-classifier-2")
model     = BertForSequenceClassification.from_pretrained("chiunhau/news-classifier-2")
model.eval()

title = "Finland wins gold in ice hockey."
text  = "The Finnish national team claimed victory in the final."
inputs = tokenizer(title, text, return_tensors="pt",
                   truncation=True, max_length=256)
with torch.no_grad():
    logits = model(**inputs).logits

threshold = 0.5
probs     = logits.sigmoid().squeeze()
predicted = [model.config.id2label[i]
             for i, p in enumerate(probs) if p > threshold]
print("Topics:", predicted)

Notes

Input: concatenated title + text fields, separated by the tokenizer's [SEP] token.
Output: raw logits — apply sigmoid() and threshold at 0.5 for binary predictions.
The companion BCE model (chiunhau/news-classifier-1) uses standard Binary Cross-Entropy and achieves slightly higher overall micro-F1.

Downloads last month: 1

Safetensors

Model size

0.1B params

Tensor type

F32