Automated Humanizer β€” academic AI-text detector

RoBERTa sequence classifier (human vs. AI-generated) fine-tuned on academic prose. It is the default hf detector backend of the automated_humanizer pipeline, where it gates a humanize β†’ detect β†’ validate rewrite loop.

  • Architecture: RobertaForSequenceClassification, 2 labels
  • Training data: synthetic (AI-draft, human-original) paragraph pairs built from award-winning pre-2020 arXiv papers (see the project's training/ subsystem)

Honest limitation

This detector was trained on the same data family as the project's humanizer, so its confidence is an internal, biased estimate β€” useful as an optimization signal inside the loop, not as independent evidence of authorship. The project README recommends an external detector (e.g. GPTZero) for final sign-off, and test_detectors.py in the repo reproduces why.

Usage

from transformers import pipeline

clf = pipeline("text-classification", model="avasil02/automated-humanizer-detector")
print(clf("Paragraph of academic text..."))

Check the human-class label index once per deployment with the project's calibrate_detector.py.

Downloads last month
15
Safetensors
Model size
0.1B params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support