Paleography Web Text Triage Classifier

This repository contains the final Logistic Regression classifier for the Paleography Web Text Triage project.

The model classifies short Chinese paleography-related snippets into four labels:

Label Meaning
ksd Scholarly discussion
kpt Primary transcription
kde Dictionary/reference entry
noise Noise or irrelevant text

Method

The system uses intfloat/multilingual-e5-small to encode each text snippet with the prefix:

passage: {text}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support