Instructions to use Harry214/paleography-web-text-triage-logreg with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use Harry214/paleography-web-text-triage-logreg with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("Harry214/paleography-web-text-triage-logreg") sentences = [ "The weather is lovely today.", "It's so sunny outside!", "He drove to the stadium." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [3, 3] - Notebooks
- Google Colab
- Kaggle
Paleography Web Text Triage Classifier
This repository contains the final Logistic Regression classifier for the Paleography Web Text Triage project.
The model classifies short Chinese paleography-related snippets into four labels:
| Label | Meaning |
|---|---|
ksd |
Scholarly discussion |
kpt |
Primary transcription |
kde |
Dictionary/reference entry |
noise |
Noise or irrelevant text |
Method
The system uses intfloat/multilingual-e5-small to encode each text snippet with the prefix:
passage: {text}