明实录与清实录多标签分类推理模型

本模型用于对《明实录》和《清实录》文本进行多标签分类推理。基于Jihuai/bert-ancient-chinese进行任务微调,利用公开语料进行预训练,得到适合实录类型的预训练模型shiluBERT

中文说明

模型与数据来源

  • 训练数据来源:《朝鲜王朝实录》
  • 任务类型:多标签文本分类;
  • 训练样本数:约27万。

评估指标

指标 数值
Sample F1 0.7227
Sample Precision 0.7554
Sample Recall 0.7281
LRAP 0.8070
Hamming Loss 0.0069

示例使用方法

English Version

This model performs multi-label classification inference on texts of VERITABLE RECORDS of the Ming/Qing DYNASTY. It is fine-tuned from Jihuai/bert-ancient-chinese, and further benefits from pretraining on public corpora to obtain a Shilu-oriented pretrained model, shiluBERT.

Model and Data Sources

Evaluation Metrics

Metric Value
Sample F1 0.7227
Sample Precision 0.7554
Sample Recall 0.7281
LRAP 0.8070
Hamming Loss 0.0069

Example Usage

Downloads last month
50
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Space using bztxb/shiluBERT 1