File size: 1,898 Bytes

---
license: cc-by-nc-sa-4.0
---

# 明实录与清实录多标签分类推理模型

本模型用于对《明实录》和《清实录》文本进行多标签分类推理。基于[Jihuai/bert-ancient-chinese](https://huggingface.co/Jihuai/bert-ancient-chinese)进行任务微调，利用公开语料进行预训练，得到适合实录类型的预训练模型[shiluBERT](https://huggingface.co/bztxb/shiluBERT)。

## 中文说明

### 模型与数据来源

- 训练数据来源：[《朝鲜王朝实录》](https://sillok.history.go.kr)；
- 任务类型：多标签文本分类；
- 训练样本数：约27万。

### 评估指标

| 指标 | 数值 |
|---|---|
| Sample F1 | 0.7209 |
| Sample Precision | 0.7527 |
| Sample Recall | 0.7306 |
| LRAP | 0.8048 |
| Hamming Loss | 0.0070 |

### 示例使用方法

- 在线体验 Space： [bztxb/shiluInfer](https://huggingface.co/spaces/bztxb/shiluInfer)
![Space 使用示例](用法示例.jpg)

## English Version

This model performs multi-label classification inference on texts of VERITABLE RECORDS of the Ming/Qing DYNASTY. It is fine-tuned from [Jihuai/bert-ancient-chinese](https://huggingface.co/Jihuai/bert-ancient-chinese), and further benefits from pretraining on public corpora to obtain a Shilu-oriented pretrained model, [shiluBERT](https://huggingface.co/bztxb/shiluBERT).

### Model and Data Sources

- Training data source: [VERITABLE RECORDS of the JOSEON DYNASTY](https://sillok.history.go.kr).
- Task type: multi-label text classification.
- Number of training samples: approximately 0.27 million.

### Evaluation Metrics

| Metric | Value |
|---|---|
| Sample F1 | 0.7209 |
| Sample Precision | 0.7527 |
| Sample Recall | 0.7306 |
| LRAP | 0.8048 |
| Hamming Loss | 0.0070 |

### Example Usage

- Try the online Space: [bztxb/shiluInfer](https://huggingface.co/spaces/bztxb/shiluInfer)
![Space Usage Example](用法示例.jpg)