qgyd2021
/

language_identification

Model card Files Files and versions

qgyd2021 commited on Apr 29, 2024

Commit

0ab3f49

·

verified ·

1 Parent(s): 0c71652

Update README.md

Files changed (1) hide show

README.md +67 -1

README.md CHANGED Viewed

@@ -11,4 +11,70 @@ metrics:
 ---
 ## Language Identification
-该模型是基于 AllenNLP 在 [qgyd2021/language_identification](https://huggingface.co/datasets/qgyd2021/language_identification) 数据集上训练的语种识别模型。

 ---
 ## Language Identification
+该模型是基于 AllenNLP 在 [qgyd2021/language_identification](https://huggingface.co/datasets/qgyd2021/language_identification) 数据集上训练的语种识别模型。
+测试代码：
+```python
+#!/usr/bin/python3
+# -*- coding: utf-8 -*-
+import argparse
+import time
+from allennlp.models.archival import archive_model, load_archive
+from allennlp.predictors.text_classifier import TextClassifierPredictor
+from project_settings import project_path
+def get_args():
+    """
+    python3 step_5_predict_by_archive.py
+    :return:
+    """
+    parser = argparse.ArgumentParser()
+    parser.add_argument(
+        "--text",
+        default="hello guy.",
+        type=str
+    )
+    parser.add_argument(
+        "--archive_file",
+        default=(project_path / "trained_models/language_identification").as_posix(),
+        type=str
+    )
+    args = parser.parse_args()
+    return args
+def main():
+    args = get_args()
+    archive = load_archive(archive_file=args.archive_file)
+    predictor = TextClassifierPredictor(
+        model=archive.model,
+        dataset_reader=archive.dataset_reader,
+    )
+    json_dict = {
+        "sentence": args.text
+    }
+    begin_time = time.time()
+    outputs = predictor.predict_json(
+        json_dict
+    )
+    label = outputs["label"]
+    prob = round(max(outputs["probs"]), 4)
+    print(label)
+    print(prob)
+    print('time cost: {}'.format(time.time() - begin_time))
+    return
+if __name__ == '__main__':
+    main()
+```