Chinese_Grammarly / README.md
CodeTed's picture
Update README.md
826116f
|
raw
history blame
875 Bytes
metadata
license: apache-2.0
datasets:
  - CodeTed/CGEDit_dataset
language:
  - zh
metrics:
  - accuracy
library_name: transformers
tags:
  - CGED
  - CSC
pipeline_tag: text2text-generation

CGEDit - Chinese Grammatical Error Diagnosis by Task-Specific Instruction Tuning

CGEDit_model.png

Usage

from transformers import AutoTokenizer, T5ForConditionalGeneration

tokenizer = AutoTokenizer.from_pretrained("CodeTed/CGEDit")
model = T5ForConditionalGeneration.from_pretrained("CodeTed/CGEDit")
input_text = '糾正句子裡的錯字: 看完那段文張,我是反對的!'
input_ids = tokenizer(input_text, return_tensors="pt").input_ids
outputs = model.generate(input_ids, max_length=256)
edited_text = tokenizer.decode(outputs[0], skip_special_tokens=True)