cbdb commited on
Commit
0366aed
·
verified ·
1 Parent(s): ca2f00d

Update sample txt file

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -16,6 +16,10 @@ license: cc-by-nc-sa-4.0
16
 
17
  Our model <font color="cornflowerblue">OTAS (Office Title Address Splitter) </font> is a Named Entity Recognition Classical Chinese language model that is intended to <font color="IndianRed">split the address portion in Classical Chinese office titles.</font>. This model is first inherited from raynardj/classical-chinese-punctuation-guwen-biaodian Classical Chinese punctuation model, and finetuned using over a 25,000 high-quality punctuation pairs collected CBDB group (China Biographical Database).
18
 
 
 
 
 
19
  ### <font color="IndianRed"> How to use </font>
20
 
21
  Here is how to use this model to get the features of a given text in PyTorch:
 
16
 
17
  Our model <font color="cornflowerblue">OTAS (Office Title Address Splitter) </font> is a Named Entity Recognition Classical Chinese language model that is intended to <font color="IndianRed">split the address portion in Classical Chinese office titles.</font>. This model is first inherited from raynardj/classical-chinese-punctuation-guwen-biaodian Classical Chinese punctuation model, and finetuned using over a 25,000 high-quality punctuation pairs collected CBDB group (China Biographical Database).
18
 
19
+ ### <font color="IndianRed"> Sample input txt file </font>
20
+ The sample input txt file can be downloaded here:
21
+ https://huggingface.co/cbdb/OfficeTitleAddressSplitter/blob/main/vocab.txt
22
+
23
  ### <font color="IndianRed"> How to use </font>
24
 
25
  Here is how to use this model to get the features of a given text in PyTorch: