Translation
Safetensors
French
Latin
t5
ponteineptique commited on
Commit
96e7367
·
verified ·
1 Parent(s): cc65a2d

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -0
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-4.0
3
+ datasets:
4
+ - comma-project/alignement-pairs
5
+ language:
6
+ - fr
7
+ - la
8
+ base_model:
9
+ - google/byt5-small
10
+ pipeline_tag: translation
11
+ examples:
12
+ - text: "⁊ non facimus ĩitatem."
13
+ - text: "Car toutes les lois sont fõ dees cor recon droitu riele pour quoi se les lois ne tont droiture "
14
+ ---
15
+
16
+ # ByT5-Small for Normalization
17
+
18
+ This models allows for normalization of ATR output using CATMuS guidelines, for both Latin and Old French. It fixes spacing, it has tendencies to
19
+ overnormalize and add punctuation.
20
+
21
+ ```py
22
+ from transformers import pipeline
23
+
24
+ pipe = pipeline(
25
+ task="text2text-generation", # change if needed
26
+ model=".", # local directory
27
+ tokenizer="."
28
+ )
29
+ pipe("⁊ non facimus ĩitatem.")
30
+ # [{'generated_text': ' non facimus veritatem.'}]
31
+ ```