comma-project
/

normalization-byt5-small

Model card Files Files and versions

ponteineptique commited on Dec 22, 2025

Commit

5acb971

·

verified ·

1 Parent(s): 7ffdcfb

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -9,7 +9,7 @@ base_model:
 - google/byt5-small
 pipeline_tag: translation
 examples:
-- text: "⁊ non facimus ĩitatem."
 - text: "Car toutes les lois sont fõ dees cor recon droitu riele pour quoi se les lois ne tont droiture "
 ---
@@ -20,12 +20,13 @@ overnormalize and add punctuation.
 ```py
 from transformers import pipeline
 pipe = pipeline(
     task="text2text-generation",  # change if needed
     model="comma-project/normalization-byt5-small",                  # local directory
     tokenizer="comma-project/normalization-byt5-small"
 )
-pipe("⁊ non facimus ĩitatem.")
-# [{'generated_text': ' non facimus veritatem.'}]
 ```

 - google/byt5-small
 pipeline_tag: translation
 examples:
+- text: "Scͥbo uobiᷤᷤ ñ pauli ł donati."
 - text: "Car toutes les lois sont fõ dees cor recon droitu riele pour quoi se les lois ne tont droiture "
 ---
 ```py
 from transformers import pipeline
+import unicodedata
 pipe = pipeline(
     task="text2text-generation",  # change if needed
     model="comma-project/normalization-byt5-small",                  # local directory
     tokenizer="comma-project/normalization-byt5-small"
 )
+pipe(unicodedata.normalize("NFD", "Scͥbo uobiᷤᷤ ñ pauli ł donati. "))
+# [{'generated_text': 'scribo uobis, non Pauli uel Donati''}]
 ```