Zual commited on
Commit
428c2d7
·
verified ·
1 Parent(s): 0dad7b9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -11
README.md CHANGED
@@ -15,7 +15,7 @@ metrics:
15
 
16
  # THIVLVC: Latin ByT5 Lemmatizer
17
 
18
- **THIVLVC** is a state-of-the-art Latin lemmatizer based on the ByT5 (base) architecture. It was developed by **Luc Pommeret** at **LISN (CNRS)** to provide a high-performance, unified model for diverse Latin corpora.
19
 
20
  ## Performance Analysis
21
 
@@ -53,14 +53,4 @@ print(lemmatize("Amorem canat"))
53
  # Expected Output: "amor cano"
54
  ```
55
 
56
- ## Dataset and Training
57
-
58
- - **Model Architecture**: ByT5-base
59
- - **Author**: Luc Pommeret
60
- - **Institution**: LISN (CNRS, Université Paris-Saclay)
61
- - **Training Data**: Unified corpus including Universal Dependencies gold standard, massive silver data from the Latin Library, and targeted distillation from Gemini.
62
- - **Scope**: Unified lemmatization across multiple historical periods and genres of Latin.
63
-
64
- ## Acknowledgments
65
-
66
  This model was produced by **Luc Pommeret** at LISN (CNRS, Université Paris-Saclay).
 
15
 
16
  # THIVLVC: Latin ByT5 Lemmatizer
17
 
18
+ **THIVLVC** is a state-of-the-art Latin lemmatizer based on the ByT5 (base) architecture. It was developed at **LISN (CNRS)** to provide a high-performance, unified model for diverse Latin corpora.
19
 
20
  ## Performance Analysis
21
 
 
53
  # Expected Output: "amor cano"
54
  ```
55
 
 
 
 
 
 
 
 
 
 
 
56
  This model was produced by **Luc Pommeret** at LISN (CNRS, Université Paris-Saclay).