UGARIT commited on
Commit
38675d2
·
verified ·
1 Parent(s): a7e9de8

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +40 -0
README.md ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - pt
4
+ - grc
5
+ base_model:
6
+ - UGARIT/grc-alignment
7
+ ---
8
+
9
+ # Automatic word-level translation alignment of Ancient Greek and Portuguese parallel texts
10
+ Translation alignment is the process of mapping translation equivalents between two texts in different languages. It can be performed on different granularity levels, form document, paragraph,
11
+ sentence, and word/phrase levels.
12
+
13
+ This model has been developed in the context of a project called **Linking Ancient Languages to Portuguese and Improving an Automatic Translation Alignment Model**
14
+ (**Letras clássicas digitais: interligando línguas antigas ao português e aprimorando um modelo automático de alinhamento de tradução**).
15
+
16
+ **Funding agency**: FAPESP proc.# 2022/09490-0 <br>
17
+ **Host Institution**: Faculty of Sciences and Letters, UNESP/Araraquara <br>
18
+ Grant within the eScience Technological Innovation Programs - Research Program in eScience and Data Science / Research Project - Regular<br>
19
+
20
+ The main objective of the project was to develop an automatic translation aligner specifically for Ancient Greek
21
+ and Brazilian Portuguese. This implied the following tasks: a) to generate manual alignments based on the gold
22
+ standard guidelines, Ancient Greek texts with parallel Brazilian Portuguese translations, creating pairs of
23
+ alignments aimed at b) the supervised training of automatic alignment models for the two languages, so that
24
+ they can be used to create automatic alignments. The motivation for having an automatic aligner for
25
+ Greek-Portuguese translations includes the possibility of generating a larger quantity of aligned texts for
26
+ reading Ancient Greek by Portuguese speakers, expanding the dynamic lexicon relating to the two languages,
27
+ as well as composing a digital corpus that can be used in other services.
28
+
29
+ # Usage
30
+ This [colab notebook](https://colab.research.google.com/drive/1jmMDvaTQcfUBL8YcOUAWv1A-NRuPEdf9?usp=sharing) contains the necessary code to use the model.
31
+
32
+ # Citation:
33
+ ```
34
+ @inproceedings{youseftranslation,
35
+ title={Translation Alignment for Ancient Greek and Portuguese},
36
+ author={Yousef, Tariq and Ferreira, Anise D’Orange and dos Reis, Michel F and Palladino, Chiara}
37
+ journal={DH2024 Book of Abstracts},
38
+ year={2024}
39
+ pages={754--756},
40
+ }