File size: 1,039 Bytes
0c5abe4 1b28f22 338c53f 1b28f22 0c5abe4 1b28f22 0c5abe4 7302f80 1b28f22 d519ca7 1b28f22 d519ca7 1b28f22 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 |
---
tags:
#- translation
- text2text-generation
- Guyanese Creole
- Caribbean dialect
license: apache-2.0
---
# Guyanese English Creole to English Translator
This model utilises T5-base pre-trained model. It was fine tuned using a custom dataset for translation of Guyanese English Creole to English. This model will be updated periodically as more data is compiled. For more on the Caribbean English Creoles checkout the library [Caribe](https://pypi.org/project/Caribe/).
___
# Usage with Transformers
```python
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("KES/GEC-English")
model = AutoModelForSeq2SeqLM.from_pretrained("KES/GEC-English")
text = "Ah waan ah phone"
inputs = tokenizer("guy:"+text, truncation=True, return_tensors='pt')
output = model.generate(inputs['input_ids'], num_beams=4, max_length=512, early_stopping=True)
translation=tokenizer.batch_decode(output, skip_special_tokens=True)
print("".join(translation)) #translation: I want a phone.
```
___
|