Create README.md
Browse files
README.md
CHANGED
|
@@ -0,0 +1,21 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Telugu Question-Answering model trained on Tydiqa dataset from Google
|
| 2 |
+
|
| 3 |
+
#### How to use
|
| 4 |
+
Use the below script from your python terminal as the web interface for inference has few encoding issues for Telugu
|
| 5 |
+
```python
|
| 6 |
+
from transformers.pipelines import pipeline, AutoModelForQuestionAnswering, AutoTokenizer
|
| 7 |
+
model = AutoModelForQuestionAnswering.from_pretrained(model_name)
|
| 8 |
+
tokenizer = AutoTokenizer.from_pretrained("kuppuluri/telugu_bertu_tydiqa",
|
| 9 |
+
clean_text=False,
|
| 10 |
+
handle_chinese_chars=False,
|
| 11 |
+
strip_accents=False,
|
| 12 |
+
wordpieces_prefix='##')
|
| 13 |
+
nlp = pipeline('question-answering', model=model, tokenizer=tokenizer)
|
| 14 |
+
result = nlp({'question': question, 'context': context})
|
| 15 |
+
```
|
| 16 |
+
|
| 17 |
+
## Training data
|
| 18 |
+
I used Tydiqa Telugu data from Google https://github.com/google-research-datasets/tydiqa
|
| 19 |
+
|
| 20 |
+
PS: If you find my model useful, I would appreciate a note from you as it would encourage me to continue improving it and also add new models.
|
| 21 |
+
|