tiya1012 commited on
Commit
acc1f90
·
1 Parent(s): a835aa8

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -0
README.md CHANGED
@@ -0,0 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Telugu Question-Answering model trained on Tydiqa dataset from Google
2
+
3
+ #### How to use
4
+ Use the below script from your python terminal as the web interface for inference has few encoding issues for Telugu
5
+ ```python
6
+ from transformers.pipelines import pipeline, AutoModelForQuestionAnswering, AutoTokenizer
7
+ model = AutoModelForQuestionAnswering.from_pretrained(model_name)
8
+ tokenizer = AutoTokenizer.from_pretrained("kuppuluri/telugu_bertu_tydiqa",
9
+ clean_text=False,
10
+ handle_chinese_chars=False,
11
+ strip_accents=False,
12
+ wordpieces_prefix='##')
13
+ nlp = pipeline('question-answering', model=model, tokenizer=tokenizer)
14
+ result = nlp({'question': question, 'context': context})
15
+ ```
16
+
17
+ ## Training data
18
+ I used Tydiqa Telugu data from Google https://github.com/google-research-datasets/tydiqa
19
+
20
+ PS: If you find my model useful, I would appreciate a note from you as it would encourage me to continue improving it and also add new models.
21
+