tiya1012
/

distilka_applied

Text Classification

text-embeddings-inference

Model card Files Files and versions

tiya1012 commited on Nov 23, 2022

Commit

acc1f90

·

1 Parent(s): a835aa8

Create README.md

Files changed (1) hide show

README.md +21 -0

README.md CHANGED Viewed

	@@ -0,0 +1,21 @@

+# Telugu Question-Answering model trained on Tydiqa dataset from Google
+#### How to use
+Use the below script from your python terminal as the web interface for inference has few encoding issues for Telugu
+```python
+from transformers.pipelines import pipeline, AutoModelForQuestionAnswering, AutoTokenizer
+model = AutoModelForQuestionAnswering.from_pretrained(model_name)
+tokenizer = AutoTokenizer.from_pretrained("kuppuluri/telugu_bertu_tydiqa",
+                                          clean_text=False,
+                                          handle_chinese_chars=False,
+                                          strip_accents=False,
+                                          wordpieces_prefix='##')
+nlp = pipeline('question-answering', model=model, tokenizer=tokenizer)
+result = nlp({'question': question, 'context': context})
+```
+## Training data
+I used Tydiqa Telugu data from Google https://github.com/google-research-datasets/tydiqa
+PS: If you find my model useful, I would appreciate a note from you as it would encourage me to continue improving it and also add new models.