tiya1012 commited on
Commit
310de7a
·
1 Parent(s): acc1f90

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -13
README.md CHANGED
@@ -1,21 +1,15 @@
1
- # Telugu Question-Answering model trained on Tydiqa dataset from Google
2
 
3
  #### How to use
4
- Use the below script from your python terminal as the web interface for inference has few encoding issues for Telugu
5
  ```python
6
- from transformers.pipelines import pipeline, AutoModelForQuestionAnswering, AutoTokenizer
7
- model = AutoModelForQuestionAnswering.from_pretrained(model_name)
8
- tokenizer = AutoTokenizer.from_pretrained("kuppuluri/telugu_bertu_tydiqa",
9
- clean_text=False,
10
- handle_chinese_chars=False,
11
- strip_accents=False,
12
- wordpieces_prefix='##')
13
- nlp = pipeline('question-answering', model=model, tokenizer=tokenizer)
14
- result = nlp({'question': question, 'context': context})
15
  ```
16
 
17
  ## Training data
18
- I used Tydiqa Telugu data from Google https://github.com/google-research-datasets/tydiqa
 
19
 
20
- PS: If you find my model useful, I would appreciate a note from you as it would encourage me to continue improving it and also add new models.
21
 
 
1
+ # code-mixed Kannada-English word-level identification trained on Kanglish dataset from ICON2022
2
 
3
  #### How to use
4
+ Use the below script from your python terminal
5
  ```python
6
+ import transformers
7
+ from transformers import AutoModelForSequenceClassification
8
+ model AutoModelForSequenceClassification.from_pretrained("tiya1012/distilka_applied")
 
 
 
 
 
 
9
  ```
10
 
11
  ## Training data
12
+ I used code-mixed dataset from https://sites.google.com/view/kanglishicon2022/home
13
+
14
 
 
15