msislam
/

code-mixed-language-detection-XLMRoberta

Token Classification

Model card Files Files and versions

msislam commited on Jul 11, 2023

Commit

da4d090

·

1 Parent(s): 5c37b23

Update code

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -52,6 +52,7 @@ The training dataset is based on [The Multilingual Amazon Reviews Corpus](https:
 The model can be used as follows:
 ```python
 from transformers import AutoTokenizer, AutoModelForTokenClassification
 tokenizer = AutoTokenizer.from_pretrained("msislam/code-mixed-language-detection-XLMRoberta")
@@ -60,14 +61,14 @@ model = AutoModelForTokenClassification.from_pretrained("msislam/code-mixed-lang
 text = 'Hala Madrid y nada más. It means Go Madrid and nothing more.'
-tokens = tokenizer(text, add_special_tokens= False, return_tensors="pt")
 with torch.no_grad():
   logits = model(**inputs).logits
 labels_predicted = logits.argmax(-1)
-lang_tag_predicted = [model_best.config.id2label[t.item()] for t in labels_predicted[0]]
 lang_tag_predicted
 ```

 The model can be used as follows:
 ```python
+import torch
 from transformers import AutoTokenizer, AutoModelForTokenClassification
 tokenizer = AutoTokenizer.from_pretrained("msislam/code-mixed-language-detection-XLMRoberta")
 text = 'Hala Madrid y nada más. It means Go Madrid and nothing more.'
+inputs = tokenizer(text, add_special_tokens= False, return_tensors="pt")
 with torch.no_grad():
   logits = model(**inputs).logits
 labels_predicted = logits.argmax(-1)
+lang_tag_predicted = [model.config.id2label[t.item()] for t in labels_predicted[0]]
 lang_tag_predicted
 ```