Token Classification
Transformers
TensorBoard
Safetensors
xlm-roberta
Generated from Trainer
language-identification
codeswitching
Instructions to use polyglot-tagger/language-identification with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use polyglot-tagger/language-identification with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("token-classification", model="polyglot-tagger/language-identification")# Load model directly from transformers import AutoTokenizer, AutoModelForTokenClassification tokenizer = AutoTokenizer.from_pretrained("polyglot-tagger/language-identification") model = AutoModelForTokenClassification.from_pretrained("polyglot-tagger/language-identification") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -123,10 +123,10 @@ datasets:
|
|
| 123 |
- wikimedia/wikipedia
|
| 124 |
- HuggingFaceFW/finetranslations
|
| 125 |
- google/smol
|
| 126 |
-
-
|
| 127 |
-
-
|
| 128 |
-
-
|
| 129 |
-
-
|
| 130 |
pipeline_tag: token-classification
|
| 131 |
---
|
| 132 |
|
|
|
|
| 123 |
- wikimedia/wikipedia
|
| 124 |
- HuggingFaceFW/finetranslations
|
| 125 |
- google/smol
|
| 126 |
+
- polyglot-tagger/nlp-noise-snippets
|
| 127 |
+
- polyglot-tagger/wikipedia-language-snippets-filtered
|
| 128 |
+
- polyglot-tagger/finetranslations-filtered
|
| 129 |
+
- polyglot-tagger/tatoeba-filtered
|
| 130 |
pipeline_tag: token-classification
|
| 131 |
---
|
| 132 |
|