nazneen's picture
model documentation
6c4cf42
|
raw
history blame
20.1 kB
metadata
language:
  - tr
license: apache-2.0
library_name: transformers
tags:
  - part-of-speech
  - token-classification
datasets:
  - universal_dependencies
metrics:
  - accuracy
model-index:
  - name: xlm-roberta-base-ft-udpos28-tr
    results:
      - task:
          type: token-classification
          name: Part-of-Speech Tagging
        dataset:
          name: Universal Dependencies v2.8
          type: universal_dependencies
        metrics:
          - type: accuracy
            value: 74.4
            name: English Test accuracy
          - type: accuracy
            value: 73.7
            name: Dutch Test accuracy
          - type: accuracy
            value: 73.5
            name: German Test accuracy
          - type: accuracy
            value: 73.2
            name: Italian Test accuracy
          - type: accuracy
            value: 71.4
            name: French Test accuracy
          - type: accuracy
            value: 71.1
            name: Spanish Test accuracy
          - type: accuracy
            value: 77.9
            name: Russian Test accuracy
          - type: accuracy
            value: 74.5
            name: Swedish Test accuracy
          - type: accuracy
            value: 69.2
            name: Norwegian Test accuracy
          - type: accuracy
            value: 73.8
            name: Danish Test accuracy
          - type: accuracy
            value: 45.8
            name: Low Saxon Test accuracy
          - type: accuracy
            value: 39.8
            name: Akkadian Test accuracy
          - type: accuracy
            value: 80.9
            name: Armenian Test accuracy
          - type: accuracy
            value: 62.9
            name: Welsh Test accuracy
          - type: accuracy
            value: 63.7
            name: Old East Slavic Test accuracy
          - type: accuracy
            value: 71.5
            name: Albanian Test accuracy
          - type: accuracy
            value: 62.3
            name: Slovenian Test accuracy
          - type: accuracy
            value: 41.3
            name: Guajajara Test accuracy
          - type: accuracy
            value: 68
            name: Kurmanji Test accuracy
          - type: accuracy
            value: 88.4
            name: Turkish Test accuracy
          - type: accuracy
            value: 81.1
            name: Finnish Test accuracy
          - type: accuracy
            value: 71.5
            name: Indonesian Test accuracy
          - type: accuracy
            value: 76.8
            name: Ukrainian Test accuracy
          - type: accuracy
            value: 74.3
            name: Polish Test accuracy
          - type: accuracy
            value: 76.7
            name: Portuguese Test accuracy
          - type: accuracy
            value: 81.1
            name: Kazakh Test accuracy
          - type: accuracy
            value: 68.2
            name: Latin Test accuracy
          - type: accuracy
            value: 47.5
            name: Old French Test accuracy
          - type: accuracy
            value: 62.6
            name: Buryat Test accuracy
          - type: accuracy
            value: 24.6
            name: Kaapor Test accuracy
          - type: accuracy
            value: 63.7
            name: Korean Test accuracy
          - type: accuracy
            value: 82
            name: Estonian Test accuracy
          - type: accuracy
            value: 72.3
            name: Croatian Test accuracy
          - type: accuracy
            value: 24.1
            name: Gothic Test accuracy
          - type: accuracy
            value: 41.1
            name: Swiss German Test accuracy
          - type: accuracy
            value: 23
            name: Assyrian Test accuracy
          - type: accuracy
            value: 45.2
            name: North Sami Test accuracy
          - type: accuracy
            value: 36
            name: Naija Test accuracy
          - type: accuracy
            value: 80
            name: Latvian Test accuracy
          - type: accuracy
            value: 55.9
            name: Chinese Test accuracy
          - type: accuracy
            value: 56.2
            name: Tagalog Test accuracy
          - type: accuracy
            value: 30
            name: Bambara Test accuracy
          - type: accuracy
            value: 81.2
            name: Lithuanian Test accuracy
          - type: accuracy
            value: 72.4
            name: Galician Test accuracy
          - type: accuracy
            value: 57
            name: Vietnamese Test accuracy
          - type: accuracy
            value: 80.2
            name: Greek Test accuracy
          - type: accuracy
            value: 69.1
            name: Catalan Test accuracy
          - type: accuracy
            value: 75.8
            name: Czech Test accuracy
          - type: accuracy
            value: 52.7
            name: Erzya Test accuracy
          - type: accuracy
            value: 50.8
            name: Bhojpuri Test accuracy
          - type: accuracy
            value: 49
            name: Thai Test accuracy
          - type: accuracy
            value: 77.9
            name: Marathi Test accuracy
          - type: accuracy
            value: 66.8
            name: Basque Test accuracy
          - type: accuracy
            value: 75.1
            name: Slovak Test accuracy
          - type: accuracy
            value: 43.1
            name: Kiche Test accuracy
          - type: accuracy
            value: 31.7
            name: Yoruba Test accuracy
          - type: accuracy
            value: 48.6
            name: Warlpiri Test accuracy
          - type: accuracy
            value: 79.5
            name: Tamil Test accuracy
          - type: accuracy
            value: 34.1
            name: Maltese Test accuracy
          - type: accuracy
            value: 58.5
            name: Ancient Greek Test accuracy
          - type: accuracy
            value: 68.9
            name: Icelandic Test accuracy
          - type: accuracy
            value: 33.6
            name: Mbya Guarani Test accuracy
          - type: accuracy
            value: 60.5
            name: Urdu Test accuracy
          - type: accuracy
            value: 69.6
            name: Romanian Test accuracy
          - type: accuracy
            value: 71.3
            name: Persian Test accuracy
          - type: accuracy
            value: 50.2
            name: Apurina Test accuracy
          - type: accuracy
            value: 44.4
            name: Japanese Test accuracy
          - type: accuracy
            value: 86.4
            name: Hungarian Test accuracy
          - type: accuracy
            value: 63.2
            name: Hindi Test accuracy
          - type: accuracy
            value: 36.3
            name: Classical Chinese Test accuracy
          - type: accuracy
            value: 51
            name: Komi Permyak Test accuracy
          - type: accuracy
            value: 59.5
            name: Faroese Test accuracy
          - type: accuracy
            value: 38.3
            name: Sanskrit Test accuracy
          - type: accuracy
            value: 65.4
            name: Livvi Test accuracy
          - type: accuracy
            value: 64.4
            name: Arabic Test accuracy
          - type: accuracy
            value: 38.9
            name: Wolof Test accuracy
          - type: accuracy
            value: 72.4
            name: Bulgarian Test accuracy
          - type: accuracy
            value: 49.1
            name: Akuntsu Test accuracy
          - type: accuracy
            value: 23.3
            name: Makurap Test accuracy
          - type: accuracy
            value: 46.5
            name: Kangri Test accuracy
          - type: accuracy
            value: 55.4
            name: Breton Test accuracy
          - type: accuracy
            value: 80.7
            name: Telugu Test accuracy
          - type: accuracy
            value: 54.3
            name: Cantonese Test accuracy
          - type: accuracy
            value: 42.9
            name: Old Church Slavonic Test accuracy
          - type: accuracy
            value: 70.5
            name: Karelian Test accuracy
          - type: accuracy
            value: 67.1
            name: Upper Sorbian Test accuracy
          - type: accuracy
            value: 58.3
            name: South Levantine Arabic Test accuracy
          - type: accuracy
            value: 47.6
            name: Komi Zyrian Test accuracy
          - type: accuracy
            value: 60.3
            name: Irish Test accuracy
          - type: accuracy
            value: 50
            name: Nayini Test accuracy
          - type: accuracy
            value: 41.9
            name: Munduruku Test accuracy
          - type: accuracy
            value: 37.5
            name: Manx Test accuracy
          - type: accuracy
            value: 47.4
            name: Skolt Sami Test accuracy
          - type: accuracy
            value: 71.3
            name: Afrikaans Test accuracy
          - type: accuracy
            value: 53.4
            name: Old Turkish Test accuracy
          - type: accuracy
            value: 53.6
            name: Tupinamba Test accuracy
          - type: accuracy
            value: 76.9
            name: Belarusian Test accuracy
          - type: accuracy
            value: 72.2
            name: Serbian Test accuracy
          - type: accuracy
            value: 50
            name: Moksha Test accuracy
          - type: accuracy
            value: 70.5
            name: Western Armenian Test accuracy
          - type: accuracy
            value: 54.1
            name: Scottish Gaelic Test accuracy
          - type: accuracy
            value: 50
            name: Khunsari Test accuracy
          - type: accuracy
            value: 79.2
            name: Hebrew Test accuracy
          - type: accuracy
            value: 70.8
            name: Uyghur Test accuracy
          - type: accuracy
            value: 40.8
            name: Chukchi Test accuracy

Model Card for XLM-RoBERTa base Universal Dependencies v2.8 POS tagging: Turkish

Model Details

Model Description

  • Developed by: Wietse de Vries
  • Shared by [Optional]: Hugging Face
  • Model type: Token Classification
  • Language(s) (NLP): tr
  • License: apache-2.0
  • Related Models: xlm-roberla
    • Parent Model:
  • Resources for more information:

Uses

Direct Use

Token Classification

Downstream Use [Optional]

More information needed.

Out-of-Scope Use

The model should not be used to intentionally create hostile or alienating environments for people.

Bias, Risks, and Limitations

Significant research has explored bias and fairness issues with language models (see, e.g., Sheng et al. (2021) and Bender et al. (2021)). Predictions generated by the model may include disturbing and harmful stereotypes across protected classes; identity characteristics; and sensitive, social, and occupational groups.

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recomendations.

Training Details

Training Data

See the associated [ Universal Dependencies v2.8 datasetcard] (https://huggingface.co/datasets/universal_dependencies) for further details.

Training Procedure

Preprocessing

More information needed.

Speeds, Sizes, Times

More information needed.

Evaluation

Testing Data, Factors & Metrics

Testing Data

See the associated Universal Dependencies v2.8 datasetcard for further details.

Factors

Metrics

Accuracy

Results

Click to expand
  - type: accuracy
    name: English Test accuracy
    value: 74.4
  - type: accuracy
    name: Dutch Test accuracy
    value: 73.7
  - type: accuracy
    name: German Test accuracy
    value: 73.5
  - type: accuracy
    name: Italian Test accuracy
    value: 73.2
  - type: accuracy
    name: French Test accuracy
    value: 71.4
  - type: accuracy
    name: Spanish Test accuracy
    value: 71.1
  - type: accuracy
    name: Russian Test accuracy
    value: 77.9
  - type: accuracy
    name: Swedish Test accuracy
    value: 74.5
  - type: accuracy
    name: Norwegian Test accuracy
    value: 69.2
  - type: accuracy
    name: Danish Test accuracy
    value: 73.8
  - type: accuracy
    name: Low Saxon Test accuracy
    value: 45.8
  - type: accuracy
    name: Akkadian Test accuracy
    value: 39.8
  - type: accuracy
    name: Armenian Test accuracy
    value: 80.9
  - type: accuracy
    name: Welsh Test accuracy
    value: 62.9
  - type: accuracy
    name: Old East Slavic Test accuracy
    value: 63.7
  - type: accuracy
    name: Albanian Test accuracy
    value: 71.5
  - type: accuracy
    name: Slovenian Test accuracy
    value: 62.3
  - type: accuracy
    name: Guajajara Test accuracy
    value: 41.3
  - type: accuracy
    name: Kurmanji Test accuracy
    value: 68.0
  - type: accuracy
    name: Turkish Test accuracy
    value: 88.4
  - type: accuracy
    name: Finnish Test accuracy
    value: 81.1
  - type: accuracy
    name: Indonesian Test accuracy
    value: 71.5
  - type: accuracy
    name: Ukrainian Test accuracy
    value: 76.8
  - type: accuracy
    name: Polish Test accuracy
    value: 74.3
  - type: accuracy
    name: Portuguese Test accuracy
    value: 76.7
  - type: accuracy
    name: Kazakh Test accuracy
    value: 81.1
  - type: accuracy
    name: Latin Test accuracy
    value: 68.2
  - type: accuracy
    name: Old French Test accuracy
    value: 47.5
  - type: accuracy
    name: Buryat Test accuracy
    value: 62.6
  - type: accuracy
    name: Kaapor Test accuracy
    value: 24.6
  - type: accuracy
    name: Korean Test accuracy
    value: 63.7
  - type: accuracy
    name: Estonian Test accuracy
    value: 82.0
  - type: accuracy
    name: Croatian Test accuracy
    value: 72.3
  - type: accuracy
    name: Gothic Test accuracy
    value: 24.1
  - type: accuracy
    name: Swiss German Test accuracy
    value: 41.1
  - type: accuracy
    name: Assyrian Test accuracy
    value: 23.0
  - type: accuracy
    name: North Sami Test accuracy
    value: 45.2
  - type: accuracy
    name: Naija Test accuracy
    value: 36.0
  - type: accuracy
    name: Latvian Test accuracy
    value: 80.0
  - type: accuracy
    name: Chinese Test accuracy
    value: 55.9
  - type: accuracy
    name: Tagalog Test accuracy
    value: 56.2
  - type: accuracy
    name: Bambara Test accuracy
    value: 30.0
  - type: accuracy
    name: Lithuanian Test accuracy
    value: 81.2
  - type: accuracy
    name: Galician Test accuracy
    value: 72.4
  - type: accuracy
    name: Vietnamese Test accuracy
    value: 57.0
  - type: accuracy
    name: Greek Test accuracy
    value: 80.2
  - type: accuracy
    name: Catalan Test accuracy
    value: 69.1
  - type: accuracy
    name: Czech Test accuracy
    value: 75.8
  - type: accuracy
    name: Erzya Test accuracy
    value: 52.7
  - type: accuracy
    name: Bhojpuri Test accuracy
    value: 50.8
  - type: accuracy
    name: Thai Test accuracy
    value: 49.0
  - type: accuracy
    name: Marathi Test accuracy
    value: 77.9
  - type: accuracy
    name: Basque Test accuracy
    value: 66.8
  - type: accuracy
    name: Slovak Test accuracy
    value: 75.1
  - type: accuracy
    name: Kiche Test accuracy
    value: 43.1
  - type: accuracy
    name: Yoruba Test accuracy
    value: 31.7
  - type: accuracy
    name: Warlpiri Test accuracy
    value: 48.6
  - type: accuracy
    name: Tamil Test accuracy
    value: 79.5
  - type: accuracy
    name: Maltese Test accuracy
    value: 34.1
  - type: accuracy
    name: Ancient Greek Test accuracy
    value: 58.5
  - type: accuracy
    name: Icelandic Test accuracy
    value: 68.9
  - type: accuracy
    name: Mbya Guarani Test accuracy
    value: 33.6
  - type: accuracy
    name: Urdu Test accuracy
    value: 60.5
  - type: accuracy
    name: Romanian Test accuracy
    value: 69.6
  - type: accuracy
    name: Persian Test accuracy
    value: 71.3
  - type: accuracy
    name: Apurina Test accuracy
    value: 50.2
  - type: accuracy
    name: Japanese Test accuracy
    value: 44.4
  - type: accuracy
    name: Hungarian Test accuracy
    value: 86.4
  - type: accuracy
    name: Hindi Test accuracy
    value: 63.2
  - type: accuracy
    name: Classical Chinese Test accuracy
    value: 36.3
  - type: accuracy
    name: Komi Permyak Test accuracy
    value: 51.0
  - type: accuracy
    name: Faroese Test accuracy
    value: 59.5
  - type: accuracy
    name: Sanskrit Test accuracy
    value: 38.3
  - type: accuracy
    name: Livvi Test accuracy
    value: 65.4
  - type: accuracy
    name: Arabic Test accuracy
    value: 64.4
  - type: accuracy
    name: Wolof Test accuracy
    value: 38.9
  - type: accuracy
    name: Bulgarian Test accuracy
    value: 72.4
  - type: accuracy
    name: Akuntsu Test accuracy
    value: 49.1
  - type: accuracy
    name: Makurap Test accuracy
    value: 23.3
  - type: accuracy
    name: Kangri Test accuracy
    value: 46.5
  - type: accuracy
    name: Breton Test accuracy
    value: 55.4
  - type: accuracy
    name: Telugu Test accuracy
    value: 80.7
  - type: accuracy
    name: Cantonese Test accuracy
    value: 54.3
  - type: accuracy
    name: Old Church Slavonic Test accuracy
    value: 42.9
  - type: accuracy
    name: Karelian Test accuracy
    value: 70.5
  - type: accuracy
    name: Upper Sorbian Test accuracy
    value: 67.1
  - type: accuracy
    name: South Levantine Arabic Test accuracy
    value: 58.3
  - type: accuracy
    name: Komi Zyrian Test accuracy
    value: 47.6
  - type: accuracy
    name: Irish Test accuracy
    value: 60.3
  - type: accuracy
    name: Nayini Test accuracy
    value: 50.0
  - type: accuracy
    name: Munduruku Test accuracy
    value: 41.9
  - type: accuracy
    name: Manx Test accuracy
    value: 37.5
  - type: accuracy
    name: Skolt Sami Test accuracy
    value: 47.4
  - type: accuracy
    name: Afrikaans Test accuracy
    value: 71.3
  - type: accuracy
    name: Old Turkish Test accuracy
    value: 53.4
  - type: accuracy
    name: Tupinamba Test accuracy
    value: 53.6
  - type: accuracy
    name: Belarusian Test accuracy
    value: 76.9
  - name: Serbian Test accuracy
    value: 72.2

   -  name: Moksha Test accuracy
    value: 50.0
  - name: Western Armenian Test accuracy
    value: 70.5
  - type: accuracy
    name: Scottish Gaelic Test accuracy
    value: 54.1
  - type: accuracy
    name: Khunsari Test accuracy
    value: 50.0
  - type: accuracy
    name: Hebrew Test accuracy
    value: 79.2
  - type: accuracy
    name: Uyghur Test accuracy
    value: 70.8
  - type: accuracy
    name: Chukchi Test accuracy
    value: 40.8

Model Examination

More information needed

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: More information needed
  • Hours used: More information needed
  • Cloud Provider: More information needed
  • Compute Region: More information needed
  • Carbon Emitted: More information needed

Technical Specifications [optional]

Model Architecture and Objective

More information needed

Compute Infrastructure

More information needed

Hardware

More information needed

Software

More information needed

Citation

BibTeX:

More information needed

APA:

More information needed

Glossary [optional]

More information needed

More Information [optional]

More information needed

Model Card Authors [optional]

Wietse de Vries in collaboration with Ezi Ozoani and the Hugging Face team.

Model Card Contact

More information needed

How to Get Started with the Model

Use the code below to get started with the model.

Click to expand
from transformers import AutoTokenizer, AutoModelForTokenClassification

tokenizer = AutoTokenizer.from_pretrained("wietsedv/xlm-roberta-base-ft-udpos28-tr")

model = AutoModelForTokenClassification.from_pretrained("wietsedv/xlm-roberta-base-ft-udpos28-tr")