abhilash1910
/

albert-german-ner

Model card Files Files and versions

abhilash1910 commited on Mar 7, 2021

Commit

0ec3178

·

1 Parent(s): b59f050

Readme

Files changed (1) hide show

Readme.md +82 -0

Readme.md ADDED Viewed

	@@ -0,0 +1,82 @@

+## German NER Albert Model
+This is a trained Albert model for Token Classification in German ,[Germeval](https://sites.google.com/site/germeval2014ner/) and can be used for Inference.
+## Model Specifications
+- MAX_LENGTH=128
+- MODEL='albert-base-v1'
+- BATCH_SIZE=32
+- NUM_EPOCHS=3
+- SAVE_STEPS=750
+- SEED=1
+- SAVE_STEPS = 100
+- LOGGING_STEPS = 100
+- SEED = 42
+### Usage Specifications
+This model is trained on Tensorflow version and is compatible with the 'ner' pipeline of huggingface.
+```python
+from transformers import AutoTokenizer,TFAutoModelForTokenClassification
+from transformers import pipeline
+model=TFAutoModelForTokenClassification.from_pretrained('abhilash1910/albert-german-ner')
+tokenizer=AutoTokenizer.from_pretrained('abhilash1910/albert-german-ner')
+ner_model = pipeline('ner', model=model, tokenizer=tokenizer)
+seq='Berlin ist die Hauptstadt von Deutschland'
+ner_model(seq)
+```
+The Tensorflow version of Albert is used for training the model and the output for the above mentioned segment is as follows:
+```
+[{'entity': 'B-PERderiv',
+  'index': 1,
+  'score': 0.09580112248659134,
+  'word': '▁berlin'},
+ {'entity': 'B-ORGpart',
+  'index': 2,
+  'score': 0.08364498615264893,
+  'word': '▁is'},
+ {'entity': 'B-LOCderiv',
+  'index': 3,
+  'score': 0.07593920826911926,
+  'word': 't'},
+ {'entity': 'B-PERderiv',
+  'index': 4,
+  'score': 0.09574996680021286,
+  'word': '▁die'},
+ {'entity': 'B-LOCderiv',
+  'index': 5,
+  'score': 0.07097965478897095,
+  'word': '▁'},
+ {'entity': 'B-PERderiv',
+  'index': 6,
+  'score': 0.07122448086738586,
+  'word': 'haupt'},
+ {'entity': 'B-PERderiv',
+  'index': 7,
+  'score': 0.12397754937410355,
+  'word': 'stadt'},
+ {'entity': 'I-OTHderiv',
+  'index': 8,
+  'score': 0.0818650871515274,
+  'word': '▁von'},
+ {'entity': 'I-LOCderiv',
+  'index': 9,
+  'score': 0.08271490037441254,
+  'word': '▁'},
+ {'entity': 'B-LOCderiv',
+  'index': 10,
+  'score': 0.08616268634796143,
+  'word': 'deutschland'}]
+  ```
+## Resources
+For all resources , please look into [huggingface](https://huggingface.com).