How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("token-classification", model="nlpso/m0_flat_ner_ocr_cmbert_io")
# Load model directly
from transformers import AutoTokenizer, AutoModelForTokenClassification

tokenizer = AutoTokenizer.from_pretrained("nlpso/m0_flat_ner_ocr_cmbert_io")
model = AutoModelForTokenClassification.from_pretrained("nlpso/m0_flat_ner_ocr_cmbert_io")
Quick Links

m0_flat_ner_ocr_cmbert_io

Introduction

This model is a fine-tuned verion from Jean-Baptiste/camembert-ner for nested NER task on a nested NER Paris trade directories dataset.

Dataset

Abbreviation Description
O Outside of a named entity
PER Person or company name
ACT Person or company professional activity
TITRE Distinction
LOC Street name
CARDINAL Street number
FT Geographical feature

Experiment parameter

  • Pretrained-model : Jean-Baptiste/camembert-ner
  • Dataset : noisy (Pero OCR)
  • Tagging format : IO
  • Recognised entities : All (flat entities)

Load model from the HuggingFace

from transformers import AutoTokenizer, AutoModelForTokenClassification

tokenizer = AutoTokenizer.from_pretrained("nlpso/m0_flat_ner_ocr_cmbert_io")
model = AutoModelForTokenClassification.from_pretrained("nlpso/m0_flat_ner_ocr_cmbert_io")
Downloads last month
4
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train nlpso/m0_flat_ner_ocr_cmbert_io

Collection including nlpso/m0_flat_ner_ocr_cmbert_io