starkdv123 commited on
Commit
1b1f82f
·
verified ·
1 Parent(s): 4fded9a

Add model card

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ tags:
4
+ - transformers
5
+ - token-classification
6
+ - ner
7
+ - bert
8
+ - conll2003
9
+ license: apache-2.0
10
+ datasets:
11
+ - conll2003
12
+ language:
13
+ - en
14
+ pipeline_tag: token-classification
15
+ authors:
16
+ - Karan D Vasa (https://huggingface.co/starkdv123)
17
+ ---
18
+
19
+ # BERT (base-cased) for CoNLL-2003 NER — Full Fine-Tune
20
+
21
+ This repository contains a **BERT base cased** model fine-tuned on **CoNLL-2003** (parquet version).
22
+ Evaluated with **seqeval** (entity-level F1).
23
+
24
+ ## 📊 Result (this run)
25
+ - **Entity Macro F1**: 0.9192
26
+
27
+ ## Usage
28
+ ```python
29
+ from transformers import pipeline
30
+ clf = pipeline("token-classification", model="starkdv123/conll2003-bert-ner-full", aggregation_strategy="simple")
31
+ clf("Chris Hoiles hit his 22nd homer for Baltimore.")
32
+ ```
33
+
34
+ ## Training summary
35
+
36
+ * Base: `bert-base-cased`
37
+ * Epochs: 3, LR: 3e-5, batch 16/32, max_len 256, weight_decay 0.01, fp16
38
+ * Label alignment: -100 for subword continuations
39
+ * Metric: seqeval F1 (entity-level)
40
+
41
+ ## Confusion Matrix
42
+ ```
43
+ LOC MISC O ORG PER
44
+ LOC 411 6 21 32 3
45
+ MISC 9 2213 51 76 14
46
+ O 67 110 38063 58 17
47
+ ORG 31 77 32 2353 10
48
+ PER 3 42 15 24 2689
49
+ ```