Shah1st commited on
Commit
68b1f59
·
verified ·
1 Parent(s): ccfffed

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -8
README.md CHANGED
@@ -7,12 +7,18 @@ tags: []
7
 
8
  <!-- Provide a quick summary of what the model is/does. -->
9
 
10
- This project involves fine-tuning a BERT-based model (dslim/bert-large-NER) to perform Named Entity Recognition (NER) on mountain names in text. The model has been trained to identify mentions of mountain names and differentiate them from other geographic entities or non-entities.
 
 
11
 
12
  Features:
 
13
  Fine-tuned on a custom dataset that includes sentences both with and without mountain names.
 
14
  Uses focal loss to handle class imbalance, which ensures the model focuses on correctly classifying rare mountain names.
 
15
  Token-level classification for identifying the B-MOUNTAIN, I-MOUNTAIN, and O (non-entity) labels.
 
16
  Balances training between sentences with mountains (80%) and without mountains (20%).
17
 
18
 
@@ -20,7 +26,7 @@ Balances training between sentences with mountains (80%) and without mountains (
20
 
21
  <!-- Provide a longer summary of what this model is. -->
22
 
23
- This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
24
 
25
  - **Developed by:** Oleksandr Kharytonov
26
  - **Model type:** BERT
@@ -38,7 +44,7 @@ This is the model card of a 🤗 transformers model that has been pushed on the
38
 
39
  <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
40
 
41
- ```
42
  from transformers import AutoTokenizer, AutoModelForTokenClassification
43
 
44
  tokenizer = AutoTokenizer.from_pretrained('./saved_model')
@@ -50,13 +56,13 @@ model = AutoModelForTokenClassification.from_pretrained('./saved_model')
50
 
51
  <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
52
 
53
- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
54
 
55
  ## How to Get Started with the Model
56
 
57
- Use the code below to get started with the model.
58
 
59
- [More Information Needed]
60
 
61
  ## Training Details
62
 
@@ -74,8 +80,7 @@ Filter for sentences with 'fine_ner_tags' == 24 (mountains)
74
 
75
  <!-- This section describes the evaluation protocols and provides the results. -->
76
 
77
- 'eval_loss': 0.009154710918664932, "eval_precision_'O'": 0.9846412638045784, "eval_recall_'O'": 0.9890537760799295, "eval_f1_'O'": 0.9868425875022907,
78
- 'eval_macro_f1': 0.8952192988290304, 'eval_accuracy': 0.9746226793108054
79
 
80
  ### Testing Data, Factors & Metrics
81
 
 
7
 
8
  <!-- Provide a quick summary of what the model is/does. -->
9
 
10
+ This project involves fine-tuning a BERT-based model (dslim/bert-large-NER) to perform Named Entity Recognition (NER) on mountain names in text.
11
+
12
+ The model has been trained to identify mentions of mountain names and differentiate them from other geographic entities or non-entities.
13
 
14
  Features:
15
+
16
  Fine-tuned on a custom dataset that includes sentences both with and without mountain names.
17
+
18
  Uses focal loss to handle class imbalance, which ensures the model focuses on correctly classifying rare mountain names.
19
+
20
  Token-level classification for identifying the B-MOUNTAIN, I-MOUNTAIN, and O (non-entity) labels.
21
+
22
  Balances training between sentences with mountains (80%) and without mountains (20%).
23
 
24
 
 
26
 
27
  <!-- Provide a longer summary of what this model is. -->
28
 
29
+ This is the model card of a 🤗 transformers model that has been pushed on the Hub.
30
 
31
  - **Developed by:** Oleksandr Kharytonov
32
  - **Model type:** BERT
 
44
 
45
  <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
46
 
47
+ ```python
48
  from transformers import AutoTokenizer, AutoModelForTokenClassification
49
 
50
  tokenizer = AutoTokenizer.from_pretrained('./saved_model')
 
56
 
57
  <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
58
 
59
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model.
60
 
61
  ## How to Get Started with the Model
62
 
63
+ Use the github below to get started with the model.
64
 
65
+ https://github.com/Shah1st/mountain-ner
66
 
67
  ## Training Details
68
 
 
80
 
81
  <!-- This section describes the evaluation protocols and provides the results. -->
82
 
83
+ 'eval_loss': 0.009154710918664932, 'eval_macro_f1': 0.8952192988290304, 'eval_accuracy': 0.9746226793108054
 
84
 
85
  ### Testing Data, Factors & Metrics
86