ele-sage commited on
Commit
e0bcccf
·
verified ·
1 Parent(s): 952a7ce

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +44 -8
README.md CHANGED
@@ -4,6 +4,9 @@ license: mit
4
  base_model: microsoft/mdeberta-v3-base
5
  tags:
6
  - generated_from_trainer
 
 
 
7
  metrics:
8
  - accuracy
9
  - precision
@@ -12,14 +15,19 @@ metrics:
12
  model-index:
13
  - name: mdeberta-v3-base-name-classifier-v2
14
  results: []
 
 
 
 
 
15
  ---
16
 
17
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
18
- should probably proofread and complete it, then remove this comment. -->
19
 
20
  # mdeberta-v3-base-name-classifier-v2
21
 
22
- This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on an unknown dataset.
 
 
23
  It achieves the following results on the evaluation set:
24
  - Loss: 0.0732
25
  - Accuracy: 0.9946
@@ -27,17 +35,45 @@ It achieves the following results on the evaluation set:
27
  - Recall: 0.9913
28
  - F1: 0.9951
29
 
 
30
  ## Model description
31
 
32
- More information needed
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
 
34
- ## Intended uses & limitations
35
 
36
- More information needed
37
 
38
- ## Training and evaluation data
 
 
39
 
40
- More information needed
41
 
42
  ## Training procedure
43
 
 
4
  base_model: microsoft/mdeberta-v3-base
5
  tags:
6
  - generated_from_trainer
7
+ - name
8
+ - person
9
+ - company
10
  metrics:
11
  - accuracy
12
  - precision
 
15
  model-index:
16
  - name: mdeberta-v3-base-name-classifier-v2
17
  results: []
18
+ datasets:
19
+ - ele-sage/person-company-names-classification
20
+ language:
21
+ - fr
22
+ - en
23
  ---
24
 
 
 
25
 
26
  # mdeberta-v3-base-name-classifier-v2
27
 
28
+ This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on [ele-sage/person-company-names-classification](https://huggingface.co/ele-sage/person-company-names-classification).
29
+
30
+
31
  It achieves the following results on the evaluation set:
32
  - Loss: 0.0732
33
  - Accuracy: 0.9946
 
35
  - Recall: 0.9913
36
  - F1: 0.9951
37
 
38
+
39
  ## Model description
40
 
41
+ This model is a high-performance binary text classifier, fine-tuned from `mdeberta-v3-base`.
42
+ Its purpose is to distinguish between a **person's name** and a **company/organization name** with high accuracy.
43
+
44
+ ### Direct Use
45
+
46
+ This model is intended to be used for text classification. Given a string, it will return a label indicating whether the string is a `Person` or a `Company`.
47
+
48
+ ```python
49
+ from transformers import pipeline
50
+
51
+ classifier = pipeline("text-classification", model="ele-sage/mdeberta-v3-base-name-classifier-v2")
52
+
53
+ results = classifier([
54
+ "Satya Nadella",
55
+ "Global Innovations Inc.",
56
+ "Martinez, Alonso"
57
+ ])
58
+
59
+ for result in results:
60
+ print(f"Text: '{result['text']}', Prediction: {result['label']}, Score: {result['score']:.4f}")
61
+ ```
62
+
63
+ ### Downstream Use
64
+
65
+ This model is a key component of a two-stage name processing pipeline. It is designed to be used as a fast, efficient "gatekeeper" to first identify person names before passing them to a more complex parsing model, such as `ele-sage/distilbert-base-uncased-name-splitter`.
66
+
67
+ ### Out-of-Scope Use
68
 
69
+ - This model is not a general-purpose classifier. It is highly specialized for distinguishing persons from companies and will not perform well on other classification tasks (e.g., sentiment analysis).
70
 
71
+ ## Bias, Risks, and Limitations
72
 
73
+ - **Geographic & Cultural Bias:** The training data is heavily biased towards North American (Canadian) person names and Quebec-based company names. The model will be less accurate when classifying names from other cultural or geographic origins.
74
+ - **Ambiguity:** Certain names can legitimately be both a person's name and a company's name (e.g., "Ford"). In these cases, the model makes a statistical guess based on its training data, which may not always align with the specific context.
75
+ - **Data Source:** The person name data is derived from a Facebook data leak and contains noise. While a rigorous cleaning process was applied, the model may have learned from some spurious data.
76
 
 
77
 
78
  ## Training procedure
79