ele-sage commited on
Commit
23a8616
·
verified ·
1 Parent(s): 2aba960

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +43 -8
README.md CHANGED
@@ -4,6 +4,9 @@ license: mit
4
  base_model: microsoft/mdeberta-v3-base
5
  tags:
6
  - generated_from_trainer
 
 
 
7
  metrics:
8
  - accuracy
9
  - precision
@@ -12,14 +15,19 @@ metrics:
12
  model-index:
13
  - name: mdeberta-v3-base-name-classifier-v2
14
  results: []
 
 
 
 
 
15
  ---
16
 
17
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
18
- should probably proofread and complete it, then remove this comment. -->
19
 
20
  # mdeberta-v3-base-name-classifier-v2
21
 
22
- This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on an unknown dataset.
 
 
23
  It achieves the following results on the evaluation set:
24
  - Loss: 0.0215
25
  - Accuracy: 0.9943
@@ -29,15 +37,42 @@ It achieves the following results on the evaluation set:
29
 
30
  ## Model description
31
 
32
- More information needed
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
 
34
- ## Intended uses & limitations
35
 
36
- More information needed
37
 
38
- ## Training and evaluation data
 
 
39
 
40
- More information needed
41
 
42
  ## Training procedure
43
 
 
4
  base_model: microsoft/mdeberta-v3-base
5
  tags:
6
  - generated_from_trainer
7
+ - name
8
+ - person
9
+ - company
10
  metrics:
11
  - accuracy
12
  - precision
 
15
  model-index:
16
  - name: mdeberta-v3-base-name-classifier-v2
17
  results: []
18
+ datasets:
19
+ - ele-sage/person-company-names-classification
20
+ language:
21
+ - fr
22
+ - en
23
  ---
24
 
 
 
25
 
26
  # mdeberta-v3-base-name-classifier-v2
27
 
28
+ This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on [ele-sage/person-company-names-classification](https://huggingface.co/ele-sage/person-company-names-classification).
29
+
30
+
31
  It achieves the following results on the evaluation set:
32
  - Loss: 0.0215
33
  - Accuracy: 0.9943
 
37
 
38
  ## Model description
39
 
40
+ This model is a high-performance binary text classifier, fine-tuned from `mdeberta-v3-base`.
41
+ Its purpose is to distinguish between a **person's name** and a **company/organization name** with high accuracy.
42
+
43
+ ### Direct Use
44
+
45
+ This model is intended to be used for text classification. Given a string, it will return a label indicating whether the string is a `Person` or a `Company`.
46
+
47
+ ```python
48
+ from transformers import pipeline
49
+
50
+ classifier = pipeline("text-classification", model="ele-sage/mdeberta-v3-base-name-classifier-v2")
51
+
52
+ results = classifier([
53
+ "Satya Nadella",
54
+ "Global Innovations Inc.",
55
+ "Martinez, Alonso"
56
+ ])
57
+
58
+ for result in results:
59
+ print(f"Text: '{result['text']}', Prediction: {result['label']}, Score: {result['score']:.4f}")
60
+ ```
61
+
62
+ ### Downstream Use
63
+
64
+ This model is a key component of a two-stage name processing pipeline. It is designed to be used as a fast, efficient "gatekeeper" to first identify person names before passing them to a more complex parsing model, such as `ele-sage/distilbert-base-uncased-name-splitter`.
65
+
66
+ ### Out-of-Scope Use
67
 
68
+ - This model is not a general-purpose classifier. It is highly specialized for distinguishing persons from companies and will not perform well on other classification tasks (e.g., sentiment analysis).
69
 
70
+ ## Bias, Risks, and Limitations
71
 
72
+ - **Geographic & Cultural Bias:** The training data is heavily biased towards North American (Canadian) person names and Quebec-based company names. The model will be less accurate when classifying names from other cultural or geographic origins.
73
+ - **Ambiguity:** Certain names can legitimately be both a person's name and a company's name (e.g., "Ford"). In these cases, the model makes a statistical guess based on its training data, which may not always align with the specific context.
74
+ - **Data Source:** The person name data is derived from a Facebook data leak and contains noise. While a rigorous cleaning process was applied, the model may have learned from some spurious data.
75
 
 
76
 
77
  ## Training procedure
78