SallySims commited on
Commit
72b5b20
Β·
verified Β·
1 Parent(s): 2b4256b

Add detailed model card

Browse files
Files changed (1) hide show
  1. README.md +131 -0
README.md ADDED
@@ -0,0 +1,131 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: apache-2.0
4
+ tags:
5
+ - pytorch
6
+ - text-classification
7
+ - dei
8
+ - dei-washing
9
+ - equibert
10
+ metrics:
11
+ - mae
12
+ - rmse
13
+ ---
14
+
15
+ # EquiBERT β€” DEI Washing Detector
16
+
17
+ **Model ID:** `SallySims/equibert-washing-detector`
18
+
19
+ Scores leadership communications and DEI statements on genuine
20
+ commitment vs performative or superficial DEI washing.
21
+
22
+ ## Output Dimensions (all 0.0–1.0, higher = more genuine)
23
+
24
+ | Index | Dimension | Low score means... |
25
+ |-------|-----------|-------------------|
26
+ | 0 | `commitment_score` | Overall washing risk (main score) |
27
+ | 1 | `specificity` | Vague language, no concrete details |
28
+ | 2 | `accountability` | No named owner, no consequences |
29
+ | 3 | `evidence` | No data, no measurable outcomes |
30
+ | 4 | `action_density` | Aspirational language only, no actions |
31
+ | 5 | `contradiction` | Claims contradict observable behaviour |
32
+
33
+ ## Washing Risk Levels
34
+
35
+ | Score | Risk | Example |
36
+ |-------|------|---------|
37
+ | 0.0–0.2 | High | "Diversity is core to who we are" |
38
+ | 0.2–0.5 | Medium | "We are working to improve diversity" |
39
+ | 0.5–0.8 | Low | "We set a 35% diverse hiring target" |
40
+ | 0.8–1.0 | None | "We achieved 27% BIPOC hiring, up from 18%" |
41
+
42
+ ## Usage
43
+
44
+ ```python
45
+ text = "Diversity and inclusion are core to who we are. We believe our differences make us stronger."
46
+ inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=256)
47
+ # outputs = model(**inputs)
48
+ # commitment_score = outputs.logits[:, 0]
49
+ # dimensions = outputs.logits[:, 1:]
50
+ ```
51
+
52
+ ## Model Description
53
+
54
+ EquiBERT is a multi-task DEI (Diversity, Equity and Inclusion) transformer
55
+ built on a dual-encoder backbone that fuses **RoBERTa-base** and
56
+ **DeBERTa-v3-base** via a learned weighted sum (Ξ± parameter).
57
+ The fused representation is fed into task-specific heads covering
58
+ 17 distinct DEI analysis tasks.
59
+
60
+ **Organisation:** [SallySims](https://huggingface.co/SallySims)
61
+ **Framework:** PyTorch + HuggingFace Transformers
62
+ **Backbone:** RoBERTa-base + DeBERTa-v3-base (dual encoder, fused)
63
+ **Language:** English
64
+ **Domain:** Organisational DEI text β€” HR communications, policies,
65
+ job descriptions, performance reviews, leadership statements, reports
66
+
67
+ ## Architecture
68
+
69
+ ```
70
+ Input Text
71
+ β”‚
72
+ β”œβ”€β”€β–Ά RoBERTa-base encoder ──▢ Linear projection
73
+ β”‚ β”‚
74
+ └──▢ DeBERTa-v3-base encoder ──▢ Linear projection
75
+ β”‚
76
+ Weighted fusion (learned Ξ±)
77
+ β”‚
78
+ Layer Norm + Dropout
79
+ β”‚
80
+ Task-specific head (see below)
81
+ ```
82
+
83
+ ## Training Data
84
+
85
+ Trained on synthetic DEI organisational text generated by the
86
+ EquiBERT synthetic data pipeline, covering 20 DEI categories
87
+ across HR, policy, leadership, and workforce analytics domains.
88
+ For production use, fine-tune on real labelled DEI data.
89
+
90
+ ## Limitations
91
+
92
+ - Trained on synthetic data β€” predictions should be validated
93
+ before use in real HR or policy decisions.
94
+ - English-only.
95
+ - Not a substitute for qualified DEI practitioners or legal advice.
96
+ - May reflect biases present in the training corpus.
97
+
98
+ ## Citation
99
+
100
+ If you use EquiBERT in your research, please cite:
101
+
102
+ ```bibtex
103
+ @misc{equibert2024,
104
+ author = {SallySims},
105
+ title = {EquiBERT: A Multi-Task DEI Transformer},
106
+ year = {2024},
107
+ publisher = {HuggingFace},
108
+ url = {https://huggingface.co/SallySims}
109
+ }
110
+ ```
111
+
112
+ ## All EquiBERT Models
113
+
114
+ | Model | Task | Primary Metric |
115
+ |-------|------|---------------|
116
+ | [equibert-bias-classifier](https://huggingface.co/SallySims/equibert-bias-classifier) | Bias Detection | Macro F1 |
117
+ | [equibert-microaggression](https://huggingface.co/SallySims/equibert-microaggression) | Microaggression Detection | Macro F1 |
118
+ | [equibert-category-tagger](https://huggingface.co/SallySims/equibert-category-tagger) | DEI Category Tagging | Macro F1 |
119
+ | [equibert-event-exclusion](https://huggingface.co/SallySims/equibert-event-exclusion) | Event Exclusion Classification | Macro F1 |
120
+ | [equibert-inclusive-language](https://huggingface.co/SallySims/equibert-inclusive-language) | Inclusive Language Scoring | Span F1 |
121
+ | [equibert-review-auditor](https://huggingface.co/SallySims/equibert-review-auditor) | Performance Review Auditing | Span F1 |
122
+ | [equibert-washing-detector](https://huggingface.co/SallySims/equibert-washing-detector) | DEI Washing Detection | MAE |
123
+ | [equibert-framing-scorer](https://huggingface.co/SallySims/equibert-framing-scorer) | Report Framing Scoring | MAE |
124
+ | [equibert-awareness-scorer](https://huggingface.co/SallySims/equibert-awareness-scorer) | DEI Awareness Scoring | MAE |
125
+ | [equibert-similarity](https://huggingface.co/SallySims/equibert-similarity) | Semantic Similarity | Accuracy |
126
+ | [equibert-ner](https://huggingface.co/SallySims/equibert-ner) | DEI Entity Recognition | Span F1 |
127
+ | [equibert-relation-extraction](https://huggingface.co/SallySims/equibert-relation-extraction) | Relation Extraction | Macro F1 |
128
+ | [equibert-qa](https://huggingface.co/SallySims/equibert-qa) | Extractive QA | Span EM |
129
+ | [equibert-search](https://huggingface.co/SallySims/equibert-search) | Semantic Search | MRR@10 |
130
+ | [equibert-nli](https://huggingface.co/SallySims/equibert-nli) | NLI / Textual Entailment | Macro F1 |
131
+ | [equibert-generator](https://huggingface.co/SallySims/equibert-generator) | DEI Text Generation | ROUGE-L |