rebeccaqian commited on
Commit
b706467
·
1 Parent(s): d67ae66

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +43 -18
README.md CHANGED
@@ -57,7 +57,7 @@ The perturber is a seq2seq controlled generation model that rewrites text along
57
  The perturber takes in (i) a source text snippet, (ii) a word in the snippet referring to a demographic group, and (iii) a new target demographic attribute, and generates a perturbed snippet that refers to the target demographic attribute, while preserving overall meaning.
58
 
59
  - **Repository:** https://github.com/facebookresearch/ResponsibleNLP/
60
- - **Paper:** https://arxiv.org/abs/2205.12586
61
  - **Point of Contact:** rebeccaqian@meta.com, ccross@meta.com, douwe@huggingface.co, adinawilliams@meta.com
62
  - **License:** MIT
63
 
@@ -77,47 +77,72 @@ Below we show some example inputs and outputs for the perturber rewriting text a
77
  Model inputs follow the format `[selected_word][target_attribute] <PERT_SEP> [input_text]`, where `selected_word` is a word that contains demographic information, `target_attribute` is a demographic attribute such as "man" or "asian", and `input_text` is the text sequence to rewrite.
78
 
79
  Currently the perturber supports text rewriting along three axes and several attributes:
80
- - **gender:** man, woman, non-binary
81
- - **race:** black, white, asian, hispanic, native-american, pacific-islander
82
- - **age:** child, young, middle-aged, senior, adult
83
 
84
  ### Gender
85
  _Input:_
86
- his, woman <PERT_SEP> Jack was passionate about rock climbing and his love for the sport was infectious to all men around him.
87
 
88
  _Output:_
89
- Jackie was passionate about rock climbing and her love for the sport was infectious to all men around her.
 
90
  <br/>
91
-
92
  _Input:_
93
- Alice, man <PERT_SEP> To her girlfriend Jen, Alice was a doting mother, loving girlfriend and talented actress.
94
 
95
  _Output:_
96
- To his girlfriend Jen, Alan was a doting father, loving partner and talented actor.
97
  <br/>
98
  <br/>
99
-
100
  _Input:_
101
- his, non-binary <PERT_SEP> Jack was passionate about rock climbing and his love for the sport was infectious to all men around him.
102
 
103
  _Output:_
104
- Jack was passionate about rock climbing and their love for the sport was infectious to all men around them.
105
  <br/>
106
  <br/>
107
  ### Age
108
  _Input:_
109
- child, senior <PERT_SEP> The young child is naive and his innocence must be protected at all costs.
110
 
111
  _Output:_
112
- The elderly person is naive and his innocence must be protected at all costs.
113
 
114
  ### Race/Ethnicity
115
  _Input:_
116
- Asian, black <PERT_SEP> The Asian students association often hosted anime nights and boba events on campus.
117
 
118
  _Output:_
119
- The Black students association often hosted anime nights and boba events on campus.
120
-
121
- ### Contributions
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
122
 
123
  Thanks to [@Rebecca-Qian](https://github.com/Rebecca-Qian) for adding this model.
 
57
  The perturber takes in (i) a source text snippet, (ii) a word in the snippet referring to a demographic group, and (iii) a new target demographic attribute, and generates a perturbed snippet that refers to the target demographic attribute, while preserving overall meaning.
58
 
59
  - **Repository:** https://github.com/facebookresearch/ResponsibleNLP/
60
+ - **Paper:** https://aclanthology.org/2022.emnlp-main.646/
61
  - **Point of Contact:** rebeccaqian@meta.com, ccross@meta.com, douwe@huggingface.co, adinawilliams@meta.com
62
  - **License:** MIT
63
 
 
77
  Model inputs follow the format `[selected_word][target_attribute] <PERT_SEP> [input_text]`, where `selected_word` is a word that contains demographic information, `target_attribute` is a demographic attribute such as "man" or "asian", and `input_text` is the text sequence to rewrite.
78
 
79
  Currently the perturber supports text rewriting along three axes and several attributes:
80
+ - **gender:** `man`, `woman`, `non-binary`
81
+ - **race:** `black`, `white`, `asian`, `hispanic`, `native-american`, `pacific-islander`
82
+ - **age:** `child`, `young`, `middle-aged`, `senior`, `adult`
83
 
84
  ### Gender
85
  _Input:_
86
+ `his, woman <PERT_SEP> Jack was passionate about rock climbing and his love for the sport was infectious to all men around him.`
87
 
88
  _Output:_
89
+ `Jackie was passionate about rock climbing and her love for the sport was infectious to all men around her.`
90
+ <br/>
91
  <br/>
 
92
  _Input:_
93
+ `Alice, man <PERT_SEP> To her girlfriend Jen, Alice was a doting mother, loving girlfriend and talented actress.`
94
 
95
  _Output:_
96
+ `To his girlfriend Jen, Alan was a doting father, loving partner and talented actor.`
97
  <br/>
98
  <br/>
 
99
  _Input:_
100
+ `his, non-binary <PERT_SEP> Jack was passionate about rock climbing and his love for the sport was infectious to all men around him.`
101
 
102
  _Output:_
103
+ `Jack was passionate about rock climbing and their love for the sport was infectious to all men around them.`
104
  <br/>
105
  <br/>
106
  ### Age
107
  _Input:_
108
+ `child, senior <PERT_SEP> The young child is naive and his innocence must be protected at all costs.`
109
 
110
  _Output:_
111
+ `The elderly person is naive and his innocence must be protected at all costs.`
112
 
113
  ### Race/Ethnicity
114
  _Input:_
115
+ `Asian, black <PERT_SEP> The Asian students association often hosted anime nights and boba events on campus.`
116
 
117
  _Output:_
118
+ `The Black students association often hosted anime nights and boba events on campus.`
119
+
120
+ ## Bias, Risks & Limitations
121
+ Limitations of the perturber include inherent biases in demographic categorization, data sourcing and crowdsourced data collection, and the ambiguous nature of fairness and perturbability. Ambiguous instances include names, where annotators may have different preconceptions about whether they contain ethnic information. Our crowdworkers and researchers are primarily English speaking and US-based, which may introduce additional cultural biases.
122
+
123
+ For an in-depth discussion of bias, risks and limitations, see the Limitations section of [our paper](https://aclanthology.org/2022.emnlp-main.646/).
124
+
125
+ ## Citation
126
+ ```
127
+ @inproceedings{qian-etal-2022-perturbation,
128
+ title = "Perturbation Augmentation for Fairer {NLP}",
129
+ author = "Qian, Rebecca and
130
+ Ross, Candace and
131
+ Fernandes, Jude and
132
+ Smith, Eric Michael and
133
+ Kiela, Douwe and
134
+ Williams, Adina",
135
+ booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing",
136
+ month = dec,
137
+ year = "2022",
138
+ address = "Abu Dhabi, United Arab Emirates",
139
+ publisher = "Association for Computational Linguistics",
140
+ url = "https://aclanthology.org/2022.emnlp-main.646",
141
+ pages = "9496--9521",
142
+ abstract = "Unwanted and often harmful social biases are becoming ever more salient in NLP research, affecting both models and datasets. In this work, we ask whether training on demographically perturbed data leads to fairer language models. We collect a large dataset of human annotated text perturbations and train a neural perturbation model, which we show outperforms heuristic alternatives. We find that (i) language models (LMs) pre-trained on demographically perturbed corpora are typically more fair, and (ii) LMs finetuned on perturbed GLUE datasets exhibit less demographic bias on downstream tasks, and (iii) fairness improvements do not come at the expense of performance on downstream tasks. Lastly, we discuss outstanding questions about how best to evaluate the (un)fairness of large language models. We hope that this exploration of neural demographic perturbation will help drive more improvement towards fairer NLP.",
143
+ }
144
+ ```
145
+
146
+ ### Model Card Contact
147
 
148
  Thanks to [@Rebecca-Qian](https://github.com/Rebecca-Qian) for adding this model.