Text Generation
PEFT
English
hate speech
conversational
palomapiot commited on
Commit
7d593b0
·
verified ·
1 Parent(s): 9d85a4f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -12
README.md CHANGED
@@ -13,6 +13,8 @@ tags:
13
 
14
  # LLaMA2 Fine-Tuned on not Engaging with Hate Speech
15
 
 
 
16
  ## Model Description
17
  This model is a fine-tuned version of `meta-llama/Llama-2-13b-chat-hf` on a hate speech dataset using the PEFT approach, to prevent the model from exacerbating hate discourse.
18
 
@@ -113,18 +115,14 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
113
  If you use this model, please cite the following reference:
114
 
115
  ```bibtex
116
- @article{Piot_Martín-Rodilla_Parapar_2024,
117
- title={MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection},
118
- volume={18},
119
- url={https://ojs.aaai.org/index.php/ICWSM/article/view/31445},
120
- DOI={10.1609/icwsm.v18i1.31445},
121
- abstractNote={Hate speech represents a pervasive and detrimental form of online discourse, often manifested through an array of slurs, from hateful tweets to defamatory posts. As such speech proliferates, it connects people globally and poses significant social, psychological, and occasionally physical threats to targeted individuals and communities. Current computational linguistic approaches for tackling this phenomenon rely on labelled social media datasets for training. For unifying efforts, our study advances in the critical need for a comprehensive meta-collection, advocating for an extensive dataset to help counteract this problem effectively. We scrutinized over 60 datasets, selectively integrating those pertinent into MetaHate. This paper offers a detailed examination of existing collections, highlighting their strengths and limitations. Our findings contribute to a deeper understanding of the existing datasets, paving the way for training more robust and adaptable models. These enhanced models are essential for effectively combating the dynamic and complex nature of hate speech in the digital realm.},
122
- number={1},
123
- journal={Proceedings of the International AAAI Conference on Web and Social Media},
124
- author={Piot, Paloma and Martín-Rodilla, Patricia and Parapar, Javier},
125
- year={2024},
126
- month={May},
127
- pages={2025-2039}
128
  }
129
  ```
130
 
 
13
 
14
  # LLaMA2 Fine-Tuned on not Engaging with Hate Speech
15
 
16
+ This model was created as part of the work "Decoding Hate: Exploring Language Models' Reactions to Hate Speech," which was accepted for the main conference of NAACL 2025.
17
+
18
  ## Model Description
19
  This model is a fine-tuned version of `meta-llama/Llama-2-13b-chat-hf` on a hate speech dataset using the PEFT approach, to prevent the model from exacerbating hate discourse.
20
 
 
115
  If you use this model, please cite the following reference:
116
 
117
  ```bibtex
118
+ @misc{piot2024decodinghateexploringlanguage,
119
+ title={Decoding Hate: Exploring Language Models' Reactions to Hate Speech},
120
+ author={Paloma Piot and Javier Parapar},
121
+ year={2024},
122
+ eprint={2410.00775},
123
+ archivePrefix={arXiv},
124
+ primaryClass={cs.CL},
125
+ url={https://arxiv.org/abs/2410.00775},
 
 
 
 
126
  }
127
  ```
128