Guscode
/

DKbert-hatespeech-detection

Text Classification

Model card Files Files and versions

DKbert-hatespeech-detection / README.md

Guscode's picture

Update README.md

4b56b80 almost 5 years ago

|

1.05 kB

	---
	language:

	- Danish

	tags:
	- Hatespeech
	- Danish
	- BERT
	license: MIT
	datasets:
	- DKHate - OffensEval2020
	Classes:
	- Hateful
	- Not Hateful
	---


	# DKbert-hatespeech-classification

	Use this model to detect hatespeech in Danish. For details, guide and command line tool see [DK hate github](https://github.com/Guscode/DKbert-hatespeech-detection)

	## Training data

	Training data is from OffensEval2020 which can be found [here]( https://figshare.com/articles/dataset/Danish_Hate_Speech_Abusive_Language_data/12220805)

	## Performance

	The model achieves a macro F1-score of 0.78

	Precision hateful: 0.77

	Recall hateful: 0.49

	See more on [DK hate github](https://github.com/Guscode/DKbert-hatespeech-detection)

	## Training procedure

	- BOTXO Nordic Bert
	- Learning rate: 1e-5,
	- Batch size: 16
	- Max sequence length: 128

	## Project information

	This model was made in collaboration between [Johan Horsmans](https://github.com/JohanHorsmans) and [Gustav Aarup Lauridsen](https://github.com/Guscode) for their Cultural Data Science Exam.