DominicTWHV commited on
Commit
6de4027
·
verified ·
1 Parent(s): 4730bdc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +81 -1
README.md CHANGED
@@ -12,4 +12,84 @@ datasets:
12
  - lmsys/toxic-chat
13
  - tasksource/jigsaw_toxicity
14
  - KoalaAI/Text-Moderation-Multilingual
15
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  - lmsys/toxic-chat
13
  - tasksource/jigsaw_toxicity
14
  - KoalaAI/Text-Moderation-Multilingual
15
+ ---
16
+
17
+ # Constellation-One-Text-001
18
+
19
+ An experimental text classification model fine-tuned from Microsoft/DeBERTa-V3 base for [Cockatoo](https://cockatoo.dev/)
20
+
21
+ > [!Warning]
22
+ > Highly experimental, may not produce production-grade accuracies
23
+
24
+ This model is licensed under the `Apache-2.0` license.
25
+
26
+ **Resources:**
27
+
28
+ Training/Inferencing server: https://github.com/DominicTWHV/Cockatoo_ML_Training/
29
+
30
+ Training Metrics: https://cockatoo.dev/ml-training.html
31
+
32
+ ## Datasets Used | Citations
33
+
34
+ | Dataset | License | Link |
35
+ | --- | --- | --- |
36
+ | **Phishing Dataset** | MIT | [Hugging Face](https://huggingface.co/datasets/ealvaradob/phishing-dataset) |
37
+ | **Measuring Hate Speech** | CC-BY-4.0 | [Hugging Face](https://huggingface.co/datasets/ucberkeley-dlab/measuring-hate-speech) |
38
+ | **Tweet Eval (SemEval-2019)** | [See Citation]* | [Hugging Face](https://huggingface.co/datasets/cardiffnlp/tweet_eval) |
39
+ | **Toxic Chat** | CC-BY-NC-4.0 | [Hugging Face](https://huggingface.co/datasets/lmsys/toxic-chat) |
40
+ | **Jigsaw Toxicity** | Apache-2.0 | [Hugging Face](https://huggingface.co/datasets/tasksource/jigsaw_toxicity) |
41
+ | **Text Moderation Multilingual** | Apache-2.0 | [Hugging Face](https://huggingface.co/datasets/KoalaAI/Text-Moderation-Multilingual) |
42
+
43
+ ---
44
+
45
+ ### Citation: ucberkeley-dlab/measuring-hate-speech
46
+
47
+ ```bibtex
48
+ @article{kennedy2020constructing,
49
+ title={Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application},
50
+ author={Kennedy, Chris J and Bacon, Geoff and Sahn, Alexander and von Vacano, Claudia},
51
+ journal={arXiv preprint arXiv:2009.10277},
52
+ year={2020}
53
+ }
54
+ ```
55
+
56
+ ### Citation: cardiffnlp/tweet_eval
57
+
58
+ ```bibtex
59
+ @inproceedings{basile-etal-2019-semeval,
60
+ title = "{S}em{E}val-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in {T}witter",
61
+ author = "Basile, Valerio and Bosco, Cristina and Fersini, Elisabetta and Nozza, Debora and Patti, Viviana and Rangel Pardo, Francisco Manuel and Rosso, Paolo and Sanguinetti, Manuela",
62
+ booktitle = "Proceedings of the 13th International Workshop on Semantic Evaluation",
63
+ year = "2019",
64
+ address = "Minneapolis, Minnesota, USA",
65
+ publisher = "Association for Computational Linguistics",
66
+ url = "https://www.aclweb.org/anthology/S19-2007",
67
+ doi = "10.18653/v1/S19-2007",
68
+ pages = "54--63"
69
+ }
70
+
71
+ ```
72
+
73
+ ### Citation: lmsys/toxic-chat
74
+
75
+ ```bibtex
76
+ @misc{lin2023toxicchat,
77
+ title={ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation},
78
+ author={Zi Lin and Zihan Wang and Yongqi Tong and Yangkun Wang and Yuxin Guo and Yujia Wang and Jingbo Shang},
79
+ year={2023},
80
+ eprint={2310.17389},
81
+ archivePrefix={arXiv},
82
+ primaryClass={cs.CL}
83
+ }
84
+ ```
85
+
86
+ ### Citation: KoalaAI/Text-Moderation-Multilingual
87
+
88
+ ```bibtex
89
+ @misc{text-moderation-large,
90
+ title={Text-Moderation-Multilingual: A Multilingual Text Moderation Dataset},
91
+ author={[KoalaAI]},
92
+ year={2025},
93
+ note={Aggregated from ifmain's and OpenAI's moderation datasets}
94
+ }
95
+ ```