You need data that is already labeled. There is example of datasets in my dataset (jigsaw-toxic-comment-classification).