hate_speech

This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 0.7742
Model Preparation Time: 0.0024
Accuracy: 0.8037
Auc Score: 0.8861
F1: 0.8318
Precision: 0.8010
Recall: 0.8651

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 3e-05
train_batch_size: 8
eval_batch_size: 8
seed: 42
optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
num_epochs: 3

Training results

Training Loss	Epoch	Step	Validation Loss	Model Preparation Time	Accuracy	Auc Score	F1	Precision	Recall
0.6024	0.1054	100	0.6052	0.0024	0.6806	0.7925	0.6496	0.8453	0.5274
0.5439	0.2107	200	0.5120	0.0024	0.7514	0.8372	0.7941	0.7419	0.8542
0.515	0.3161	300	0.5180	0.0024	0.7538	0.8469	0.8035	0.7278	0.8969
0.5225	0.4215	400	0.5000	0.0024	0.7698	0.8393	0.7863	0.8210	0.7544
0.4935	0.5269	500	0.5008	0.0024	0.768	0.8457	0.7961	0.7855	0.8070
0.5196	0.6322	600	0.5069	0.0024	0.7674	0.8473	0.8023	0.767	0.8410
0.4918	0.7376	700	0.5011	0.0024	0.7655	0.8565	0.8109	0.7407	0.8958
0.5182	0.8430	800	0.4873	0.0024	0.7902	0.8616	0.8150	0.8067	0.8235
0.4749	0.9484	900	0.4606	0.0024	0.7815	0.8674	0.8109	0.7886	0.8344
0.4042	1.0537	1000	0.5453	0.0024	0.7852	0.8735	0.8211	0.7709	0.8783
0.3593	1.1591	1100	0.5650	0.0024	0.7791	0.8745	0.8193	0.7572	0.8925
0.3911	1.2645	1200	0.5108	0.0024	0.8025	0.8783	0.8264	0.8154	0.8377
0.3445	1.3699	1300	0.6231	0.0024	0.7902	0.8815	0.8265	0.7711	0.8904
0.4027	1.4752	1400	0.5336	0.0024	0.8062	0.8796	0.8239	0.8404	0.8081
0.3058	1.5806	1500	0.6094	0.0024	0.7957	0.8760	0.8232	0.8002	0.8476
0.3535	1.6860	1600	0.5834	0.0024	0.7951	0.8810	0.8254	0.7910	0.8629
0.3713	1.7914	1700	0.5286	0.0024	0.7969	0.8817	0.8278	0.7898	0.8695
0.359	1.8967	1800	0.5292	0.0024	0.8086	0.8819	0.8290	0.8313	0.8268
0.3762	2.0021	1900	0.5222	0.0024	0.8037	0.8814	0.8297	0.8085	0.8520
0.2101	2.1075	2000	0.6738	0.0024	0.8055	0.8793	0.8271	0.8253	0.8289
0.2307	2.2129	2100	0.7485	0.0024	0.8012	0.8845	0.8324	0.7901	0.8794
0.2403	2.3182	2200	0.7186	0.0024	0.8049	0.8818	0.8322	0.8045	0.8618
0.221	2.4236	2300	0.7233	0.0024	0.8074	0.8818	0.8334	0.8097	0.8586
0.2112	2.5290	2400	0.7259	0.0024	0.8123	0.8844	0.8345	0.8260	0.8432
0.2155	2.6344	2500	0.7302	0.0024	0.8117	0.8854	0.8342	0.8244	0.8443
0.1997	2.7397	2600	0.7658	0.0024	0.8074	0.8832	0.8289	0.8266	0.8311
0.2761	2.8451	2700	0.7838	0.0024	0.8037	0.8869	0.8334	0.7956	0.875
0.1878	2.9505	2800	0.7742	0.0024	0.8037	0.8861	0.8318	0.8010	0.8651

Framework versions

Transformers 4.53.0
Pytorch 2.7.1+cu126
Datasets 3.6.0
Tokenizers 0.21.2

Downloads last month: -

Safetensors

Model size

0.1B params

Tensor type

F32