andriadze commited on
Commit
6460378
·
verified ·
1 Parent(s): 90c7d49

Training in progress, epoch 1

Browse files
Files changed (3) hide show
  1. README.md +21 -31
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -14,35 +14,26 @@ model-index:
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
- # ai-chat-censor6
18
 
19
- The primary focus of the model is detecting sexual/minors category of messages.
20
- Main goal of the model is to detect and prevent illegal use of uncensored chatbots, because of this main focus was detecting "underage" comments.
21
- Model is trigger happy about this specific tag.
22
- For example: "Can you roleplay as 16 year old girl" will be tagged as "underage" by this model, meanwhile openai omni-moderation does not flag same message.
23
-
24
-
25
-
26
- Possible flags are: regular, racist, underage, sexual
27
 
 
28
 
29
- # BEWARE
30
 
31
- The model might categorize any talk about race as racism, for example: "Black people suffer so much in America" will be flagged as "racist".
32
 
 
33
 
34
  ## Training and evaluation data
35
 
36
- The model uses variety of datasets, mostly focusing on casual conversation and sexual content.
37
- The dataset contains around 50k messages.
38
- Due to a lack of data, underage comments and requests were synthetically generated by uncensored qwen2-72b.
39
-
40
-
41
- This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
42
- It achieves the following results on the evaluation set:
43
- - Loss: 0.0637
44
- - Accuracy: 0.9903
45
 
 
46
 
47
  ### Training hyperparameters
48
 
@@ -53,23 +44,22 @@ The following hyperparameters were used during training:
53
  - seed: 42
54
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
55
  - lr_scheduler_type: linear
56
- - num_epochs: 6
57
 
58
  ### Training results
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
61
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
62
- | 0.0471 | 1.0 | 1175 | 0.0729 | 0.9854 |
63
- | 0.0282 | 2.0 | 2350 | 0.0529 | 0.9900 |
64
- | 0.0105 | 3.0 | 3525 | 0.0680 | 0.9888 |
65
- | 0.0079 | 4.0 | 4700 | 0.0558 | 0.9911 |
66
- | 0.0017 | 5.0 | 5875 | 0.0595 | 0.9902 |
67
- | 0.0001 | 6.0 | 7050 | 0.0637 | 0.9903 |
68
 
69
 
70
  ### Framework versions
71
 
72
- - Transformers 4.44.2
73
- - Pytorch 2.4.0+cu121
74
- - Datasets 3.0.0
75
- - Tokenizers 0.19.1
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ # ai-chat-censor
18
 
19
+ This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 0.1202
22
+ - Accuracy: 0.9879
 
 
 
 
23
 
24
+ ## Model description
25
 
26
+ More information needed
27
 
28
+ ## Intended uses & limitations
29
 
30
+ More information needed
31
 
32
  ## Training and evaluation data
33
 
34
+ More information needed
 
 
 
 
 
 
 
 
35
 
36
+ ## Training procedure
37
 
38
  ### Training hyperparameters
39
 
 
44
  - seed: 42
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 5
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
53
+ | 0.1279 | 1.0 | 1636 | 0.0890 | 0.9832 |
54
+ | 0.0743 | 2.0 | 3272 | 0.1128 | 0.9861 |
55
+ | 0.0373 | 3.0 | 4908 | 0.1098 | 0.9878 |
56
+ | 0.007 | 4.0 | 6544 | 0.1353 | 0.9886 |
57
+ | 0.0018 | 5.0 | 8180 | 0.1202 | 0.9879 |
 
58
 
59
 
60
  ### Framework versions
61
 
62
+ - Transformers 4.45.2
63
+ - Pytorch 2.3.1
64
+ - Datasets 3.0.1
65
+ - Tokenizers 0.20.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:be391ebd402747377c681f845e350e380b18beba0bd04014dcbf70f98372b207
3
  size 267838720
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0b1e69aaa57a4ef79f298cd928d4412dbd0a3d39416fa48e26c61942d70f8246
3
  size 267838720
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e50b41dc4b25e4145f04e98320e0954e138e9c7e19324dd73b363c26b9b54a83
3
  size 5176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1e7b3b464157ee272958ff03945b69cf348484b701eb667b396853bc4cab6131
3
  size 5176