ribesstefano commited on
Commit
ad8e9f5
·
verified ·
1 Parent(s): 54eac4a

Initial version

Browse files
README.md CHANGED
@@ -18,41 +18,41 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [seyonec/ChemBERTa-zinc-base-v1](https://huggingface.co/seyonec/ChemBERTa-zinc-base-v1) on the ailab-bio/PROTAC-Splitter-Dataset dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.0003
22
- - Reassembly: 0.9991
23
- - Tanimoto Similarity: 0.0
24
- - Poi Valid: 1.0
25
- - E3 Tanimoto Similarity: 0.0
26
- - Reassembly Nostereo: 0.9991
27
- - Linker Has Attachment Point(s): 1.0
28
- - Heavy Atoms Difference Norm: -0.0000
29
- - Linker Valid: 1.0
30
- - E3 Heavy Atoms Difference: 0.0068
31
- - Linker Heavy Atoms Difference Norm: -0.0023
32
- - Num Fragments: 3.0
33
  - Valid: 1.0
34
- - Has Three Substructures: 1.0
 
 
35
  - E3 Graph Edit Distance Norm: inf
36
  - Poi Graph Edit Distance: inf
37
- - Poi Tanimoto Similarity: 0.0
38
- - Poi Graph Edit Distance Norm: inf
 
 
 
39
  - Linker Graph Edit Distance Norm: inf
40
- - All Ligands Equal: 0.9874
 
 
 
 
 
 
 
 
 
 
 
 
 
 
41
  - E3 Valid: 1.0
 
 
 
42
  - Poi Has Attachment Point(s): 1.0
43
- - Linker Tanimoto Similarity: 0.0
44
- - Linker Equal: 0.9878
45
- - Poi Equal: 0.9928
46
- - Poi Heavy Atoms Difference Norm: 0.0003
47
- - Heavy Atoms Difference: 0.0
48
- - Poi Heavy Atoms Difference: 0.0118
49
- - E3 Graph Edit Distance: inf
50
- - E3 Has Attachment Point(s): 1.0
51
- - Linker Graph Edit Distance: 0.1165
52
- - E3 Equal: 0.9944
53
- - E3 Heavy Atoms Difference Norm: 0.0003
54
- - Linker Heavy Atoms Difference: -0.0186
55
- - Has All Attachment Points: 1.0
56
 
57
  ## Model description
58
 
@@ -78,17 +78,33 @@ The following hyperparameters were used during training:
78
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
79
  - lr_scheduler_type: cosine_with_restarts
80
  - lr_scheduler_warmup_steps: 699
81
- - training_steps: 100000
82
  - mixed_precision_training: Native AMP
83
 
84
  ### Training results
85
 
86
- | Training Loss | Epoch | Step | Validation Loss | Reassembly | Tanimoto Similarity | Poi Valid | E3 Tanimoto Similarity | Reassembly Nostereo | Linker Has Attachment Point(s) | Heavy Atoms Difference Norm | Linker Valid | E3 Heavy Atoms Difference | Linker Heavy Atoms Difference Norm | Num Fragments | Valid | Has Three Substructures | E3 Graph Edit Distance Norm | Poi Graph Edit Distance | Poi Tanimoto Similarity | Poi Graph Edit Distance Norm | Linker Graph Edit Distance Norm | All Ligands Equal | E3 Valid | Poi Has Attachment Point(s) | Linker Tanimoto Similarity | Linker Equal | Poi Equal | Poi Heavy Atoms Difference Norm | Heavy Atoms Difference | Poi Heavy Atoms Difference | E3 Graph Edit Distance | E3 Has Attachment Point(s) | Linker Graph Edit Distance | E3 Equal | E3 Heavy Atoms Difference Norm | Linker Heavy Atoms Difference | Has All Attachment Points |
87
- |:-------------:|:------:|:------:|:---------------:|:----------:|:-------------------:|:---------:|:----------------------:|:-------------------:|:------------------------------:|:---------------------------:|:------------:|:-------------------------:|:----------------------------------:|:-------------:|:------:|:-----------------------:|:---------------------------:|:-----------------------:|:-----------------------:|:----------------------------:|:-------------------------------:|:-----------------:|:--------:|:---------------------------:|:--------------------------:|:------------:|:---------:|:-------------------------------:|:----------------------:|:--------------------------:|:----------------------:|:--------------------------:|:--------------------------:|:--------:|:------------------------------:|:-----------------------------:|:-------------------------:|
88
- | 0.0008 | 6.5543 | 70000 | 0.0003 | 0.9976 | 0.0 | 1.0 | 0.0 | 0.9978 | 1.0 | -0.0000 | 1.0 | 0.0063 | -0.0030 | 3.0 | 1.0 | 1.0 | inf | inf | 0.0 | inf | inf | 0.9846 | 1.0 | 1.0 | 0.0 | 0.9859 | 0.9915 | 0.0002 | -0.0009 | 0.0118 | inf | 1.0 | 0.1331 | 0.9935 | 0.0003 | -0.0189 | 1.0 |
89
- | 0.0011 | 7.4906 | 80000 | 0.0004 | 0.9968 | 0.0 | 0.9998 | 0.0 | 0.9968 | 1.0 | 0.0003 | 1.0 | 0.0048 | -0.0026 | 3.0 | 0.9998 | 1.0 | inf | inf | 0.0 | inf | inf | 0.9849 | 1.0 | 0.9998 | 0.0 | 0.9869 | 0.9916 | 0.0005 | 0.0326 | 0.0318 | inf | 1.0 | inf | 0.9937 | 0.0002 | -0.0146 | 1.0 |
90
- | 0.0006 | 8.4270 | 90000 | 0.0003 | 0.9990 | 0.0 | 1.0 | 0.0 | 0.9990 | 1.0 | 0.0000 | 1.0 | 0.0047 | -0.0018 | 3.0 | 1.0 | 1.0 | inf | inf | 0.0 | inf | inf | 0.9864 | 1.0 | 1.0 | 0.0 | 0.9867 | 0.9925 | -0.0005 | 0.0009 | -0.0036 | inf | 1.0 | inf | 0.9938 | 0.0002 | -0.0003 | 1.0 |
91
- | 0.0005 | 9.3633 | 100000 | 0.0003 | 0.9991 | 0.0 | 1.0 | 0.0 | 0.9991 | 1.0 | -0.0000 | 1.0 | 0.0068 | -0.0023 | 3.0 | 1.0 | 1.0 | inf | inf | 0.0 | inf | inf | 0.9874 | 1.0 | 1.0 | 0.0 | 0.9878 | 0.9928 | 0.0003 | 0.0 | 0.0118 | inf | 1.0 | 0.1165 | 0.9944 | 0.0003 | -0.0186 | 1.0 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
92
 
93
 
94
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [seyonec/ChemBERTa-zinc-base-v1](https://huggingface.co/seyonec/ChemBERTa-zinc-base-v1) on the ailab-bio/PROTAC-Splitter-Dataset dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.0002
 
 
 
 
 
 
 
 
 
 
 
22
  - Valid: 1.0
23
+ - Linker Equal: 0.9882
24
+ - Linker Graph Edit Distance: 0.1158
25
+ - Linker Tanimoto Similarity: 0.0
26
  - E3 Graph Edit Distance Norm: inf
27
  - Poi Graph Edit Distance: inf
28
+ - E3 Has Attachment Point(s): 1.0
29
+ - Has Three Substructures: 1.0
30
+ - Reassembly: 0.9997
31
+ - E3 Heavy Atoms Difference Norm: 0.0002
32
+ - Linker Heavy Atoms Difference Norm: -0.0027
33
  - Linker Graph Edit Distance Norm: inf
34
+ - E3 Tanimoto Similarity: 0.0
35
+ - Poi Heavy Atoms Difference: 0.0124
36
+ - Linker Valid: 1.0
37
+ - Heavy Atoms Difference: -0.0001
38
+ - Num Fragments: 3.0
39
+ - All Ligands Equal: 0.9880
40
+ - Has All Attachment Points: 1.0
41
+ - E3 Graph Edit Distance: inf
42
+ - Linker Has Attachment Point(s): 1.0
43
+ - Poi Heavy Atoms Difference Norm: 0.0003
44
+ - Heavy Atoms Difference Norm: -0.0000
45
+ - Linker Heavy Atoms Difference: -0.0166
46
+ - Tanimoto Similarity: 0.0
47
+ - E3 Equal: 0.9951
48
+ - Poi Valid: 1.0
49
  - E3 Valid: 1.0
50
+ - Poi Tanimoto Similarity: 0.0
51
+ - Poi Graph Edit Distance Norm: inf
52
+ - Poi Equal: 0.9930
53
  - Poi Has Attachment Point(s): 1.0
54
+ - E3 Heavy Atoms Difference: 0.0041
55
+ - Reassembly Nostereo: 0.9997
 
 
 
 
 
 
 
 
 
 
 
56
 
57
  ## Model description
58
 
 
78
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
79
  - lr_scheduler_type: cosine_with_restarts
80
  - lr_scheduler_warmup_steps: 699
81
+ - training_steps: 500000
82
  - mixed_precision_training: Native AMP
83
 
84
  ### Training results
85
 
86
+ | Training Loss | Epoch | Step | All Ligands Equal | E3 Equal | E3 Graph Edit Distance | E3 Graph Edit Distance Norm | E3 Has Attachment Point(s) | E3 Heavy Atoms Difference | E3 Heavy Atoms Difference Norm | E3 Tanimoto Similarity | E3 Valid | Has All Attachment Points | Has Three Substructures | Heavy Atoms Difference | Heavy Atoms Difference Norm | Linker Equal | Linker Graph Edit Distance | Linker Graph Edit Distance Norm | Linker Has Attachment Point(s) | Linker Heavy Atoms Difference | Linker Heavy Atoms Difference Norm | Linker Tanimoto Similarity | Linker Valid | Validation Loss | Num Fragments | Poi Equal | Poi Graph Edit Distance | Poi Graph Edit Distance Norm | Poi Has Attachment Point(s) | Poi Heavy Atoms Difference | Poi Heavy Atoms Difference Norm | Poi Tanimoto Similarity | Poi Valid | Reassembly | Reassembly Nostereo | Tanimoto Similarity | Valid |
87
+ |:-------------:|:-------:|:------:|:-----------------:|:--------:|:----------------------:|:---------------------------:|:--------------------------:|:-------------------------:|:------------------------------:|:----------------------:|:--------:|:-------------------------:|:-----------------------:|:----------------------:|:---------------------------:|:------------:|:--------------------------:|:-------------------------------:|:------------------------------:|:-----------------------------:|:----------------------------------:|:--------------------------:|:------------:|:---------------:|:-------------:|:---------:|:-----------------------:|:----------------------------:|:---------------------------:|:--------------------------:|:-------------------------------:|:-----------------------:|:---------:|:----------:|:-------------------:|:-------------------:|:------:|
88
+ | 0.0008 | 6.5543 | 70000 | 0.9846 | 0.9935 | inf | inf | 1.0 | 0.0063 | 0.0003 | 0.0 | 1.0 | 1.0 | 1.0 | -0.0009 | -0.0000 | 0.9859 | 0.1331 | inf | 1.0 | -0.0189 | -0.0030 | 0.0 | 1.0 | 0.0003 | 3.0 | 0.9915 | inf | inf | 1.0 | 0.0118 | 0.0002 | 0.0 | 1.0 | 0.9976 | 0.9978 | 0.0 | 1.0 |
89
+ | 0.0011 | 7.4906 | 80000 | 0.9849 | 0.9937 | inf | inf | 1.0 | 0.0048 | 0.0002 | 0.0 | 1.0 | 1.0 | 1.0 | 0.0326 | 0.0003 | 0.9869 | inf | inf | 1.0 | -0.0146 | -0.0026 | 0.0 | 1.0 | 0.0004 | 3.0 | 0.9916 | inf | inf | 0.9998 | 0.0318 | 0.0005 | 0.0 | 0.9998 | 0.9968 | 0.9968 | 0.0 | 0.9998 |
90
+ | 0.0006 | 8.4270 | 90000 | 0.9864 | 0.9938 | inf | inf | 1.0 | 0.0047 | 0.0002 | 0.0 | 1.0 | 1.0 | 1.0 | 0.0009 | 0.0000 | 0.9867 | inf | inf | 1.0 | -0.0003 | -0.0018 | 0.0 | 1.0 | 0.0003 | 3.0 | 0.9925 | inf | inf | 1.0 | -0.0036 | -0.0005 | 0.0 | 1.0 | 0.9990 | 0.9990 | 0.0 | 1.0 |
91
+ | 0.0005 | 9.3633 | 100000 | 0.9874 | 0.9944 | inf | inf | 1.0 | 0.0068 | 0.0003 | 0.0 | 1.0 | 1.0 | 1.0 | 0.0 | -0.0000 | 0.9878 | 0.1165 | inf | 1.0 | -0.0186 | -0.0023 | 0.0 | 1.0 | 0.0003 | 3.0 | 0.9928 | inf | inf | 1.0 | 0.0118 | 0.0003 | 0.0 | 1.0 | 0.9991 | 0.9991 | 0.0 | 1.0 |
92
+ | 0.0004 | 32.7715 | 350000 | 0.0002 | 1.0 | 0.9888 | 0.1068 | 0.0 | inf | inf | 1.0 | 1.0 | 0.9996 | 0.0003 | -0.0037 | inf | 0.0 | 0.0260 | 1.0 | 0.0004 | 3.0 | 0.9886 | 1.0 | inf | 1.0 | 0.0009 | 0.0000 | -0.0323 | 0.0 | 0.9949 | 1.0 | 1.0 | 0.0 | inf | 0.9937 | 1.0 | 0.0066 | 0.9996 |
93
+ | 0.0003 | 33.7079 | 360000 | 0.0002 | 1.0 | 0.9880 | 0.1166 | 0.0 | inf | inf | 1.0 | 1.0 | 0.9994 | 0.0002 | -0.0030 | inf | 0.0 | 0.0190 | 1.0 | 0.0021 | 3.0 | 0.9877 | 1.0 | inf | 1.0 | 0.0005 | 0.0000 | -0.0203 | 0.0 | 0.9948 | 1.0 | 1.0 | 0.0 | inf | 0.9930 | 1.0 | 0.0034 | 0.9994 |
94
+ | 0.0003 | 34.6442 | 370000 | 0.0002 | 1.0 | 0.9880 | inf | 0.0 | inf | inf | 1.0 | 1.0 | 0.9995 | 0.0002 | -0.0026 | inf | 0.0 | 0.0116 | 1.0 | -0.0012 | 3.0 | 0.9879 | 1.0 | inf | 1.0 | 0.0003 | -0.0000 | -0.0175 | 0.0 | 0.9953 | 1.0 | 1.0 | 0.0 | inf | 0.9927 | 1.0 | 0.0047 | 0.9995 |
95
+ | 0.0003 | 35.5805 | 380000 | 0.0002 | 1.0 | 0.9884 | inf | 0.0 | inf | inf | 1.0 | 1.0 | 0.9997 | 0.0002 | -0.0032 | inf | 0.0 | 0.0189 | 1.0 | -0.0005 | 3.0 | 0.9883 | 1.0 | inf | 1.0 | 0.0005 | -0.0000 | -0.0239 | 0.0 | 0.9951 | 1.0 | 1.0 | 0.0 | inf | 0.9933 | 1.0 | 0.0045 | 0.9997 |
96
+ | 0.0003 | 36.5169 | 390000 | 0.0002 | 1.0 | 0.9890 | 0.1027 | 0.0 | inf | inf | 1.0 | 1.0 | 0.9996 | 0.0002 | -0.0030 | inf | 0.0 | 0.0237 | 1.0 | -0.0002 | 3.0 | 0.9887 | 1.0 | inf | 1.0 | 0.0008 | -0.0000 | -0.0285 | 0.0 | 0.9950 | 1.0 | 1.0 | 0.0 | inf | 0.9936 | 1.0 | 0.0046 | 0.9996 |
97
+ | 0.0003 | 37.4532 | 400000 | 0.0002 | 1.0 | 0.9890 | 0.1059 | 0.0 | inf | inf | 1.0 | 1.0 | 0.9995 | 0.0002 | -0.0025 | inf | 0.0 | 0.0115 | 1.0 | -0.0002 | 3.0 | 0.9887 | 1.0 | inf | 1.0 | 0.0002 | -0.0000 | -0.0148 | 0.0 | 0.9949 | 1.0 | 1.0 | 0.0 | inf | 0.9938 | 1.0 | 0.0031 | 0.9995 |
98
+ | 0.0003 | 38.3895 | 410000 | 0.0002 | 1.0 | 0.9885 | 0.1101 | 0.0 | inf | inf | 1.0 | 1.0 | 0.9995 | 0.0002 | -0.0033 | inf | 0.0 | 0.0214 | 1.0 | -0.0002 | 3.0 | 0.9882 | 1.0 | inf | 1.0 | 0.0006 | -0.0000 | -0.0250 | 0.0 | 0.9950 | 1.0 | 1.0 | 0.0 | inf | 0.9930 | 1.0 | 0.0034 | 0.9995 |
99
+ | 0.0003 | 39.3258 | 420000 | 0.0002 | 0.9999 | 0.9887 | inf | 0.0 | inf | inf | 1.0 | 1.0 | 0.9993 | 0.0002 | -0.0026 | inf | 0.0 | 0.0184 | 1.0 | 0.0064 | 3.0 | 0.9881 | 1.0 | inf | 1.0 | 0.0005 | 0.0001 | -0.0207 | 0.0 | 0.9950 | 0.9999 | 1.0 | 0.0 | inf | 0.9930 | 0.9999 | 0.0047 | 0.9993 |
100
+ | 0.0003 | 40.2622 | 430000 | 0.0002 | 1.0 | 0.9884 | 0.1125 | 0.0 | inf | inf | 1.0 | 1.0 | 0.9999 | 0.0002 | -0.0034 | inf | 0.0 | 0.0236 | 1.0 | 0.0 | 3.0 | 0.9883 | 1.0 | 0.0333 | 1.0 | 0.0007 | 0.0 | -0.0272 | 0.0 | 0.9949 | 1.0 | 1.0 | 0.0 | inf | 0.9933 | 1.0 | 0.0036 | 0.9999 |
101
+ | 0.0003 | 41.1985 | 440000 | 0.0002 | 1.0 | 0.9882 | 0.1105 | 0.0 | inf | inf | 1.0 | 1.0 | 0.9993 | 0.0002 | -0.0032 | 0.0022 | 0.0 | 0.0229 | 1.0 | -0.0008 | 3.0 | 0.9877 | 1.0 | inf | 1.0 | 0.0007 | -0.0000 | -0.0275 | 0.0 | 0.9943 | 1.0 | 1.0 | 0.0 | inf | 0.9930 | 1.0 | 0.0038 | 0.9994 |
102
+ | 0.0003 | 42.1348 | 450000 | 0.0002 | 1.0 | 0.9880 | inf | 0.0 | inf | inf | 1.0 | 1.0 | 0.9994 | 0.0002 | -0.0027 | inf | 0.0 | 0.0171 | 1.0 | 0.0009 | 3.0 | 0.9877 | 1.0 | inf | 1.0 | 0.0005 | 0.0000 | -0.0201 | 0.0 | 0.9949 | 1.0 | 1.0 | 0.0 | inf | 0.9929 | 1.0 | 0.0039 | 0.9995 |
103
+ | 0.0003 | 43.0712 | 460000 | 0.0002 | 1.0 | 0.9883 | 0.1074 | 0.0 | inf | inf | 1.0 | 1.0 | 0.9997 | 0.0002 | -0.0033 | inf | 0.0 | 0.0244 | 1.0 | 0.0009 | 3.0 | 0.9882 | 1.0 | inf | 1.0 | 0.0008 | 0.0000 | -0.0282 | 0.0 | 0.9950 | 1.0 | 1.0 | 0.0 | inf | 0.9930 | 1.0 | 0.0047 | 0.9997 |
104
+ | 0.0003 | 44.0075 | 470000 | 0.0002 | 1.0 | 0.9887 | inf | 0.0 | inf | inf | 1.0 | 1.0 | 0.9994 | 0.0002 | -0.0032 | inf | 0.0 | 0.0232 | 1.0 | 0.0009 | 3.0 | 0.9885 | 1.0 | inf | 1.0 | 0.0007 | 0.0000 | -0.0272 | 0.0 | 0.9950 | 1.0 | 1.0 | 0.0 | inf | 0.9936 | 1.0 | 0.0049 | 0.9994 |
105
+ | 0.0003 | 44.9438 | 480000 | 0.0002 | 1.0 | 0.9892 | inf | 0.0 | inf | inf | 1.0 | 1.0 | 0.9997 | 0.0002 | -0.0037 | inf | 0.0 | 0.0295 | 1.0 | -0.0001 | 3.0 | 0.9890 | 1.0 | inf | 1.0 | 0.0010 | -0.0000 | -0.0341 | 0.0 | 0.9951 | 1.0 | 1.0 | 0.0 | inf | 0.9938 | 1.0 | 0.0044 | 0.9997 |
106
+ | 0.0003 | 45.8801 | 490000 | 0.0002 | 1.0 | 0.9884 | inf | 0.0 | inf | inf | 1.0 | 1.0 | 0.9997 | 0.0002 | -0.0027 | inf | 0.0 | 0.0108 | 1.0 | -0.0001 | 3.0 | 0.9882 | 1.0 | inf | 1.0 | 0.0002 | -0.0000 | -0.0148 | 0.0 | 0.9951 | 1.0 | 1.0 | 0.0 | inf | 0.9930 | 1.0 | 0.0040 | 0.9997 |
107
+ | 0.0003 | 46.8165 | 500000 | 0.0002 | 1.0 | 0.9882 | 0.1158 | 0.0 | inf | inf | 1.0 | 1.0 | 0.9997 | 0.0002 | -0.0027 | inf | 0.0 | 0.0124 | 1.0 | -0.0001 | 3.0 | 0.9880 | 1.0 | inf | 1.0 | 0.0003 | -0.0000 | -0.0166 | 0.0 | 0.9951 | 1.0 | 1.0 | 0.0 | inf | 0.9930 | 1.0 | 0.0041 | 0.9997 |
108
 
109
 
110
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fb6c56954e78dc0009a37a49227c9d09ca6b44ee0bf58f8054ecb3de57d1f0ca
3
  size 409608164
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fd86a5661c5814794bc18e7a100b4262437940c6729b336b0ae599d33e7f2a77
3
  size 409608164
runs/Jan23_17-42-34_alvis4-22/events.out.tfevents.1737650571.alvis4-22.1725594.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:523b9260b3a00e23f8455455a9246a3bd30f935c67be26eae7a1e71329502725
3
- size 133036
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e2157d8519e0098cbd0a9e89a6cd327d1436b6d15b4f152cbdbfea415d0f1028
3
+ size 138076