Update README.md
Browse files
README.md
CHANGED
|
@@ -144,16 +144,16 @@ outputs = tokenizer.batch_encode_plus(smiles_list, padding=True, truncation=True
|
|
| 144 |
- [ChemMiniQ3-HoriFIE](https://github.com/gbyuvd/ChemMiniQ3-HoriFIE)
|
| 145 |
|
| 146 |
|
| 147 |
-
## 📚 Early VAE Evaluation (vs. ChemBERTa's) [WIP
|
| 148 |
-
1st Epoch, on
|
| 149 |
-
|
| 150 |
-
Planned: 8K samples, 10 epochs
|
| 151 |
|
| 152 |
Latent Space Visualization based on SMILES Interpolation Validity
|
| 153 |
|
| 154 |
-

|
| 145 |
|
| 146 |
|
| 147 |
+
## 📚 Early VAE Evaluation (vs. ChemBERTa's) [WIP for Scaling]
|
| 148 |
+
1st Epoch, on ~14K samples of len(token_ids)<=25; embed_dim=64, hidden_dim=128, latent_dim=64, num_layers=2; batch_size= 16 * 4 (grad acc)
|
|
|
|
|
|
|
| 149 |
|
| 150 |
Latent Space Visualization based on SMILES Interpolation Validity
|
| 151 |
|
| 152 |
+

|
| 153 |
+
|
| 154 |
+
using smitok (with tails)
|
| 155 |
|
| 156 |
+

|
| 157 |
|
| 158 |
```text
|
| 159 |
Loaded 8106 SMILES (assumed pre-canonicalized)
|