Update README.md
Browse files
README.md
CHANGED
|
@@ -18,7 +18,6 @@ base_model:
|
|
| 18 |
# ChemSolubilityBERTa
|
| 19 |
## Model Description
|
| 20 |
ChemSolubilityBERTa is a prototype designed to predict the aqueous solubility of chemical compounds from their SMILES representations. Based on ChemBERTa, a BERT-like transformer-based architecture, ChemBERTa pre-trained on 77M SMILES strings for molecular property prediction. We adapted ChemBERTa to predict solubility values by fine-tuning ChemBERTa with the ESOL (Estimated SOLubility) dataset, a water solubility prediction dataset of 1,128 samples. A user inputs a SMILES string, and the model outputs a log solubility value (log mol/L).
|
| 21 |
-
|
| 22 |
You can read the full paper [here](./01_ChemSolubilityBERTa.pdf).
|
| 23 |
|
| 24 |
## Fine-Tuning Details
|
|
@@ -42,9 +41,17 @@ inputs = tokenizer(smiles_string, return_tensors='pt')
|
|
| 42 |
outputs = model(**inputs)
|
| 43 |
solubility = outputs.logits.item()
|
| 44 |
print(f"Predicted solubility: {solubility}")
|
|
|
|
|
|
|
| 45 |
|
| 46 |
-
|
| 47 |
-
|
| 48 |
-
This model is licensed under the [MIT License](https://opensource.org/licenses/MIT).
|
| 49 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 50 |
|
|
|
|
|
|
|
|
|
| 18 |
# ChemSolubilityBERTa
|
| 19 |
## Model Description
|
| 20 |
ChemSolubilityBERTa is a prototype designed to predict the aqueous solubility of chemical compounds from their SMILES representations. Based on ChemBERTa, a BERT-like transformer-based architecture, ChemBERTa pre-trained on 77M SMILES strings for molecular property prediction. We adapted ChemBERTa to predict solubility values by fine-tuning ChemBERTa with the ESOL (Estimated SOLubility) dataset, a water solubility prediction dataset of 1,128 samples. A user inputs a SMILES string, and the model outputs a log solubility value (log mol/L).
|
|
|
|
| 21 |
You can read the full paper [here](./01_ChemSolubilityBERTa.pdf).
|
| 22 |
|
| 23 |
## Fine-Tuning Details
|
|
|
|
| 41 |
outputs = model(**inputs)
|
| 42 |
solubility = outputs.logits.item()
|
| 43 |
print(f"Predicted solubility: {solubility}")
|
| 44 |
+
```
|
| 45 |
+
## Citation and Usage
|
| 46 |
|
| 47 |
+
If you use ChemSolubilityBERTa in your research or projects, please cite the following:
|
|
|
|
|
|
|
| 48 |
|
| 49 |
+
@misc{ChemSolubilityBERTa,
|
| 50 |
+
author = {Farooq Khan},
|
| 51 |
+
title = {ChemSolubilityBERTa: A Transformer-Based Model for Predicting Aqueous Solubility from SMILES},
|
| 52 |
+
year = {2024},
|
| 53 |
+
url = {https://huggingface.co/khanfs/ChemSolubilityBERTa}
|
| 54 |
+
}
|
| 55 |
|
| 56 |
+
## License
|
| 57 |
+
This model is licensed under the [MIT License](https://opensource.org/licenses/MIT).
|