DD 2.7: BBB Permeability Predictor

Fine-tuned ChemBERTa-77M-MLM on the BBBP dataset for binary classification of blood-brain barrier (BBB) permeability (permeable vs non-permeable).

Key Results (Scaffold Split Test Set, 13 Epochs)

Accuracy: 84.48%
F1 Score: 90.38%
ROC-AUC: 0.8321
PR-AUC: 0.9413

Training was done manually with PyTorch on a MacBook Air M1 (CPU only, due to accelerate/MPS compatibility issues).

Live Interactive Demo

Try the model right now without any code:
Gradio Space → BBB Predictor Demo

Features:

Enter any SMILES string
Get prediction + permeable probability
See molecule visualization
Pre-loaded examples (Ethanol, Aspirin, Caffeine, Doxorubicin)

How to Use

from transformers import AutoTokenizer, AutoModelForSequenceClassification

model_id = "Yousuf7/ChemBERT-BBB-Permeability"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForSequenceClassification.from_pretrained(model_id, num_labels=2)

Downloads last month: 1

Safetensors

Model size

3.43M params

Tensor type

F32

Yousuf7
/

ChemBERT-BBB-Permeability

DD 2.7: BBB Permeability Predictor

Key Results (Scaffold Split Test Set, 13 Epochs)

Live Interactive Demo

How to Use

Space using Yousuf7/ChemBERT-BBB-Permeability 1