DD 2.7: BBB Permeability Predictor

Fine-tuned ChemBERTa-77M-MLM on the BBBP dataset for binary classification of blood-brain barrier (BBB) permeability (permeable vs non-permeable).

Key Results (Scaffold Split Test Set, 13 Epochs)

  • Accuracy: 84.48%
  • F1 Score: 90.38%
  • ROC-AUC: 0.8321
  • PR-AUC: 0.9413

Training was done manually with PyTorch on a MacBook Air M1 (CPU only, due to accelerate/MPS compatibility issues).

Live Interactive Demo

Try the model right now without any code:
Gradio Space โ†’ BBB Predictor Demo

Features:

  • Enter any SMILES string
  • Get prediction + permeable probability
  • See molecule visualization
  • Pre-loaded examples (Ethanol, Aspirin, Caffeine, Doxorubicin)

How to Use

from transformers import AutoTokenizer, AutoModelForSequenceClassification

model_id = "Yousuf7/ChemBERT-BBB-Permeability"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForSequenceClassification.from_pretrained(model_id, num_labels=2)
Downloads last month
1
Safetensors
Model size
3.43M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Space using Yousuf7/ChemBERT-BBB-Permeability 1