Add performance metrics and training details

- Fix dataset description (CATlas ATAC, not ChromHMM)
- Add test and validation metrics (accuracy, AUROC, avg precision, F1)
- Add training hyperparameters (lr, batch size, epochs, loss, optimizer)
- Add parameter count (72M)
- Add model architecture details (EnformerPretrainedModel)
- Add grelu version used for training

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Files changed (1) hide show

README.md +40 -5

README.md CHANGED Viewed

@@ -14,11 +14,46 @@ base_model:
 # human-atac-catlas-model
 ## Model Description
-This model is a multi-task classifier trained to predict the binary accessibility of genomic DNA sequences in 204 cell types. It was trained by fine-tuning the Enformer model using the `grelu` library on top of the human ChromHMM fullstack annotation dataset.
-- **Architecture:** Fine-tuned Enformer
 - **Input:** Genomic sequences (hg38)
-- **Output:** Probability of accessibility in 204 cell types.
 ## Repository Content
 1. `model.ckpt`: The trained model weights and hyperparameters (PyTorch Lightning checkpoint).
@@ -33,10 +68,10 @@ from grelu.lightning import LightningModel
 from huggingface_hub import hf_hub_download
 ckpt_path = hf_hub_download(
-    repo_id="Genentech/human-atac-catlas-model",
     filename="model.ckpt"
 )
 model = LightningModel.load_from_checkpoint(ckpt_path, weights_only=False)
 model.eval()
-```

 # human-atac-catlas-model
 ## Model Description
+This model is a multi-task classifier trained to predict the binary accessibility of genomic DNA sequences in 204 cell types. It was trained by fine-tuning the Enformer model using the `grelu` library on the human ATAC CATlas dataset.
+- **Architecture:** Fine-tuned Enformer (EnformerPretrainedModel)
 - **Input:** Genomic sequences (hg38)
+- **Output:** Probability of accessibility in 204 cell types
+- **Parameters:** 72M total (all trainable)
+## Performance
+Metrics are computed per cell type and averaged across all 204 cell types.
+### Test Set
+| Metric | Mean | Std | Min | Max |
+|--------|------|-----|-----|-----|
+| Accuracy | 0.9416 | 0.0175 | 0.8959 | 0.9743 |
+| AUROC | 0.9053 | 0.0167 | 0.8634 | 0.9467 |
+| Average Precision | 0.6097 | 0.0374 | 0.4545 | 0.7008 |
+| Best F1 | 0.5716 | 0.0289 | 0.4704 | 0.6395 |
+### Validation Set
+| Metric | Mean | Std | Min | Max |
+|--------|------|-----|-----|-----|
+| Accuracy | 0.9482 | 0.0172 | 0.9071 | 0.9789 |
+| AUROC | 0.8935 | 0.0190 | 0.8350 | 0.9379 |
+| Average Precision | 0.5524 | 0.0370 | 0.4168 | 0.6888 |
+| Best F1 | 0.5253 | 0.0299 | 0.4285 | 0.6309 |
+## Training Details
+| Parameter | Value |
+|-----------|-------|
+| Task | Binary classification |
+| Loss | Binary Cross-Entropy |
+| Optimizer | Adam |
+| Learning rate | 0.0001 |
+| Batch size | 3072 |
+| Max epochs | 10 |
+| n_transformers | 1 |
+| crop_len | 0 |
+| grelu version | 1.0.4.post1.dev39 |
 ## Repository Content
 1. `model.ckpt`: The trained model weights and hyperparameters (PyTorch Lightning checkpoint).
 from huggingface_hub import hf_hub_download
 ckpt_path = hf_hub_download(
+    repo_id="Genentech/human-atac-catlas-model",
     filename="model.ckpt"
 )
 model = LightningModel.load_from_checkpoint(ckpt_path, weights_only=False)
 model.eval()
+```