Upload README.md with huggingface_hub
Browse files
README.md
ADDED
|
@@ -0,0 +1,42 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
tags:
|
| 4 |
+
- crystal-generation
|
| 5 |
+
- diffusion-model
|
| 6 |
+
- materials-science
|
| 7 |
+
- probe-gradient-guidance
|
| 8 |
+
library_name: pytorch
|
| 9 |
+
---
|
| 10 |
+
|
| 11 |
+
# Crystalite 10K (Alex-MP-20)
|
| 12 |
+
|
| 13 |
+
Crystalite checkpoint trained for 10K steps on the full Alex-MP-20 dataset (540K structures, 97.9% metals). This is the diversity-optimized model used for the Pareto sweep experiments.
|
| 14 |
+
|
| 15 |
+
**Architecture**: 67.8M-parameter Diffusion Transformer with subatomic tokenizer and GEM attention bias ([Crystalite](https://arxiv.org/abs/2604.02270), Hadzi Veljkovic et al.).
|
| 16 |
+
|
| 17 |
+
## Key results with probe-gradient guidance
|
| 18 |
+
|
| 19 |
+
| Guidance weight | In-window (4-6 eV) | Uniqueness | Metal % |
|
| 20 |
+
|---|---|---|---|
|
| 21 |
+
| 0 (baseline) | 0.1% | 99.7% | 96.9% |
|
| 22 |
+
| 10 | 31.8% | 99.7% | 0.1% |
|
| 23 |
+
| 15 | 33.7% | 99.6% | 0.0% |
|
| 24 |
+
|
| 25 |
+
Every guidance weight Pareto-dominates the baseline. 18,432 structures across 6 weights, 3 seeds, 1,024 per batch. No mode collapse.
|
| 26 |
+
|
| 27 |
+
Band gap probe AUROC: 0.957 (256 parameters, trained on atom-mean hidden states).
|
| 28 |
+
|
| 29 |
+
## Usage
|
| 30 |
+
|
| 31 |
+
Requires the [Crystalite](https://github.com/joshrosie/crystalite) codebase and [probe-gradient-guidance](https://github.com/Dynamical-Systems-Research/probe-gradient-guidance) scripts.
|
| 32 |
+
|
| 33 |
+
```python
|
| 34 |
+
from scripts.train_probe import load_model
|
| 35 |
+
model = load_model("final.pt", device="cuda")
|
| 36 |
+
```
|
| 37 |
+
|
| 38 |
+
## Links
|
| 39 |
+
|
| 40 |
+
- **Blog post**: [Scaling Test-Time Verification for Novel Materials](https://dynamicalsystems.ai/blog/scaling-test-time-verification)
|
| 41 |
+
- **Code**: [Dynamical-Systems-Research/probe-gradient-guidance](https://github.com/Dynamical-Systems-Research/probe-gradient-guidance)
|
| 42 |
+
- **Crystalite paper**: [arXiv:2604.02270](https://arxiv.org/abs/2604.02270)
|