crystalite-base / README.md
Jarrodbarnes's picture
Upload README.md with huggingface_hub
4bac69a verified
---
license: mit
tags:
- crystal-generation
- diffusion-model
- materials-science
- probe-gradient-guidance
library_name: pytorch
---
# Crystalite 10K (Alex-MP-20)
Crystalite checkpoint trained for 10K steps on the full Alex-MP-20 dataset (540K structures, 97.9% metals). This is the diversity-optimized model used for the Pareto sweep experiments.
**Architecture**: 67.8M-parameter Diffusion Transformer with subatomic tokenizer and GEM attention bias ([Crystalite](https://arxiv.org/abs/2604.02270), Hadzi Veljkovic et al.).
## Key results with probe-gradient guidance
| Guidance weight | In-window (4-6 eV) | Uniqueness | Metal % |
|---|---|---|---|
| 0 (baseline) | 0.1% | 99.7% | 96.9% |
| 10 | 31.8% | 99.7% | 0.1% |
| 15 | 33.7% | 99.6% | 0.0% |
Every guidance weight Pareto-dominates the baseline. 18,432 structures across 6 weights, 3 seeds, 1,024 per batch. No mode collapse.
Band gap probe AUROC: 0.957 (256 parameters, trained on atom-mean hidden states).
## Usage
Requires the [Crystalite](https://github.com/joshrosie/crystalite) codebase and [probe-gradient-guidance](https://github.com/Dynamical-Systems-Research/probe-gradient-guidance) scripts.
```python
from scripts.train_probe import load_model
model = load_model("final.pt", device="cuda")
```
## Links
- **Blog post**: [Scaling Test-Time Verification for Novel Materials](https://dynamicalsystems.ai/blog/scaling-test-time-verification)
- **Code**: [Dynamical-Systems-Research/probe-gradient-guidance](https://github.com/Dynamical-Systems-Research/probe-gradient-guidance)
- **Crystalite paper**: [arXiv:2604.02270](https://arxiv.org/abs/2604.02270)
## Used In
This checkpoint was used as an upstream generation asset in the open-world environment pipeline for **Training Scientific Judgment with Verified Environments for Autonomous Science**.
- **Scientific judgment blog post**: [Training Scientific Judgment](https://dynamicalsystems.ai/blog/training-scientific-judgment)
- **Public repo**: [Dynamical-Systems-Research/training-scientific-judgment](https://github.com/Dynamical-Systems-Research/training-scientific-judgment)
- **Paper PDF**: [Training Scientific Judgment with Verified Environments for Autonomous Science](https://github.com/Dynamical-Systems-Research/training-scientific-judgment/blob/main/paper/training-scientific-judgment.pdf)