File size: 2,348 Bytes
2324335
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4bac69a
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
---
license: mit
tags:
  - crystal-generation
  - diffusion-model
  - materials-science
  - probe-gradient-guidance
library_name: pytorch
---

# Crystalite 10K (Alex-MP-20)

Crystalite checkpoint trained for 10K steps on the full Alex-MP-20 dataset (540K structures, 97.9% metals). This is the diversity-optimized model used for the Pareto sweep experiments.

**Architecture**: 67.8M-parameter Diffusion Transformer with subatomic tokenizer and GEM attention bias ([Crystalite](https://arxiv.org/abs/2604.02270), Hadzi Veljkovic et al.).

## Key results with probe-gradient guidance

| Guidance weight | In-window (4-6 eV) | Uniqueness | Metal % |
|---|---|---|---|
| 0 (baseline) | 0.1% | 99.7% | 96.9% |
| 10 | 31.8% | 99.7% | 0.1% |
| 15 | 33.7% | 99.6% | 0.0% |

Every guidance weight Pareto-dominates the baseline. 18,432 structures across 6 weights, 3 seeds, 1,024 per batch. No mode collapse.

Band gap probe AUROC: 0.957 (256 parameters, trained on atom-mean hidden states).

## Usage

Requires the [Crystalite](https://github.com/joshrosie/crystalite) codebase and [probe-gradient-guidance](https://github.com/Dynamical-Systems-Research/probe-gradient-guidance) scripts.

```python
from scripts.train_probe import load_model
model = load_model("final.pt", device="cuda")
```

## Links

- **Blog post**: [Scaling Test-Time Verification for Novel Materials](https://dynamicalsystems.ai/blog/scaling-test-time-verification)
- **Code**: [Dynamical-Systems-Research/probe-gradient-guidance](https://github.com/Dynamical-Systems-Research/probe-gradient-guidance)
- **Crystalite paper**: [arXiv:2604.02270](https://arxiv.org/abs/2604.02270)

## Used In

This checkpoint was used as an upstream generation asset in the open-world environment pipeline for **Training Scientific Judgment with Verified Environments for Autonomous Science**.

- **Scientific judgment blog post**: [Training Scientific Judgment](https://dynamicalsystems.ai/blog/training-scientific-judgment)
- **Public repo**: [Dynamical-Systems-Research/training-scientific-judgment](https://github.com/Dynamical-Systems-Research/training-scientific-judgment)
- **Paper PDF**: [Training Scientific Judgment with Verified Environments for Autonomous Science](https://github.com/Dynamical-Systems-Research/training-scientific-judgment/blob/main/paper/training-scientific-judgment.pdf)