Dynamical-Systems
/

crystalite-base

crystal-generation

diffusion-model

materials-science

probe-gradient-guidance

Model card Files Files and versions

crystalite-base / README.md

Jarrodbarnes's picture

Upload README.md with huggingface_hub

4bac69a verified 2 days ago

|

history blame contribute delete

2.35 kB

	---
	license: mit
	tags:
	- crystal-generation
	- diffusion-model
	- materials-science
	- probe-gradient-guidance
	library_name: pytorch
	---

	# Crystalite 10K (Alex-MP-20)

	Crystalite checkpoint trained for 10K steps on the full Alex-MP-20 dataset (540K structures, 97.9% metals). This is the diversity-optimized model used for the Pareto sweep experiments.

	Architecture: 67.8M-parameter Diffusion Transformer with subatomic tokenizer and GEM attention bias ([Crystalite](https://arxiv.org/abs/2604.02270), Hadzi Veljkovic et al.).

	## Key results with probe-gradient guidance

	\| Guidance weight \| In-window (4-6 eV) \| Uniqueness \| Metal % \|
	\|---\|---\|---\|---\|
	\| 0 (baseline) \| 0.1% \| 99.7% \| 96.9% \|
	\| 10 \| 31.8% \| 99.7% \| 0.1% \|
	\| 15 \| 33.7% \| 99.6% \| 0.0% \|

	Every guidance weight Pareto-dominates the baseline. 18,432 structures across 6 weights, 3 seeds, 1,024 per batch. No mode collapse.

	Band gap probe AUROC: 0.957 (256 parameters, trained on atom-mean hidden states).

	## Usage

	Requires the [Crystalite](https://github.com/joshrosie/crystalite) codebase and [probe-gradient-guidance](https://github.com/Dynamical-Systems-Research/probe-gradient-guidance) scripts.

	```python
	from scripts.train_probe import load_model
	model = load_model("final.pt", device="cuda")
	```

	## Links

	- Blog post: [Scaling Test-Time Verification for Novel Materials](https://dynamicalsystems.ai/blog/scaling-test-time-verification)
	- Code: [Dynamical-Systems-Research/probe-gradient-guidance](https://github.com/Dynamical-Systems-Research/probe-gradient-guidance)
	- Crystalite paper: [arXiv:2604.02270](https://arxiv.org/abs/2604.02270)

	## Used In

	This checkpoint was used as an upstream generation asset in the open-world environment pipeline for Training Scientific Judgment with Verified Environments for Autonomous Science.

	- Scientific judgment blog post: [Training Scientific Judgment](https://dynamicalsystems.ai/blog/training-scientific-judgment)
	- Public repo: [Dynamical-Systems-Research/training-scientific-judgment](https://github.com/Dynamical-Systems-Research/training-scientific-judgment)
	- Paper PDF: [Training Scientific Judgment with Verified Environments for Autonomous Science](https://github.com/Dynamical-Systems-Research/training-scientific-judgment/blob/main/paper/training-scientific-judgment.pdf)