Upload folder using huggingface_hub

41dab69 verified 3 days ago

4.65 kB

license: apache-2.0
tags:
  - materials
  - diffusion
  - crystal-structure-prediction
  - editing
  - mattergen

ALM Edit · crystal-structure prediction + text-conditioned editing

ALM Edit conditions a from-scratch CSP-mode MatterGen decoder (csp_backbone) on an input structure + text instruction via a producer-consumer bridge: K=8 [atoms_i] soft tokens plus prompt context feed a learnable-query producer (M=16), whose tokens the cross-attention consumer reads inside the decoder. It performs crystal-structure prediction and the ALM Bench editing tasks.

This is a full-finetuning run: it bundles a fully fine-tuned Qwen3-8B (llm_full_ft/qwen3_state_dict.pt, ~16 GB), which the loader auto-detects and uses in place of any LoRA. Contents: atoms_mapper.pt (producer-consumer bridge) + llm_full_ft/

projector_and_state.pt + the csp_backbone/ decoder dir (config + last.ckpt).

Crystal-structure prediction: match rate MR (%, ↑) and RMSE (Å, ↓) to the MP-20 and MPTS-52 test sets at K=1 and best-of-K=20 (95% CIs in the paper):

Model	MP-20 MR@1	RMSE@1	MR@20	RMSE@20	MPTS-52 MR@1	RMSE@1	MR@20	RMSE@20
CDVAE	33.90	0.1045	66.95	0.1026	5.34	0.2106	20.79	0.2085
DiffCSP	51.49	0.0631	77.93	0.0492	12.19	0.1786	34.02	0.1749
FlowMM	61.39	0.0566	n/a	n/a	17.54	0.1726	n/a	n/a
CrystaLLM-large	58.70	0.0408	73.97	0.0349	19.21	0.1110	33.75	0.1059
CrystalFlow	62.02	0.0710	78.34	0.0577	21.00	0.1613	37.81	0.1584
OMatG	63.75	0.0720	n/a	n/a	25.15	0.1931	n/a	n/a
MCFlow-L	64.08	0.0561	76.08	0.0383	27.16	0.1401	41.45	0.1296
ALM Edit	45.6	0.021	83.2	0.034	22.7	0.022	45.7	0.038
ALM Gen + T2C-FK	22.3	0.025	41.0	0.012	6.0	0.040	10.0	0.011

ALM Edit sets SoTA RMSE and best-of-K=20 match rate on both benchmarks (it learns a valid polymorph distribution); it sees composition + space group at train time, only composition at inference.

ALM Bench directional editing per property (E_f, ρ, V) and direction (↑/↓), plus polymorph/doping/strain (N=7×1000; every metric scores invalid generations, trivial lattice rescalings, and unphysical relabelings as failures). Frontier LLMs were prompted to read/write CIFs:

Model	E_f↑	E_f↓	ρ↑	ρ↓	V↑	V↓	Polymorph	Doping	Strain
ALM Edit	0.613	0.624	0.353	0.367	0.451	0.355	0.224	0.879	0.151
GPT-4o	0.505	0.469	0.024	0.127	0.081	0.018	0.040	0.007	0.000
GPT-4.1	0.465	0.496	0.007	0.239	0.276	0.040	0.083	0.003	0.000
GPT-5.2	0.437	0.414	0.058	0.244	0.006	0.032	0.118	0.002	0.000

Text-conditioned generation (Application = LLM-judged fit; Describe/OOD = composition & structure consistency, N=7×1000):

Model	Application	Describe (Comp.)	Describe (Struct.)	OOD (Comp.)	OOD (Struct.)
ALM Edit	0.423	0.730	0.412	0.474	0.231
GPT-4o	0.131	0.279	0.121	0.130	0.025
GPT-4.1	0.224	0.254	0.090	0.168	0.035
GPT-5.2	0.252	0.356	0.162	0.263	0.075

Edit / generate a structure (inference):

# composition/description -> structure, using the CSP-mode backbone
alm-generate generate --alm_checkpoint alm-edit \
    --atoms_mapper alm-edit/atoms_mapper.pt --mattergen_model_path alm-edit/csp_backbone \
    --prompt "An orthorhombic perovskite of calcium and titanium." --num_samples 8 --out_dir gen_out

Evaluate (CSP M@20 / RMSE, and the ALM Bench editing tasks):

alm-eval-csp --ckpt_dir alm-edit/csp_backbone \
    --guidance_factor 0.5 --out_dir out/csp
alm-eval-almbench --alm_checkpoint alm-edit \
    --atoms_mapper alm-edit/atoms_mapper.pt --bridge_lora_dir none \
    --mattergen_model_path alm-edit/csp_backbone --guidance_factor 0.5

Links

Paper: arXiv · HuggingFace · Code: GitHub

License

Apache-2.0.

Citation

@article{edamadaka2026atomistic,
  title   = {Atomistic Language Models Understand and Generate Materials},
  author  = {Edamadaka, Sathya and Ramesh, Krithik and Li, Ju and G\'omez-Bombarelli, Rafael},
  journal = {arXiv preprint arXiv:2606.21395},
  year    = {2026}
}