c-bone commited on
Commit
2c9f973
·
verified ·
1 Parent(s): 98bc6b2

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +99 -0
README.md ADDED
@@ -0,0 +1,99 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: mit
5
+ library_name: transformers
6
+ tags:
7
+ - materials-science
8
+ - crystallography
9
+ - generative-ai
10
+ - inverse-design
11
+ - chemistry
12
+ - photovoltaics
13
+ datasets:
14
+ - c-bone/mpdb-slme-full
15
+ base_model: c-bone/CrystaLLM-pi_base
16
+ pipeline_tag: text-generation
17
+ ---
18
+
19
+ # Model Card for CrystaLLM-pi_SLME
20
+
21
+ ## Model Details
22
+
23
+ ### Model Description
24
+
25
+ **CrystaLLM-pi_SLME** is a conditional generative model designed for the discovery of high-performance photovoltaic materials. It is a fine-tuned version of the `CrystaLLM-pi` framework, based on a GPT-2 decoder-only architecture. This variant employs the **Property-Key-Value (PKV)** attention mechanism to condition the generation of Crystallographic Information Files (CIFs) on the **Spectroscopic Limited Maximum Efficiency (SLME)** metric.
26
+
27
+ The model generates crystal structures based on a single target scalar property:
28
+ 1. **SLME** (%) - A theoretical maximum efficiency metric for photovoltaic absorbers.
29
+
30
+ - **Developed by:** Bone et al. (University College London)
31
+ - **Model type:** Autoregressive Transformer with Prefix Attention Conditioning
32
+ - **Language(s):** CIF (Crystallographic Information File) syntax
33
+ - **License:** MIT
34
+ - **Finetuned from model:** `c-bone/CrystaLLM-pi_base`
35
+
36
+ ### Model Sources
37
+
38
+ - **Repository:** [GitHub: CrystaLLM-pi](https://github.com/C-Bone-UCL/CrystaLLM-pi)
39
+ - **Paper:** [Discovery and recovery of crystalline materials with property-conditioned transformers (arXiv:2511.21299)](https://arxiv.org/abs/2511.21299)
40
+ - **Dataset:** [HuggingFace: c-bone/mpdb-slme-full](https://huggingface.co/datasets/c-bone/mpdb-slme-full)
41
+
42
+ ## Uses
43
+
44
+ ### Direct Use
45
+
46
+ The model is intended for the exploration of chemical space for new photovoltaic candidates. Users can condition generation on high SLME values (e.g., >25%) to discover novel materials with optimal optical and electronic properties for solar energy conversion.
47
+
48
+ ### Out-of-Scope Use
49
+
50
+ - **Large Unit Cells:** Context window limit applies (~1024 tokens).
51
+ - **Production Deployment:** Generated structures are theoretical predictions. Verification via Hybrid-DFT calculations and experimental synthesis is required.
52
+
53
+ ## Bias, Risks, and Limitations
54
+
55
+ - **Implicit Learning:** The model was not explicitly trained on band gap data, but implicitly learned to target the optimal Shockley-Queisser range (1.2-1.4 eV) via the SLME metric. It may be less effective at targeting SLME values driven by mechanisms outside the primary training distribution.
56
+ - **Data Scarcity:** The model was fine-tuned on a relatively small dataset (~5.3K materials).
57
+
58
+ ## How to Get Started with the Model
59
+
60
+ For instructions on how to load and run generation with this model, please refer to the `_load_and_generate.py` script in the [CrystaLLM-pi GitHub Repository](https://github.com/C-Bone-UCL/CrystaLLM-pi).
61
+
62
+ ## Training Details
63
+
64
+ ### Training Data
65
+
66
+ The model was fine-tuned on the **MP SLME** dataset, containing inorganic structures labeled with their calculated Spectroscopic Limited Maximum Efficiency.
67
+
68
+ - **Source:** Materials Project / Derived from Walker and Butler (via `c-bone/mpdb-slme-full`)
69
+ - **Preprocessing:** CIFs are augmented, tokenized, and SLME values are normalized.
70
+
71
+ ### Training Procedure
72
+
73
+ - **Architecture:** GPT-2 with Property-Key-Value (PKV) encoder layers. (~38.7M parameters)
74
+ - **Mechanism:** Prefix Tuning (PKV) is used to inject the SLME target directly into the attention mechanism.
75
+
76
+ ## Evaluation
77
+
78
+ ### Metrics
79
+
80
+ The model is evaluated based on:
81
+ 1. **Hit-Rate:** Fraction of generated materials with predicted SLME values near the target.
82
+ 2. **VSUN:** Validity, Stability, Uniqueness, and Novelty of the generated candidates.
83
+
84
+ ### Results
85
+
86
+ The model successfully generated stable, novel candidates (e.g., $Rb_2(NbBr_3)_3$) with high predicted efficiencies, demonstrating the ability to map complex structure-property relationships from limited data.
87
+
88
+ ## Citation
89
+
90
+ ```bibtex
91
+ @misc{bone2025discoveryrecoverycrystallinematerials,
92
+ title={Discovery and recovery of crystalline materials with property-conditioned transformers},
93
+ author={Cyprien Bone and Matthew Walker and Kuangdai Leng and Luis M. Antunes and Ricardo Grau-Crespo and Amil Aligayev and Javier Dominguez and Keith T. Butler},
94
+ year={2025},
95
+ eprint={2511.21299},
96
+ archivePrefix={arXiv},
97
+ primaryClass={cond-mat.mtrl-sci},
98
+ url={[https://arxiv.org/abs/2511.21299](https://arxiv.org/abs/2511.21299)},
99
+ }