File size: 3,795 Bytes
9891700 1b36117 bce7856 1b36117 12ca4c7 907362e 1b36117 0ac1cf8 1b36117 58ac2b4 1b36117 12ca4c7 1b36117 2ca7b9c 1b36117 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 |
# π SpecCLIP: Aligning and Translating Spectroscopic Measurements for Stars
[](https://arxiv.org/abs/2507.01939)
[](https://github.com/Xiaosheng-Zhao/SpecCLIP)
[](https://github.com/Xiaosheng-Zhao/SpecCLIP/blob/main/LICENSE)
**SpecCLIP** is a contrastive + domain-preserving foundation model designed to align **LAMOST LRS** spectra with **Gaia XP** spectrophotometric data.
It learns a **general-purpose spectral embedding (768-dim)** that supports:
* **Stellar parameter estimation**
* **Cross-survey spectral translation** (LAMOST LRS β· Gaia XP)
* **Similarity retrieval** across LAMOST LRS and GAIA XP spectra
For full documentation, installation instructions, examples, and end-to-end usage, please visit the **GitHub repository**:
π [https://github.com/Xiaosheng-Zhao/SpecCLIP](https://github.com/Xiaosheng-Zhao/SpecCLIP)
---
## π§ Available Models
The following pretrained weights are included in this model repository:
| File | Description | Embedding Dim | Param |
| -------------------------------------------- | ------------------------------------- | ------------- | ------|
| `encoders/lrs_encoder.ckpt` | LAMOST LRS masked transformer encoder | 768 | 43M |
| `encoders/xp_encoder.ckpt` | Gaia XP masked transformer encoder | 768 | 43M |
| `encoders/xp_encoder_mlp.ckpt` | Gaia XP autoencoder (MLP head) | 768 | 43M |
| `specclip/specclip_model_base.ckpt` | Gaia XP β· LAMOST contrastive | 768 | 100M |
| `specclip/specclip_model_predrecon_mlp.ckpt` | CLIP alignment + pred+recon | 768 | 168M |
| `specclip/specclip_model_split_mlp.ckpt` | CLIP alignment + split pred/recon | 768 | 126M |
---
## π§ What the Model Does
SpecCLIP consists of:
* **Two masked transformer encoders**
β LAMOST LRS
β Gaia XP
* **Contrastive alignment loss (CLIP-style)**
* **Domain-preserving prediction & reconstruction heads**
* **Cross-modal decoder** for spectrum translation
It produces **shared embeddings** enabling multi-survey astrophysical analysis.
---
## π Full Documentation
To keep the Hugging Face card concise, **all detailed instructions**, including:
* Installation
* Parameter prediction
* Spectral translation
* Retrieval
* Full examples (Python + figures)
* Acknowledgments
are available at the GitHub repo:
π **[https://github.com/Xiaosheng-Zhao/SpecCLIP](https://github.com/Xiaosheng-Zhao/SpecCLIP)**
---
## π Citation
```bibtex
@ARTICLE{2025arXiv250701939Z,
author = {{Zhao}, Xiaosheng and {Huang}, Yang and {Xue}, Guirong and {Kong}, Xiao and
{Liu}, Jifeng and {Tang}, Xiaoyu and {Beers}, Timothy C. and
{Ting}, Yuan-Sen and {Luo}, A-Li},
title = "{SpecCLIP: Aligning and Translating Spectroscopic Measurements for Stars}",
journal = {arXiv e-prints},
keywords = {Instrumentation and Methods for Astrophysics, Solar and Stellar Astrophysics,
Artificial Intelligence, Machine Learning},
year = 2025,
month = jul,
eid = {arXiv:2507.01939},
pages = {arXiv:2507.01939},
doi = {10.48550/arXiv.2507.01939},
archivePrefix = {arXiv},
eprint = {2507.01939},
primaryClass = {astro-ph.IM},
}
```
---
## π¬ Contact
* GitHub Issues: [https://github.com/Xiaosheng-Zhao/SpecCLIP/issues](https://github.com/Xiaosheng-Zhao/SpecCLIP/issues)
* Email: [xzhao113@jh.edu](mailto:xzhao113@jh.edu)
|