--- license: apache-2.0 datasets: - samfatnassi/gaia-dr3 language: - en metrics: - accuracy library_name: transformers pipeline_tag: feature-extraction doi: 10.5281/zenodo.18684894 --- # SADIM-54M: Stellar Intelligence Framework **SADIM-54M** is a specialized AI model trained on the **Gaia Data Release 3 (DR3)** catalog. It bridges the gap between massive raw astronomical observations and actionable scientific insights. **Research Objectives** • Galactic Archaeology: Identifying stellar streams and ancient structures. • Big Data Optimization: Providing an AI-ready interface for 1B+ records. • Scalability: Real-time stellar analysis for future space surveys. ### 1. Technical Feature Map The model is designed to process 13 fundamental astronomical parameters: | Feature Name | Description | | :--- | :--- | | **source_id** | Unique Gaia DR3 Identifier | | **ra / dec** | Celestial Equatorial Coordinates | | **l / b** | Galactic Coordinates (Disk Alignment) | | **pmra / pmdec** | Kinematics (Proper Motion Velocity) | | **d_pc** | Distance in Parsecs (1/parallax) | | **x, y, z** | 3D Heliocentric Cartesian Mapping | | **abs_m** | Absolute Magnitude (Intrinsic Brightness) | | **bp_rp** | Color Index (Temperature Indicator) | ### 2. When to use the Model vs. the Dataset? * **Use the SADIM-54M Model:** For fast inference, predicting missing stellar properties, or automating the classification of new astronomical data. * **Use the Gaia-DR3 Dataset:** For deep-dive research, historical record querying, or training your own custom neural networks. * **Dataset Link:** [samfatnassi/gaia-dr3](https://huggingface.co/datasets/samfatnassi/gaia-dr3) ### 3. Quick Start (Python) Since the dataset contains over **1 Billion records**, we recommend using **Streaming Mode**: ```python from datasets import load_dataset from transformers import AutoModel # 1. Access the Data dataset = load_dataset("samfatnassi/gaia-dr3", split="train", streaming=True) # 2. Load the Model model = AutoModel.from_pretrained("KilmaAI/SADIM-54M") # Fetch a sample star sample_star = next(iter(dataset)) print(f"Analyzing Star ID: {sample_star['source_id']}")