Update README.md
Browse files
README.md
CHANGED
|
@@ -125,12 +125,12 @@ ensure that the dataset is loaded with the corresponding seed for consistency.
|
|
| 125 |
## Using your own data
|
| 126 |
|
| 127 |
Note that the data in the AstroM3Processed dataset is already pre-processed.
|
| 128 |
-
|
| 129 |
|
| 130 |
1. **Spectra**: Each spectrum is interpolated to a fixed wavelength grid (3850–9000 Å), normalized using mean and MAD, and log-MAD is added as an auxiliary feature.
|
| 131 |
2. **Photometry**: Light curves are deduplicated, sorted by time, normalized using mean and MAD, time-scaled to [0, 1], and augmented with auxiliary features like log-MAD and time span.
|
| 132 |
3. **Metadata**: Scalar metadata is transformed via domain-specific functions (e.g., absolute magnitude, log, sin/cos), then normalized using dataset-level statistics.
|
| 133 |
|
| 134 |
For a detailed description, read the [paper](https://arxiv.org/abs/2411.08842).
|
| 135 |
-
To see exactly how we performed this preprocessing,
|
| 136 |
|
|
|
|
| 125 |
## Using your own data
|
| 126 |
|
| 127 |
Note that the data in the AstroM3Processed dataset is already pre-processed.
|
| 128 |
+
If you want to use the model with your own data, you must pre-process it in the same way:
|
| 129 |
|
| 130 |
1. **Spectra**: Each spectrum is interpolated to a fixed wavelength grid (3850–9000 Å), normalized using mean and MAD, and log-MAD is added as an auxiliary feature.
|
| 131 |
2. **Photometry**: Light curves are deduplicated, sorted by time, normalized using mean and MAD, time-scaled to [0, 1], and augmented with auxiliary features like log-MAD and time span.
|
| 132 |
3. **Metadata**: Scalar metadata is transformed via domain-specific functions (e.g., absolute magnitude, log, sin/cos), then normalized using dataset-level statistics.
|
| 133 |
|
| 134 |
For a detailed description, read the [paper](https://arxiv.org/abs/2411.08842).
|
| 135 |
+
To see exactly how we performed this preprocessing, refer to [`preprocess.py`](https://huggingface.co/datasets/AstroFOMO/AstroM3Dataset/blob/main/preprocess.py) in the AstroM3Dataset repo.
|
| 136 |
|