Safetensors
astronomy
multimodal
classification
MeriDK commited on
Commit
ff54988
·
verified ·
1 Parent(s): f589b80

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -125,12 +125,12 @@ ensure that the dataset is loaded with the corresponding seed for consistency.
125
  ## Using your own data
126
 
127
  Note that the data in the AstroM3Processed dataset is already pre-processed.
128
- So if you want to use the model, you must pre-process your data in the same way:
129
 
130
  1. **Spectra**: Each spectrum is interpolated to a fixed wavelength grid (3850–9000 Å), normalized using mean and MAD, and log-MAD is added as an auxiliary feature.
131
  2. **Photometry**: Light curves are deduplicated, sorted by time, normalized using mean and MAD, time-scaled to [0, 1], and augmented with auxiliary features like log-MAD and time span.
132
  3. **Metadata**: Scalar metadata is transformed via domain-specific functions (e.g., absolute magnitude, log, sin/cos), then normalized using dataset-level statistics.
133
 
134
  For a detailed description, read the [paper](https://arxiv.org/abs/2411.08842).
135
- To see exactly how we performed this preprocessing, reference [`preprocess.py`](https://huggingface.co/datasets/AstroFOMO/AstroM3Dataset/blob/main/preprocess.py) from the AstroM3Dataset repo.
136
 
 
125
  ## Using your own data
126
 
127
  Note that the data in the AstroM3Processed dataset is already pre-processed.
128
+ If you want to use the model with your own data, you must pre-process it in the same way:
129
 
130
  1. **Spectra**: Each spectrum is interpolated to a fixed wavelength grid (3850–9000 Å), normalized using mean and MAD, and log-MAD is added as an auxiliary feature.
131
  2. **Photometry**: Light curves are deduplicated, sorted by time, normalized using mean and MAD, time-scaled to [0, 1], and augmented with auxiliary features like log-MAD and time span.
132
  3. **Metadata**: Scalar metadata is transformed via domain-specific functions (e.g., absolute magnitude, log, sin/cos), then normalized using dataset-level statistics.
133
 
134
  For a detailed description, read the [paper](https://arxiv.org/abs/2411.08842).
135
+ To see exactly how we performed this preprocessing, refer to [`preprocess.py`](https://huggingface.co/datasets/AstroFOMO/AstroM3Dataset/blob/main/preprocess.py) in the AstroM3Dataset repo.
136