| # FrontierTS - Frontier Time Series Forecasting Model | |
| ## Architecture | |
| - **Type**: Decoder-only transformer with patch-based input | |
| - **Features**: RoPE, SwiGLU, sinh⁻¹ normalization, quantile regression (9 quantiles), multi-token prediction, CPM | |
| - **Optimizer**: Muon (2x compute efficiency vs AdamW) + AdamW for non-matrix params | |
| ## Training Data | |
| - autogluon/chronos_datasets (M4, Electricity, Dominick, etc.) | |
| - KernelSynth-style synthetic time series | |
| ## References | |
| - TiRex (arxiv:2505.23719) | Chronos-2 (arxiv:2510.15821) | |
| - Moirai 2.0 (arxiv:2511.11698) | Moonlight/Muon (arxiv:2502.16982) | |
| ## Checkpoint: step 5 (final) | |