On Subquadratic Architectures: From Applications to Principles Paper • 2606.12364 • Published 17 days ago • 23
On Subquadratic Architectures: From Applications to Principles Paper • 2606.12364 • Published 17 days ago • 23
Olmo 3 Post-training Collection All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated Dec 23, 2025 • 56
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 16 days ago • 172
view article Article xLSTM-based time series model TiRex significantly outperforms competing models in forecasting accuracy BobWue • Jun 4, 2025 • 12