| # Model Summary | |
| - **Repository:** [simplescaling/s1](https://github.com/simplescaling/s1) | |
| - **Paper:** TODO | |
| # Use | |
| This is the token-conditional control model for our paper. You can evaluate using the information [here](https://github.com/simplescaling/s1?tab=readme-ov-file#evaluation). | |
| # Training information | |
| [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/hashimoto-group/o1/runs/xaantfal) | |
| - TRL: 0.13.0 | |
| - Transformers: 4.48.0 | |
| - Pytorch: 2.3.1 | |
| - Datasets: 3.0.1 | |
| - Tokenizers: 0.21.0 | |
| # Citation | |
| ```bibtex | |
| TODO | |
| ``` |