| license: mit | |
| A very small "base model" that's very good at addition, regardless of how the input is tokenized. See the post for more details. |
| license: mit | |
| A very small "base model" that's very good at addition, regardless of how the input is tokenized. See the post for more details. |