A newer version of this model is available: qikp/kite-2.6-13m

Kite

Kite is a small, trained, 1 million parameter language model, without any special optimizations.

Training

It was trained on this dataset using 3500 steps, 1 epoch, 1 batch size, and the GPT-2 tokenizer.

Due to its size, the model will produce near-nonsensical output and is not suitable for production workloads.

Safetensors

Model size

1.65M params

Tensor type

BF16