A newer version of this model is available: qikp/kite-3.1-20m

Kite

🎉 You are looking at Kite 1.6, which is now trained using pika!

Kite is a small, trained, 1 million parameter language model, without any special optimizations.

Training

It was trained on this dataset using 20000 steps, 1 epoch, 1 batch size, and the pika tokenizer.

Due to its size, the model is not suitable for production workloads.

Safetensors

Model size

999k params

Tensor type

BF16

Base model

Finetuned

(3)

this model