AXL-Chat-Pro

Specs Training Usage Download

Rewritten from numpy to PyTorch. Trained with Lion on 10MB chat pairs. 208 steps in 10 min.

ollama create axl-chat-pro -f Modelfile
ollama run axl-chat-pro "def fibonacci():"

Better quality than AXL-Chat-Lion (PPL 1.34 vs 1.52).

File	Size	Format
F16 GGUF	26 MB	Full precision
Q4_K_M GGUF	15 MB	4-bit quantized

GGUF files work with Ollama and llama.cpp. Q4_K_M is about 3x smaller than F16.