AXL-Code-1B

Specs Training Usage Download

Trained with vanilla SGD on 50MB Python code. 1012 steps, 30 min. Baseline for Lion comparison.

ollama create axl-code-1b -f Modelfile
ollama run axl-code-1b "def fibonacci():"

SGD baseline. AXL-Code-1B-Lion achieves 16x better perplexity.

File	Size	Format
F16 GGUF	636 MB	Full precision
Q4_K_M GGUF	197 MB	4-bit quantized

GGUF files work with Ollama and llama.cpp. Q4_K_M is about 3x smaller than F16.