Lion Optimized

AXL-Comment-5M

Code commenting. 7.2M params. PPL 1.16. Context 2048 bytes.

7M

Parameters

1.16

Perplexity

10 min

Training

14 MB

GGUF

Specs Training Usage Download

Property	Value
Architecture	Multi-Scale Transformer
d_model	?
Attention Heads	?
Layers per Scale	?
Context Window	2048 bytes
Downsample Factors	[1, 2, 4]
Vocab Size	258 (byte-level)
Optimizer	Lion

Retrained with Lion on 20MB commenting pairs. 263 steps in 10 min.

Metric	Value
Final Loss	0.1476
Perplexity	1.16
Training Steps	263
Training Time	10 min

Usage

ollama create axl-comment-5m -f Modelfile
ollama run axl-comment-5m "def fibonacci():"

Adds inline comments to explain code logic.

File	Size	Format
F16 GGUF	14 MB	Full precision
Q4_K_M GGUF	14 MB	4-bit quantized

GGUF files work with Ollama and llama.cpp. Q4_K_M is about 3x smaller than F16.

← All AXL Models