Lion Optimized

AXL-Comment-Lion

Code commenting. 7.2M params. PPL 1.20. Context 2048 bytes.

7M

Parameters

1.20

Perplexity

10 min

Training

14 MB

GGUF

Specs Training Usage Download

Property	Value
Architecture	Multi-Scale Transformer
d_model	?
Attention Heads	?
Layers per Scale	?
Context Window	2048 bytes
Downsample Factors	[1, 2, 4]
Vocab Size	258 (byte-level)
Optimizer	Lion

Trained on 20MB uncommented-to-commented pairs. 263 steps in 10 min.

Metric	Value
Final Loss	0.1949
Perplexity	1.20
Training Steps	244
Training Time	10 min

Usage

ollama create axl-comment-lion -f Modelfile
ollama run axl-comment-lion "def fibonacci():"

Adds inline comments to explain code logic.

File	Size	Format
F16 GGUF	14 MB	Full precision
Q4_K_M GGUF	5 MB	4-bit quantized

GGUF files work with Ollama and llama.cpp. Q4_K_M is about 3x smaller than F16.

← All AXL Models