Lion Optimized

AXL-Comment-Lion

Code commenting. 7.2M params. PPL 1.20. Context 2048 bytes.

7M
Parameters
1.20
Perplexity
10 min
Training
14 MB
GGUF
PropertyValue
ArchitectureMulti-Scale Transformer
d_model?
Attention Heads?
Layers per Scale?
Context Window2048 bytes
Downsample Factors[1, 2, 4]
Vocab Size258 (byte-level)
OptimizerLion
Trained on 20MB uncommented-to-commented pairs. 263 steps in 10 min.
MetricValue
Final Loss0.1949
Perplexity1.20
Training Steps244
Training Time10 min

Usage

ollama create axl-comment-lion -f Modelfile
ollama run axl-comment-lion "def fibonacci():"
Adds inline comments to explain code logic.
FileSizeFormat
F16 GGUF14 MBFull precision
Q4_K_M GGUF5 MB4-bit quantized
GGUF files work with Ollama and llama.cpp. Q4_K_M is about 3x smaller than F16.
← All AXL Models