AXE Specialists
Collection
Fine-tuned models, each purpose-built for a specific lane in the inference pipeline. • 7 items • Updated
Our first complete training cycle, end to end. SFT followed by GRPO policy optimization. ~1B parameters of weights we own.
Caesar isn't a fine-tune of someone else's model. It's a full training run we executed on our own pipeline — supervised fine-tuning on tool-calling data, then GRPO over 600+ iterations to optimize the policy. The point isn't that 1B beats the frontier on benchmarks. It's that we control every byte of how it got there.
| Property | Value |
|---|---|
| Developer | AXE Technologies |
| Training | Custom SFT + GRPO (600+ iterations) |
| Parameters | ~1B |
| Context | 8K tokens |
| Quantization | GGUF |
| License | Apache 2.0 |
ollama pull axetechnologies/caesar-1.0
Five models, each tuned for a different lane in the inference pipeline:
| Model | Lane | What it does |
|---|---|---|
| Casanova 1.2 | Agency | Tool-calling, multi-step workflows. 27B dense. |
| Geralt 1.3 | Reasoning at scale | 26B parameters of capability, 4B of inference cost. MoE. |
| Pegasus 1.0 | Visible work | Chain-of-thought you can audit. 12B dense. |
| Artemis 1.0 | Speed | Loads in seconds. 4B for edge hardware. |
| Caesar 1.0 | First principles | Our own training cycle. ~1B, end-to-end on our pipeline. |
Canadian in-house AI infrastructure. Built on Apple Silicon. The models run on hardware you can audit — no cloud dependency, no third-party model in the data path.
Website: axetechnologies.ca