there was originally going to be a better logo but i couldnt get any image model working. so this is what you all deserve
Info
Lune Mamba 3B is a Claude-OSS series model based on Granite 4.0 H(ybrid) Micro.
Claude-OSS is a (non-affiliated with Anthropic!) attempt to replicate the style of Anthropic's Claude model on top of open source bases.
| Benchmarks | Granite 4.0 H Micro | Lune Mamba 3B | Lune Mamba 3B GRPO_IF |
|---|---|---|---|
| MMLU | 63.7860 | 64.2338 | 64.3443 |
| IFEval* | 80.2218 | 75.0462 | 77.4492 |
| * IFEval numbers calculated from prompt loose accuracy |
Artifacts
- SFT checkpoint: allura-forge/claumba-micro-sft
- KTO checkpoint: You are here!
- GRPO (on IFeval) checkpoint: allura-org/Lune-Mamba-3B-v1-GRPO_IF
- Downloads last month
- 7
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for allura-org/Lune-Mamba-3B-v1
Base model
ibm-granite/granite-4.0-h-micro