You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Configuration Parsing Warning:Invalid JSON for config file config.json

Mamba-2 2.7B -- UltraCompress v3 (5-bit, near-lossless)

Compressed by: Sipsa Labs Codec: UltraCompress v0.6+ (BUSL-1.1) Base model: state-spaces/mamba2-2.7b

Headline numbers

Metric Value
PPL ratio (vs bf16 baseline) 1.0133x
Baseline PPL (bf16) 25.028
Compressed PPL (5-bit) 25.362
Effective bpw ~5.1
Architecture class SSM (Mamba-2)
Peak VRAM (single-GPU eval) ~12.9 GB
Verified matrix cross-architecture (dense LLM, MoE, SSM, ViT)
Verifier hash SHA256_MANIFEST.json

PPL is measured on a held-out FineWeb-edu corpus, n=30 prompts of 1024 tokens each, against the bf16 reference. Methodology details and reproducible measurement scripts ship with the verifier -- see below.

This is the second state-space model (SSM) architecture in the Sipsa Labs verified matrix, following Mamba-2.8B. Together they demonstrate cross-architecture generalization from Mamba-1 to Mamba-2 with zero format changes.

Important: upstream config.json fix required

The upstream state-spaces/mamba2-2.7b checkpoint ships with an incorrect config.json. Four fields must be corrected at load time for HF Transformers Mamba2ForCausalLM to produce valid outputs:

Field Upstream (wrong) Correct
hidden_size 4096 2560
num_heads 128 80
state_size 128 16
vocab_size 50277 50288

Additionally, the embedding key is mismatched (backbone.embedding.weight in the checkpoint vs backbone.embeddings.weight in the model). Our evaluation scripts apply these corrections automatically. No mamba_ssm library install is required -- the HF Transformers naive SSM fallback path handles inference.

Comparator note

This pack uses an SSM-compatible comparator -- the same architecture-matched reconstruction comparator used for the Mamba-1 record. The transformer comparator pipeline (used for transformer LLM records in the registry) is architecture-incompatible with SSMs. Treat this record as the canonical SSM near-lossless 5-bit number; do not directly compare against transformer records without reading this note.

Verify the reconstruction yourself

pip install ultracompress
uc verify SipsaLabs/mamba2-2.7b-uc-v3-bpw5
# SHA-256 match: True
# PPL ratio (vs base): 1.0133x

The verifier downloads this artifact, decompresses to an HF folder, and checks that every weight tensor is np.allclose to the original at float32 precision.

When to use this

  • You want a Mamba-2 2.7B-class state-space model at reduced storage footprint
  • You need reproducible, cryptographically verifiable reconstruction (a deterministic decode to the SHA-256-pinned validated artifact), not opaque "approximately equivalent" PTQ
  • You're integrating into your own stack and want a drop-in HF folder after uc unpack

When NOT to use this

  • You need raw fp16/bf16 -- fall back to state-spaces/mamba2-2.7b
  • You're on hardware without 5-bit unpacking support -- currently CUDA / Apple Metal / x86 with AVX2

License

  • This artifact: BUSL-1.1 with Additional Use Grant for research, individuals, and companies under $1M ARR
  • Auto-converts to Apache 2.0 four years after release date
  • Above $1M ARR running this in commercial production: founder@sipsalabs.com
  • Patent posture: two USPTO provisional applications filed April 2026

Citation

@software{sipsa_ultracompress_2026,
  title  = {UltraCompress: near-lossless 5-bit model compression at scale},
  author = {Sipsa Labs, Inc.},
  year   = {2026},
  url    = {https://github.com/sipsalabs/ultracompress},
  note   = {Two USPTO provisional applications filed April 2026}
}

Contact


This artifact is part of the SipsaLabs cross-architecture verified matrix. Browse the full collection on the org page.

Downloads last month
3
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for SipsaLabs/mamba2-2.7b-uc-v3-bpw5

Finetuned
(5)
this model