You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Configuration Parsing Warning:Invalid JSON for config file config.json

Mamba-2 2.7B -- UltraCompress v3 (5-bit, near-lossless)

Compressed by: Sipsa Labs Codec: UltraCompress v0.6+ (BUSL-1.1) Base model: state-spaces/mamba2-2.7b

Headline numbers

Metric	Value
PPL ratio (vs bf16 baseline)	1.0133x
Baseline PPL (bf16)	25.028
Compressed PPL (5-bit)	25.362
Effective bpw	~5.1
Architecture class	SSM (Mamba-2)
Peak VRAM (single-GPU eval)	~12.9 GB
Verified matrix	cross-architecture (dense LLM, MoE, SSM, ViT)
Verifier hash	SHA256_MANIFEST.json

PPL is measured on a held-out FineWeb-edu corpus, n=30 prompts of 1024 tokens each, against the bf16 reference. Methodology details and reproducible measurement scripts ship with the verifier -- see below.

This is the second state-space model (SSM) architecture in the Sipsa Labs verified matrix, following Mamba-2.8B. Together they demonstrate cross-architecture generalization from Mamba-1 to Mamba-2 with zero format changes.

Important: upstream config.json fix required

The upstream state-spaces/mamba2-2.7b checkpoint ships with an incorrect config.json. Four fields must be corrected at load time for HF Transformers Mamba2ForCausalLM to produce valid outputs:

Field	Upstream (wrong)	Correct
`hidden_size`	4096	2560
`num_heads`	128	80
`state_size`	128	16
`vocab_size`	50277	50288

Additionally, the embedding key is mismatched (backbone.embedding.weight in the checkpoint vs backbone.embeddings.weight in the model). Our evaluation scripts apply these corrections automatically. No mamba_ssm library install is required -- the HF Transformers naive SSM fallback path handles inference.

Comparator note

This pack uses an SSM-compatible comparator -- the same architecture-matched reconstruction comparator used for the Mamba-1 record. The transformer comparator pipeline (used for transformer LLM records in the registry) is architecture-incompatible with SSMs. Treat this record as the canonical SSM near-lossless 5-bit number; do not directly compare against transformer records without reading this note.

Verify the reconstruction yourself

pip install ultracompress
uc verify SipsaLabs/mamba2-2.7b-uc-v3-bpw5
# SHA-256 match: True
# PPL ratio (vs base): 1.0133x

The verifier downloads this artifact, decompresses to an HF folder, and checks that every weight tensor is np.allclose to the original at float32 precision.

When to use this

You want a Mamba-2 2.7B-class state-space model at reduced storage footprint
You need reproducible, cryptographically verifiable reconstruction (a deterministic decode to the SHA-256-pinned validated artifact), not opaque "approximately equivalent" PTQ
You're integrating into your own stack and want a drop-in HF folder after uc unpack

When NOT to use this

You need raw fp16/bf16 -- fall back to state-spaces/mamba2-2.7b
You're on hardware without 5-bit unpacking support -- currently CUDA / Apple Metal / x86 with AVX2

License

This artifact: BUSL-1.1 with Additional Use Grant for research, individuals, and companies under $1M ARR
Auto-converts to Apache 2.0 four years after release date
Above $1M ARR running this in commercial production: founder@sipsalabs.com
Patent posture: two USPTO provisional applications filed April 2026

Citation

@software{sipsa_ultracompress_2026,
  title  = {UltraCompress: near-lossless 5-bit model compression at scale},
  author = {Sipsa Labs, Inc.},
  year   = {2026},
  url    = {https://github.com/sipsalabs/ultracompress},
  note   = {Two USPTO provisional applications filed April 2026}
}

Contact

Commercial license: founder@sipsalabs.com
Security: security@sipsalabs.com
General: founder@sipsalabs.com

This artifact is part of the SipsaLabs cross-architecture verified matrix. Browse the full collection on the org page.

Downloads last month: -

Model tree for SipsaLabs/mamba2-2.7b-uc-v3-bpw5

Base model

state-spaces/mamba2-2.7b

Finetuned

(5)

this model