Spaces:

SipsaLabs
/

README

Configuration error

README / README.md

docs(card): pre-outreach hygiene pass — remove patent-number/identifier residue, correct verified-count claim, soften superlatives, fix garbled text

4b4ab91 verified 1 day ago

preview code

raw

history blame contribute delete

5.18 kB

	---
	title: Sipsa Labs
	emoji: 🧊
	colorFrom: indigo
	colorTo: blue
	sdk: static
	pinned: false
	---

	# Sipsa Labs

	> Sipsa Labs is an experimental and deep tech-and-software company. We invent and ship across the full breadth of tech and software.
	>
	> Two products live today:
	>
	> 1. UltraCompress — lossless 5-bit transformer compression. Flagship. `pip install ultracompress`.
	> 2. Sipsa Inference — OpenAI-compatible serving API for our compressed weights. `https://api.sipsalabs.com/v1`. Pro $99/mo + Team $499/mo.
	>
	> Compression-as-a-Service engagements are open ($5K Phase 0 POC). More products in flight — we don't pre-announce.

	---

	## v0.6.9 shipped 2026-05-15 — security release + new Mistral 7B record

	`pip install --upgrade ultracompress` — RCE-class fix on `torch.load()` paths + MANIFEST-scrubbed sdist. v0.6.7 + v0.6.8 yanked from PyPI. Drop-in replacement, no code changes required.

	Mistral-7B-v0.3 hits 1.00548× — a strong dense 7B-class lossless 5-bit result.

	---

	## Product 1: UltraCompress

	Production-grade lossless 5-bit transformer compression across 22 shipped architectures (14 PPL-verified end-to-end), dense + Mixture-of-Experts + state-space, 0.6B to 405B parameters. Mathematically lossless customer-side reconstruction: SHA-256 over reconstructed tensor bytes matches the trainer's measurement, verified by `uc verify`.

	### Tightest verified PPL ratios at 5 bpw
	(perplexity ratio = compressed PPL / bf16 baseline PPL; FineWeb-edu held-out tail; seq_len = 1024; n = 30-50; seed = 42)

	\| Model \| Params \| Type \| PPL ratio \| Notes \|
	\|---\|---:\|:---\|---:\|:---\|
	\| Mixtral-8x7B \| 47B \| MoE \| 1.00368× \| tightest MoE result \|
	\| Qwen3-1.7B-Base \| 1.7B \| dense \| 1.00401× \| small-decoder record \|
	\| Qwen3-14B \| 14B \| dense \| 1.00403× \| 14B-class record \|
	\| Yi-1.5-9B \| 8.8B \| dense \| 1.00414× \| >8B record \|
	\| Qwen3-8B \| 8B \| dense \| 1.00440× \| 8B-class record \|
	\| Mistral-7B-v0.3 \| 7B \| dense \| 1.00548× \| NEW — a strong dense 7B-class result \|
	\| Phi-3-mini-4k \| 3.8B \| dense \| 1.00262× \| tightest dense ratio (seq_len=128 caveat) \|
	\| Hermes-3-Llama-3.1-405B \| 405B \| dense \| 1.0066× \| 405B-class lossless on a single 32 GB consumer GPU \|
	\| Qwen3-0.6B \| 0.6B \| dense \| 1.0069× \| \|
	\| OLMo-2-0425-1B \| 1B \| dense \| 1.0073× \| \|
	\| SmolLM2-1.7B-Instruct \| 1.7B \| dense \| 1.0075× \| \|
	\| SmolLM2-1.7B \| 1.7B \| dense \| 1.0085× \| \|
	\| Mamba-2.8B \| 2.8B \| SSM \| _compression validated; PPL eval pending_ \| state-space model \|
	\| Llama-3.1-8B \| 8B \| dense \| 1.0125× \| \|

	22 architectures shipped, 14 PPL-verified end-to-end. Full matrix below + at github.com/sipsalabs/ultracompress.

	### Try UltraCompress (3 commands)

	```bash
	pip install ultracompress
	hf download SipsaLabs/qwen3-1.7b-base-uc-v3-bpw5 --local-dir ./pack
	uc verify ./pack
	```

	Expected: `VERIFY: PASS — bit-identical reconstruction guaranteed.`

	---

	## Product 2: Sipsa Inference

	OpenAI-compatible inference API serving our compressed weights. Drop-in replacement for OpenAI's `base_url` — same `openai` Python SDK works unchanged.

	```bash
	export OPENAI_BASE_URL=https://api.sipsalabs.com/v1
	curl $OPENAI_BASE_URL/models
	```

	22 models live in the catalog with `sipsa-*` prefix (e.g. `model="sipsa-hermes-3-llama-3.1-405b"`). Backed by dual RTX 5090 over Cloudflare Tunnel.

	Pricing: Free $5 credit on signup (no card). Pro $99/mo (600 RPM, $100 included credit). Team $499/mo (2400 RPM, $500 included credit). Full pricing + bill estimator at sipsalabs.com/pricing.

	---

	## Service: Compression-as-a-Service (CaaS)

	Bring a model, we deliver a verified-lossless 5-bit pack you can run on your hardware. Phase 0 POC is $5K / 5 business days / customer-picked model. Day 7 deliverable is a pack you self-verify with `uc verify` + benchmark with `uc bench`. Acceptance gate: `uc verify PASS` + PPL ratio within 1.5% on your eval set.

	Email founder@sipsalabs.com.

	---

	## Why this matters

	Defense / FDA-regulated healthcare / SR 11-7 model validation / frontier lab red-team eval — places where "good enough on MMLU" isn't enough. Bit-identical reconstruction is the actual reason to talk to us in those settings. As a side-effect of the streaming compression path, it lets us put a 405B-parameter model through a single 32 GB consumer GPU without renting an H100 cluster.

	---

	## License + IP

	- PyPI v0.6+ under BUSL-1.1 with Additional Use Grant: free for sub-$1M ARR companies, research, and individuals. Auto-converts to Apache 2.0 four years after each release.
	- v0.5.x stays under Apache-2.0 forever on the `legacy/0.5.x` branch.
	- Codec internals patent-protected. Continuations through 2027.

	---

	## Contact

	- Commercial / Phase 0 POC → founder@sipsalabs.com
	- Patents / licensing → legal@sipsalabs.com
	- Press / media → press@sipsalabs.com
	- Security disclosure → security@sipsalabs.com
	- General → hello@sipsalabs.com

	[sipsalabs.com](https://sipsalabs.com) · [GitHub](https://github.com/sipsalabs) · [PyPI](https://pypi.org/project/ultracompress/) · [Pricing](https://sipsalabs.com/pricing)