How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull HOLOGRAMTECH/q-bitnet-2b
Run and chat with the model
lemonade run user.q-bitnet-2b-{{QUANT_TAG}}
List all available models
lemonade list
Quick Links

Hologram 路 BitNet-2B-4T

Native 1.58-bit ternary brain

t2 路 1.58-bit ternary0.69 GB 路 streamed to Q as a key-addressable .holo object

HologramLive SpaceOrganizationCode


What this is

Microsoft's BitNet b1.58 2B4T, the first natively 1.58-bit trained model at scale, re-encoded to Hologram's ternary key format. Q's default brain: fast, tiny, and coherent.

This repository is not a GGUF or Transformers checkpoint. It is a Hologram key object: the weights of microsoft/bitnet-b1.58-2B-4T re-encoded into Hologram's content-addressed .holo format so they stream, one verified block at a time, into Q, the on-device brain of the Hologram web OS. It runs in the browser on WebGPU, serverless, with nothing to install.

How it streams

The object is laid out for cold streaming from an untrusted CDN:

File Role
manifest.json the root. Names every tensor and the key (content hash) of its block.
b/sha256_*.gz the tensor blocks. Each filename is the SHA-256 of its bytes.
tokenizer.gguf bundled header, so loading is fully serverless.

Q fetches the manifest, then pulls each block by its key and re-derives sha256(block) on arrival. If a byte is wrong, the block is rejected. Nothing is trusted; everything is proven.

Verify (Law L5)

The object's identity is the SHA-256 of its manifest, pinned in Q's catalog before a single byte of weight is trusted:

did:holo:sha256:fcf835659d88d2fe6f683cf1ab8de6a6ba6214ea0deeee4b1bcf3da1a4c05412
curl -sL https://huggingface.co/HOLOGRAMTECH/q-bitnet-2b/resolve/main/manifest.json | sha256sum

Specifications

Architecture BitNet b1.58 (Llama 3 template)
Precision t2 路 1.58-bit ternary
Object size 0.69 GB
Hidden size 2560
Layers 30
Heads (Q / KV) 20 / 5 (GQA)
FFN 6912
Vocab 128256
Context 3000
Format holo-2bit/1

Provenance and license

Derived from microsoft/bitnet-b1.58-2B-4T. Inherits the MIT license from microsoft/bitnet-b1.58-2B-4T. The re-encoding is content-addressed at the key level: the object either re-derives to its pinned identity or it is refused.

Run it

These weights load through Q, not a standard runtime. Open the Live Space or visit gethologram.ai to run Hologram, then pick BitNet-2B-4T from Q's model list.

Composed on the golden ratio. One key, everything.
Downloads last month
-
GGUF
Model size
2B params
Architecture
bitnet-b1.58
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for HOLOGRAMTECH/q-bitnet-2b

Quantized
(7)
this model

Spaces using HOLOGRAMTECH/q-bitnet-2b 2