How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="HOLOGRAMTECH/q-dream-7b",
	filename="tokenizer.gguf",
)
llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

Hologram · Dream-7B

Diffusion language model

q3f · diffusion · 2.9 GB · streamed to Q as a key-addressable .holo object

Hologram · Live Space · Organization · Code


What this is

A masked-diffusion LM: generation is iterative bidirectional unmasking over denoising steps, not left to right. Wall-clock is fixed by step count, not output length.

This repository is not a GGUF or Transformers checkpoint. It is a Hologram key object: the weights of Dream-org/Dream-v0-Instruct-7B re-encoded into Hologram's content-addressed .holo format so they stream, one verified block at a time, into Q, the on-device brain of the Hologram web OS. It runs in the browser on WebGPU, serverless, with nothing to install.

How it streams

The object is laid out for cold streaming from an untrusted CDN:

File Role
manifest.json the root. Names every tensor and the key (content hash) of its block.
b/sha256_*.gz the tensor blocks. Each filename is the SHA-256 of its bytes.
tokenizer.gguf bundled header (where present), so loading is fully serverless.

Q fetches the manifest, then pulls each block by its key and re-derives sha256(block) on arrival. If a byte is wrong, the block is rejected. Nothing is trusted; everything is proven.

Verify (Law L5)

The object's identity is the SHA-256 of its manifest, pinned in Q's catalog before a single byte of weight is trusted:

did:holo:sha256:7b862931ae088f348f1f7e9ea3adbd418924c2e07e6ddd134f926e5681ad760d
# the manifest hash equals the pinned identity above
curl -sL https://huggingface.co/HOLOGRAMTECH/q-dream-7b/resolve/main/manifest.json   | sha256sum

Specifications

Architecture Qwen2.5-7B backbone (masked diffusion)
Precision q3f · diffusion
Object size 2.9 GB
Hidden size 3584
Layers 28
Heads (Q / KV) 28 / 4 (GQA)
FFN 18944
Vocab 152064
Context 192
Format holo-2bit/1

Provenance and license

Derived from Dream-org/Dream-v0-Instruct-7B. Inherits its license from Dream-org/Dream-v0-Instruct-7B. The re-encoding is lossless-by-construction at the key level: every block is content-addressed, so the object either re-derives to its pinned identity or it is refused.

Run it

These weights load through Q, not a standard runtime. Open the Live Space or visit gethologram.ai to run Hologram, then pick Dream-7B from Q's model list.

Composed on the golden ratio. One key, everything.
Downloads last month
-
GGUF
Model size
8B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for HOLOGRAMTECH/q-dream-7b

Quantized
(11)
this model

Space using HOLOGRAMTECH/q-dream-7b 1