llama-70b-partial โ€” Resonance Knot (.rknot)

Spectral-residual repack of the dense .knot weight format. Drops non-standing-wave embedding dimensions, signed-quantizes the rest at amplitude-graded bit widths, ships as a single .rknot file with a header pointer table for mmap-style block access.

Compression

Artifact Bytes Notes
Resonance Knot .rknot 4496042838 (4.19 GiB) spectral-residual repack of the dense Q4_K_M .knot

Quantization profile (production defaults):

  • standing-wave coverage k = 0.30 ร— hidden_dim
  • Q/K: 4 bits, signed, per-block scale
  • V: 8 bits, signed, per-block scale
  • FFN: 4 bits, signed, per-block scale

Format

magic        :  6 bytes  RKNOT\\x02
header_len   :  4 bytes  u32 LE
header_json  :  N bytes  manifest, per-layer pointers, arch metadata
body         :  rest     concatenated quantized blocks

Generated by the forkjoin-ai/distributed-inference Cloud Build pipeline.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for forkjoin-ai/llama-70b-partial-rknot

Finetuned
(48)
this model