How to use from
Docker Model Runner
docker model run hf.co/programasweights/paw-4b-qwen3-0.6b
Quick Links

paw-4b-qwen3-0.6b β€” ProgramAsWeights "Standard" compiler

This is the Standard compiler from ProgramAsWeights (PAW). Given a natural-language spec (a description of a function), it emits a tiny per-task program β€” a LoRA adapter β€” that then runs locally on the Qwen3-0.6B interpreter.

It is the model invoked by paw.compile(spec, compiler="paw-4b-qwen3-0.6b") and powers programs on https://programasweights.com.

  • Compiler base model: Qwen/Qwen3-4B-Instruct-2507
  • Target interpreter: Qwen/Qwen3-0.6B
  • Snapshot: 20260407 (see git tag 20260407)

Contents

  • compiler/ β€” a finetuned Qwen3-4B-Instruct-2507 causal LM (the compiler).
  • lora_mapper.pt β€” the mapper head (trunk + coefficient head + learnable LoRA basis matrices) that turns the compiler's hidden states into a LoRA program.
  • meta.json β€” lora_rank=64, lora_alpha=16, lora_num_bases=64, prefix_steps=64, target modules [q,k,v,o,gate,up,down]_proj.

How it works

  1. The 4B compiler generates a short "pseudo-program" (a task description plus a few I/O examples) from the spec.
  2. The text chat_template(spec) + pseudo-program + 64 prefix tokens is run through the compiler; the mapper reads the 64 prefix hidden states and emits per-layer LoRA A/B matrices as a learned mixture of basis matrices.
  3. The resulting LoRA (about 22 MB) is the program. It loads onto Qwen3-0.6B and runs locally and offline (about 100 ms/call).

Status

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for programasweights/paw-4b-qwen3-0.6b

Adapter
(5498)
this model