stamsam's picture
Tighten model card
9593c4f verified
---
base_model: Qwen/Qwen3.5-9B
library_name: mlx
pipeline_tag: text-generation
license: apache-2.0
tags:
- mlx
- qwen3.5
- qwenjamin-franklin
- 4bit
- local-first
- text-generation
---
![Qwenjamin Franklin](./assets/qwen9b.png)
# Qwenjamin Franklin 4bit
Qwenjamin Franklin 4bit is the fused Apple Silicon release of the strongest everyday-use branch from the Qwenjamin Franklin workshop line.
It is built from Qwen 3.5 9B and tuned for compact coding help, stricter JSON and tool behavior, and stronger false-premise correction while staying local-first in MLX.
If you want the follow-on release with a more expanded model card, see [`stamsam/Qwenjamin_Franklin_V2`](https://huggingface.co/stamsam/Qwenjamin_Franklin_V2) and its compact sibling [`stamsam/Qwenjamin_Franklin_V2_4bit`](https://huggingface.co/stamsam/Qwenjamin_Franklin_V2_4bit).
## What This Release Is
- Fused MLX 4-bit model
- Base lineage: `Qwen/Qwen3.5-9B`
- Workshop branch lineage: `v14` broad-benchmark daily-driver
- Best fit: Apple Silicon local use where size and speed matter
## Base vs This Model
Internal workshop evals. These scores are project-specific and directional, not public leaderboard claims.
| Eval | Base `Qwen3.5-9B-MLX-4bit` | Qwenjamin Franklin 4bit |
|---|---:|---:|
| `workbench_local_agent_smoke` | 63/100 | 72/100 |
| `full40` | 309/400 | 325/400 |
| `json_hard` | 15/30 | 30/30 |
| `parser_gate` | 2/3, 1/3, 1/3 | 3/3, 3/3, 3/3 |
| `code_smoke` | 95/120 | 95/120 |
| `false_smoke` | 102/110 | 110/110 |
| `tool_schema_canary` | 50/175 | 106/175 |
| `no_tool_leakage` | 99/100 | 100/100 |
## Usage
```bash
python -m mlx_lm generate \
--model stamsam/Qwenjamin_Franklin_4bit \
--prompt "Return only valid JSON." \
--max-tokens 256 \
--temp 0.0
```
## Notes
- This is the compact Apple Silicon release.
- For strict JSON or code-only tasks, use explicit output instructions in the prompt.
- Verify important outputs before using them in high-stakes workflows.