ToPo-ToPo/Ornith-1.0-35B-mlx-4bit

MLX 4bit conversion of deepreinforce-ai/Ornith-1.0-35B for Apple Silicon (mlx-vlm).

Provenance (self-converted from official weights)

  • Source: deepreinforce-ai/Ornith-1.0-35B (license: mit)
  • Tool: mlx-vlm 0.6.3mlx_vlm.convert --hf-path deepreinforce-ai/Ornith-1.0-35B --mlx-path . -q --q-bits 4 --q-group-size 64
  • Effective: 4.649 bits/weight
  • Note (MoE): the source stores experts per-expert (experts.{i}.gate_proj/up_proj/down_proj), while mlx-vlm expects a fused/stacked experts.gate_up_proj. A sanitize monkeypatch fused them before conversion (gate then up, stacked over experts).
  • Validation: reproduced geometrically exact CAD output in an agentic CAD+FEM pipeline (volumes match the reference mlx-community conversion).

Usage

from mlx_vlm import load, generate
model, processor = load("ToPo-ToPo/Ornith-1.0-35B-mlx-4bit")
Downloads last month
237
Safetensors
Model size
6B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ToPo-ToPo/Ornith-1.0-35B-mlx-4bit

Quantized
(62)
this model

Collection including ToPo-ToPo/Ornith-1.0-35B-mlx-4bit