--- license: mit library_name: mlx tags: - depth-estimation - mlx - apple-silicon base_model: robbyant/lingbot-depth-pretrain-vitl-14-v0.5 --- # lingbot-depth-mlx MLX weights for [LingBot-Depth](https://github.com/Robbyant/lingbot-depth) (`mdm`), pre-converted for the `mlx-native` backend of [lingbot-depth-viz](https://github.com/WT-MM/lingbot-depth-viz). These are the original `robbyant/lingbot-depth-pretrain-vitl-14-v0.5` weights, converted to MLX `safetensors` with the position embeddings precomputed, so the whole model runs on Apple Silicon through `mlx.core` alone. Hosting them lets the torch-free install run without downloading the torch checkpoint or running the conversion: ```bash uv sync --extra mlx uv run lingbot-depth-viz --backend mlx-native --mode benchmark # weights.safetensors is fetched from here on first run ``` ## Files - `weights.safetensors` — MLX arrays (bf16 encoder, fp32 decoder) plus precomputed position embeddings. - `config.json` — model dims and preprocessing constants read by the runtime. ## Provenance Produced by `--mode convert-mlx` (converter_version 1). Parity vs torch-MPS is 0.27–0.48% AbsRel end-to-end. See the repo's `results/mlx-backend.md`.