renezander030
/

browserground-mlx

Image-Text-to-Text

4-bit precision

Model card Files Files and versions

browserground-mlx / README.md

renezander030's picture

upload README.md

1424b94 verified 2 days ago

|

history blame contribute delete

1.06 kB

	---
	license: apache-2.0
	library_name: mlx
	base_model: renezander030/browserground
	tags:
	- mlx
	- apple-silicon
	- ui-grounding
	- browser-agent
	- qwen3-vl
	pipeline_tag: image-text-to-text
	---

	# browserground-mlx (Apple Silicon, 4-bit)

	MLX-converted 4-bit quant of [renezander030/browserground](https://huggingface.co/renezander030/browserground).
	Drop in the same model you'd use via `transformers`, but ~10× faster on Apple Silicon.

	## Use

	```python
	from mlx_vlm import load, generate
	model, processor = load("renezander030/browserground-mlx")
	out = generate(model, processor, image="screenshot.png", prompt="Locate: Submit button", max_tokens=64)
	print(out)
	```

	Or via the CLI:
	```bash
	npm install -g browserground
	IMGPARSE_MODEL=renezander030/browserground-mlx browserground parse screenshot.png --target "Submit button"
	```

	Numbers, training recipe, and the full positioning vs UI-TARS-2B-SFT are on the main model card: <https://huggingface.co/renezander030/browserground>.

	License: Apache 2.0 (inherits from `Qwen/Qwen3-VL-2B-Instruct`).