Instructions to use PlusMinus1/omniparser-icon-caption-mlx with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use PlusMinus1/omniparser-icon-caption-mlx with MLX:
# Make sure mlx-vlm is installed # pip install --upgrade mlx-vlm from mlx_vlm import load, generate from mlx_vlm.prompt_utils import apply_chat_template from mlx_vlm.utils import load_config # Load the model model, processor = load("PlusMinus1/omniparser-icon-caption-mlx") config = load_config("PlusMinus1/omniparser-icon-caption-mlx") # Prepare input image = ["http://images.cocodataset.org/val2017/000000039769.jpg"] prompt = "Describe this image." # Apply chat template formatted_prompt = apply_chat_template( processor, config, prompt, num_images=1 ) # Generate output output = generate(model, processor, formatted_prompt, image) print(output) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
OmniParser icon_caption โ MLX
MLX (bfloat16) conversion of microsoft/OmniParser-v2.0's icon_caption โ a Florence-2 fine-tuned on UI elements, for captioning interactive icons in screenshots. Runs on Apple Silicon via mlx_vlm with no PyTorch.
License: MIT (ยฉ Microsoft Corporation) โ see LICENSE. This repo redistributes the original MIT-licensed weights converted to MLX format.
Usage
Needs a small no-torch patch on transformers 5.x (register florence2_language, route the image processor to CLIPImageProcessorPil). See the conversion recipe at the bottom.
# apply the florence2 no-torch patch first (see recipe), then:
from mlx_vlm import load, generate
model, processor = load("PlusMinus1/omniparser-icon-caption-mlx")
out = generate(model, processor, "<CAPTION>", image=["icon_crop.png"], max_tokens=20)
Provenance
- Base: microsoft/Florence-2-base (MIT)
- Fine-tune: microsoft/OmniParser-v2.0
icon_caption(MIT) - Conversion:
mlx_vlm.convert(bfloat16) + a transformers-5.x no-torch compatibility patch.
- Downloads last month
- 21
Model size
0.3B params
Tensor type
BF16
ยท
Hardware compatibility
Log In to add your hardware
Quantized
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support
Model tree for PlusMinus1/omniparser-icon-caption-mlx
Base model
microsoft/OmniParser-v2.0