UGround-V1-2B / README.md
teevee112's picture
Upload folder using huggingface_hub
610c2e7 verified
metadata
license: apache-2.0
language:
  - en
pipeline_tag: image-text-to-text
tags:
  - multimodal
  - mlx
library_name: transformers
base_model:
  - Qwen/Qwen2-VL-2B

mlx-community/UGround-V1-2B

This model was converted to MLX format from osunlp/UGround-V1-2B using mlx-vlm version 0.1.26. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm
python -m mlx_vlm.generate --model mlx-community/UGround-V1-2B --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>