metadata
license: apache-2.0
language:
- en
pipeline_tag: image-text-to-text
tags:
- multimodal
- mlx
library_name: transformers
base_model:
- Qwen/Qwen2-VL-2B
mlx-community/UGround-V1-2B
This model was converted to MLX format from osunlp/UGround-V1-2B using mlx-vlm version 0.1.26.
Refer to the original model card for more details on the model.
Use with mlx
pip install -U mlx-vlm
python -m mlx_vlm.generate --model mlx-community/UGround-V1-2B --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>