metadata
license: mit
pipeline_tag: image-text-to-text
library_name: transformers
base_model:
- OpenGVLab/InternViT-300M-448px-V2_5
- Qwen/Qwen2.5-0.5B-Instruct
base_model_relation: merge
language:
- multilingual
tags:
- internvl
- custom_code
- mlx
datasets:
- HuggingFaceFV/finevideo
mlx-community/InternVL2_5-1B-4bit
This model was converted to MLX format from OpenGVLab/InternVL2_5-1B using mlx-vlm version 0.3.3.
Refer to the original model card for more details on the model.
Use with mlx
pip install -U mlx-vlm
python -m mlx_vlm.generate --model mlx-community/InternVL2_5-1B-4bit --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>