HunyuanOCR-mlx / README.md
hadeseus's picture
Upload folder using huggingface_hub
dd3aded verified
metadata
license: other
language:
  - multilingual
pipeline_tag: image-text-to-text
library_name: transformers
base_model:
  - tencent/HunyuanOCR
tags:
  - ocr
  - hunyuan
  - vision-language
  - image-to-text
  - 1B
  - end-to-end
  - mlx

hadeseus/HunyuanOCR-mlx

This model was converted to MLX format from tencent/HunyuanOCR using mlx-vlm version 0.3.12. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm
python -m mlx_vlm.generate --model hadeseus/HunyuanOCR-mlx --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>