HunyuanOCR-mlx / README.md

hadeseus

Upload folder using huggingface_hub

dd3aded verified about 2 months ago

preview code

raw

history blame contribute delete

675 Bytes

metadata

license: other
language:
  - multilingual
pipeline_tag: image-text-to-text
library_name: transformers
base_model:
  - tencent/HunyuanOCR
tags:
  - ocr
  - hunyuan
  - vision-language
  - image-to-text
  - 1B
  - end-to-end
  - mlx

hadeseus/HunyuanOCR-mlx

This model was converted to MLX format from tencent/HunyuanOCR using mlx-vlm version 0.3.12. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm

python -m mlx_vlm.generate --model hadeseus/HunyuanOCR-mlx --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>