ykhrustalev's picture
Upload README.md with huggingface_hub
6c33f49 verified
metadata
library_name: mlx
license: other
license_name: lfm1.0
license_link: LICENSE
language:
  - en
  - ja
  - ko
  - fr
  - es
  - de
  - it
  - pt
  - ar
  - zh
pipeline_tag: image-text-to-text
tags:
  - liquid
  - lfm2.5-vl
  - lfm2.5
  - edge
  - mlx
base_model: LiquidAI/LFM2.5-VL-450M

LFM2.5-VL-450M-MLX-bf16

MLX export of LFM2.5-VL-450M for Apple Silicon inference.

LFM2.5-VL-450M is a vision-language model built on the LFM2.5-350M backbone with a SigLIP2 NaFlex vision encoder (86M). It supports OCR, document comprehension, multilingual vision understanding, bounding box prediction, and function calling.

Model Details

Property Value
Parameters 450M
Precision bfloat16
Size 0.84 GB
Context Length 32K
Vision Encoder SigLIP2 NaFlex (86M)
Native Resolution up to 512x512

Quickstart

uv pip install 'mlx-vlm==0.3.9'
from mlx_vlm import load, generate
from mlx_vlm.utils import load_image

model, processor = load("LiquidAI/LFM2.5-VL-450M-MLX-bf16")

image = load_image("photo.jpg")

# Apply chat template (required for LFM2.5-VL)
messages = [{"role": "user", "content": [
    {"type": "image"},
    {"type": "text", "text": "What do you see in this image?"},
]}]
prompt = processor.apply_chat_template(messages, add_generation_prompt=True)

result = generate(
    model,
    processor,
    prompt,
    [image],
    temp=0.1,
    min_p=0.15,
    repetition_penalty=1.05,
    verbose=True,
)
print(result.text)

Recommended Sampling Parameters

Parameter Value
temperature 0.1
min_p 0.15
repetition_penalty 1.05

License

This model is released under the LFM 1.0 License.