Instructions to use beshkenadze/moondream3-preview-mlx-8bit with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use beshkenadze/moondream3-preview-mlx-8bit with MLX:
# Make sure mlx-vlm is installed # pip install --upgrade mlx-vlm from mlx_vlm import load, generate from mlx_vlm.prompt_utils import apply_chat_template from mlx_vlm.utils import load_config # Load the model model, processor = load("beshkenadze/moondream3-preview-mlx-8bit") config = load_config("beshkenadze/moondream3-preview-mlx-8bit") # Prepare input image = ["http://images.cocodataset.org/val2017/000000039769.jpg"] prompt = "Describe this image." # Apply chat template formatted_prompt = apply_chat_template( processor, config, prompt, num_images=1 ) # Generate output output = generate(model, processor, formatted_prompt, image) print(output) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
| { | |
| "architectures": [ | |
| "HfMoondream" | |
| ], | |
| "auto_map": { | |
| "AutoConfig": "hf_moondream.HfConfig", | |
| "AutoModelForCausalLM": "hf_moondream.HfMoondream" | |
| }, | |
| "config": { | |
| "skills": [ | |
| "query", | |
| "caption", | |
| "detect", | |
| "point" | |
| ] | |
| }, | |
| "model_type": "moondream3", | |
| "quantization": { | |
| "group_size": 64, | |
| "bits": 8, | |
| "mode": "affine" | |
| }, | |
| "quantization_config": { | |
| "group_size": 64, | |
| "bits": 8, | |
| "mode": "affine" | |
| }, | |
| "torch_dtype": "bfloat16", | |
| "transformers_version": "4.51.1" | |
| } | |