moondream
/

md3p-int4

Model card Files Files and versions

err805 commited on 22 days ago

Commit

c6e79eb

·

verified ·

1 Parent(s): 2ad961f

Update README.md

Files changed (1) hide show

README.md +0 -9

README.md CHANGED Viewed

@@ -17,15 +17,6 @@ Pre-quantized version of Moondream 3 Preview for MLX inference.
 - **Other weights**: bf16 (unchanged)
 - **Memory savings**: ~60% reduction in MoE weight memory
-## Usage
-This model is designed for use with the moondream-station MLX backend.
-```python
-# In moondream-station, use with:
-moondream-station serve --backend mlx
-```
 ## Source
 Quantized from [moondream/moondream3-preview](https://huggingface.co/moondream/moondream3-preview)

 - **Other weights**: bf16 (unchanged)
 - **Memory savings**: ~60% reduction in MoE weight memory
 ## Source
 Quantized from [moondream/moondream3-preview](https://huggingface.co/moondream/moondream3-preview)