PierrunoYT
/

moondream3-preview

Image-Text-to-Text

text-generation

Model card Files Files and versions

vikhyatk commited on Sep 17, 2025

Commit

638927a

·

verified ·

1 Parent(s): d1b7c10

Update README.md

Files changed (1) hide show

README.md +16 -9

README.md CHANGED Viewed

@@ -19,14 +19,21 @@ For more details, please refer to our ||coming soon release blog post||. Or try
 Load the model and prepare it for inference. We use [FlexAttention for inference](https://pytorch.org/blog/flexattention-for-inference/), so calling `.compile()` is critical for fast decoding. Our `compile` implementation also handles warmup, so you can start making requests directly once it returns.
-```
-    moondream = AutoModelForCausalLM.from_pretrained(
-        "moondream/moondream3-preview",
-        trust_remote_code=True,
-        dtype=torch.bfloat16,
-        device_map={"": "cuda"},
-    )
-    moondream.compile()
 ```
-* TODO: Add usage examples

 Load the model and prepare it for inference. We use [FlexAttention for inference](https://pytorch.org/blog/flexattention-for-inference/), so calling `.compile()` is critical for fast decoding. Our `compile` implementation also handles warmup, so you can start making requests directly once it returns.
+```python
+import torch
+from transformers import AutoModelForCausalLM
+moondream = AutoModelForCausalLM.from_pretrained(
+    "moondream/moondream3-preview",
+    trust_remote_code=True,
+    dtype=torch.bfloat16,
+    device_map={"": "cuda"},
+)
+moondream.compile()
 ```
+* TODO: Add usage examples
+  * Query
+  * Caption
+  * Detect
+  * Point