Video-Text-to-Text
Transformers
Safetensors
English
molmo2
image-text-to-text
multimodal
olmo
molmo
custom_code
Instructions to use allenai/Molmo2-VideoPoint-4B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use allenai/Molmo2-VideoPoint-4B with Transformers:
# Load model directly from transformers import AutoModelForImageTextToText model = AutoModelForImageTextToText.from_pretrained("allenai/Molmo2-VideoPoint-4B", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
update allenai/Molmo2-VideoPoint4B to allenai/Molmo2-VideoPoint-4B
Browse files
README.md
CHANGED
|
@@ -59,7 +59,7 @@ import torch
|
|
| 59 |
from molmo_utils import process_vision_info
|
| 60 |
import re
|
| 61 |
|
| 62 |
-
model_id="allenai/Molmo2-
|
| 63 |
|
| 64 |
# load the processor
|
| 65 |
processor = AutoProcessor.from_pretrained(
|
|
|
|
| 59 |
from molmo_utils import process_vision_info
|
| 60 |
import re
|
| 61 |
|
| 62 |
+
model_id="allenai/Molmo2-VideoPoint-4B"
|
| 63 |
|
| 64 |
# load the processor
|
| 65 |
processor = AutoProcessor.from_pretrained(
|