Robotics
Transformers
Safetensors
qwen2_5_vl
image-text-to-text
vision-language-action-model
vision-language-model
text-generation-inference
Instructions to use InternRobotics/InternVLA-M1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use InternRobotics/InternVLA-M1 with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("InternRobotics/InternVLA-M1") model = AutoModelForImageTextToText.from_pretrained("InternRobotics/InternVLA-M1") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -23,7 +23,7 @@ library_name: transformers
|
|
| 23 |
## Citation
|
| 24 |
```
|
| 25 |
@misc{internvla2024,
|
| 26 |
-
title = {InternVLA-M1:
|
| 27 |
author = {InternVLA-M1 Contributors},
|
| 28 |
year = {2025},
|
| 29 |
booktitle={arXiv},
|
|
|
|
| 23 |
## Citation
|
| 24 |
```
|
| 25 |
@misc{internvla2024,
|
| 26 |
+
title = {InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy},
|
| 27 |
author = {InternVLA-M1 Contributors},
|
| 28 |
year = {2025},
|
| 29 |
booktitle={arXiv},
|