Robotics
Transformers
Safetensors
qwen2_5_vl
image-text-to-text
vision-language-action-model
vision-language-model
text-generation-inference
Instructions to use InternRobotics/RoboInter-VLM with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use InternRobotics/RoboInter-VLM with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("InternRobotics/RoboInter-VLM") model = AutoModelForImageTextToText.from_pretrained("InternRobotics/RoboInter-VLM") - Notebooks
- Google Colab
- Kaggle
Upload RoboInterVLM_llava_one_vision_7B/added_tokens.json with huggingface_hub
Browse files
RoboInterVLM_llava_one_vision_7B/added_tokens.json
ADDED
|
@@ -0,0 +1,6 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"<image>": 151646,
|
| 3 |
+
"<|endoftext|>": 151643,
|
| 4 |
+
"<|im_end|>": 151645,
|
| 5 |
+
"<|im_start|>": 151644
|
| 6 |
+
}
|