Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
InternRobotics
/
RoboInter-VLM
like
5
Follow
Intern Robotics
306
Robotics
Transformers
Safetensors
qwen2_5_vl
image-text-to-text
vision-language-action-model
vision-language-model
text-generation-inference
arxiv:
2602.09973
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
b2d3689
RoboInter-VLM
/
RoboInterVLM_llava_one_vision_7B
13 MB
1 contributor
History:
25 commits
JeasLee
Delete RoboInterVLM_llava_one_vision_7B/special_tokens_map.json with huggingface_hub
b2d3689
verified
15 days ago
tokenizer.json
7.03 MB
Upload RoboInterVLM_llava_one_vision_7B/tokenizer.json with huggingface_hub
18 days ago
tokenizer_config.json
1.54 kB
Upload RoboInterVLM_llava_one_vision_7B/tokenizer_config.json with huggingface_hub
18 days ago
trainer_state.json
3.16 MB
Upload RoboInterVLM_llava_one_vision_7B/trainer_state.json with huggingface_hub
19 days ago
training_args.bin
7.86 kB
xet
Upload RoboInterVLM_llava_one_vision_7B/training_args.bin with huggingface_hub
18 days ago
vocab.json
2.78 MB
Upload RoboInterVLM_llava_one_vision_7B/vocab.json with huggingface_hub
18 days ago