Robotics
Transformers
Safetensors
English
qwen2_5_vl
image-text-to-text
RDT
rdt
RDT 2
Vision-Language-Action
Bimanual
Manipulation
Zero-shot
UMI
text-generation-inference
Instructions to use robotics-diffusion-transformer/RDT2-VQ with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use robotics-diffusion-transformer/RDT2-VQ with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForMultimodalLM processor = AutoProcessor.from_pretrained("robotics-diffusion-transformer/RDT2-VQ") model = AutoModelForMultimodalLM.from_pretrained("robotics-diffusion-transformer/RDT2-VQ") - Notebooks
- Google Colab
- Kaggle
Add Arxiv link to metadata and improve model card
#1
by nielsr HF Staff - opened
Hi! I'm Niels from the Hugging Face community team.
I've updated the model card to include the arxiv metadata tag, linking this repository to its research paper: RDT2: Exploring the Scaling Limit of UMI Data Towards Zero-Shot Cross-Embodiment Generalization.
Linking artifacts to their respective papers helps researchers and users find and cite your work more easily. I also made a small fix to the Quickstart code snippet to ensure it runs correctly and added links to the project page and paper in the header.