Spirit-v1.5-for-RoboChallenge-move-objects-into-box

Introduction

This is a fine-tuned model based on the Spirit-v1.5 base model, specifically for the RoboChallenge task: move_objects_into_box.

Base model:

  • Spirit-AI-robotics/Spirit-v1.5: https://huggingface.co/Spirit-AI-robotics/Spirit-v1.5

Spirit-v1.5 is built on:

  • A VLM backbone: Qwen/Qwen3-VL-4B-Instruct
  • A DiT (Diffusion Transformer) action head
  • A policy inference API

For implementation details and usage examples, please refer to:

  • GitHub: https://github.com/Spirit-AI-Team/spirit-v1.5

For the model announcement / overview, see:

  • Blog: https://www.spirit-ai.com/en/blog/spirit-v1-5

Requirements

Recommended Python: 3.11+. GPU environment is recommended.

Pinned dependency versions used in the open-source implementation:

  • torch==2.9.1
  • torchvision==0.24.1
  • transformers==4.57.3
  • diffusers==0.36.0
  • safetensors==0.7.0
  • numpy==2.4.0
  • pillow==12.1.0
  • scipy==1.16.3

Quickstart

This model card is hosted separately on Hugging Face. For the latest runnable commands and end-to-end usage, please follow:

  • GitHub repo README: https://github.com/Spirit-AI-Team/spirit-v1.5

When running the RoboChallenge wrapper, make sure you select the task name:

  • TASK_NAME=move_objects_into_box

Citation

If you use this model / code in your work, please cite and link to:

  • GitHub: https://github.com/Spirit-AI-Team/spirit-v1.5
  • Blog: https://www.spirit-ai.com/en/blog/spirit-v1-5

Bibtex

@article{spiritai2026spiritv15,
  author = {Spirit AI Team},
  title = {Spirit-v1.5: Clean Data Is the Enemy of Great Robot Foundation Models},
  journal = {Spirit AI Blog},
  year = {2026},
  note = {https://www.spirit-ai.com/en/blog/spirit-v1-5},
}
Downloads last month
12
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Spirit-AI-robotics/Spirit-v1.5-for-RoboChallenge-move-objects-into-box

Finetuned
(1)
this model