CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion

Paper | Project Page | Code

DiT-Dec base checkpoint from "CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion", pretrained on LIBERO-90.

CLARE is a general, parameter-efficient framework for exemplar-free continual learning with Vision-Language-Action (VLA) models. It introduces lightweight modular adapters into selected feedforward layers and autonomously expands the model only where necessary when learning a new task, guided by layer-wise feature similarity. During deployment, an autoencoder-based routing mechanism dynamically activates the most relevant adapters without requiring task labels.

BibTeX

@article{romer2026clare,
  title={CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion},
  author={Ralf R{\"o}mer and Yi Zhang and Angela P. Schoellig},
  journal={arXiv preprint arXiv:2601.09512},
  year={2026}
}

Downloads last month: 16

Video Preview

Robotics

Paper for continuallearning/dit_flow_mt_libero_90_pretrain

CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion

Paper • 2601.09512 • Published Jan 14 • 4