CASA - a kyutai Collection

kyutai 's Collections

MIRA World Model

Interactivity Alignment

MoshiRAG Release

Moshi v0.1 Release

CASA

updated 1 day ago

CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion on long-context streaming inputs

Running

Agents

4

CASA Gallery

🏠

4

Video Gallery for CASA: Cross-Attention over Self-Attention
CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion

Paper • 2512.19535 • Published Dec 22, 2025 • 13
kyutai/CASA-Helium1-VL-2B

Image-Text-to-Text • 3B • Updated Mar 9 • 13 • 9
kyutai/CASA-Qwen2_5-VL-3B

Image-Text-to-Text • 4B • Updated Dec 23, 2025 • 10 • 3
kyutai/CASA-Qwen2_5-VL-3B-LiveCC

Video-Text-to-Text • 4B • Updated Dec 23, 2025 • 23 • 5
kyutai/Helium1-VL-2B

Image-Text-to-Text • 3B • Updated Dec 23, 2025 • 9 • 2