CAST: Modeling Visual State Transitions for Consistent Video Retrieval Paper • 2603.08648 • Published 16 days ago • 4