Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining
Paper • 2605.14747 • Published • 144
None defined yet.
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing
MiMo-V2-Flash Technical Report