D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7, 2025 • 146
Running on Zero Agents Generalist IDM 📊 Process gameplay videos to predict keyboard and mouse actions