D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper β’ 2510.05684 β’ Published Oct 7, 2025 β’ 143
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks Paper β’ 2505.11881 β’ Published May 17, 2025 β’ 4
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper β’ 2510.05684 β’ Published Oct 7, 2025 β’ 143
ESREAL: Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models Paper β’ 2403.16167 β’ Published Mar 24, 2024 β’ 1
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper β’ 2510.05684 β’ Published Oct 7, 2025 β’ 143
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper β’ 2510.05684 β’ Published Oct 7, 2025 β’ 143
Exploring Fine-Tuning of Large Audio Language Models for Spoken Language Understanding under Limited Speech data Paper β’ 2509.15389 β’ Published Sep 18, 2025 β’ 3
Exploring Fine-Tuning of Large Audio Language Models for Spoken Language Understanding under Limited Speech data Paper β’ 2509.15389 β’ Published Sep 18, 2025 β’ 3
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA Paper β’ 2505.12805 β’ Published May 19, 2025 β’ 22