SVGS: Enhancing Gaussian Splatting Using Primitives with Spatially Varying Colors Paper • 2411.18966 • Published 3 days ago • 3
HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness Paper • 2605.02396 • Published 3 days ago • 10
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL Paper • 2604.28123 • Published 6 days ago • 35
SymptomAI: Towards a Conversational AI Agent for Everyday Symptom Assessment Paper • 2605.04012 • Published 2 days ago • 4
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories Paper • 2605.04036 • Published 2 days ago • 37
Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs Paper • 2605.00814 • Published 6 days ago • 16
T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning Paper • 2605.02178 • Published 3 days ago • 5
Motion-Aware Caching for Efficient Autoregressive Video Generation Paper • 2605.01725 • Published 4 days ago • 3
AcademiClaw: When Students Set Challenges for AI Agents Paper • 2605.02661 • Published 3 days ago • 8
Perceptual Flow Network for Visually Grounded Reasoning Paper • 2605.02730 • Published 3 days ago • 3
PhysicianBench: Evaluating LLM Agents in Real-World EHR Environments Paper • 2605.02240 • Published 3 days ago • 6
ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models Paper • 2405.13729 • Published 8 days ago • 9
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 3 days ago • 196
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 4 days ago • 133
End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer Paper • 2605.00503 • Published 6 days ago • 5
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published 6 days ago • 77
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills Paper • 2604.24026 • Published 10 days ago • 16
Learning from Noisy Preferences: A Semi-Supervised Learning Approach to Direct Preference Optimization Paper • 2604.24952 • Published 10 days ago • 5
Step-level Optimization for Efficient Computer-use Agents Paper • 2604.27151 • Published 8 days ago • 16
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence Paper • 2604.24954 • Published 10 days ago • 19