UNIDOC-BENCH: A Unified Benchmark for Document-Centric Multimodal RAG Paper • 2510.03663 • Published Oct 4, 2025 • 16
Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning Paper • 2510.20150 • Published Oct 23, 2025 • 7
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published about 1 month ago • 245
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language Paper • 2604.19667 • Published 18 days ago • 22
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 6 days ago • 144
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills Paper • 2604.24026 • Published 12 days ago • 19