Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published 22 days ago • 117
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents Paper • 2509.06917 • Published Sep 8, 2025 • 44
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published Feb 11 • 193
GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators Paper • 2512.19682 • Published Dec 22, 2025 • 19
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper • 2511.21689 • Published Nov 26, 2025 • 126
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published Dec 2, 2025 • 264
SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL Paper • 2512.04069 • Published Dec 3, 2025 • 24
RAISECity: A Multimodal Agent Framework for Reality-Aligned 3D World Generation at City-Scale Paper • 2511.18005 • Published Nov 22, 2025 • 1
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Paper • 2511.16334 • Published Nov 20, 2025 • 94
GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization Paper • 2511.15705 • Published Nov 19, 2025 • 98
CityRiSE: Reasoning Urban Socio-Economic Status in Vision-Language Models via Reinforcement Learning Paper • 2510.22282 • Published Oct 25, 2025 • 3
CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities Paper • 2501.08983 • Published Jan 15, 2025 • 22
CityBench: Evaluating the Capabilities of Large Language Model as World Model Paper • 2406.13945 • Published Jun 20, 2024 • 1
CityGPT: Empowering Urban Spatial Cognition of Large Language Models Paper • 2406.13948 • Published Jun 20, 2024 • 1