MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants Paper • 2603.09652 • Published 3 days ago • 12
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published 3 days ago • 56
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing Paper • 2603.09877 • Published 3 days ago • 36
view article Article Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge 4 days ago • 8
Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs Paper • 2603.09095 • Published 3 days ago • 23
WildActor: Unconstrained Identity-Preserving Video Generation Paper • 2603.00586 • Published 13 days ago • 32
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling Paper • 2603.04791 • Published 8 days ago • 16
DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval Paper • 2603.04743 • Published 8 days ago • 47
Running on Zero Featured 934 MMAudio — generating synchronized audio from video/text 🔊 934 Generate synchronized audio for videos from text prompts
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 27B • Updated 5 days ago • 117k • 184
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios Paper • 2602.23166 • Published 15 days ago • 40
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 10 days ago • 88
Helios: Real Real-Time Long Video Generation Model Paper • 2603.04379 • Published 9 days ago • 160