Bridging VideoQA and Video-Guided Agentic Tasks via Generalized Keyframe Extraction Paper • 2606.29445 • Published 3 days ago • 21
Running on Zero Agents Featured 322 Pixal3D 🏆 322 High-fidelity pixel-aligned image-to-3D generation.
Running Agents Featured 54 Hy3-preview ⚡ 54 Hy3-preview multi-turn streaming chat with function calling
Making LLMs Optimize Multi-Scenario CUDA Kernels Like Experts Paper • 2603.07169 • Published Mar 7 • 2
PresentBench: A Fine-Grained Rubric-Based Benchmark for Slide Generation Paper • 2603.07244 • Published Mar 7 • 2
PresentBench: A Fine-Grained Rubric-Based Benchmark for Slide Generation Paper • 2603.07244 • Published Mar 7 • 2
Making LLMs Optimize Multi-Scenario CUDA Kernels Like Experts Paper • 2603.07169 • Published Mar 7 • 2
Making LLMs Optimize Multi-Scenario CUDA Kernels Like Experts Paper • 2603.07169 • Published Mar 7 • 2
One Model to Rig Them All: Diverse Skeleton Rigging with UniRig Paper • 2504.12451 • Published Apr 16, 2025
Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task Paper • 2512.10359 • Published Dec 11, 2025 • 4
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published Dec 17, 2025 • 45
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published Dec 17, 2025 • 45
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published Dec 17, 2025 • 45
When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought Paper • 2511.02779 • Published Nov 4, 2025 • 60 • 2