InSight-o3: Empowering Multimodal Foundation Models with Generalized Visual Search Paper • 2512.18745 • Published Dec 21, 2025 • 12
See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning Paper • 2512.22120 • Published Dec 26, 2025 • 15
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published Dec 19, 2025 • 99
lmstudio-community/DeepSeek-R1-Distill-Llama-8B-GGUF Text Generation • 8B • Updated Jan 20, 2025 • 5.81k • 46