Visual Document Understanding and Question Answering: A Multi-Agent Collaboration Framework with Test-Time Scaling Paper • 2508.03404 • Published Aug 5, 2025 • 5
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published Apr 15 • 127
Pretrained Video Models as Differentiable Physics Simulators for Urban Wind Flows Paper • 2603.21210 • Published Mar 22 • 1
💧 LFM2.5 Collection Collection of post-trained and base LFM2.5 models. • 12 items • Updated 11 days ago • 158
view article Article Introducing OptiMind, a research model designed for optimization microsoft • Jan 15 • 35
FastViDAR: Real-Time Omnidirectional Depth Estimation via Alternative Hierarchical Attention Paper • 2509.23733 • Published Sep 28, 2025 • 1
OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion Paper • 2507.06165 • Published Jul 8, 2025 • 60
HPR3D: Hierarchical Proxy Representation for High-Fidelity 3D Reconstruction and Controllable Editing Paper • 2507.11971 • Published Jul 16, 2025 • 1
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 ariG23498, merve, pcuenq, reach-vb • Mar 12, 2025 • 497
Timer-XL: Long-Context Transformers for Unified Time Series Forecasting Paper • 2410.04803 • Published Oct 7, 2024 • 2
view article Article Introducing smolagents: simple agents that write actions in code. +1 m-ric, merve, thomwolf • Dec 31, 2024 • 1.2k
view article Article SmolVLM Grows Smaller – Introducing the 256M & 500M Models! +1 andito, mfarre, merve • Jan 23, 2025 • 192
MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery Paper • 2409.05591 • Published Sep 9, 2024 • 31
Llemma: An Open Language Model For Mathematics Paper • 2310.10631 • Published Oct 16, 2023 • 57
Improving Token-Based World Models with Parallel Observation Prediction Paper • 2402.05643 • Published Feb 8, 2024 • 1