view article Article Introducing the agentic robotics appstore for 10,000 Reachy Minis 2 days ago β’ 28
Valley Collection Valley Family: Exploring Scalable Vision-Language Design for Multimodal Understanding and Reasoning β’ 7 items β’ Updated 9 days ago β’ 3
Ling-2.6 Collection Ling-2.6 series is designed for real-world agents that require fast responses, strong execution, and high token efficiency, with several sized SKUs. β’ 4 items β’ Updated 8 days ago β’ 13
SenseNova-U1 Collection SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture β’ 5 items β’ Updated about 3 hours ago β’ 43
Building a Precise Video Language with Human-AI Oversight Paper β’ 2604.21718 β’ Published 16 days ago β’ 16
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper β’ 2604.11804 β’ Published 25 days ago β’ 71
TRACER: Trace-Based Adaptive Cost-Efficient Routing for LLM Classification Paper β’ 2604.14531 β’ Published 22 days ago β’ 7
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper β’ 2604.14268 β’ Published 23 days ago β’ 118
MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex β’ 7 items β’ Updated 6 days ago β’ 55
Seedance 2.0: Advancing Video Generation for World Complexity Paper β’ 2604.14148 β’ Published 23 days ago β’ 155
Geometric Context Transformer for Streaming 3D Reconstruction Paper β’ 2604.14141 β’ Published 23 days ago β’ 19