BAAI/Video-XL-2
Video-Text-to-Text • 8B • Updated • 109 • 55
None defined yet.
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale
EgoActor: Grounding Task Planning into Spatial-aware Egocentric Actions for Humanoid Robots via Visual-Language Models