MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Paper • 2512.16909 • Published Dec 18, 2025 • 2
MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Paper • 2512.16909 • Published Dec 18, 2025 • 2
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation Paper • 2511.01163 • Published Nov 3, 2025 • 32
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation Paper • 2511.11434 • Published Nov 14, 2025 • 45
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation Paper • 2511.11434 • Published Nov 14, 2025 • 45
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation Paper • 2511.01163 • Published Nov 3, 2025 • 32
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation Paper • 2511.01163 • Published Nov 3, 2025 • 32 • 1