Zhi Hou's picture

Zhi Hou

zhihou

·

zhihou7

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper 11 days ago

FASTER: Rethinking Real-Time Flow VLAs

new activity 8 months ago

InternRobotics/InternData-BridgeV2:Dataset Viewer issue: JobManagerCrashedError

upvoted a paper 9 months ago

Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning

View all activity

Organizations

upvoted a paper 11 days ago

FASTER: Rethinking Real-Time Flow VLAs

Paper • 2603.19199 • Published Mar 19 • 60

upvoted a paper 9 months ago

Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning

Paper • 2510.11027 • Published Oct 13, 2025 • 23

upvoted a paper 10 months ago

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Paper • 2508.21148 • Published Aug 28, 2025 • 143

upvoted 3 papers about 1 year ago

Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces

Paper • 2506.00123 • Published May 30, 2025 • 35

EnerVerse-AC: Envisioning Embodied Environments with Action Condition

Paper • 2505.09723 • Published May 14, 2025 • 24

EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models

Paper • 2505.09694 • Published May 14, 2025 • 20

upvoted 2 papers over 1 year ago

Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy

Paper • 2503.19757 • Published Mar 25, 2025 • 51

Adversarial Data Collection: Human-Collaborative Perturbations for Efficient and Robust Robotic Imitation Learning

Paper • 2503.11646 • Published Mar 14, 2025 • 34