8 17 3

Zhixuan Liang

Liang-ZX

https://liang-zx.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

authored a paper 17 days ago

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis

authored a paper 17 days ago

Benchmarking Generalizable Bimanual Manipulation: RoboTwin Dual-Arm Collaboration Challenge at CVPR 2025 MEIS Workshop

View all activity

Organizations

upvoted a paper 16 days ago

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

Paper • 2606.11182 • Published 16 days ago • 18

upvoted a paper 17 days ago

AHA-WAM:Asynchronous Horizon-Adaptive World-Action Modeling with Observation-Guided Context Routing

Paper • 2606.09811 • Published 18 days ago • 15

upvoted a paper 28 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 29 days ago • 146

upvoted 2 papers 2 months ago

AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

Paper • 2604.19747 • Published Apr 21 • 40

HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System

Paper • 2604.14125 • Published Apr 15 • 21

upvoted an article 3 months ago

Article

Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem

drmapavone

•

Jan 5

• 26

upvoted a paper 3 months ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

Paper • 2603.15600 • Published Mar 16 • 7

upvoted 3 papers 4 months ago

UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data

Paper • 2603.05312 • Published Mar 5 • 7

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published Mar 3 • 186

BiManiBench: A Hierarchical Benchmark for Evaluating Bimanual Coordination of Multimodal Large Language Models

Paper • 2602.08392 • Published Feb 9 • 3

upvoted a paper 8 months ago

VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing

Paper • 2510.05213 • Published Oct 6, 2025 • 6

upvoted 2 papers 10 months ago

EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs

Paper • 2509.09174 • Published Sep 11, 2025 • 62

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27, 2025 • 32

upvoted a paper 11 months ago

HyCodePolicy: Hybrid Language Controllers for Multimodal Monitoring and Decision in Embodied Agents

Paper • 2508.02629 • Published Aug 4, 2025 • 6

upvoted 3 papers about 2 years ago

Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11, 2024 • 55

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Paper • 2406.08418 • Published Jun 12, 2024 • 33

Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots

Paper • 2405.07990 • Published May 13, 2024 • 20

Zhixuan Liang

AI & ML interests

Recent Activity

Organizations

Liang-ZX's activity

Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem