Chang Liu
changliu816
·
AI & ML interests
None yet
Recent Activity
updated a collection 3 days ago
benchmark updated a collection 3 days ago
ComputerUseAgent updated a collection 9 days ago
AgenticOrganizations
None yet
ComputerUseAgent
-
Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification
Paper • 2603.26648 • Published • 42 -
OpenGame: Open Agentic Coding for Games
Paper • 2604.18394 • Published • 78 -
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents
Paper • 2603.24440 • Published • 98 -
InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?
Paper • 2604.27419 • Published • 13
VLM
-
Perception Encoder: The best visual embeddings are not at the output of the network
Paper • 2504.13181 • Published • 36 -
VGR: Visual Grounded Reasoning
Paper • 2506.11991 • Published • 20 -
Qwen3-Omni Technical Report
Paper • 2509.17765 • Published • 153 -
BabyVision: Visual Reasoning Beyond Language
Paper • 2601.06521 • Published • 201
papers
VisualGeneration
Agentic
-
Agentic Reasoning for Large Language Models
Paper • 2601.12538 • Published • 204 -
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
Paper • 2511.18538 • Published • 304 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276 -
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
Paper • 2602.08222 • Published • 290
reasoning
-
AdaptThink: Reasoning Models Can Learn When to Think
Paper • 2505.13417 • Published • 83 -
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Paper • 2504.20571 • Published • 98 -
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Paper • 2601.05242 • Published • 231 -
Does Your Reasoning Model Implicitly Know When to Stop Thinking?
Paper • 2602.08354 • Published • 264
benchmark
VisualGeneration
ComputerUseAgent
-
Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification
Paper • 2603.26648 • Published • 42 -
OpenGame: Open Agentic Coding for Games
Paper • 2604.18394 • Published • 78 -
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents
Paper • 2603.24440 • Published • 98 -
InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?
Paper • 2604.27419 • Published • 13
Agentic
-
Agentic Reasoning for Large Language Models
Paper • 2601.12538 • Published • 204 -
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
Paper • 2511.18538 • Published • 304 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276 -
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
Paper • 2602.08222 • Published • 290
VLM
-
Perception Encoder: The best visual embeddings are not at the output of the network
Paper • 2504.13181 • Published • 36 -
VGR: Visual Grounded Reasoning
Paper • 2506.11991 • Published • 20 -
Qwen3-Omni Technical Report
Paper • 2509.17765 • Published • 153 -
BabyVision: Visual Reasoning Beyond Language
Paper • 2601.06521 • Published • 201
reasoning
-
AdaptThink: Reasoning Models Can Learn When to Think
Paper • 2505.13417 • Published • 83 -
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Paper • 2504.20571 • Published • 98 -
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Paper • 2601.05242 • Published • 231 -
Does Your Reasoning Model Implicitly Know When to Stop Thinking?
Paper • 2602.08354 • Published • 264
papers