3 16 1

Yao

Huaxiu

https://www.huaxiuyao.io/

HuaxiuYaoML

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

VisualClaw: A Real-Time, Personalized Agent for the Physical World

authored a paper about 1 month ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

upvoted a paper about 1 month ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

View all activity

Organizations

upvoted a paper 10 days ago

VisualClaw: A Real-Time, Personalized Agent for the Physical World

Paper • 2606.16295 • Published 12 days ago • 28

upvoted 2 papers about 1 month ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published May 19 • 190

EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents

Paper • 2605.13941 • Published May 13 • 24

upvoted 4 papers 3 months ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

Paper • 2604.04202 • Published Apr 5 • 37

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

Paper • 2604.01007 • Published Apr 2 • 31

PRBench: End-to-end Paper Reproduction in Physics Research

Paper • 2603.27646 • Published Mar 29 • 29

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 141

upvoted a paper 4 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

upvoted 3 papers 5 months ago

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Paper • 2602.10090 • Published Feb 10 • 53

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 76

OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation

Paper • 2601.15369 • Published Jan 21 • 22

upvoted an article 6 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

NormalUhr

•

Feb 11, 2025

• 126

upvoted a paper 6 months ago

SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 38

upvoted 2 papers 7 months ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

Paper • 2511.19900 • Published Nov 25, 2025 • 49

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 110

upvoted a paper 8 months ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4, 2025 • 60

Yao

AI & ML interests

Recent Activity

Organizations

Huaxiu's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment