9 34 157

Xie

Zhihui

https://zhxie.site/

zhxieml

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

Qwen-AgentWorld: Language World Models for General Agents

upvoted a paper 2 days ago

Self-Compacting Language Model Agents

liked a model 2 months ago

deepseek-ai/DeepSeek-V4-Pro

View all activity

Organizations

upvoted a paper about 12 hours ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 2 days ago • 95

upvoted a paper 2 days ago

Self-Compacting Language Model Agents

Paper • 2606.23525 • Published 3 days ago • 15

liked a model 2 months ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 3 days ago • 2.05M • • 5.05k

upvoted a paper 2 months ago

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

Paper • 2604.15093 • Published Apr 16 • 30

liked a dataset 2 months ago

nvidia/Nemotron-VLM-Dataset-v2

Viewer • Updated Dec 18, 2025 • 4.58M • 5.42k • 91

upvoted 2 papers 2 months ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published Apr 13 • 68

Many-Tier Instruction Hierarchy in LLM Agents

Paper • 2604.09443 • Published Apr 10 • 16

upvoted a paper 3 months ago

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published Apr 7 • 122

liked a dataset 3 months ago

claw-eval/Claw-Eval

Benchmark • Updated May 8 • 3.59k • 28

upvoted a paper 3 months ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published Mar 17 • 110

liked a model 4 months ago

Qwen/Qwen3.5-35B-A3B-Base

Image-Text-to-Text • 36B • Updated Apr 23 • 123k • 134

liked a dataset 4 months ago

InternScience/SGI-Reasoning

Viewer • Updated 23 days ago • 291 • 258 • 8

upvoted a collection 4 months ago

SGI-Bench

Collection

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows • 12 items • Updated May 6 • 35

liked a dataset 5 months ago

ellisbrown/SIMS-VSI

Viewer • Updated Nov 7, 2025 • 242k • 280 • 7

liked a model 7 months ago

EssentialAI/rnj-1-instruct

Text Generation • 8B • Updated Dec 24, 2025 • 893 • • 318

liked a Space 7 months ago

CUA - Computer Use Agent 2.0

🤖

157

Launch an interactive web interface

liked a dataset 7 months ago

rl-research/dr-tulu-rl-data

Viewer • Updated Nov 25, 2025 • 4.88k • 493 • 13

liked a model 7 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24, 2025 • 80.6k • 535

upvoted a paper 8 months ago

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 73

liked a dataset 8 months ago

zjunlp/DataMind-Data

Preview • Updated Oct 11, 2025 • 82 • 2

Xie

AI & ML interests

Recent Activity

Organizations

Zhihui's activity

CUA - Computer Use Agent 2.0