3 12 7

Xiaochuan Li PRO

lixiaochuan2020

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes

updated a dataset about 2 months ago

lixiaochuan2020/bcp_env

published a dataset about 2 months ago

lixiaochuan2020/bcp_env

View all activity

Organizations

upvoted a paper 29 days ago

Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes

Paper • 2605.05724 • Published about 1 month ago • 15

upvoted a paper 3 months ago

Benchmark Test-Time Scaling of General LLM Agents

Paper • 2602.18998 • Published Feb 22 • 9

upvoted a paper 6 months ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 95

upvoted a paper 8 months ago

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Paper • 2510.08189 • Published Oct 9, 2025 • 28

upvoted a paper 10 months ago

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12, 2025 • 33

upvoted a collection 10 months ago

OpenCUA: Open Foundations for Computer-Use Agents

Collection

This is the official versions of OpenCUA models and AgentNet datasets. Website: https://opencua.xlang.ai/ • 7 items • Updated Mar 2 • 25

upvoted 2 papers about 1 year ago

DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Paper • 2505.19253 • Published May 25, 2025 • 34

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19, 2025 • 46

upvoted a paper over 1 year ago

Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning

Paper • 2410.14208 • Published Oct 18, 2024 • 3

upvoted an article over 1 year ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

loubnabnl, anton-l, davanstrien

•

Mar 20, 2024

• 114

upvoted an article almost 2 years ago

Article

Fine-tuning Llama 2 70B using PyTorch FSDP

smangrul, sgugger, lewtun, philschmid

•

Sep 13, 2023

• 32

upvoted a paper about 2 years ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11, 2024 • 52