15 36 35

Daixuan Cheng

daixuancheng

https://cdxeve.github.io

DaixuanC45443

AI & ML interests

I work on LLMs across pre-training, post-training, and agents.

Recent Activity

updated a dataset 12 days ago

RUC-AIBOX/ClawGym-Bench

updated a dataset 12 days ago

RUC-AIBOX/ClawGym-Trajectory

updated a dataset 12 days ago

RUC-AIBOX/ClawGym-Task

View all activity

Organizations

commented a paper about 2 months ago

ClawGym: A Scalable Framework for Building Effective Claw Agents

Paper • 2604.26904 • Published Apr 29 • 54 •

New activity in daixuancheng/llm-in-sandbox-rl 5 months ago

[bot] Conversion to Parquet

#1 opened 5 months ago by

parquet-converter

New activity in daixuancheng/llm-in-sandbox-bench 5 months ago

[bot] Conversion to Parquet

#1 opened 5 months ago by

parquet-converter

commented a paper 9 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18, 2025 • 119 •

commented 3 papers 12 months ago

New activity in daixuancheng/dapo_yes_suffix 12 months ago

[bot] Conversion to Parquet

#1 opened 12 months ago by

parquet-converter

New activity in daixuancheng/dapo 12 months ago

[bot] Conversion to Parquet

#1 opened 12 months ago by

parquet-converter

commented 2 papers about 1 year ago

Reasoning with Exploration: An Entropy Perspective

Paper • 2506.14758 • Published Jun 17, 2025 • 30 •

Reasoning with Exploration: An Entropy Perspective

Paper • 2506.14758 • Published Jun 17, 2025 • 30 •

commented 2 papers over 1 year ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 53 •

On Domain-Specific Post-Training for Multimodal Large Language Models

Paper • 2411.19930 • Published Nov 29, 2024 • 30 •

commented a paper almost 2 years ago

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 82 •

commented 2 papers about 2 years ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 96 •

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 96 •

Daixuan Cheng

AI & ML interests

Recent Activity

Organizations

daixuancheng's activity

[bot] Conversion to Parquet

[bot] Conversion to Parquet

[bot] Conversion to Parquet

[bot] Conversion to Parquet