ChengweiLiu

lcw888

·

https://github.com/ChavesLiu

ChavesLiu

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 6 months ago

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Paper • 2601.21558 • Published Jan 29 • 61

upvoted 12 papers 12 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 161

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 161

MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4, 2025 • 169

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 257

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 264

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 190

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 213

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 277

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published Aug 6, 2025 • 164

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2, 2025 • 240

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15, 2025 • 83

upvoted a paper about 1 year ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28, 2025 • 87