1 9 1

youyou

shiyingcheng

AI & ML interests

None yet

Recent Activity

authored a paper 15 days ago

Tongyi DeepResearch Technical Report

authored a paper 15 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

authored a paper 15 days ago

ESPO: Early-Stopping Proximal Policy Optimization

View all activity

Organizations

None yet

authored 4 papers 15 days ago

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28, 2025 • 104

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 113

ESPO: Early-Stopping Proximal Policy Optimization

Paper • 2605.29860 • Published about 1 month ago • 20

EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning

Paper • 2606.03108 • Published 25 days ago • 11

upvoted a paper 16 days ago

EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning

Paper • 2606.03108 • Published 25 days ago • 11

submitted a paper to Daily Papers 16 days ago

EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning

Paper • 2606.03108 • Published 25 days ago • 11

upvoted a paper 25 days ago

ESPO: Early-Stopping Proximal Policy Optimization

Paper • 2605.29860 • Published about 1 month ago • 20

upvoted a paper 6 months ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 113

upvoted a paper 12 months ago

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published Jul 8, 2025 • 48

authored 2 papers about 1 year ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 89

QwenLong-CPRS: Towards $\infty$-LLMs with Dynamic Context Optimization

Paper • 2505.18092 • Published May 23, 2025 • 43

liked a model about 1 year ago

Tongyi-Zhiwen/QwenLong-L1-32B

Text Generation • 33B • Updated Jun 9, 2025 • 166 • • 167

upvoted 2 papers about 1 year ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 89

QwenLong-CPRS: Towards infty-LLMs with Dynamic Context Optimization

Paper • 2505.18092 • Published May 23, 2025 • 43

upvoted 2 papers over 1 year ago

WritingBench: A Comprehensive Benchmark for Generative Writing

Paper • 2503.05244 • Published Mar 7, 2025 • 22

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization

Paper • 2411.06208 • Published Nov 9, 2024 • 21

upvoted a paper almost 2 years ago

Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA

Paper • 2406.17419 • Published Jun 25, 2024 • 17

youyou

AI & ML interests

Recent Activity

Organizations

shiyingcheng's activity