TanYuQiao's picture

TanYuQiao

Trae1ounG

·

Trae1ounG

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

upvoted a paper 2 months ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

upvoted a paper 2 months ago

Seedance 2.0: Advancing Video Generation for World Complexity

View all activity

Organizations

None yet

upvoted a paper 15 days ago

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

Paper • 2606.12191 • Published 16 days ago • 67

upvoted 3 papers 2 months ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published Apr 20 • 87

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 166

From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space

Paper • 2604.14142 • Published Apr 15 • 30

upvoted a paper 4 months ago

MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning

Paper • 2603.02024 • Published Mar 2 • 47

upvoted 3 papers 5 months ago

Self-Distillation Enables Continual Learning

Paper • 2601.19897 • Published Jan 27 • 41

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published Jan 28 • 50

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 201

upvoted a paper 6 months ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published Dec 22, 2025 • 66

upvoted a paper 7 months ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 159

upvoted a paper 8 months ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30, 2025 • 88

upvoted a paper 11 months ago

Better wit than wealth: Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement

Paper • 2503.23895 • Published Mar 31, 2025 • 1