Tianle Wang's picture

In a Training Loop 🔄

Tianle Wang

wtl666wtl

https://wtl666wtl.github.io/

wtl666wtl

AI & ML interests

None yet

Recent Activity

authored a paper about 11 hours ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

commentedon a paper 2 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

upvoted a paper 3 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published 4 days ago • 11

upvoted a paper 3 months ago

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Paper • 2602.10090 • Published Feb 10 • 53

upvoted a paper 10 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27, 2025 • 15