Tianle Wang's picture

In a Training Loop 🔄

Tianle Wang

wtl666wtl

https://wtl666wtl.github.io/

wtl666wtl

AI & ML interests

None yet

Recent Activity

commentedon a paper about 5 hours ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

upvoted a paper about 18 hours ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

submitted a paper about 19 hours ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

View all activity

Organizations

None yet

submitted a paper to Daily Papers about 19 hours ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published 2 days ago • 8

authored a paper 9 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27, 2025 • 15