Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tianle Wang's picture
In a Training Loop 🔄
1 3

Tianle Wang

wtl666wtl
https://wtl666wtl.github.io/
  • wtl666wtl

AI & ML interests

None yet

Recent Activity

authored a paper about 11 hours ago
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key
commentedon a paper 2 days ago
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key
upvoted a paper 3 days ago
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key
View all activity

Organizations

None yet

upvoted a paper 3 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published 4 days ago • 11
upvoted a paper 3 months ago

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Paper • 2602.10090 • Published Feb 10 • 53
upvoted a paper 10 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27, 2025 • 15
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs