TIANYI
BIMU233
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 4 hours ago
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance authored a paper about 4 hours ago
SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks upvoted a paper about 5 hours ago
From Word to World: Can Large Language Models be Implicit Text-based World Models?Organizations
None yet