-
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
Paper • 2503.22675 • Published • 36 -
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
Paper • 2503.22230 • Published • 45 -
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
Paper • 2509.13313 • Published • 80 -
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Paper • 2509.13309 • Published • 67
bypan
bypan123
·
AI & ML interests
None yet
Recent Activity
liked
a model about 1 month ago
UBTECH-Robotics/Thinker-4B updated
a model about 1 month ago
UBTECH-Robotics/Thinker-4B updated
a model about 1 month ago
UBTECH-Robotics/Thinker-4B