fanxilai's picture

5

fanxilai

fanxilai

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

upvoted a paper 13 days ago

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

upvoted an article 2 months ago

Aligning to What? Rethinking Agent Generalization in MiniMax M2

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published 12 days ago • 108

upvoted a paper 13 days ago

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Paper • 2512.22322 • Published 17 days ago • 38

upvoted an article 2 months ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

42

upvoted 2 papers 3 months ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9, 2025 • 44

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26, 2025 • 29

updated a dataset 8 months ago

fanxilai/huggingface-smol-course-instruction-tuning-dataset

Viewer • Updated May 14, 2025 • 1 • 3

published a dataset 8 months ago

fanxilai/huggingface-smol-course-instruction-tuning-dataset

Viewer • Updated May 14, 2025 • 1 • 3

updated a model 8 months ago

fanxilai/sft_output

Updated May 14, 2025

published 3 models 8 months ago

fanxilai/sft_output

Updated May 14, 2025

fanxilai/SmolLM2-FT-DPO

Updated May 12, 2025

fanxilai/SmolLM2-FT-MyDataset

Updated May 12, 2025