Keyu Duan
vermouthdky
AI & ML interests
LLM Reasoning and Safety
Recent Activity
liked a model 8 days ago
MiniMaxAI/MiniMax-M3 authored a paper 3 months ago
In-Context Reinforcement Learning for Tool Use in Large Language Models upvoted an article 4 months ago
Forge: Scalable Agent RL Framework and Algorithm