arxiv:2402.12663
Varad Pimpalkhute
DaoistKalki
AI & ML interests
Few-shot learning, generalization, multi-modality
Recent Activity
upvoted a paper about 4 hours ago
Critique of Agent Model upvoted a paper about 1 month ago
Efficient Agentic Reasoning Through Self-Regulated Simulative Planning upvoted a paper 7 months ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices