Andy Andurkar

AndyAndurkar

AI & ML interests

None yet

Recent Activity

upvoted an article 23 days ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

liked a Space about 2 months ago

Vchitect/VBench_Leaderboard

upvoted an article about 2 months ago

Vision Language Models Explained

View all activity

Organizations

None yet

upvoted an article 23 days ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

•

393

liked a Space about 2 months ago

VBench Leaderboard

📊

345

Upload and evaluate video generation models with detailed scoring

upvoted an article about 2 months ago

Article

Vision Language Models Explained

Apr 11, 2024

•

512

upvoted 4 articles 7 months ago

Article

🦸🏻#11: How Do Agents Plan and Reason?

Feb 24, 2025

•

Article

🦸🏻#10: Does Present-Day GenAI Actually Reason?

Feb 15, 2025

•

Article

Everything You Need to Know about Knowledge Distillation

Mar 6, 2025

•

Article

Inside the family of Smol models

Feb 27, 2025

•

commented on DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge 11 months ago

A very well written article with clear explanations!

upvoted an article 12 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

273

Andy Andurkar

AI & ML interests

Recent Activity

Organizations

AndyAndurkar's activity

Illustrating Reinforcement Learning from Human Feedback (RLHF)

VBench Leaderboard

Vision Language Models Explained

🦸🏻#11: How Do Agents Plan and Reason?

🦸🏻#10: Does Present-Day GenAI Actually Reason?

Everything You Need to Know about Knowledge Distillation

Inside the family of Smol models

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge