pawl_jack's picture

3 1

pawl_jack

pangdingjack

·

AI & ML interests

None yet

Organizations

None yet

upvoted 3 articles 10 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 294

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 342

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

+7

danaaubakirova, andito, merve, ariG23498, fracapuano, loubnabnl, pcuenq, mshukor, cadene

•

Jun 3, 2025

• 349