hu-po (hu-po)

upvoted an article 7 months ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

nvidia

•

Dec 15, 2025

• 113

upvoted an article 10 months ago

Article

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

+9

fracapuano, aractingi, lhoestq, CarolinePascal, pepijn223, jadechoghari, cadene, aliberts, AdilZtn, nepyope, imstevenpmwork

•

Sep 16, 2025

• 56

upvoted a paper 10 months ago

Reinforcement Learning Foundations for Deep Research Systems: A Survey

Paper • 2509.06733 • Published Sep 8, 2025 • 32

upvoted an article 10 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 360

upvoted a collection 11 months ago

MolmoAct

Collection

All models for the MolmoAct (Multimodal Open Language Model for Action) release. • 10 items • Updated May 4 • 37

upvoted an article about 1 year ago

Article

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

nvidia

•

Jun 11, 2025

• 134

upvoted 2 articles over 1 year ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

+2

danaaubakirova, Molbap, mshukor, cadene

•

Feb 4, 2025

• 191

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 890

upvoted a collection over 1 year ago

DeepSeek-R1

Collection

10 items • Updated Nov 27, 2025 • 857

upvoted a paper over 1 year ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 289

upvoted 4 papers almost 2 years ago

FLUX that Plays Music

Paper • 2409.00587 • Published Sep 1, 2024 • 33

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Paper • 2408.08459 • Published Aug 15, 2024 • 45

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 62

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 175

upvoted 3 papers about 2 years ago

Thermodynamic Natural Gradient Descent

Paper • 2405.13817 • Published May 22, 2024 • 16

Body Design and Gait Generation of Chair-Type Asymmetrical Tripedal Low-rigidity Robot

Paper • 2404.05932 • Published Apr 9, 2024 • 1

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2, 2024 • 107

upvoted 3 papers over 2 years ago

hu-po

AI & ML interests

Organizations

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

Reinforcement Learning Foundations for Deep Research Systems: A Survey

KV Caching Explained: Optimizing Transformer Inference Efficiency

MolmoAct

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Open-R1: a fully open reproduction of DeepSeek-R1

DeepSeek-R1

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

FLUX that Plays Music

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Transformer Explainer: Interactive Learning of Text-Generative Models

Thermodynamic Natural Gradient Descent

Body Design and Gait Generation of Chair-Type Asymmetrical Tripedal Low-rigidity Robot

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Humanoid Locomotion as Next Token Prediction

V-IRL: Grounding Virtual Intelligence in Real Life

Drivable 3D Gaussian Avatars

hu-po

AI & ML interests

Organizations

hu-po's activity

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

KV Caching Explained: Optimizing Transformer Inference Efficiency

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Open-R1: a fully open reproduction of DeepSeek-R1