1 76 8

Sweker

Swekerr

AI & ML interests

None yet

Recent Activity

upvoted an article 20 days ago

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

upvoted an article about 2 months ago

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

upvoted a paper 2 months ago

OpenGame: Open Agentic Coding for Games

View all activity

Organizations

upvoted an article 20 days ago

Article

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

sergiopaniego, ariG23498

•

May 25

• 120

upvoted an article about 2 months ago

Article

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

nvidia

•

Apr 28

• 62

upvoted a paper 2 months ago

OpenGame: Open Agentic Coding for Games

Paper • 2604.18394 • Published Apr 20 • 84

upvoted an article 3 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 164

upvoted 2 articles 6 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 188

Article

LLM based Audio models

YatharthS

•

Dec 18, 2025

• 59

upvoted an article 9 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

A-Mahla, merve, sergiopaniego, reach-vb, lewtun

•

Sep 23, 2025

• 138

upvoted 3 articles 12 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 780

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

thomwolf, matthieu-lapeyre

•

Jul 9, 2025

• 803

Article

Announcing NeurIPS 2025 E2LM Competition: Early Training Evaluation of Language Models

tiiuae

•

Jul 4, 2025

• 11

upvoted an article about 1 year ago

Article

🐯 Liger GRPO meets TRL

shisahni, kashif, smohammadi, ShirinYamani, m0m0chen, liberty4321

•

May 25, 2025

• 54

upvoted a paper about 1 year ago

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4, 2024 • 102

upvoted 3 articles about 1 year ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 260

Article

The Transformers Library: standardizing model definitions

lysandre, ArthurZ, pcuenq, julien-c

•

May 15, 2025

• 123

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 538

upvoted a paper about 1 year ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29, 2025 • 96

upvoted 2 articles about 1 year ago

Article

Train your first Decision Transformer

edbeeching, ThomasSimonini

•

Sep 8, 2022

• 15

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 295

upvoted a paper about 1 year ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 126

upvoted an article about 1 year ago

Article

What is test-time compute and how to scale it?

Kseniase

•

Feb 6, 2025

• 123

Sweker

AI & ML interests

Recent Activity

Organizations

Swekerr's activity

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

LLM based Audio models

Smol2Operator: Post-Training GUI Agents for Computer Use

SmolLM3: smol, multilingual, long-context reasoner

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Announcing NeurIPS 2025 E2LM Competition: Early Training Evaluation of Language Models

🐯 Liger GRPO meets TRL

nanoVLM: The simplest repository to train your VLM in pure PyTorch

The Transformers Library: standardizing model definitions

Vision Language Models Explained

Train your first Decision Transformer

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

What is test-time compute and how to scale it?