1 15 11

yyx

RuggingHace

AI & ML interests

None yet

Recent Activity

upvoted an article 6 days ago

Continuous batching from first principles

new activity 2 months ago

facebook/layerskip-llama2-13B:layerskip-llama2-13B access denied

upvoted an article 2 months ago

Ulysses Sequence Parallelism: Training with Million-Token Contexts

View all activity

Organizations

None yet

upvoted an article 6 days ago

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 396

upvoted 2 articles 2 months ago

Article

Ulysses Sequence Parallelism: Training with Million-Token Contexts

kashif, stas

•

Mar 9

• 30

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 157

upvoted an article 3 months ago

Article

Custom Kernels for All from Codex and Claude

burtenshaw, sayakpaul, ariG23498, evalstate

•

Feb 13

• 78

upvoted 2 articles 4 months ago

Article

Training Design for Text-to-Image Models: Lessons from Ablations

Photoroom

•

Feb 3

• 73

Article

Text-to-image Architectural Experiments

Photoroom

•

Nov 13, 2025

• 57

upvoted a paper 4 months ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published Dec 15, 2025 • 107

upvoted 2 articles 4 months ago

Article

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

huggingface

•

Jan 27

• 45

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

•

Jan 27

• 75

upvoted 2 articles 7 months ago

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 482

Article

What is test-time compute and how to scale it?

Kseniase

•

Feb 6, 2025

• 121

upvoted 2 articles 9 months ago

Article

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

cbensimon, sayakpaul, linoyts, multimodalart

•

Sep 2, 2025

• 77

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

drbh, danieldk

•

Aug 18, 2025

• 100

upvoted an article 11 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 777

upvoted a paper 11 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 278

yyx

AI & ML interests

Recent Activity

Organizations

RuggingHace's activity

Continuous batching from first principles

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Custom Kernels for All from Codex and Claude

Training Design for Text-to-Image Models: Lessons from Ablations

Text-to-image Architectural Experiments

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

You could have designed state of the art positional encoding

What is test-time compute and how to scale it?

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

SmolLM3: smol, multilingual, long-context reasoner