Taha Akbari

Taha1506

16 1

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Unlocking asynchronicity in continuous batching

upvoted an article 7 months ago

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

upvoted an article 8 months ago

Continuous batching from first principles

View all activity

Organizations

upvoted an article about 1 month ago

Article

Unlocking asynchronicity in continuous batching

ror, pcuenq, ariG23498

•

May 14

• 65

upvoted an article 7 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

itazap, ariG23498, ArthurZ, sergiopaniego, merve, pcuenq

•

Dec 18, 2025

• 125

upvoted an article 8 months ago

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 423

upvoted a paper 11 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 79

upvoted an article 11 months ago

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

xhluca

•

Jul 9, 2024

• 85

upvoted 3 articles about 1 year ago

Article

The N Implementation Details of RLHF with PPO

vwxyzjn, tianlinliu0121, lvwerra

•

Oct 24, 2023

• 72

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 614

Article

Train 400x faster Static Embedding Models with Sentence Transformers

tomaarsen

•

Jan 15, 2025

• 233

upvoted 7 articles over 1 year ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.16k

Article

Welcome Llama 4 Maverick & Scout on Hugging Face

burtenshaw, reach-vb, pcuenq, clem, rajatarya, jsulz, lysandre

•

Apr 5, 2025

• 149

Article

The NLP Course is becoming the LLM Course

burtenshaw, reach-vb, lewtun, fdaudens, pcuenq, tomaarsen, coyotte508, mishig, sergiopaniego, julien-c

•

Apr 3, 2025

• 107

Article

Open-R1: Update #1

open-r1

•

Feb 2, 2025

• 304

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 297

Article

Common AI Model Formats

ngxson

•

Feb 27, 2025

• 73

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

sirluk

•

Oct 7, 2024

• 71

Taha Akbari

AI & ML interests

Recent Activity

Organizations

Taha1506's activity

Unlocking asynchronicity in continuous batching

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Continuous batching from first principles

BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡

The N Implementation Details of RLHF with PPO

Vision Language Models (Better, faster, stronger)

Train 400x faster Static Embedding Models with Sentence Transformers

Mixture of Experts Explained

Welcome Llama 4 Maverick & Scout on Hugging Face

The NLP Course is becoming the LLM Course

Open-R1: Update #1

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Common AI Model Formats

Efficient LLM Pretraining: Packed Sequences and Masked Attention

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡