3 9 6

Arun Prakash

arun-AiBharat

Arunprakash-a

AI & ML interests

LLMs, OCR

Recent Activity

liked a Space 2 months ago

lm-provers/qed-nano-blogpost

upvoted an article 3 months ago

Gotchas in Tokenizer Behavior Every Developer Should Know

liked a model 4 months ago

google/translategemma-12b-it

View all activity

Organizations

liked a Space 2 months ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

upvoted an article 3 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

qgallouedec

•

Apr 18, 2025

• 72

liked a model 4 months ago

google/translategemma-12b-it

Image-Text-to-Text • Updated Jan 28 • 14.2k • 300

liked a Space 7 months ago

The Smol Training Playbook

📚

3.18k

The secrets to building world-class LLMs

upvoted an article 9 months ago

Article

The N Implementation Details of RLHF with PPO

vwxyzjn, tianlinliu0121, lvwerra

•

Oct 24, 2023

• 72

upvoted 2 articles 10 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante

•

Aug 5, 2025

• 513

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 413

commented on Mixture of Experts Explained 10 months ago

Here is an illustration that helps you understand the routing process visually. Source

New activity in huggingface/documentation-images 10 months ago

A simple visualization of moe block

#524 opened 10 months ago by

arun-AiBharat

upvoted an article 10 months ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.13k

liked a Space 10 months ago

The Ultra-Scale Playbook

🌌

3.85k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 11 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 776

upvoted 3 articles about 1 year ago

Article

Let's talk about LLM evaluation

clefourrier

•

May 23, 2024

• 209

Article

Open LLM Leaderboard: DROP deep dive

clefourrier, cabreraalex, stellaathena, SaylorTwift, thomwolf

•

Dec 1, 2023

• 11

Article

What's going on with the Open LLM Leaderboard?

clefourrier, SaylorTwift, slippylolo, thomwolf

•

Jun 23, 2023

• 51

liked a dataset about 1 year ago

ai4bharat/MILU

Viewer • Updated Feb 9 • 88.5k • 784 • 23

updated 2 datasets over 1 year ago

arun-AiBharat/BookCorpus_Chunked_1K_Tokens_GPT2_Pretraining

Viewer • Updated Sep 30, 2024 • 1.06M • 137

arun-AiBharat/bookcorpus_tokenized_gpt2

Viewer • Updated Sep 30, 2024 • 74M • 140

liked a dataset over 1 year ago

ai4bharat/sangraha

Viewer • Updated Mar 5, 2025 • 268M • 7.59k • 72

New activity in arun-AiBharat/gpt-2-bookcorpus over 1 year ago

version-2-40k-steps

#2 opened over 1 year ago by

arun-AiBharat

Arun Prakash

AI & ML interests

Recent Activity

Organizations

arun-AiBharat's activity

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Gotchas in Tokenizer Behavior Every Developer Should Know

The Smol Training Playbook

The N Implementation Details of RLHF with PPO

Welcome GPT OSS, the new open-source model family from OpenAI!

Illustrating Reinforcement Learning from Human Feedback (RLHF)

A simple visualization of moe block

Mixture of Experts Explained

The Ultra-Scale Playbook

SmolLM3: smol, multilingual, long-context reasoner

Let's talk about LLM evaluation

Open LLM Leaderboard: DROP deep dive

What's going on with the Open LLM Leaderboard?

version-2-40k-steps