Omkar Pangarkar

omkarenator

AI & ML interests

None yet

Recent Activity

new activity about 2 months ago

LLM360/TxT360:Will the code/scripts be released?

upvoted an article 7 months ago

Mixture of Experts Explained

upvoted a collection 7 months ago

🤖 Agents

View all activity

Organizations

liked a Space 8 months ago

The Smol Training Playbook

📚

3.22k

The secrets to building world-class LLMs

liked a dataset 8 months ago

bigcode/the-stack-github-issues

Viewer • Updated Mar 20, 2023 • 31M • 381 • 50

liked a Space about 1 year ago

Predict Memory

🧮

110

Estimate model memory usage and see detailed plots

liked a dataset about 1 year ago

WebOrganizer/Corpus-200B

Viewer • Updated 2 days ago • 218M • 38.5k • 11

liked a Space about 1 year ago

TxT360: Trillion Extracted Text

📖

134

Explore the TxT360 LLM pre‑training dataset online

liked a model over 1 year ago

mlfoundations/fasttext-oh-eli5

Updated Aug 1, 2024 • 31

liked 2 Spaces over 1 year ago

The Ultra-Scale Playbook

🌌

3.91k

The ultimate guide to training LLM on large GPU Clusters

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

Evaluate multilingual models using FineTasks

liked a dataset over 1 year ago

LLM360/TxT360

Updated May 26, 2025 • 16k • 263

liked a Space almost 2 years ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.38k

Explore and download the FineWeb web‑scale text dataset

liked 2 datasets almost 2 years ago

Trelis/touch-rugby-rules-memorisation

Viewer • Updated Feb 28, 2024 • 363 • 13 • 2

commoncrawl/statistics

Viewer • Updated 7 days ago • 637k • 594 • 27

liked 6 models over 2 years ago

liked a model about 3 years ago

stanfordnlp/backpack-gpt2

Text Generation • Updated Aug 14, 2023 • 252 • 16