🏗️ Building on HF

Daniel Bourke PRO

mrdbourke

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

liked a model 2 days ago

Boogu/Boogu-Image-0.1-Turbo

liked a model 4 days ago

LiquidAI/LFM2.5-1.2B-Instruct

liked a model 4 days ago

zai-org/GLM-5.2

View all activity

Organizations

upvoted an article 8 days ago

Article

GLM-5.2: Built for Long-Horizon Tasks

zai-org

•

10 days ago

• 105

upvoted an article 11 days ago

Article

I fine-tuned a model for free from one prompt, with TRL and the Google Colab CLI

sergiopaniego

•

11 days ago

• 4

upvoted an article 12 days ago

Article

MTEB Leaderboard: From a slow demo to feature-rich leaderboard

Samoed

•

14 days ago

• 22

upvoted an article 22 days ago

Article

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

nvidia

•

22 days ago

• 66

upvoted a collection 23 days ago

Ideogram 4

Collection

8 items • Updated 22 days ago • 65

upvoted a collection 24 days ago

Verbatim RAG v1

Collection

Hallucination free RAG and out SOTA state-of-the-art extractors • 8 items • Updated 24 days ago • 9

upvoted 2 articles 25 days ago

Article

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

nvidia

•

26 days ago

• 83

Article

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

JetBrains

•

25 days ago

• 32

upvoted an article 30 days ago

Article

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

ibm-research

•

about 1 month ago

• 17

upvoted a collection about 1 month ago

📝 Research & Long-Form Blog Posts

Collection

In-depth technical articles and research pieces published by Hugging Face • 18 items • Updated 29 days ago • 34

upvoted 6 articles about 1 month ago

Article

Running AI agents to automate outreach at scale

nielsr

•

Apr 27

• 15

Article

Relaunching PapersWithCode with new features

nielsr

•

May 24

• 12

Article

Eight Days in China: What I Learned from the AI Labs, Robotics Startups and Academia

matthew-d-white

•

May 22

• 5

Article

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

sergiopaniego, ariG23498

•

May 25

• 122

Article

Introducing the Ettin Reranker Family

tomaarsen

•

May 19

• 52

Article

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

ibm-granite

•

May 14

• 33

upvoted a collection about 2 months ago

MiniCPM-V 4.6

Collection

MLX variants of MiniCPM-V 4.6, 1.3B parameters (SigLIP2 400M vision encoder + Qwen3.5-0.8B LLM), repo: https://huggingface.co/openbmb/MiniCPM-V-4.6 • 7 items • Updated May 11 • 1

upvoted 3 articles about 2 months ago

Article

EMO: Pretraining mixture of experts for emergent modularity

allenai

•

May 8

• 38

Article

Build a Domain-Specific Embedding Model in Under a Day

nvidia

•

Mar 20

• 74

Article

Granite 4.1 LLMs: How They’re Built

ibm-granite

•

Apr 29

• 81

Daniel Bourke PRO

AI & ML interests

Recent Activity

Organizations

mrdbourke's activity

GLM-5.2: Built for Long-Horizon Tasks

I fine-tuned a model for free from one prompt, with TRL and the Google Colab CLI

MTEB Leaderboard: From a slow demo to feature-rich leaderboard

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

Running AI agents to automate outreach at scale

Relaunching PapersWithCode with new features

Eight Days in China: What I Learned from the AI Labs, Robotics Startups and Academia

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

Introducing the Ettin Reranker Family

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

EMO: Pretraining mixture of experts for emergent modularity

Build a Domain-Specific Embedding Model in Under a Day

Granite 4.1 LLMs: How They’re Built