Community Blog & Articles

Community Articles

Falcon-Arabic: A Breakthrough in Arabic Language Models

Exploring Quantization Backends in Diffusers

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Microsoft and Hugging Face expand collaboration

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

The Transformers Library: standardizing model definitions

Improving Hugging Face Model Access for Kaggle Users

Blazingly fast whisper transcriptions with Inference Endpoints

Vision Language Models (Better, faster, stronger)

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

How to Build an MCP Server with Gradio

The 4 Things Qwen-3’s Chat Template Teaches Us

Welcoming Llama Guard 4 on Hugging Face Hub

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

Community Blog & Articles

Kog Laneformer 2B: The Latency-First Model Behind Kog Inference Engine

Building Moon Bot: A Slack-Native Coding Agent Backed by HuggingFace Buckets

Chitos: From Detection to Proof — An Autonomous Security AI That Actually Exploits

80TB+ of astronomy for the HDD-poor: crossmatch the Multimodal Universe from your laptop

Does Your LLM Know *When It's About to Be Wrong*?

VLX-Flow: Continuous Video Understanding for Real-Time Multimodal Interaction

VLX-Seek: Improving VLM Fine-Grained Perception via Region Reference Instead of Coordinate Generation

VLX-Go: Vision-Language Short-Horizon Waypoint Prediction for Embodied Navigation

Which tokens does a hybrid model predict better?

KV Caching Explained: Optimizing Transformer Inference Efficiency

Build real agentic apps using CUGA: two dozen working examples on a lightweight harness

Interhuman’s Goblin: “Yeah, Friday at Five”

Uncensor any LLM with abliteration

Introducing North Mini Code: Cohere’s First Model For Developers

Continuous batching for GRPO, now in TRL

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Introduction to State Space Models (SSM)

Code a simple RAG from scratch

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

OlmoLogic: Boosting Reasoning via RLVR with Inductive Logic Programming

Falcon-Arabic: A Breakthrough in Arabic Language Models

Exploring Quantization Backends in Diffusers

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Microsoft and Hugging Face expand collaboration

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

The Transformers Library: standardizing model definitions

Improving Hugging Face Model Access for Kaggle Users

Blazingly fast whisper transcriptions with Inference Endpoints

Vision Language Models (Better, faster, stronger)

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

How to Build an MCP Server with Gradio

The 4 Things Qwen-3’s Chat Template Teaches Us

Welcoming Llama Guard 4 on Hugging Face Hub

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Kog Laneformer 2B: The Latency-First Model Behind Kog Inference Engine

Building Moon Bot: A Slack-Native Coding Agent Backed by HuggingFace Buckets

Chitos: From Detection to Proof — An Autonomous Security AI That Actually Exploits

80TB+ of astronomy for the HDD-poor: crossmatch the Multimodal Universe from your laptop

Does Your LLM Know *When It's About to Be Wrong*?

VLX-Flow: Continuous Video Understanding for Real-Time Multimodal Interaction

VLX-Seek: Improving VLM Fine-Grained Perception via Region Reference Instead of Coordinate Generation

VLX-Go: Vision-Language Short-Horizon Waypoint Prediction for Embodied Navigation

Which tokens does a hybrid model predict better?

KV Caching Explained: Optimizing Transformer Inference Efficiency

Build real agentic apps using CUGA: two dozen working examples on a lightweight harness

Interhuman’s Goblin: “Yeah, Friday at Five”

Uncensor any LLM with abliteration

Introducing North Mini Code: Cohere’s First Model For Developers

Continuous batching for GRPO, now in TRL

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Introduction to State Space Models (SSM)

Code a simple RAG from scratch

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

OlmoLogic: Boosting Reasoning via RLVR with Inductive Logic Programming

Does Your LLM Know When It's About to Be Wrong?

Does Your LLM Know When It's About to Be Wrong?