🤝 Open to Collab

Kartikey Rawat

carrycooldude

1 24 23

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

qualcomm/HuggingFace-WavLM-Base-Plus

upvoted a changelog about 2 months ago

Introducing Kernels

liked a model about 2 months ago

google/gemma-4-26B-A4B-it

View all activity

Organizations

upvoted a changelog about 2 months ago

Hugging Face Changelog

Introducing Kernels

Apr 15

• 202

upvoted a collection 2 months ago

Qualcomm

Collection

Collection for models for Qualcomm hackathon • 8 items • Updated 7 days ago • 10

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 910

upvoted an article 4 months ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.15k

upvoted a paper 8 months ago

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24, 2025 • 64

upvoted 4 changelogs 11 months ago

Hugging Face Changelog

JSON Support in the Dataset Viewer

Jul 23, 2025

• 54

Hugging Face Changelog

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Jul 30, 2025

• 204

Hugging Face Changelog

Trending Papers

Jul 28, 2025

• 107

Hugging Face Changelog

Introducing a better Hugging Face CLI

Jul 25, 2025

• 98

upvoted 6 articles about 1 year ago

Article

Why Maybe We're Measuring LLM Compression Wrong

rishiraj

•

Jun 21, 2025

• 16

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

danaaubakirova, andito, merve, ariG23498, fracapuano, loubnabnl, pcuenq, mshukor, cadene

•

Jun 3, 2025

• 357

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

ybelkada, timdettmers

•

Aug 17, 2022

• 136

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

ybelkada, timdettmers, artidoro, sgugger, smangrul

•

May 24, 2023

• 180

Article

Making LLMs lighter with AutoGPTQ and transformers

marcsun13, fxmarty, PanEa, qwopqwop, ybelkada, TheBloke

•

Aug 23, 2023

• 64

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

merve

•

Aug 25, 2023

• 40

upvoted 2 collections about 2 years ago

Instruction Pre-Training

Collection

8 items • Updated Jun 21, 2024 • 26

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 18 days ago • 164

upvoted a paper about 2 years ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 71

upvoted 2 articles about 2 years ago

Article

A Dive into Vision-Language Models

adirik, sayakpaul

•

Feb 3, 2023

• 84

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 538

Kartikey Rawat

AI & ML interests

Recent Activity

Organizations

carrycooldude's activity

Introducing Kernels

Welcome Gemma 4: Frontier multimodal intelligence on device

Mixture of Experts Explained

JSON Support in the Dataset Viewer

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Trending Papers

Introducing a better Hugging Face CLI

Why Maybe We're Measuring LLM Compression Wrong

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Making LLMs lighter with AutoGPTQ and transformers

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

A Dive into Vision-Language Models

Vision Language Models Explained