smokxy (Kartik)

upvoted an article about 1 year ago

Article

Vision Language Models (Better, faster, stronger)

+3

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 614

upvoted 4 articles over 1 year ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

mlabonne

•

Jul 29, 2024

• 373

Article

Finally, a Replacement for BERT: Introducing ModernBERT

+13

bwarner, NohTow, bclavie, orionweller, ohallstrom, staghado, alexisgallagher, rbiswasfc, fladhak, tomaarsen, ncoop57, griffin, jph00, johnowhitaker, iacolippo

•

Dec 19, 2024

• 748

Article

Parameter-Efficient Fine-Tuning using 🤗 PEFT

smangrul, sayakpaul

•

Feb 10, 2023

• 121

Article

Introduction to ggml

+1

ngxson, ggerganov, slaren

•

Aug 13, 2024

• 295

upvoted a paper over 1 year ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 195

upvoted 3 articles over 1 year ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

+3

ybelkada, timdettmers, artidoro, sgugger, smangrul

•

May 24, 2023

• 180

Article

Introducing smolagents: simple agents that write actions in code.

+1

m-ric, merve, thomwolf

•

Dec 31, 2024

• 1.2k

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

merve

•

Aug 25, 2023

• 40

upvoted 4 articles almost 2 years ago

Article

MTEB: Massive Text Embedding Benchmark

Muennighoff

•

Oct 19, 2022

• 94

Article

🪆 Introduction to Matryoshka Embedding Models

+1

tomaarsen, Xenova, osanseviero

•

Feb 23, 2024

• 211

Article

Convert Transformers to ONNX with Hugging Face Optimum

philschmid

•

Jun 22, 2022

• 10

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

ybelkada, timdettmers

•

Aug 17, 2022

• 136

upvoted 2 articles about 2 years ago

Article

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Langage Model

+9

HugoLaurencon, davanstrien, stas, Leyo, SaulLu, TimeRobber, skaramcheti, aps, giadap, yjernite, VictorSanh

•

Aug 22, 2023

• 37

Article

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

sanchit-gandhi

•

Nov 3, 2022

• 373

Kartik

AI & ML interests

Organizations

Vision Language Models (Better, faster, stronger)

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Finally, a Replacement for BERT: Introducing ModernBERT

Parameter-Efficient Fine-Tuning using 🤗 PEFT

Introduction to ggml

Executable Code Actions Elicit Better LLM Agents

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Introducing smolagents: simple agents that write actions in code.

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

MTEB: Massive Text Embedding Benchmark

🪆 Introduction to Matryoshka Embedding Models

Convert Transformers to ONNX with Hugging Face Optimum

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Langage Model

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

Kartik

AI & ML interests

Organizations

smokxy's activity

Vision Language Models (Better, faster, stronger)

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Finally, a Replacement for BERT: Introducing ModernBERT

Parameter-Efficient Fine-Tuning using 🤗 PEFT

Introduction to ggml

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Introducing smolagents: simple agents that write actions in code.

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

MTEB: Massive Text Embedding Benchmark

🪆 Introduction to Matryoshka Embedding Models

Convert Transformers to ONNX with Hugging Face Optimum

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Langage Model

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers