🔄 In a Training Loop

Bal Narendra Sapa

bnsapa

11 9 83

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 5 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 454

upvoted a paper 12 months ago

Improved Baselines with Visual Instruction Tuning

Paper • 2310.03744 • Published Oct 5, 2023 • 39

upvoted a changelog about 1 year ago

Hugging Face Changelog

Connect Your MCP Client to the Hugging Face Hub

Jun 6, 2025

• 115

upvoted an article about 1 year ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

toslali-ibm, mirinflim, qgallouedec, esnible, rganti, mudhakar

•

Jun 3, 2025

• 101

upvoted 2 collections about 1 year ago

sarvam-m

Collection

Collection of all variations of the sarvam-m model • 3 items • Updated May 24, 2025 • 30

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 739

upvoted 3 articles almost 2 years ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

ybelkada, timdettmers

•

Aug 17, 2022

• 136

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

ybelkada, timdettmers, artidoro, sgugger, smangrul

•

May 24, 2023

• 180

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

philschmid, osanseviero, alvarobartt, lvwerra, dvilasuero, reach-vb, marcsun13, pcuenq

•

Jul 23, 2024

• 241

Bal Narendra Sapa

AI & ML interests

Organizations

bnsapa's activity

Connect Your MCP Client to the Hugging Face Hub

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context