Sanket Rai

sanketrai

·

https://sanketrai.xyz

AI & ML interests

ML, Distributed Systems

Organizations

upvoted a collection 6 months ago

SpecBundle

A collection of production-grade draft models for speculative decoding • 18 items • Updated Apr 15 • 19

upvoted an article 7 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 630

upvoted 3 articles about 1 year ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 418

Article

Mixture of Experts Explained

+4

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.15k

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 488

upvoted 3 articles over 1 year ago

Article

Open R1: Update #3

open-r1

•

Mar 11, 2025

• 298

Article

Open-R1: Update #1

open-r1

•

Feb 2, 2025

• 305

Article

LLM Inference at scale with TGI

martinigoyanes

•

Sep 6, 2024

• 26

upvoted 2 collections almost 2 years ago

FP8 LLMs for vLLM

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 42 items • Updated Mar 2 • 81

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 712

upvoted a collection about 2 years ago

Llama3-ChatQA-1.5

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 19 days ago • 47

upvoted a paper about 2 years ago

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29, 2024 • 71