1 42 55

Haotian Shan

HenryShan

AI & ML interests

None yet

Organizations

upvoted 20 articles about 1 year ago

Article

Decoding Strategies in Large Language Models

mlabonne

•

Oct 29, 2024

• 114

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

sirluk

•

Oct 7, 2024

• 71

Article

Document Similarity Search with ColPali

fsommers

•

Sep 21, 2024

• 52

Article

The Environmental Impacts of AI -- Primer

sasha

•

Sep 3, 2024

• 45

Article

RAG vs Fine-Tuning for LLMs: A Comprehensive Guide with Examples

airabbitX

•

Aug 16, 2024

• 10

Article

RegMix: Data Mixture as Regression for Language Model Pre-training

SivilTaram

•

Jul 11, 2024

• 16

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

AviSoori1x

•

May 7, 2024

• 122

Article

Merge Large Language Models with mergekit

mlabonne

•

Jan 9, 2024

• 157

Article

Deploying Your FastAPI Applications on Huggingface Via Docker

HemanthSai7

•

Dec 11, 2023

• 42

Article

4D masks support in Transformers

poedator

•

Jan 8, 2024

• 31

Article

Better RAG 3: The text is your friend

hrishioa

•

Mar 14, 2024

• 14

Article

Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA

sirluk

•

Jan 22, 2024

• 26

Article

🕳️ Attention Sinks in LLMs for endless fluency

tomaarsen

•

Oct 9, 2023

• 37

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

merve

•

Aug 25, 2023

• 40

Article

Sensitivity Aware Mixed Precision Quantization V1

badaoui

•

Jun 13, 2025

• 26

Article

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Hcompany

•

Jun 3, 2025

• 71

Article

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

davidberenstein1957

•

May 7, 2025

• 42

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

NormalUhr

•

Feb 11, 2025

• 126

Article

G2P Shrinks Speech Models

hexgrad

•

Feb 5, 2025

• 97

Article

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

dvgodoy

•

Feb 11, 2025

• 124

Haotian Shan

AI & ML interests

Organizations

HenryShan's activity

Decoding Strategies in Large Language Models

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Document Similarity Search with ColPali

The Environmental Impacts of AI -- Primer

RAG vs Fine-Tuning for LLMs: A Comprehensive Guide with Examples

RegMix: Data Mixture as Regression for Language Model Pre-training

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Merge Large Language Models with mergekit

Deploying Your FastAPI Applications on Huggingface Via Docker

4D masks support in Transformers

Better RAG 3: The text is your friend

Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA

🕳️ Attention Sinks in LLMs for endless fluency

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

Sensitivity Aware Mixed Precision Quantization V1

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

G2P Shrinks Speech Models

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face