le nguyen's picture

4 8

le nguyen

lenguyen1807

·

lenguyen1807

AI & ML interests

None yet

Organizations

upvoted 3 articles 8 months ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

AviSoori1x

•

May 7, 2024

• 121

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 342

Article

Mastering Tensor Dimensions in Transformers

not-lain

•

Jan 12, 2025

• 178

upvoted a collection over 2 years ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 254