16 11 84

UnstableLlama

https://medium.com/@unstablellama

UnstableLlama

AI & ML interests

Local inference, quantization, training. Philosophy of AI.

Recent Activity

new activity about 7 hours ago

UnstableLlama/preference:[bot] Conversion to Parquet

updated a collection about 14 hours ago

Semancer-12B

updated a collection about 14 hours ago

Semancer-12B

View all activity

Organizations

None yet

upvoted a changelog 28 days ago

Hugging Face Changelog

Filter Models page by Base Models only

30 days ago

• 171

upvoted a collection 30 days ago

My Models

Collection

4 items • Updated 28 days ago • 1

upvoted a changelog 4 months ago

Hugging Face Changelog

Public Storage Add-ons

Feb 26

• 168

upvoted 2 articles 5 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 158

Article

Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp

Doctor-Shotgun

•

Jan 30

• 28

upvoted a collection 6 months ago

Doc's Choice

Collection

Models that I personally recommend, periodically updated. • 6 items • Updated Apr 21 • 5

upvoted 3 papers 8 months ago

upvoted an article 8 months ago

Article

Meta-learning Meets Small Models

appvoid

•

Oct 21, 2025

• 1

upvoted a paper 9 months ago

Rethinking Large Language Model Distillation: A Constrained Markov Decision Process Perspective

Paper • 2509.22921 • Published Sep 26, 2025 • 12

UnstableLlama

AI & ML interests

Recent Activity

Organizations

UnstableLlama's activity

Filter Models page by Base Models only

Public Storage Add-ons

We Got Claude to Build CUDA Kernels and teach open models!

Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp

Meta-learning Meets Small Models