view article Article Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp 11 days ago • 9
view article Article The Great Classification Showdown: OSS vs BERT on Consumer Hardware 15 days ago • 12
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs Paper • 2601.17058 • Published 19 days ago • 186
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published 12 days ago • 98
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security Paper • 2601.18491 • Published 15 days ago • 122
Load 4bit models 4x faster Collection Native bitsandbytes 4bit pre quantized models • 25 items • Updated 6 days ago • 60
Embedding Models Collection Run or fine-tune embedding models with Unsloth. • 14 items • Updated 6 days ago • 3