view article Article Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp Jan 30 • 15
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs Paper • 2601.17058 • Published Jan 22 • 190
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published Jan 29 • 102
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security Paper • 2601.18491 • Published Jan 26 • 125
Load 4bit models 4x faster Collection Native bitsandbytes 4bit pre quantized models • 25 items • Updated 18 days ago • 61
Embedding Models Collection Run or fine-tune embedding models with Unsloth. • 14 items • Updated 18 days ago • 3