view article Article Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models codelion ⢠Jan 23 ⢠10
view article Article Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement codelion ⢠Dec 3, 2025 ⢠14
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 ybelkada, timdettmers, artidoro, sgugger, smangrul ⢠May 24, 2023 ⢠180
DeepSeek R1 (All Versions) Collection DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. ⢠37 items ⢠Updated 20 days ago ⢠267
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper ⢠2505.04588 ⢠Published May 7, 2025 ⢠65
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth mlabonne ⢠Jul 29, 2024 ⢠371
ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset Paper ⢠2405.10004 ⢠Published May 16, 2024 ⢠1
Quantifying the Carbon Emissions of Machine Learning Paper ⢠1910.09700 ⢠Published Oct 21, 2019 ⢠45
MMMModal -- Multi-Images Multi-Audio Multi-turn Multi-Modal Paper ⢠2402.11297 ⢠Published Feb 17, 2024 ⢠2