DeepSeek R1 (All Versions) Collection DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. โข 37 items โข Updated 2 days ago โข 265
RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text Paper โข 2305.13304 โข Published May 22, 2023 โข 2
view article Article Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+ Apr 26, 2024 โข 13