view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models Jul 18 • 50
Higher Precision GGUFs / Imatrix Plus Collection Models compressed in higher precision with parts of the model compression remaining in F16/full precision. Increases overall quality in all tasks. • 53 items • Updated 19 days ago • 8
view article Article Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU! Apr 21, 2024 • 44