Functionality-Oriented LLM Merging on the Fisher--Rao Manifold Paper • 2603.04972 • Published 8 days ago • 2
GPT-2 models fine-tuned on tasks from GLUE Benchmark Collection if you find these models helpful, consider citing [our paper](https://arxiv.org/abs/2406.03280) • 7 items • Updated Aug 27, 2024 • 3
Scaling Latent Reasoning via Looped Language Models Paper • 2510.25741 • Published Oct 29, 2025 • 229
story writing favourites Collection Models I personally liked for generating stories in the past. Not a recommendation, most of these are outdated. • 17 items • Updated 10 days ago • 95
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated about 23 hours ago • 94
miscii-14b-dev Collection Known stable releases of the miscii-1020 based models • 3 items • Updated 10 days ago • 2