Running 8 TurboQuant on Consumer GPUs — 100K Context on RTX 3090, 64K on RTX 4070 🚀 8 Extend LLM context to 100K tokens on consumer GPUs
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF Image-Text-to-Text • 27B • Updated Apr 6 • 117k • 605
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated 16 days ago • 921k • 305
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 18 days ago • 380k • 247
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 226k • • 2.84k
Running on CPU Upgrade 234 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 234 Explore synthetic data experiments on a virtual bookshelf