Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Spaces:
ai-engineering-at
/
llama-cpp-turboquant-guide
Running

App Files Files Community
llama-cpp-turboquant-guide / results
8.86 kB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 5 commits
AI Engineering Lab
results: RTX 3090 consolidated β€” 4 runs, 15 measurements, avg -7.5% TPS
a99aedc about 2 months ago
  • turboquant-3090-all-runs-2026-04.json
    1.67 kB
    results: RTX 3090 consolidated β€” 4 runs, 15 measurements, avg -7.5% TPS about 2 months ago
  • turboquant-4070-laptop-2026-04-01.json
    1.65 kB
    results: add RTX 4070 Laptop 8GB benchmark (Llama-3.1 8B) about 2 months ago
  • turboquant-4070-results-2026-04-01.json
    667 Bytes
    results: add verified RTX 4070 Laptop benchmark + cross-GPU comparison table about 2 months ago
  • turboquant-rtx3090-2026-04-01-v2.json
    992 Bytes
    results: add v2 verification run β€” confirms v1 within measurement variance about 2 months ago
  • turboquant-rtx3090-2026-04-01.json
    3.89 kB
    Initial release: TurboQuant practical guide for consumer hardware about 2 months ago