Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
YellowjacketGames 's Collections
[ data ] What a Dump!
non-EN Models
[ papers ] Sports Tech
[papers] Distillation
[papers] RAG$ to Riche$
[mixed] ORC_Assist "Work's Done!"
[mixed] Chess x AI
[papers] Gameplay Optimization
[models] RTX a6000 48gb
[models] GTX 1660 Super 6gb
[models] Sub-1gb for Edge Deployment
[models] iGPU-Capable < 512mb
[models] 100B+ Param, CPU-Offload + A6000x2
[mixed] Image Generation Stack

[models] 100B+ Param, CPU-Offload + A6000x2

updated 3 days ago

TPS can be as low as 1.0, seriously. its SLOW.

Upvote
1

  • unsloth/GLM-4.7-GGUF

    Text Generation • 358B • Updated Dec 27, 2025 • 122k • 187

    Note not deeply tested.


  • unsloth/DeepSeek-R1-0528-GGUF

    Text Generation • 671B • Updated Jun 15, 2025 • 4.18k • 193

    Note Max Precision Capacity: Q8_K_XL


  • unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF

    Image-to-Text • 401B • Updated Jun 18, 2025 • 6.24k • 42

  • unsloth/MiniMax-M2.1-GGUF

    Text Generation • 229B • Updated Dec 26, 2025 • 137k • 159

  • unsloth/Kimi-K2-Thinking-GGUF

    1T • Updated about 16 hours ago • 4.39k • 112
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs