Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Ferr0 's Collections
Red-team & offensive LLMs
Defensive AI & code security
Structured output & tool-calling
Local-first LLMs

Local-first LLMs

updated about 22 hours ago

Small, capable models I run locally on a single RTX 3090 (Ollama / llama.cpp / transformers) — the backbone of self-hosted, sovereign AI.

Upvote
-

  • Qwen/Qwen3.6-27B

    Image-Text-to-Text • 28B • Updated Apr 24 • 5.26M • • 1.85k

  • Qwen/Qwen3.5-4B

    Image-Text-to-Text • 5B • Updated Mar 2 • 8.4M • • 702

  • google/gemma-4-12B-it

    Any-to-Any • 12B • Updated 26 days ago • 2.62M • 1.22k

  • Qwen/Qwen3-Coder-30B-A3B-Instruct

    Text Generation • 31B • Updated Dec 3, 2025 • 1.75M • • 1.14k

  • Qwen/Qwen3-Embedding-0.6B

    Feature Extraction • 0.6B • Updated Apr 20 • 10.1M • • 1.09k

  • yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF

    Text Generation • 12B • Updated 12 days ago • 575k • 2.52k

  • deepreinforce-ai/Ornith-1.0-9B

    Text Generation • 1.47M • Updated 5 days ago • 26.2k • • 304
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs