Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Michael Goin's picture
104 21 21

Michael Goin

mgoin
dc-algo's profile picture nickandbro's profile picture srvm's profile picture
·
  • mgoin_
  • mgoin

AI & ML interests

LLM inference optimization, compression, quantization, pruning, distillation

Recent Activity

updated a model 17 days ago
google/gemma-4-E4B-it-qat-mobile-ct
updated a model 17 days ago
google/gemma-4-E2B-it-qat-mobile-ct
published a model 19 days ago
google/gemma-4-E4B-it-qat-mobile-ct
View all activity

Organizations

Neural Magic's profile picture garage-bAInd's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture NM Testing's profile picture Red Hat AI's profile picture Inference Optimization's profile picture gg-hf-qat's profile picture

mgoin 's collections 1

Nemotron in vLLM
Nemotron models that have been converted and/or quantized to work well in vLLM
  • mgoin/Nemotron-4-340B-Instruct-hf-FP8

    Text Generation • 341B • Updated Aug 8, 2024 • 11 • 3
  • mgoin/Nemotron-4-340B-Base-hf-FP8

    Text Generation • 341B • Updated Aug 8, 2024 • 30 • 2
  • mgoin/Nemotron-4-340B-Instruct-hf

    Text Generation • 341B • Updated Aug 8, 2024 • 2.69k • 4
  • mgoin/Nemotron-4-340B-Base-hf

    Text Generation • 341B • Updated Aug 8, 2024 • 10 • 1
Nemotron in vLLM
Nemotron models that have been converted and/or quantized to work well in vLLM
  • mgoin/Nemotron-4-340B-Instruct-hf-FP8

    Text Generation • 341B • Updated Aug 8, 2024 • 11 • 3
  • mgoin/Nemotron-4-340B-Base-hf-FP8

    Text Generation • 341B • Updated Aug 8, 2024 • 30 • 2
  • mgoin/Nemotron-4-340B-Instruct-hf

    Text Generation • 341B • Updated Aug 8, 2024 • 2.69k • 4
  • mgoin/Nemotron-4-340B-Base-hf

    Text Generation • 341B • Updated Aug 8, 2024 • 10 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs