Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
nm-testing 's Collections
KV Cache Quantization
Models in CI
FP8-Block Quantized Models
LLM Compressor testing
Speculators testing
Sparse-Llama-3.1-8B-2of4
SparseGPT LLMs
FP8 Models

LLM Compressor testing

updated Nov 17
Upvote
-

  • nm-testing/tinysmokellama-3.2

    354k • Updated Sep 17 • 35.6k

  • nm-testing/llama2.c-stories42M-pruned2.4

    Updated Oct 29 • 577

  • nm-testing/tinyllama-fp8-dynamic-compressed

    1B • Updated Oct 9, 2024 • 385

  • nm-testing/tinyllama-w4a16-compressed

    0.3B • Updated Oct 9, 2024 • 789

  • nm-testing/tinyllama-w8a8-compressed

    1B • Updated Oct 9, 2024 • 801

  • nm-testing/tinyllama-w8a16-dense

    1B • Updated Oct 9, 2024 • 258

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8-Dynamic-compressed

    1B • Updated Jan 14 • 580

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8-Dynamic-uncompressed

    1B • Updated Jan 14 • 156

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-compressed

    0.3B • Updated Jan 14 • 235

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-uncompressed

    1B • Updated Jan 14 • 74

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-compressed

    1B • Updated Jan 14 • 248

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-uncompressed

    1B • Updated Jan 14 • 82

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-compressed

    0.4B • Updated Jan 14 • 561

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-uncompressed

    1B • Updated Jan 14 • 149
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs