Jan Grzybek's picture

4

Jan Grzybek

jangrzybek

·

AI & ML interests

None yet

Recent Activity

updated a model 16 days ago

AmpereComputing/granite-4.0-h-small-gguf

updated a collection 16 days ago

updated a model 16 days ago

AmpereComputing/granite-8b-code-instruct-128k-gguf

View all activity

Organizations

updated a model 16 days ago

AmpereComputing/granite-4.0-h-small-gguf

32B • Updated 16 days ago • 117 • 1

updated a collection 16 days ago

Granite 4.0

Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp • 2 items • Updated 16 days ago

updated a model 16 days ago

AmpereComputing/granite-8b-code-instruct-128k-gguf

8B • Updated 16 days ago • 47 • 1

published 2 models 16 days ago

AmpereComputing/granite-4.0-h-small-gguf

32B • Updated 16 days ago • 117 • 1

AmpereComputing/granite-8b-code-instruct-128k-gguf

8B • Updated 16 days ago • 47 • 1

updated a collection 4 months ago

GPT-OSS

With gpt-oss models we recommend using native mxfp4 quantization. • 3 items • Updated Sep 26, 2025 • 1

updated a model 4 months ago

AmpereComputing/gpt-oss-20b-gguf

21B • Updated Sep 26, 2025 • 47

published a model 4 months ago

AmpereComputing/gpt-oss-20b-gguf

21B • Updated Sep 26, 2025 • 47

updated 4 models 4 months ago

AmpereComputing/gemma-3-27b-it-gguf

27B • Updated Sep 16, 2025 • 108

AmpereComputing/gemma-3-12b-it-gguf

12B • Updated Sep 16, 2025 • 112

AmpereComputing/gemma-3-4b-it-gguf

4B • Updated Sep 16, 2025 • 122

AmpereComputing/qwen-2.5-vl-3b-instruct-gguf

3B • Updated Sep 16, 2025 • 128

updated a collection 4 months ago

Qwen 2.5

Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp • 8 items • Updated Sep 16, 2025

published a model 4 months ago

AmpereComputing/qwen-2.5-vl-3b-instruct-gguf

3B • Updated Sep 16, 2025 • 128

updated a model 4 months ago

AmpereComputing/qwen-2.5-vl-7b-instruct-gguf

8B • Updated Sep 16, 2025 • 100

updated a collection 4 months ago

Qwen 2.5

Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp • 8 items • Updated Sep 16, 2025

published a model 4 months ago

AmpereComputing/qwen-2.5-vl-7b-instruct-gguf

8B • Updated Sep 16, 2025 • 100

updated a model 5 months ago

AmpereComputing/qwen-3-coder-a3b-30b-instruct-gguf

31B • Updated Sep 15, 2025 • 59

updated a collection 5 months ago

Qwen 3

Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp • 14 items • Updated Sep 15, 2025 • 2

published a model 5 months ago

AmpereComputing/qwen-3-coder-a3b-30b-instruct-gguf

31B • Updated Sep 15, 2025 • 59