Ampere Computing

company

Verified

https://amperecomputing.com/developers/power-your-ai

amperecomputing

AmpereComputingAI

Activity Feed Request to join this org

AI & ML interests

AI inference, AI in the cloud, AI on edge, software acceleration of AI workloads on hardware, efficient AI deployments, GPU-Free AI inference, AI model optimization.

Recent Activity

dkupnicki updated a model 29 days ago

AmpereComputing/Trinity-Nano-Preview-GGUF

dkupnicki published a model 29 days ago

AmpereComputing/Trinity-Nano-Preview-GGUF

dkupnicki updated a model 29 days ago

AmpereComputing/granite-4.0-h-tiny-GGUF

View all activity

updated a model 29 days ago

AmpereComputing/Trinity-Nano-Preview-GGUF

6B • Updated 29 days ago • 276 • 1

published a model 29 days ago

AmpereComputing/Trinity-Nano-Preview-GGUF

6B • Updated 29 days ago • 276 • 1

updated a model 29 days ago

AmpereComputing/granite-4.0-h-tiny-GGUF

7B • Updated 29 days ago • 207

published a model 29 days ago

AmpereComputing/granite-4.0-h-tiny-GGUF

7B • Updated 29 days ago • 207

published 3 models about 1 month ago

AmpereComputing/Qwen3-VL-2B-Instruct-GGUF

2B • Updated Apr 3 • 26

AmpereComputing/Qwen3-VL-4B-Instruct-GGUF

4B • Updated Apr 3 • 94

AmpereComputing/Qwen3-VL-8B-Instruct-GGUF

8B • Updated Apr 3 • 137

updated a collection about 1 month ago

Qwen3-VL

3 items • Updated Apr 3

updated a model about 1 month ago

AmpereComputing/Qwen3-VL-8B-Instruct-GGUF

8B • Updated Apr 3 • 137

published a model about 2 months ago

AmpereComputing/bge-m3-gguf

0.6B • Updated Mar 12 • 16

updated a model about 2 months ago

AmpereComputing/bge-m3-gguf

0.6B • Updated Mar 12 • 16

updated a model 4 months ago

AmpereComputing/granite-4.0-h-small-gguf

32B • Updated Jan 13 • 24 • 1

updated a collection 4 months ago

Granite 4.0

Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp • 2 items • Updated Jan 13

updated a model 4 months ago

AmpereComputing/granite-8b-code-instruct-128k-gguf

8B • Updated Jan 13 • 11 • 1

published 2 models 4 months ago

AmpereComputing/granite-4.0-h-small-gguf

32B • Updated Jan 13 • 24 • 1

AmpereComputing/granite-8b-code-instruct-128k-gguf

8B • Updated Jan 13 • 11 • 1

updated a collection 7 months ago

GPT-OSS

With gpt-oss models we recommend using native mxfp4 quantization. • 3 items • Updated Sep 26, 2025 • 1

updated a model 7 months ago

AmpereComputing/gpt-oss-20b-gguf

21B • Updated Sep 26, 2025 • 3