AI & ML interests

AI inference, AI in the cloud, AI on edge, software acceleration of AI workloads on hardware, efficient AI deployments, GPU-Free AI inference, AI model optimization.

Recent Activity

dkupnicki  published a model about 18 hours ago
AmpereComputing/bge-m3-gguf
dkupnicki  updated a model about 18 hours ago
AmpereComputing/bge-m3-gguf
jangrzybek  updated a model about 2 months ago
AmpereComputing/granite-4.0-h-small-gguf
View all activity