This repository provides a model compiled and optimized for Mobilint NPU hardware. The model is packaged for deployment on Mobilint’s acceleration stack and is intended to be used within that environment.
Downloads last month
2
Model tree for mobilint/Llama-3.1-8B-Instruct-Batch32