NPU - OpenVINO
Collection
leading models optimized for OpenVINO NPU • 24 items • Updated
slim-summary-tiny-npu-ov is a specialized function calling model that summarizes a given text and generates as output a Python list of summary points.
This is an OpenVino int4 quantized version of slim-summary-tiny, providing a very fast, very small inference implementation, optimized for AI PCs using Intel NPU.
Base model
llmware/slim-summary-tiny