Apple Neural Engine LLMs
Collection
CoreML LLMs optimized for Apple Neural Engine. • 3 items • Updated • 2
CoreML conversion of Llama 2 7B from smpanaro/Llama-2-7b-NuGPTQ.
Use this CLI to download and run inference. macOS 14 (Sonoma) is required.
Base model
meta-llama/Llama-2-7b-hf