smpanaro's picture
add model
691b3b7
---
license: llama3.2
base_model:
- meta-llama/Llama-3.2-3B-Instruct
tags:
- coreml
---
CoreML conversion of Llama-3.2-3B-Instruct with a 512 context length. Optimized for Apple Neural Engine.
Use [this CLI](https://github.com/smpanaro/coreml-llm-cli) to download and run inference. macOS 14 (Sonoma) is required.
> [!IMPORTANT]
> This model will likley run slowly or not at all on M1 Macs and phones. Consider trying the 1B model for those devices: [smpanaro/Llama-3.2-1B-Instruct-CoreML](https://huggingface.co/smpanaro/Llama-3.2-1B-Instruct-CoreML)