Llama-3.2-3B-Instruct-ET

ExecuTorch .pte model converted from meta-llama/Llama-3.2-3B-Instruct for on-device inference with ToMogo.

Model Details

Download all files into a single directory and load with ExecuTorch on Android:

val engine = ExecuTorchEngine(modelDir = "/path/to/Llama-3.2-3B-Instruct-ET")
engine.prefill(prompt) { token -> print(token) }

Auto-generated by ToMogo upload pipeline.

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support