YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Llama-3.2-3B-Instruct-ET
ExecuTorch .pte model converted from meta-llama/Llama-3.2-3B-Instruct for on-device inference with ToMogo.
Model Details
| Property | Value |
|---|---|
| Source | meta-llama/Llama-3.2-3B-Instruct |
| Format | executorch |
| Quantization | int4 weight-only |
| Backend | xnnpack |
| Fusion | disabled |
| Total Size | 4588.1 MB |
Files
| File | Size |
|---|---|
README.md |
0.0 MB |
chat_template.jinja |
0.0 MB |
export_config.json |
0.0 MB |
model.pte |
4571.6 MB |
tokenizer.json |
16.4 MB |
tokenizer_config.json |
0.0 MB |
Usage
Download all files into a single directory and load with ExecuTorch on Android:
val engine = ExecuTorchEngine(modelDir = "/path/to/Llama-3.2-3B-Instruct-ET")
engine.prefill(prompt) { token -> print(token) }
Auto-generated by ToMogo upload pipeline.
- Downloads last month
- 7
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support