Qwen2-0.5B-Instruct LiteRT-LM Model

This repository contains LiteRT-LM variants of Qwen/Qwen2-0.5B-Instruct optimized for on-device text generation.

Available Artifact

File Quantization Recipe Context Size
Qwen2_0.5B_Instruct.litertlm dynamic_wi8_afp32 - 647.4 MB

Integration

Ready to integrate this into your product? Get started in the LiteRT-LM documentation.

Downloads last month
7
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for litert-community/Qwen2-0.5B-Instruct

Base model

Qwen/Qwen2-0.5B
Quantized
(88)
this model