Instructions to use opalitestudios/Qwen2.5-3B-Instruct-ONNX with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers.js
How to use opalitestudios/Qwen2.5-3B-Instruct-ONNX with Transformers.js:
// npm i @huggingface/transformers import { pipeline } from '@huggingface/transformers'; // Allocate pipeline const pipe = await pipeline('text-generation', 'opalitestudios/Qwen2.5-3B-Instruct-ONNX');
Qwen2.5-3B-Instruct ONNX
ONNX conversion of Qwen/Qwen2.5-3B-Instruct for use with Transformers.js.
Quantization
- q4f16: 4-bit weights with fp16 compute (~2.2 GB)
Usage
import { AutoTokenizer, AutoModelForCausalLM } from '@huggingface/transformers';
const tokenizer = await AutoTokenizer.from_pretrained('opalitestudios/Qwen2.5-3B-Instruct-ONNX');
const model = await AutoModelForCausalLM.from_pretrained('opalitestudios/Qwen2.5-3B-Instruct-ONNX', {
dtype: 'q4f16',
});
- Downloads last month
- 11