Instructions to use onnx-community/Qwen2.5-Coder-1.5B-Instruct with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers.js
How to use onnx-community/Qwen2.5-Coder-1.5B-Instruct with Transformers.js:
// npm i @huggingface/transformers import { pipeline } from '@huggingface/transformers'; // Allocate pipeline const pipe = await pipeline('text-generation', 'onnx-community/Qwen2.5-Coder-1.5B-Instruct');
Model does not follow or acknowledge system prompts?
#3
by RonanMcGovern - opened
Is this expected? Passing a system message results in it being ignored...
BTW the tokenizer config has the system message in there, so I'm not sure why it behaves this way. Could be the quanting...
FWIW, I can only see to run q4fp16.
I tried int8 and uint8, they load but produce garbage.