Text Generation
Transformers.js
ONNX
multilingual
gemma3_text
gemma3
translation
no-vision
causal-lm
conversational
Instructions to use willopcbeta/translategemma-4b-it-Text-ONNX with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers.js
How to use willopcbeta/translategemma-4b-it-Text-ONNX with Transformers.js:
// npm i @huggingface/transformers import { pipeline } from '@huggingface/transformers'; // Allocate pipeline const pipe = await pipeline('text-generation', 'willopcbeta/translategemma-4b-it-Text-ONNX');
Modifying "kv_cache_dtype" now conforms to Q4/Q4f16. The original model's dtype is now "float16".
#4
by willopcbeta - opened
Fixed a bug in Q4 that caused issues with the dialogue.
OrtRun(). ERROR_CODE: 2, ERROR_MESSAGE: Unexpected input data type. Actual: (tensor(float)) , expected: (tensor(float16))
willopcbeta changed pull request status to merged