Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#1)

Browse files

- Add/update the quantized ONNX model files and README.md for Transformers.js v3 (9ff15c6a2d81c918839c1243c4591617b0420aa8)

Co-authored-by: Yuichiro Tachibana <whitphx@users.noreply.huggingface.co>

Files changed (7) hide show

README.md +18 -0
onnx/decoder_model_q4f16.onnx +3 -0
onnx/decoder_with_past_model_q4f16.onnx +3 -0
onnx/encoder_model_q4f16.onnx +3 -0
onnx/model.onnx +3 -0
onnx/model_fp16.onnx +3 -0
onnx/model_q4f16.onnx +3 -0

README.md CHANGED Viewed

@@ -5,4 +5,22 @@ library_name: transformers.js
 https://huggingface.co/pszemraj/grammar-synthesis-small with ONNX weights to be compatible with Transformers.js.
 Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).

 https://huggingface.co/pszemraj/grammar-synthesis-small with ONNX weights to be compatible with Transformers.js.
+## Usage (Transformers.js)
+If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@huggingface/transformers) using:
+```bash
+npm i @huggingface/transformers
+```
+**Example:** Text-to-text generation.
+```js
+import { pipeline } from '@huggingface/transformers';
+const generator = await pipeline('text2text-generation', 'Xenova/grammar-synthesis-small');
+const output = await generator('how can I become more healthy?', {
+  max_new_tokens: 100,
+});
+```
 Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).

onnx/decoder_model_q4f16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1caf813ab653586fefffb87c9426e7b0f00744154915b9f2c96d336920c946e4
+size 56580249

onnx/decoder_with_past_model_q4f16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d0af4e497579f97eb317fa0ea767b3bf2efa91910f943eb586d262773a83d77d
+size 54772520

onnx/encoder_model_q4f16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:784e5a431dc3467e03218a2f8d57be7a3c779f1a162dfc256a7ea5ae330a42d3
+size 43669219

onnx/model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:92e8aacbe6374d7103b074a351eeaf9f2dc56b552892d1398b70e56d724d9755
+size 232784389

onnx/model_fp16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:626f1e6f51c80432fb615c66e0efd788871d9afb757bbd286ccdf92e48acef3a
+size 116622030

onnx/model_q4f16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a2067e4f6b55f49d9de3534cc88b8c0802fbe3c7548a53d50bf414709aabd218
+size 56823126