Add/update the quantized ONNX model files and README.md for Transformers.js v3

by whitphx - opened Jul 22, 2025

←

Files changed (5) hide show

README.md CHANGED Viewed

@@ -7,18 +7,18 @@ https://huggingface.co/YituTech/conv-bert-medium-small with ONNX weights to be c
 ## Usage (Transformers.js)
-If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@xenova/transformers) using:
 ```bash
-npm i @xenova/transformers
 ```
 **Example:** Feature extraction w/ `Xenova/conv-bert-medium-small`.
 ```javascript
-import { pipeline } from '@xenova/transformers';
 // Create feature extraction pipeline
-const extractor = await pipeline('feature-extraction', 'Xenova/conv-bert-medium-small', { quantized: false });
 // Perform feature extraction
 const output = await extractor('This is a test sentence.');
@@ -33,5 +33,4 @@ console.log(output)
 ---
 Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).

 ## Usage (Transformers.js)
+If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@huggingface/transformers) using:
 ```bash
+npm i @huggingface/transformers
 ```
 **Example:** Feature extraction w/ `Xenova/conv-bert-medium-small`.
 ```javascript
+import { pipeline } from '@huggingface/transformers';
 // Create feature extraction pipeline
+const extractor = await pipeline('feature-extraction', 'Xenova/conv-bert-medium-small', { dtype: "fp32" });  // Options: "fp32", "fp16", "q8", "q4"
 // Perform feature extraction
 const output = await extractor('This is a test sentence.');
 ---
 Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).

onnx/model_bnb4.onnx ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:cb57c2671181d94853945e3fedc5f7c94463bfa6e55b9a3b6b2e225f37d90582
+size 51698508

onnx/model_q4.onnx ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:7563188c0022a199b3fbf16a7c35ad53c841929c31a58ab735b4965af87e3222
+size 52037992

onnx/model_q4f16.onnx ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:f9db79bba23b0065948cfd0deba80fe8aa8bf2491583eca018f2732480836127
+size 27627345

onnx/model_uint8.onnx ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:26ad7dac3319744ee6969182d0b5090dfd10543615f6342d999ad042a09eb9e4
+size 18302304