Add/update the quantized ONNX model files and README.md for Transformers.js v3
#1
by
whitphx
HF Staff
- opened
- README.md +3 -4
- onnx/model_bnb4.onnx +3 -0
- onnx/model_q4.onnx +3 -0
- onnx/model_q4f16.onnx +3 -0
- onnx/model_uint8.onnx +3 -0
README.md
CHANGED
|
@@ -7,15 +7,15 @@ https://huggingface.co/YituTech/conv-bert-small with ONNX weights to be compatib
|
|
| 7 |
|
| 8 |
## Usage (Transformers.js)
|
| 9 |
|
| 10 |
-
If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@
|
| 11 |
```bash
|
| 12 |
-
npm i @
|
| 13 |
```
|
| 14 |
|
| 15 |
**Example:** Feature extraction w/ `Xenova/conv-bert-small`.
|
| 16 |
|
| 17 |
```javascript
|
| 18 |
-
import { pipeline } from '@
|
| 19 |
|
| 20 |
// Create feature extraction pipeline
|
| 21 |
const extractor = await pipeline('feature-extraction', 'Xenova/conv-bert-small');
|
|
@@ -33,5 +33,4 @@ console.log(output)
|
|
| 33 |
|
| 34 |
---
|
| 35 |
|
| 36 |
-
|
| 37 |
Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [π€ Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).
|
|
|
|
| 7 |
|
| 8 |
## Usage (Transformers.js)
|
| 9 |
|
| 10 |
+
If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@huggingface/transformers) using:
|
| 11 |
```bash
|
| 12 |
+
npm i @huggingface/transformers
|
| 13 |
```
|
| 14 |
|
| 15 |
**Example:** Feature extraction w/ `Xenova/conv-bert-small`.
|
| 16 |
|
| 17 |
```javascript
|
| 18 |
+
import { pipeline } from '@huggingface/transformers';
|
| 19 |
|
| 20 |
// Create feature extraction pipeline
|
| 21 |
const extractor = await pipeline('feature-extraction', 'Xenova/conv-bert-small');
|
|
|
|
| 33 |
|
| 34 |
---
|
| 35 |
|
|
|
|
| 36 |
Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [π€ Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).
|
onnx/model_bnb4.onnx
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2dd67430dd9e8eb8d768cfba3c888fc722783265c848b1dd1e247efcf6c2e441
|
| 3 |
+
size 23074731
|
onnx/model_q4.onnx
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ca17e48ecab7eb8093ae605568450f80301a1693d210ec3449cb190f8394b38a
|
| 3 |
+
size 23618403
|
onnx/model_q4f16.onnx
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2907762f58cc48cae8b22febba1a8443746ca7cbf362e7307110c6396a0b3beb
|
| 3 |
+
size 14208916
|
onnx/model_uint8.onnx
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d69ca530eb9b2cdb7a7608aeb7e19e6d4e9770abd1dd5471bd0c650860a1d88d
|
| 3 |
+
size 13855013
|