Xenova HF Staff whitphx commited on
Commit
ea46733
·
verified ·
1 Parent(s): e9a7d25

Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#1)

Browse files

- Add/update the quantized ONNX model files and README.md for Transformers.js v3 (9dbc2ff15309e3678a49299b525a449d6b1e58ad)


Co-authored-by: Yuichiro Tachibana <whitphx@users.noreply.huggingface.co>

README.md CHANGED
@@ -5,4 +5,22 @@ library_name: transformers.js
5
 
6
  https://huggingface.co/declare-lab/flan-alpaca-large with ONNX weights to be compatible with Transformers.js.
7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).
 
5
 
6
  https://huggingface.co/declare-lab/flan-alpaca-large with ONNX weights to be compatible with Transformers.js.
7
 
8
+ ## Usage (Transformers.js)
9
+
10
+ If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@huggingface/transformers) using:
11
+ ```bash
12
+ npm i @huggingface/transformers
13
+ ```
14
+
15
+ **Example:** Text-to-text generation.
16
+
17
+ ```js
18
+ import { pipeline } from '@huggingface/transformers';
19
+
20
+ const generator = await pipeline('text2text-generation', 'Xenova/flan-alpaca-large');
21
+ const output = await generator('how can I become more healthy?', {
22
+ max_new_tokens: 100,
23
+ });
24
+ ```
25
+
26
  Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).
onnx/decoder_model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:48f874fb491b1b7c99689451805c863c07fbabde3cfc3fbfc8cd1f6a047bfea6
3
+ size 381134915
onnx/decoder_model_fp16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f0662b938d5070beb35b8acb689f380bcb8378f86cbfce91f994f385ca7e77e5
3
+ size 950300405
onnx/decoder_model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2bbde49891735c383fb09391b12ee6f76d380d295542fbf02c288e5d377b17b0
3
+ size 476021985
onnx/decoder_model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a7f8ba3bbb1fed9f277a976c08050a544e4a420b5006de1966576733384bff00
3
+ size 408748029
onnx/decoder_model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9a47b9c06a034b2e2ccc76f6254f63898d9448287725b2514902e6c11702c907
3
+ size 315189312
onnx/decoder_model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e1edfc7191b2f61ca83c695324bfc0446d5c3149a732032b82300c7aa9a09388
3
+ size 476022121
onnx/decoder_with_past_model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4a5533931192458aec806375137e37b6e779e547db51b359c7ec3b7fe23b5a09
3
+ size 352682669
onnx/decoder_with_past_model_fp16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:03b676ed96ba52b3c6b8b277383fdc5fa64c9a8e012c90b23ca35d228dcf182e
3
+ size 849523706
onnx/decoder_with_past_model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d7b162a9832ed5e64b8a21870d626e5217669db92e89d7faa271c285f6951e41
3
+ size 425510472
onnx/decoder_with_past_model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4e61b183b0a75cbd517d158e3db885e380c2ec266a935fafed05ddd70161d165
3
+ size 377150439
onnx/decoder_with_past_model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:087a26c11f42c1f00d7d2579159cbd7ac7c6fedd5cd52dcd6863d07f3cb03861
3
+ size 286757253
onnx/decoder_with_past_model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5d38dd8f0de3aaaac9072207847d57392b65f3e792bf47eeac7c4197a3a9e173
3
+ size 425510584
onnx/encoder_model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e0af4e86d260d7b20b8f7b290dc1af38fad5686196b4c79553c555ca0c4cdab4
3
+ size 305592841
onnx/encoder_model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ca722e90eb967739db54b39a54d64da4ed5191cd0e76c34ae2a21b736fb5b2f6
3
+ size 341928528
onnx/encoder_model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:407fc78d67cacf4978b3f74cfed3c17d877d244d0f4f9dccaf97d2a59af3b87a
3
+ size 324859081
onnx/encoder_model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:421696d7c70eb151c5a1a9e731b39132559278cd6b38cc6a6674a34dcc57a0e1
3
+ size 239692729
onnx/encoder_model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9959afd9bb8647e3ff00381e1346146018997e2a8c9114269f4aee8d6c3be21f
3
+ size 341928611
onnx/model.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:893c585c6c94c9ab6138e889d1e8918c63a1bc3fbd2df36e0dc486e04e410881
3
+ size 1900612149
onnx/model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c823c54c6dd8629205c1718284bacdbf3ac8cfd2dfd10cafe3262e4346b4f426
3
+ size 381788986
onnx/model_fp16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e5d28e3a01386d547425fb1c2bbb10e1711f25d88285ebb22ea5392761b450ed
3
+ size 950954495
onnx/model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:425d84d1fcbae930b0c72d1ee0eca493fa40bda109cf894c424d35ae787d6b73
3
+ size 476913881
onnx/model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fc4bfc69a5ac1adb09da7922cf6cbe73037d5b290e22763db86a01a0bb7fe9d8
3
+ size 409400147
onnx/model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:943c6a0bdbb50165e1924d67fb0e8e2f3d8c6f4ceeaca0b512011e911b777668
3
+ size 315873129
onnx/model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:43a4da96026759ce80adc9778d5339ae9ffce8a8d8201192cd58a7dd2d838c4b
3
+ size 476914017