Buckets:

rtrm's picture
|
download
raw
1.78 kB

Use custom models

By default, Transformers.js uses hosted pretrained models and precompiled WASM binaries, which should work out-of-the-box. You can customize this as follows:

Settings

import { env } from '@huggingface/transformers';

// Specify a custom location for models (defaults to '/models/').
env.localModelPath = '/path/to/models/';

// Disable the loading of remote models from the Hugging Face Hub:
env.allowRemoteModels = false;

// Set location of .wasm files. Defaults to use a CDN.
env.backends.onnx.wasm.wasmPaths = '/path/to/files/';

For a full list of available settings, check out the API Reference.

Convert your models to ONNX

We recommend using our conversion script to convert your PyTorch, TensorFlow, or JAX models to ONNX in a single command. Behind the scenes, it uses ๐Ÿค— Optimum to perform conversion and quantization of your model.

python -m scripts.convert --quantize --model_id 

For example, convert and quantize bert-base-uncased using:

python -m scripts.convert --quantize --model_id bert-base-uncased

This will save the following files to ./models/:

bert-base-uncased/
โ”œโ”€โ”€ config.json
โ”œโ”€โ”€ tokenizer.json
โ”œโ”€โ”€ tokenizer_config.json
โ””โ”€โ”€ onnx/
    โ”œโ”€โ”€ model.onnx
    โ””โ”€โ”€ model_quantized.onnx

For the full list of supported architectures, see the Optimum documentation.

Xet Storage Details

Size:
1.78 kB
ยท
Xet hash:
655417b54002e34aef34fd0de9af4eff45a6f82e095ac994c6d0e02c479b3218

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.