Buckets:

rtrm's picture
|
download
raw
2.58 kB
# Using TEI Container with Intel® Hardware
This guide explains how to build and deploy `text-embeddings-inference` containers optimized for Intel® hardware, including CPUs, XPUs, and HPUs.
## CPU
### Build Docker Image
To build a container optimized for Intel® CPUs, run the following command:
```shell
platform="cpu"
docker build . -f Dockerfile-intel --build-arg PLATFORM=$platform -t tei_cpu_ipex
```
### Deploy Docker Container
To deploy your model on an Intel® CPU, use the following command:
```shell
model='Qwen/Qwen3-Embedding-0.6B'
volume=$PWD/data
docker run -p 8080:80 -v $volume:/data tei_cpu_ipex --model-id $model
```
## XPU
### Build Docker Image
To build a container optimized for Intel® XPUs, run the following command:
```shell
platform="xpu"
docker build . -f Dockerfile-intel --build-arg PLATFORM=$platform -t tei_xpu_ipex
```
### Deploy Docker Container
To deploy your model on an Intel® XPU, use the following command:
```shell
model='Qwen/Qwen3-Embedding-0.6B'
volume=$PWD/data
docker run -p 8080:80 -v $volume:/data --device=/dev/dri -v /dev/dri/by-path:/dev/dri/by-path tei_xpu_ipex --model-id $model --dtype float16
```
## HPU
> [!WARNING]
> TEI is supported only on Gaudi 2 and Gaudi 3. Gaudi 1 is **not** supported.
### Build Docker Image
To build a container optimized for Intel® HPUs (Gaudi), run the following command:
```shell
platform="hpu"
docker build . -f Dockerfile-intel --build-arg PLATFORM=$platform -t tei_hpu
```
### Deploy Docker Container
To deploy your model on an Intel® HPU (Gaudi), use the following command:
```shell
model='Qwen/Qwen3-Embedding-0.6B'
volume=$PWD/data
docker run -p 8080:80 -v $volume:/data --runtime=habana -e HABANA_VISIBLE_DEVICES=all -e MAX_WARMUP_SEQUENCE_LENGTH=512 tei_hpu --model-id $model --dtype bfloat16
```
## Prebuilt Docker Images
For convenience, prebuilt Docker images are available on GitHub Container Registry (GHCR). You can pull these images directly without the need to build them manually:
### CPU
To use the prebuilt image optimized for Intel® CPUs, run:
```shell
docker pull ghcr.io/huggingface/text-embeddings-inference:cpu-ipex-latest
```
### XPU
To use the prebuilt image optimized for Intel® XPUs, run:
```shell
docker pull ghcr.io/huggingface/text-embeddings-inference:xpu-ipex-latest
```
### HPU
> [!WARNING]
> TEI is supported only on Gaudi 2 and Gaudi 3. Gaudi 1 is **not** supported.
To use the prebuilt image optimized for Intel® HPUs (Gaudi), run:
```shell
docker pull ghcr.io/huggingface/text-embeddings-inference:hpu-latest
```

Xet Storage Details

Size:
2.58 kB
·
Xet hash:
4747caf7bcd7d4212ab8d12d9225d1c97ff2227054b15ad124024fe47c2e3bd5

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.