Buckets:

hf-doc-build
/

doc-dev

Files

xet

hf-doc-build/doc-dev / text-embeddings-inference /pr_742 /en /intel_container.md

rtrm

about 2 months ago

preview code

download

raw

2.58 kB

	# Using TEI Container with Intel® Hardware

	This guide explains how to build and deploy `text-embeddings-inference` containers optimized for Intel® hardware, including CPUs, XPUs, and HPUs.

	## CPU

	### Build Docker Image

	To build a container optimized for Intel® CPUs, run the following command:

	```shell
	platform="cpu"

	docker build . -f Dockerfile-intel --build-arg PLATFORM=$platform -t tei_cpu_ipex
	```

	### Deploy Docker Container

	To deploy your model on an Intel® CPU, use the following command:

	```shell
	model='Qwen/Qwen3-Embedding-0.6B'
	volume=$PWD/data

	docker run -p 8080:80 -v $volume:/data tei_cpu_ipex --model-id $model
	```

	## XPU

	### Build Docker Image

	To build a container optimized for Intel® XPUs, run the following command:

	```shell
	platform="xpu"

	docker build . -f Dockerfile-intel --build-arg PLATFORM=$platform -t tei_xpu_ipex
	```

	### Deploy Docker Container

	To deploy your model on an Intel® XPU, use the following command:

	```shell
	model='Qwen/Qwen3-Embedding-0.6B'
	volume=$PWD/data

	docker run -p 8080:80 -v $volume:/data --device=/dev/dri -v /dev/dri/by-path:/dev/dri/by-path tei_xpu_ipex --model-id $model --dtype float16
	```

	## HPU

	> [!WARNING]
	> TEI is supported only on Gaudi 2 and Gaudi 3. Gaudi 1 is not supported.

	### Build Docker Image

	To build a container optimized for Intel® HPUs (Gaudi), run the following command:

	```shell
	platform="hpu"

	docker build . -f Dockerfile-intel --build-arg PLATFORM=$platform -t tei_hpu
	```

	### Deploy Docker Container

	To deploy your model on an Intel® HPU (Gaudi), use the following command:

	```shell
	model='Qwen/Qwen3-Embedding-0.6B'
	volume=$PWD/data

	docker run -p 8080:80 -v $volume:/data --runtime=habana -e HABANA_VISIBLE_DEVICES=all -e MAX_WARMUP_SEQUENCE_LENGTH=512 tei_hpu --model-id $model --dtype bfloat16
	```

	## Prebuilt Docker Images

	For convenience, prebuilt Docker images are available on GitHub Container Registry (GHCR). You can pull these images directly without the need to build them manually:

	### CPU
	To use the prebuilt image optimized for Intel® CPUs, run:
	```shell
	docker pull ghcr.io/huggingface/text-embeddings-inference:cpu-ipex-latest
	```

	### XPU
	To use the prebuilt image optimized for Intel® XPUs, run:
	```shell
	docker pull ghcr.io/huggingface/text-embeddings-inference:xpu-ipex-latest
	```

	### HPU

	> [!WARNING]
	> TEI is supported only on Gaudi 2 and Gaudi 3. Gaudi 1 is not supported.

	To use the prebuilt image optimized for Intel® HPUs (Gaudi), run:
	```shell
	docker pull ghcr.io/huggingface/text-embeddings-inference:hpu-latest
	```

Xet Storage Details

Size:: 2.58 kB
Xet hash:: 4747caf7bcd7d4212ab8d12d9225d1c97ff2227054b15ad124024fe47c2e3bd5

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.