Use this model with ROCm and docker

by cool9203 - opened Jan 22

Jan 22

Thanks for sharing such a great model!
I've documented my process of running it with ROCm and Docker below.

Verify ROCm version using the following command: sudo update-alternatives --display rocm
Pull the PyTorch ROCm image select the appropriate image from rocm/pytorch
Example used: rocm/pytorch:rocm7.1_ubuntu24.04_py3.12_pytorch_release_2.9.1
Start container, i use docker compose

services:
  pytorch-rocm:
    image: rocm/pytorch:rocm7.1_ubuntu24.04_py3.12_pytorch_release_2.9.1
    container_name: vibevoice-asr-rocm
    network_mode: "host"
    command: bash
    tty: true
    stdin_open: true
    devices:
      - /dev/kfd
      - /dev/dri
    group_add:
      - "video"
    ipc: host
    shm_size: "16gb"
    environment:
      TZ: "Asia/Taipei"
    mem_limit: "64g"
    restart: unless-stopped

Enter container and install inside container flash-attn rocm version

git clone https://github.com/ROCm/flash-attention.git &&\ 
cd flash-attention &&\
FLASH_ATTENTION_TRITON_AMD_ENABLE="TRUE" python setup.py install

Clone VibeVoice and install inside container
Run with gradio FLASH_ATTENTION_TRITON_AMD_ENABLE="TRUE" python demo/vibevoice_asr_gradio_demo.py --model_path microsoft/VibeVoice-ASR

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment