Working on DGX Spark (ARM64 + CUDA 13) - Setup Notes

#23

by logos-flux - opened Jan 4

Jan 4

Got VibeVoice-Realtime-0.5B running on DGX Spark with full GPU acceleration. Sharing setup notes since the official docs focus on x86_64.

The Issue:
PyTorch may not have CUDA enabled on Spark. You'll see CUDA available: False even though the GPU is there. This is a common issue.

The Fix:

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu130

Performance:

Notes:

Built a full voice pipeline (Whisper + Ollama + VibeVoice) with sentence-level streaming that achieves ~766ms to first audio.

Happy to share code if there's interest.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment