Instructions to use microsoft/wavlm-base-plus-sv with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/wavlm-base-plus-sv with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForAudioXVector processor = AutoProcessor.from_pretrained("microsoft/wavlm-base-plus-sv") model = AutoModelForAudioXVector.from_pretrained("microsoft/wavlm-base-plus-sv") - Notebooks
- Google Colab
- Kaggle
Input padding?
#1
by cinjonr - opened
I get a lot of errors when running this of the variety: Given input with spatial size (1, 1), kernel_size=(3, 512), dilation=(3, 1), padding=(0, 0), calculated shape of the array of sliding blocks as (-5, 1), but its components must be at least one.
Does this model require a certain shape?