How to use openinterx/UGC-VideoCaptioner with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForTextToWaveform processor = AutoProcessor.from_pretrained("openinterx/UGC-VideoCaptioner") model = AutoModelForTextToWaveform.from_pretrained("openinterx/UGC-VideoCaptioner")