How to use deepmind/multimodal-perceiver with Transformers:
# Load model directly from transformers import AutoTokenizer, PerceiverForMultimodalAutoencoding tokenizer = AutoTokenizer.from_pretrained("deepmind/multimodal-perceiver") model = PerceiverForMultimodalAutoencoding.from_pretrained("deepmind/multimodal-perceiver")