wajahatalikhan
/

Multimodal_Autoencoder

Model card Files Files and versions

Multimodal_Autoencoder / README.md

wajahatalikhan's picture

Update README.md

94f2bc2 verified about 1 year ago

|

history blame contribute delete

318 Bytes

	---
	license: apache-2.0
	base_model:
	- openai/clip-vit-base-patch32
	---
	# Multimodal Learning for Autoencoders

	Repository of my SIGGRAPH Asia publication.
	In Multimodal Autoencoder the image is reconstructed using image and text inputs rather than just only image input.

	https://dl.acm.org/doi/10.1145/3681756.3697974