Leveraging Model Soups to Classify Intangible Cultural Heritage Images from the Mekong Delta Paper โข 2603.02181 โข Published 2 days ago
ViCLIP-OT: The First Foundation Vision-Language Model for Vietnamese Image-Text Retrieval with Optimal Transport Paper โข 2602.22678 โข Published 6 days ago