Anthropogenic Regional Adaptation in Multimodal Vision-Language Model Paper • 2604.11490 • Published 7 days ago • 11
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published Mar 10, 2025 • 101
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines Paper • 2410.12705 • Published Oct 16, 2024 • 32
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation Paper • 2112.06223 • Published Dec 12, 2021
Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset Paper • 2201.02419 • Published Jan 7, 2022
Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition Paper • 2306.14517 • Published Jun 26, 2023
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity Paper • 2302.04023 • Published Feb 8, 2023
Which One Are You Referring To? Multimodal Object Identification in Situated Dialogue Paper • 2302.14680 • Published Feb 28, 2023 • 1
InstructAlign: High-and-Low Resource Language Alignment via Continual Crosslingual Instruction Tuning Paper • 2305.13627 • Published May 23, 2023 • 1
NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages Paper • 2309.10661 • Published Sep 19, 2023 • 1
Greenformer: Factorization Toolkit for Efficient Deep Neural Networks Paper • 2109.06762 • Published Sep 14, 2021 • 1