HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation Paper ⢠2504.12330 ⢠Published Apr 13, 2025 ⢠1
Whisper Models Dutch Language Collection This repo contains Dutch Whisper models finetuned on CV and other synthetic data, with different filtering options ⢠11 items ⢠Updated Sep 16, 2025 ⢠3
Whisper Models Portuguese Language Collection This Repo contains Whisper models trained on subsets of data like Common Voice 17(CV_17), Synthetic(Generated by OpenAI) + CV17 and Synthetic Only. ⢠13 items ⢠Updated Mar 2 ⢠2
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. ⢠14 items ⢠Updated Dec 10, 2025 ⢠23
Seamless: Multilingual Expressive and Streaming Speech Translation Paper ⢠2312.05187 ⢠Published Dec 8, 2023 ⢠14