CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought Paper • 2409.19510 • Published Sep 29, 2024 • 1
CCFQA: A Benchmark for Cross-Lingual and Cross-Modal Speech and Text Factuality Evaluation Paper • 2508.07295 • Published Aug 10, 2025
MCGA: A Multi-task Classical Chinese Literary Genre Audio Corpus Paper • 2601.09270 • Published Jan 14 • 1
Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion Paper • 2602.21646 • Published 7 days ago
MCAT: Scaling Many-to-Many Speech-to-Text Translation with MLLMs to 70 Languages Paper • 2512.01512 • Published Dec 1, 2025
CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought Paper • 2409.19510 • Published Sep 29, 2024 • 1
MCGA: A Multi-task Classical Chinese Literary Genre Audio Corpus Paper • 2601.09270 • Published Jan 14 • 1