Large-scale Pre-training for Grounded Video Caption Generation Paper โข 2503.10781 โข Published Mar 13, 2025 โข 16