LLaVAction: evaluating and training multi-modal large language models for action recognition Paper • 2503.18712 • Published Mar 24, 2025 • 4 • 2