Submitted by Gabriele Serussi 10 HERBench: A Benchmark for Multi-Evidence Integration in Video Question Answering INSIGHT Lab 3 3