TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering Paper • 2404.01476 • Published Apr 1, 2024 • 1
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning Paper • 2406.15334 • Published Jun 21, 2024 • 9