LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation Paper • 2412.15188 • Published Dec 19, 2024 • 2
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 252
MM-DINOv2: Adapting Foundation Models for Multi-Modal Medical Image Analysis Paper • 2509.06617 • Published Sep 8, 2025 • 1