view article Article Open-source DeepResearch – Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier • Feb 4, 2025 • 1.32k
Modality Mixer Exploiting Complementary Information for Multi-modal Action Recognition Paper • 2311.12344 • Published Nov 21, 2023 • 2
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts Paper • 2403.09176 • Published Mar 14, 2024 • 2
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Paper • 2406.16860 • Published Jun 24, 2024 • 63
HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D Paper • 2312.15980 • Published Dec 26, 2023 • 12