view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito ⢠May 12, 2025 ⢠614
Cosmos-Preidct1 Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/cosmos3 ⢠14 items ⢠Updated 15 days ago ⢠304
view article Article ColPali: Efficient Document Retrieval with Vision Language Models š manu ⢠Jul 5, 2024 ⢠321
Bark Collection Bark is a transformer-based text-to-audio model created by Suno. Currently, two checkpoints are supported: a small and a large version. ⢠3 items ⢠Updated Sep 14, 2023 ⢠20