Valley Valley Family: Exploring Scalable Vision-Language Design for Multimodal Understanding and Reasoning bytedance-research/Valley3-8B-Instruct 10B • Updated 9 days ago • 16 • 2 bytedance-research/Valley3-32B-Instruct 34B • Updated 9 days ago • 19 • 2 bytedance-research/Valley3-8B-Think 10B • Updated 9 days ago • 16 • 3 bytedance-research/Valley3-32B-Think 34B • Updated 9 days ago • 18 • 1
Vidi Vidi model collection for multimodal video understanding and creation bytedance-research/Vidi-7B 9B • Updated Dec 15, 2025 • 31 • 16 bytedance-research/Vidi1.5-9B 10B • Updated Jan 22 • 26 • 10
Valley Valley Family: Exploring Scalable Vision-Language Design for Multimodal Understanding and Reasoning bytedance-research/Valley3-8B-Instruct 10B • Updated 9 days ago • 16 • 2 bytedance-research/Valley3-32B-Instruct 34B • Updated 9 days ago • 19 • 2 bytedance-research/Valley3-8B-Think 10B • Updated 9 days ago • 16 • 3 bytedance-research/Valley3-32B-Think 34B • Updated 9 days ago • 18 • 1
Vidi Vidi model collection for multimodal video understanding and creation bytedance-research/Vidi-7B 9B • Updated Dec 15, 2025 • 31 • 16 bytedance-research/Vidi1.5-9B 10B • Updated Jan 22 • 26 • 10