view article Article Getting More from Your Test-Time Compute Budget with Portfolio Beam Search 17 days ago • 8
view article Article Use AI on Your PC: Optimize and Deploy a Multimodal Agentic Pipeline on AI PC Powered by Intel Sep 17, 2025 • 6
Getting it Right: Improving Spatial Consistency in Text-to-Image Models Paper • 2404.01197 • Published Apr 1, 2024 • 31
LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models Paper • 2404.03118 • Published Apr 3, 2024 • 25
FastRM: An efficient and automatic explainability framework for multimodal generative models Paper • 2412.01487 • Published Dec 2, 2024 • 1
LDM3D collection Collection This collection contains the models, papers, and demo associated with the LDM3D release. • 7 items • Updated Aug 23, 2024
The SPRIGHT T2I collection Collection This collection contains the datasets, model, paper, and demo associated with the SPRIGHT (SPatially RIGHT) release. • 5 items • Updated Apr 2, 2024 • 6