BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing Paper • 2506.17450 • Published Jun 20 • 64
Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index Paper • 2506.12229 • Published Jun 13 • 3
DocRAG Datasets Collection Processed ("Unified") datasets used in DocRAG for training or inference purposes. • 12 items • Updated Jun 14 • 1
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Paper • 2504.15280 • Published Apr 21 • 25
Synthetic Object Compositions for Det / Seg / Grounding Collection Dataset Collections for paper: https://github.com/weikaih04/Synthetic-Detection-Segmentation-Grounding-Data • 10 items • Updated 21 days ago • 2
CoTA Datasets Collection This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets. • 5 items • Updated Oct 31 • 7
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 101
TaskMeAnything Collection A collection of TaskMeAnything resources [https://github.com/JieyuZ2/TaskMeAnything] • 12 items • Updated Aug 4, 2024 • 3