-
Kosmos-2: Grounding Multimodal Large Language Models to the World
Paper • 2306.14824 • Published • 35 -
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Paper • 2310.02992 • Published • 4 -
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 56 -
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model
Paper • 2309.16058 • Published • 56
Collections
Discover the best community collections!
Collections trending this week
-
LRM: Large Reconstruction Model for Single Image to 3D
Paper • 2311.04400 • Published • 52 -
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features
Paper • 2311.04391 • Published • 14 -
Drivable 3D Gaussian Avatars
Paper • 2311.08581 • Published • 47 -
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
Paper • 2312.13913 • Published • 24
-
LLM4TS: Two-Stage Fine-Tuning for Time-Series Forecasting with Pre-Trained LLMs
Paper • 2308.08469 • Published • 3 -
TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time Series
Paper • 2308.08241 • Published • 3 -
Are Large Language Models Temporally Grounded?
Paper • 2311.08398 • Published • 1 -
NL2TL: Transforming Natural Languages to Temporal Logics using Large Language Models
Paper • 2305.07766 • Published • 1
-
Kosmos-2: Grounding Multimodal Large Language Models to the World
Paper • 2306.14824 • Published • 35 -
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Paper • 2310.02992 • Published • 4 -
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 56 -
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model
Paper • 2309.16058 • Published • 56
-
LRM: Large Reconstruction Model for Single Image to 3D
Paper • 2311.04400 • Published • 52 -
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features
Paper • 2311.04391 • Published • 14 -
Drivable 3D Gaussian Avatars
Paper • 2311.08581 • Published • 47 -
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
Paper • 2312.13913 • Published • 24
-
LLM4TS: Two-Stage Fine-Tuning for Time-Series Forecasting with Pre-Trained LLMs
Paper • 2308.08469 • Published • 3 -
TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time Series
Paper • 2308.08241 • Published • 3 -
Are Large Language Models Temporally Grounded?
Paper • 2311.08398 • Published • 1 -
NL2TL: Transforming Natural Languages to Temporal Logics using Large Language Models
Paper • 2305.07766 • Published • 1