Collections
Discover the best community collections!
Collections trending this week
-
FMViT: A multiple-frequency mixing Vision Transformer
Paper • 2311.05707 • Published • 7 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper • 2405.00732 • Published • 122 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90
-
FMViT: A multiple-frequency mixing Vision Transformer
Paper • 2311.05707 • Published • 7 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper • 2405.00732 • Published • 122 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90