view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... Jan 20, 2025 • 75
DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 189