view article Article Introducing Cohere-transcribe: state-of-the-art speech recognition 3 days ago • 25
IndicConformer Collection A collection of ASR models for 22 scheduled languages of India • 23 items • Updated 27 days ago • 29
OWLS: Scaling Laws for Speech Recognition and Translation Collection 🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate. • 8 items • Updated May 3, 2025 • 7
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 Jul 5, 2024 • 317
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27, 2024 • 156
impira/layoutlm-document-qa Document Question Answering • 0.1B • Updated Mar 18, 2023 • 18.6k • 1.17k
google/pix2struct-docvqa-base Visual Question Answering • 0.3B • Updated Dec 24, 2023 • 2.87k • 44