google/pix2struct-ai2d-base Visual Question Answering β’ 0.3B β’ Updated Dec 24, 2023 β’ 1.63k β’ 43
impira/layoutlm-invoices Document Question Answering β’ 0.1B β’ Updated Mar 25, 2023 β’ 1.69k β’ 224
pyannote/speaker-diarization Automatic Speech Recognition β’ Updated May 10, 2024 β’ 763k β’ 1.25k
facebook/wav2vec2-large-960h-lv60-self Automatic Speech Recognition β’ Updated May 23, 2022 β’ 94.3k β’ 160