-
Open ASR Leaderboard configuration for Transformers š¤ models
šNormalize text and evaluate its quality with provided scripts
-
Open ASR Leaderboard configuration for NVIDIA NeMo ASR models
šNormalize text to a consistent, clean format
-
Open ASR Leaderboard configuration for Boson's Higgs Audio v3
šNormalize and clean text data for analysis
-
Open ASR Leaderboard configuration for API models
šRun model evaluation and get performance metrics
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
View all PapersA collection of ASR models supported in š¤ Transformers
-
openai/whisper-large-v2
Automatic Speech Recognition ⢠2B ⢠Updated ⢠101k ⢠1.8k -
facebook/wav2vec2-base-960h
Automatic Speech Recognition ⢠94.4M ⢠Updated ⢠1.32M ⢠398 -
facebook/wav2vec2-large-xlsr-53
Updated ⢠293k ⢠160 -
facebook/hubert-xlarge-ls960-ft
Automatic Speech Recognition ⢠1.0B ⢠Updated ⢠13.4k ⢠16
A collection of audio classification models supported in š¤ Transformers
A collection of codec and embedding models supported in š¤ Transformers.
-
laion/clap-htsat-unfused
Feature Extraction ⢠Updated ⢠461k ⢠⢠76 -
facebook/encodec_32khz
Feature Extraction ⢠59M ⢠Updated ⢠40.9k ⢠18 -
descript/dac_44khz
Feature Extraction ⢠76.6M ⢠Updated ⢠131k ⢠⢠11 -
descript/dac_24khz
Feature Extraction ⢠74.7M ⢠Updated ⢠4.17k ⢠⢠3
Transformer supported versions of X-Codec models: https://github.com/zhenye234/xcodec?tab=readme-ov-file#available-models
-
hf-audio/xcodec-hubert-general-balanced
Feature Extraction ⢠0.2B ⢠Updated ⢠1.17k ⢠1 -
hf-audio/xcodec-wavlm-more-data
Feature Extraction ⢠0.2B ⢠Updated ⢠2.09k ⢠1 -
hf-audio/xcodec-wavlm-mls
Feature Extraction ⢠0.2B ⢠Updated ⢠1.25k -
hf-audio/xcodec-hubert-general
Feature Extraction ⢠0.2B ⢠Updated ⢠4.68k
A collection of TTS models supported in š¤ Transformers.
A collection of music generation models supported in š¤ Transformers and š§Ø Diffusers
-
Open ASR Leaderboard configuration for Transformers š¤ models
šNormalize text and evaluate its quality with provided scripts
-
Open ASR Leaderboard configuration for NVIDIA NeMo ASR models
šNormalize text to a consistent, clean format
-
Open ASR Leaderboard configuration for Boson's Higgs Audio v3
šNormalize and clean text data for analysis
-
Open ASR Leaderboard configuration for API models
šRun model evaluation and get performance metrics
Transformer supported versions of X-Codec models: https://github.com/zhenye234/xcodec?tab=readme-ov-file#available-models
-
hf-audio/xcodec-hubert-general-balanced
Feature Extraction ⢠0.2B ⢠Updated ⢠1.17k ⢠1 -
hf-audio/xcodec-wavlm-more-data
Feature Extraction ⢠0.2B ⢠Updated ⢠2.09k ⢠1 -
hf-audio/xcodec-wavlm-mls
Feature Extraction ⢠0.2B ⢠Updated ⢠1.25k -
hf-audio/xcodec-hubert-general
Feature Extraction ⢠0.2B ⢠Updated ⢠4.68k
A collection of ASR models supported in š¤ Transformers
-
openai/whisper-large-v2
Automatic Speech Recognition ⢠2B ⢠Updated ⢠101k ⢠1.8k -
facebook/wav2vec2-base-960h
Automatic Speech Recognition ⢠94.4M ⢠Updated ⢠1.32M ⢠398 -
facebook/wav2vec2-large-xlsr-53
Updated ⢠293k ⢠160 -
facebook/hubert-xlarge-ls960-ft
Automatic Speech Recognition ⢠1.0B ⢠Updated ⢠13.4k ⢠16
A collection of TTS models supported in š¤ Transformers.
A collection of audio classification models supported in š¤ Transformers
A collection of music generation models supported in š¤ Transformers and š§Ø Diffusers
A collection of codec and embedding models supported in š¤ Transformers.
-
laion/clap-htsat-unfused
Feature Extraction ⢠Updated ⢠461k ⢠⢠76 -
facebook/encodec_32khz
Feature Extraction ⢠59M ⢠Updated ⢠40.9k ⢠18 -
descript/dac_44khz
Feature Extraction ⢠76.6M ⢠Updated ⢠131k ⢠⢠11 -
descript/dac_24khz
Feature Extraction ⢠74.7M ⢠Updated ⢠4.17k ⢠⢠3