Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
57.9
TFLOPS
3
1
92
Casimiro Ferreira
Jarbas
Follow
klebster's profile picture
Lmagoncalo's profile picture
webxos's profile picture
11 followers
·
48 following
https://tigregotico.pt
JarbasAl
casimiro-ferreira-953783151
AI & ML interests
None yet
Recent Activity
liked
a model
3 days ago
yuriyvnv/WAVe-1B-Multimodal-NL
reacted
to
yuriyvnv
's
post
with 🔥
3 days ago
📄 The WAVe paper is officially out in the Information Sciences Journal. You saw the PT and NL model releases earlier this year. This is the peer-reviewed paper behind them, with the full method, ablations, and downstream ASR evaluation. Quick recap: WAVe is a 1B multimodal embedding model that filters synthetic speech at the word level, not the sentence level. On Portuguese ASR it cuts training steps by 34%, improves cross-domain generalization by 50%, and matches WER with 30% less synthetic data. 📦 Resources - Paper: https://www.sciencedirect.com/science/article/pii/S0020025526005220 - PT model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-PT - NL model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-NL - Collection: https://huggingface.co/collections/yuriyvnv/multi-modal-embeddings-for-synthetic-transcript-filtering - Code: https://github.com/yuriyvnv/WAVe If you train ASR on synthetic or back-translated data, would like to see WAVe benchmarked on other languages. @reach-vb @ylacombe @hf-audio @BramVanroy #speech #asr #multimodal #syntheticdata #lowresource
liked
a dataset
3 days ago
apptek-com/apptek_callcenter_dialogues
View all activity
Organizations
Jarbas
's datasets
31
Sort:Â Recently updated
Jarbas/ovos-tts-bench
Updated
Sep 30, 2024
•
15
Previous
1
2
Next