Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
57.9
TFLOPS
3
1
92
Casimiro Ferreira
Jarbas
Follow
shtefcs's profile picture
webxos's profile picture
Lmagoncalo's profile picture
11 followers
·
48 following
https://tigregotico.pt
JarbasAl
casimiro-ferreira-953783151
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 hour ago
yuriyvnv/WAVe-1B-Multimodal-NL
reacted
to
yuriyvnv
's
post
with 🔥
about 1 hour ago
📄 The WAVe paper is officially out in the Information Sciences Journal. You saw the PT and NL model releases earlier this year. This is the peer-reviewed paper behind them, with the full method, ablations, and downstream ASR evaluation. Quick recap: WAVe is a 1B multimodal embedding model that filters synthetic speech at the word level, not the sentence level. On Portuguese ASR it cuts training steps by 34%, improves cross-domain generalization by 50%, and matches WER with 30% less synthetic data. 📦 Resources - Paper: https://www.sciencedirect.com/science/article/pii/S0020025526005220 - PT model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-PT - NL model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-NL - Collection: https://huggingface.co/collections/yuriyvnv/multi-modal-embeddings-for-synthetic-transcript-filtering - Code: https://github.com/yuriyvnv/WAVe If you train ASR on synthetic or back-translated data, would like to see WAVe benchmarked on other languages. @reach-vb @ylacombe @hf-audio @BramVanroy #speech #asr #multimodal #syntheticdata #lowresource
liked
a dataset
about 1 hour ago
apptek-com/apptek_callcenter_dialogues
View all activity
Organizations
Jarbas
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
a
collection
about 1 year ago
Intent Classification Datasets
Collection
21 items
•
Updated
13 days ago
•
2