Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
57.9
TFLOPS
3
1
92
Casimiro Ferreira
Jarbas
Follow
yoyo8744's profile picture
shtefcs's profile picture
denics's profile picture
11 followers
Ā·
48 following
https://tigregotico.pt
JarbasAl
casimiro-ferreira-953783151
AI & ML interests
None yet
Recent Activity
liked
a model
about 18 hours ago
yuriyvnv/WAVe-1B-Multimodal-NL
reacted
to
yuriyvnv
's
post
with š„
about 18 hours ago
š The WAVe paper is officially out in the Information Sciences Journal. You saw the PT and NL model releases earlier this year. This is the peer-reviewed paper behind them, with the full method, ablations, and downstream ASR evaluation. Quick recap: WAVe is a 1B multimodal embedding model that filters synthetic speech at the word level, not the sentence level. On Portuguese ASR it cuts training steps by 34%, improves cross-domain generalization by 50%, and matches WER with 30% less synthetic data. š¦ Resources - Paper: https://www.sciencedirect.com/science/article/pii/S0020025526005220 - PT model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-PT - NL model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-NL - Collection: https://huggingface.co/collections/yuriyvnv/multi-modal-embeddings-for-synthetic-transcript-filtering - Code: https://github.com/yuriyvnv/WAVe If you train ASR on synthetic or back-translated data, would like to see WAVe benchmarked on other languages. @reach-vb @ylacombe @hf-audio @BramVanroy #speech #asr #multimodal #syntheticdata #lowresource
liked
a dataset
about 19 hours ago
apptek-com/apptek_callcenter_dialogues
View all activity
Organizations
Jarbas
's models
126
Sort:Ā Recently updated
Jarbas/m2v-256-multilingual-e5-base
Updated
May 20, 2025
ā¢
11
Jarbas/m2v-256-multilingual-e5-small
Updated
May 20, 2025
ā¢
50
Jarbas/m2v-256-gervasio-7b-portuguese-ptpt-decoder
Updated
May 20, 2025
ā¢
16
Jarbas/m2v-256-serafim-335m-portuguese-pt-sentence-encoder-ir
Updated
May 20, 2025
ā¢
8
Jarbas/m2v-256-serafim-335m-portuguese-pt-sentence-encoder
Updated
May 20, 2025
ā¢
8
Jarbas/m2v-256-serafim-100m-portuguese-pt-sentence-encoder-ir
Updated
May 20, 2025
ā¢
8
Jarbas/m2v-256-serafim-100m-portuguese-pt-sentence-encoder
Updated
May 20, 2025
ā¢
9
Jarbas/m2v-256-albertina-100m-portuguese-ptbr-encoder
Updated
May 20, 2025
ā¢
10
Jarbas/m2v-256-albertina-100m-portuguese-ptpt-encoder
Updated
May 20, 2025
ā¢
13
Jarbas/m2v-256-bert-base-cased-squad-v1.1-portuguese
Updated
May 20, 2025
ā¢
8
Jarbas/m2v-256-bert-large-portuguese-cased
Updated
May 20, 2025
ā¢
7
Jarbas/m2v-256-bert-base-portuguese-cased
Updated
May 20, 2025
ā¢
16
Jarbas/m2v-256-roberta-base-ca-v2-cased-qa
Updated
May 20, 2025
ā¢
21
Jarbas/m2v-256-roberta-base-ca-cased-sts
Updated
May 20, 2025
ā¢
59
Jarbas/m2v-256-distilroberta-base-ca-v2
Updated
May 20, 2025
ā¢
8
Jarbas/m2v-256-roberta-large-ca-v2
Updated
May 20, 2025
ā¢
8
Jarbas/m2v-256-roberta-base-ca-v2-cased-te
Updated
May 20, 2025
ā¢
27
Jarbas/m2v-256-roberta-base-ca-v2-cawikitc
Updated
May 20, 2025
ā¢
26
Jarbas/m2v-256-roberta-large-ca-paraphrase
Updated
May 20, 2025
ā¢
10
Jarbas/m2v-256-roberta-large-ca-v2-massive
Updated
May 20, 2025
ā¢
8
Jarbas/m2v-256-roberta-base-ca-v2-massive
Updated
May 20, 2025
ā¢
27
Jarbas/m2v-256-roberta-base-ca-v2-cased-tc
Updated
May 20, 2025
ā¢
60
Jarbas/m2v-256-bert-base-multilingual-cased
Updated
May 20, 2025
ā¢
11
Jarbas/m2v-256-bertinho-gl-small-cased
Updated
May 20, 2025
ā¢
8
ā¢
1
Jarbas/m2v-256-bertinho-gl-base-cased
Updated
May 20, 2025
ā¢
16
ā¢
1
Jarbas/m2v-256-LaBSE
Updated
May 14, 2025
ā¢
13
Jarbas/m2v-256-distiluse-base-multilingual-cased-v2
Updated
May 14, 2025
ā¢
16
Jarbas/m2v-256-paraphrase-multilingual-mpnet-base-v2
Updated
May 14, 2025
ā¢
11
ā¢
1
Jarbas/m2v-256-paraphrase-multilingual-MiniLM-L12-v2
Updated
May 14, 2025
ā¢
16
Jarbas/stt_es_citrinet_512_onnx
Updated
Jan 15, 2025
Previous
1
2
3
4
5
Next