Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
90.5
TFLOPS
100
22
168
Bram Vanroy
PRO
BramVanroy
Follow
stefan-it's profile picture
sasha's profile picture
sytse06's profile picture
243 followers
·
167 following
https://bramvanroy.github.io/
BramVanroy
BramVanroy
bramvanroy
bramvanroy.bsky.social
AI & ML interests
Artificial intelligence, natural language processing, computational linguistics
Recent Activity
reacted
to
yuriyvnv
's
post
with 🚀
4 days ago
🎯 WAVe-1B-Multimodal-NL: Word-Level Speech Quality Assessment for Dutch Following the release of the Portuguese model, we're releasing the Dutch variant of WAVe — a 1B multimodal embedding model that assesses synthetic speech quality at the word level, thereby improving the quality of synthetically augmented datasets for training ASR models. Trained on CommonVoice 16.1 Dutch with 5 corruption strategies, this model catches mispronunciations, timing errors, and prosody issues in synthetic data that sentence-level embeddings miss entirely. Resources - Dutch model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-NL - Portuguese model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-PT - Code: https://github.com/yuriyvnv/WAVe This model builds on CommonVoice Dutch data — thanks to @mozilla and the CommonVoice community for making multilingual speech data accessible. Would be great to hear from the Dutch NLP community — @BramVanroy @GroNLP — especially if you're working on Dutch ASR or TTS pipelines where quality filtering could help. Also tagging @hf-audio as this sits at the intersection of speech processing and data curation.
liked
a model
23 days ago
hexgrad/Kokoro-82M
liked
a Space
2 months ago
antalvdb/olifant-explainability-demo
View all activity
Organizations
BramVanroy
's datasets
45
Sort: Recently updated
BramVanroy/finewiki-nl-30-to-24k-tokens
Viewer
•
Updated
Dec 18, 2025
•
821k
•
65
•
1
BramVanroy/finemath-4plus-seqlen36k
Viewer
•
Updated
Dec 5, 2025
•
2.85M
•
113
•
1
BramVanroy/synthetic-uner-ner-200-Qwen3-14B-AWQ
Viewer
•
Updated
Nov 26, 2025
•
200
•
32
BramVanroy/synthetic-uner-ner
Viewer
•
Updated
Nov 25, 2025
•
61.5k
•
75
BramVanroy/synthetic-uner-ner-20000-Qwen3-14B-AWQ
Viewer
•
Updated
Nov 24, 2025
•
20k
•
42
BramVanroy/universal_ner
Viewer
•
Updated
Nov 24, 2025
•
77.9k
•
339
BramVanroy/conll2002
Viewer
•
Updated
Nov 14, 2025
•
35.7k
•
62
BramVanroy/conll2003
Viewer
•
Updated
Nov 14, 2025
•
20.7k
•
308
•
1
BramVanroy/dutch-edu-classifier-training-v3
Viewer
•
Updated
Sep 4, 2025
•
274k
•
24
BramVanroy/CommonCrawl-CreativeCommons
Viewer
•
Updated
Aug 28, 2025
•
739M
•
957
•
34
BramVanroy/CommonCrawl-CreativeCommons-fine
Viewer
•
Updated
Aug 28, 2025
•
75.1M
•
355
•
2
BramVanroy/CommonCrawl-CreativeCommons-strict
Viewer
•
Updated
Aug 28, 2025
•
32.8M
•
94
•
1
BramVanroy/dutch-edu-classifier-training-v2
Viewer
•
Updated
Aug 18, 2025
•
500k
•
40
BramVanroy/dutch-edu-classifier-training
Viewer
•
Updated
Aug 14, 2025
•
744k
•
30
BramVanroy/fineweb-duckdbs
Updated
May 15, 2025
•
1.67k
•
1
BramVanroy/fineweb-2-duckdbs
Updated
Apr 28, 2025
•
4.58k
BramVanroy/fw2-nl-qwen2_5-72b-50k-merged-split
Viewer
•
Updated
Apr 25, 2025
•
95.7k
•
7
•
1
BramVanroy/belebele_dutch
Viewer
•
Updated
Apr 25, 2025
•
1.8k
•
18
BramVanroy/finewebs-copyright-domains
Viewer
•
Updated
Mar 26, 2025
•
361
•
8
•
1
BramVanroy/WildChat-1M-filtered-gpt-4
Viewer
•
Updated
Feb 17, 2025
•
136k
•
12
BramVanroy/fw2-nl-rm-qwen2_5-72b-50k
Viewer
•
Updated
Jan 20, 2025
•
50k
•
7
BramVanroy/fw2-nl-qwen2_5-72b-50k
Viewer
•
Updated
Jan 20, 2025
•
50k
•
9
BramVanroy/wikipedia_culturax_dutch
Viewer
•
Updated
Dec 23, 2024
•
1.3B
•
3.85k
•
6
BramVanroy/ultra_feedback_dutch
Viewer
•
Updated
Dec 6, 2024
•
53.6k
•
121
•
3
BramVanroy/no_robots_dutch
Viewer
•
Updated
Dec 6, 2024
•
8.61k
•
65
•
2
BramVanroy/ultra_feedback_dutch_cleaned
Viewer
•
Updated
Dec 6, 2024
•
183k
•
141
•
6
BramVanroy/orca_dpo_pairs_dutch_cleaned
Viewer
•
Updated
Dec 6, 2024
•
31.6k
•
83
•
3
BramVanroy/orca_dpo_pairs_dutch
Viewer
•
Updated
Dec 6, 2024
•
11k
•
45
•
6
BramVanroy/ultrachat_200k_dutch
Viewer
•
Updated
Dec 6, 2024
•
214k
•
97
•
8
BramVanroy/lmsys-20240814-nl
Viewer
•
Updated
Oct 21, 2024
•
2.75k
•
10
Previous
1
2
Next