AI & ML interests

None defined yet.

Recent Activity

Articles

model-dataset 
published an article 14 days ago
view article
Article

VoxCeleb Dataset: Real-World Speech for Speaker Recognition

model-dataset 
published an article about 1 month ago
view article
Article

Transformer-Based SSL Embeddings vs ECAPA-TDNN for Speaker Recognition

model-dataset 
published an article about 1 month ago
view article
Article

ECAPA vs X-Vector in Speaker Recognition: Comparing SpeechBrain’s spkrec-ecapa-voxceleb and spkrec-xvect-voxceleb

model-dataset 
published an article about 1 month ago
view article
Article

Synthesized vs Cloned Voices: A Comprehensive Comparison

model-dataset 
published an article about 1 month ago
view article
Article

Cloned Voice vs Deepfake Voice: What’s the Real Difference?

model-dataset 
published an article about 1 month ago
view article
Article

Building a Voice Authenticator with EnCodec Tokens (Bark-Style Voiceprints)

model-dataset 
published an article about 1 month ago
view article
Article

XTTS v2 vs YourTTS: A Comprehensive Voice Cloning Comparison

model-dataset 
published an article about 1 month ago
view article
Article

MelGAN, HiFi-GAN, PWG, and WaveGlow: What They Actually Do (and What They Don’t)

model-dataset 
published an article about 2 months ago
view article
Article

Male vs Female Voice Classification with Hugging Face Audio Pipelines

•
1
model-dataset 
published an article about 2 months ago
view article
Article

Speech vs Noise Classification with AST

•
1
model-dataset 
published an article about 2 months ago
view article
Article

Minimal Example for English Text-to-Speech with VITS: Female and Male Voices

model-dataset 
published an article about 2 months ago
view article
Article

Minimal, Practical Guide for Multilingual Voice Cloning with XTTS v2

model-dataset 
published an article about 2 months ago
view article
Article

Minimal Image Captioning with BLIP on Hugging Face

model-dataset 
published an article about 2 months ago
view article
Article

Simple Speaker Diarization with SpeechBrain X-Vectors

model-dataset 
published an article about 2 months ago
view article
Article

Types of Voice Deepfakes: Techniques, Tools, and Open-Source Methods

model-dataset 
published an article about 2 months ago
view article
Article

Faster-Whisper vs. NVIDIA Canary-Qwen-2.5B: Which One Should You Use for Speech-to-Text?