--- title: Diarized Speaker Segments Community-1 emoji: 🎙️ colorFrom: green colorTo: blue sdk: docker app_port: 7860 suggested_hardware: t4-small --- # Diarized Speaker Segments Community-1 This Space uses **repo-style transcription logic** from your attached codebase plus **pyannote/speaker-diarization-community-1**. ## What changed - transcription now uses the attached repo's logic: - custom speech-window detection - ffmpeg audio enhancement - repo-style Hindi/Hinglish/English prompt - repo thresholds and dedupe flow - repo segment splitting by words - diarization remains `community-1` - cleanup remains exactly: - merge **only adjacent same-speaker** segments - otherwise do not touch ## Notes - default ASR model is **medium** - `large-v3` is available as a dropdown for evaluation - default language is **hi** because the attached repo logic is Hindi-biased for Hinglish handling - you can still test `auto` and `en`