| --- |
| title: Diarized Speaker Segments Community-1 |
| emoji: 🎙️ |
| colorFrom: green |
| colorTo: blue |
| sdk: docker |
| app_port: 7860 |
| suggested_hardware: t4-small |
| --- |
| |
| # Diarized Speaker Segments Community-1 |
|
|
| This Space uses **repo-style transcription logic** from your attached codebase plus **pyannote/speaker-diarization-community-1**. |
|
|
| ## What changed |
| - transcription now uses the attached repo's logic: |
| - custom speech-window detection |
| - ffmpeg audio enhancement |
| - repo-style Hindi/Hinglish/English prompt |
| - repo thresholds and dedupe flow |
| - repo segment splitting by words |
| - diarization remains `community-1` |
| - cleanup remains exactly: |
| - merge **only adjacent same-speaker** segments |
| - otherwise do not touch |
|
|
| ## Notes |
| - default ASR model is **medium** |
| - `large-v3` is available as a dropdown for evaluation |
| - default language is **hi** because the attached repo logic is Hindi-biased for Hinglish handling |
| - you can still test `auto` and `en` |
|
|