Spaces:
Sleeping
Sleeping
A newer version of the Gradio SDK is available: 6.14.0
metadata
title: ShiblASR-v1.0
emoji: ποΈ
colorFrom: purple
colorTo: gray
sdk: gradio
sdk_version: 5.49.1
app_file: asr_disfluency.py
pinned: false
license: mit
ποΈ ShiblASR v1.0 β Verbatim Disfluent Transcription
Multilingual ASR system with advanced disfluency detection, pause timestamping, and grammar analysis.
Features
β
Multilingual Support - All Whisper-supported languages
β
Disfluency Detection - Fillers, pauses, repetitions
β
Pause Timestamping - Exact timing of silent periods
β
Grammar Analysis - Highlighted grammar issues
β
Offline Processing - No external API calls
β
Export Results - HTML and TXT downloads
Usage
- Record or upload audio file (WAV, MP3, etc.)
- Click "Start Transcription"
- View results in two tabs:
- Verbatim: Original transcription with disfluencies highlighted
- Clean: Grammar-corrected version
- Download results for further analysis
Output Details
- π Detected Language: Automatically identified language
- βΈοΈ Detected Pauses: Timestamped silence periods (β₯3 seconds)
- π£οΈ Verbatim Transcript:
[filler]- Detected filler words[pause]- Short pauses (3-5 seconds)[LONG PAUSE]- Extended silences (>5 seconds)grammar- Highlighted grammar issues
- π§ Clean Transcript: Grammar-corrected text
Technical Details
- Based on OpenAI Whisper Base model
- PyTorch backend (CPU/GPU compatible)
- LanguageTool for grammar analysis
- LibROSA for audio processing