ShiblASR-v1.0 / README.md
shibly100's picture
Update README.md
0babf14 verified

A newer version of the Gradio SDK is available: 6.14.0

Upgrade
metadata
title: ShiblASR-v1.0
emoji: πŸŽ™οΈ
colorFrom: purple
colorTo: gray
sdk: gradio
sdk_version: 5.49.1
app_file: asr_disfluency.py
pinned: false
license: mit

πŸŽ™οΈ ShiblASR v1.0 β€” Verbatim Disfluent Transcription

Multilingual ASR system with advanced disfluency detection, pause timestamping, and grammar analysis.

Features

βœ… Multilingual Support - All Whisper-supported languages
βœ… Disfluency Detection - Fillers, pauses, repetitions
βœ… Pause Timestamping - Exact timing of silent periods
βœ… Grammar Analysis - Highlighted grammar issues
βœ… Offline Processing - No external API calls
βœ… Export Results - HTML and TXT downloads

Usage

  1. Record or upload audio file (WAV, MP3, etc.)
  2. Click "Start Transcription"
  3. View results in two tabs:
    • Verbatim: Original transcription with disfluencies highlighted
    • Clean: Grammar-corrected version
  4. Download results for further analysis

Output Details

  • 🌐 Detected Language: Automatically identified language
  • ⏸️ Detected Pauses: Timestamped silence periods (β‰₯3 seconds)
  • πŸ—£οΈ Verbatim Transcript:
    • [filler] - Detected filler words
    • [pause] - Short pauses (3-5 seconds)
    • [LONG PAUSE] - Extended silences (>5 seconds)
    • grammar - Highlighted grammar issues
  • 🧠 Clean Transcript: Grammar-corrected text

Technical Details

  • Based on OpenAI Whisper Base model
  • PyTorch backend (CPU/GPU compatible)
  • LanguageTool for grammar analysis
  • LibROSA for audio processing