--- title: Basic STT Transcript Cleanup emoji: 🎤 colorFrom: blue colorTo: green sdk: gradio sdk_version: 4.44.0 app_file: app.py pinned: false short_description: Clean up speech-to-text transcripts with AI --- # Basic STT Transcript Cleanup Tool (Version 3) A foundational speech-to-text transcript remediation tool that provides purpose-agnostic text cleanup instructions. This is the **daily workhorse** for cleaning up raw speech-to-text transcripts that naturally contain undesirable material. ## Purpose & Philosophy This tool implements **Version 3** of the Basic Speech-to-Text Cleanup prompt - a carefully crafted system prompt that provides sufficiently deterministic guidance without overstepping into actual content editing. The challenge in developing this prompt was ensuring it cleans up technical artifacts of speech-to-text conversion while preserving the authentic voice and intent of the original speaker. ## Foundational Design This basic cleanup prompt serves as a **foundation layer** that can be combined with specialized text transformation prompts: - **Standalone Use**: Perfect for general transcript cleanup - **Modular Design**: Can be concatenated with purpose-specific prompts from extensive libraries - **Purpose-Agnostic**: Works across all content types and domains - **Extensible**: Hundreds of specialized transformation prompts can be layered on top ## Features - **AI-Powered Cleanup**: Uses OpenAI's GPT models with a refined system prompt - **BYOK (Bring Your Own Key)**: Secure - uses your own OpenAI API key - **Copy to Clipboard**: Easy copying of cleaned text - **Re-run Capability**: Quickly re-process the same text - **System Prompt Viewer**: Transparent - see exactly how the AI processes your text - **Deterministic Processing**: Consistent, predictable cleanup results ## How to Use 1. **Enter API Key**: Provide your OpenAI API key (required for processing) 2. **Paste Transcript**: Add your raw speech-to-text transcript 3. **Process**: Click "Clean Up Transcript" to apply remediation 4. **Copy Results**: Use the cleaned output or re-run if needed ## What It Does The tool applies these **foundational improvements** to your transcripts: ### Core Remediations - **Removes filler words** (like "um") - **Adds punctuation, sentence structure, and paragraph spacing** - **Fixes obvious STT hallucinations and mistranscriptions** (e.g., "McDonuts" → "McDonalds") - **Removes repetitive or run-on thoughts** that would not be helpful to readers - **Follows inferred instructions** to omit certain clauses (e.g., "wait .. scratch that from the note") ### What It Preserves - **All important content** and meaning - **Original speaker's voice** and intent - **Factual accuracy** and details - **Natural flow** of conversation ## Design Principles 1. **Light Touch Editing**: Minimal intervention while maximizing clarity 2. **Content Preservation**: Never removes or alters important information 3. **Deterministic Guidance**: Consistent, predictable results 4. **Purpose Agnostic**: Works across all content domains 5. **Modular Foundation**: Ready for specialized prompt layering ## Extended Ecosystem This basic cleanup prompt is part of a larger ecosystem: - **Hundreds of specialized prompts** available in shared libraries - **Domain-specific transformations** for various use cases - **Concatenation-ready design** for complex workflows - **Shared on Hugging Face** and other platforms ## System Prompt The tool uses a carefully crafted system prompt (Version 3, September 2025) that balances cleanup effectiveness with content preservation. View the complete prompt using the "Show System Prompt" feature in the interface. ## Created By **[Daniel Rosehill](https://danielrosehill.com)** - Specializing in AI-powered text processing and speech-to-text optimization workflows.