File size: 3,856 Bytes
b260a01 9a8a0bf b260a01 9a8a0bf b260a01 9dd53d3 b260a01 9a8a0bf 3dc0b3d 9a8a0bf |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 |
---
title: Basic STT Transcript Cleanup
emoji: 🎤
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
short_description: Clean up speech-to-text transcripts with AI
---
# Basic STT Transcript Cleanup Tool (Version 3)
A foundational speech-to-text transcript remediation tool that provides purpose-agnostic text cleanup instructions. This is the **daily workhorse** for cleaning up raw speech-to-text transcripts that naturally contain undesirable material.
## Purpose & Philosophy
This tool implements **Version 3** of the Basic Speech-to-Text Cleanup prompt - a carefully crafted system prompt that provides sufficiently deterministic guidance without overstepping into actual content editing. The challenge in developing this prompt was ensuring it cleans up technical artifacts of speech-to-text conversion while preserving the authentic voice and intent of the original speaker.
## Foundational Design
This basic cleanup prompt serves as a **foundation layer** that can be combined with specialized text transformation prompts:
- **Standalone Use**: Perfect for general transcript cleanup
- **Modular Design**: Can be concatenated with purpose-specific prompts from extensive libraries
- **Purpose-Agnostic**: Works across all content types and domains
- **Extensible**: Hundreds of specialized transformation prompts can be layered on top
## Features
- **AI-Powered Cleanup**: Uses OpenAI's GPT models with a refined system prompt
- **BYOK (Bring Your Own Key)**: Secure - uses your own OpenAI API key
- **Copy to Clipboard**: Easy copying of cleaned text
- **Re-run Capability**: Quickly re-process the same text
- **System Prompt Viewer**: Transparent - see exactly how the AI processes your text
- **Deterministic Processing**: Consistent, predictable cleanup results
## How to Use
1. **Enter API Key**: Provide your OpenAI API key (required for processing)
2. **Paste Transcript**: Add your raw speech-to-text transcript
3. **Process**: Click "Clean Up Transcript" to apply remediation
4. **Copy Results**: Use the cleaned output or re-run if needed
## What It Does
The tool applies these **foundational improvements** to your transcripts:
### Core Remediations
- **Removes filler words** (like "um")
- **Adds punctuation, sentence structure, and paragraph spacing**
- **Fixes obvious STT hallucinations and mistranscriptions** (e.g., "McDonuts" → "McDonalds")
- **Removes repetitive or run-on thoughts** that would not be helpful to readers
- **Follows inferred instructions** to omit certain clauses (e.g., "wait .. scratch that from the note")
### What It Preserves
- **All important content** and meaning
- **Original speaker's voice** and intent
- **Factual accuracy** and details
- **Natural flow** of conversation
## Design Principles
1. **Light Touch Editing**: Minimal intervention while maximizing clarity
2. **Content Preservation**: Never removes or alters important information
3. **Deterministic Guidance**: Consistent, predictable results
4. **Purpose Agnostic**: Works across all content domains
5. **Modular Foundation**: Ready for specialized prompt layering
## Extended Ecosystem
This basic cleanup prompt is part of a larger ecosystem:
- **Hundreds of specialized prompts** available in shared libraries
- **Domain-specific transformations** for various use cases
- **Concatenation-ready design** for complex workflows
- **Shared on Hugging Face** and other platforms
## System Prompt
The tool uses a carefully crafted system prompt (Version 3, September 2025) that balances cleanup effectiveness with content preservation. View the complete prompt using the "Show System Prompt" feature in the interface.
## Created By
**[Daniel Rosehill](https://danielrosehill.com)** - Specializing in AI-powered text processing and speech-to-text optimization workflows.
|