|
|
--- |
|
|
title: Basic STT Transcript Cleanup |
|
|
emoji: π€ |
|
|
colorFrom: blue |
|
|
colorTo: green |
|
|
sdk: gradio |
|
|
sdk_version: 4.44.0 |
|
|
app_file: app.py |
|
|
pinned: false |
|
|
short_description: Clean up speech-to-text transcripts with AI |
|
|
--- |
|
|
|
|
|
# Basic STT Transcript Cleanup Tool (Version 3) |
|
|
|
|
|
A foundational speech-to-text transcript remediation tool that provides purpose-agnostic text cleanup instructions. This is the **daily workhorse** for cleaning up raw speech-to-text transcripts that naturally contain undesirable material. |
|
|
|
|
|
## Purpose & Philosophy |
|
|
|
|
|
This tool implements **Version 3** of the Basic Speech-to-Text Cleanup prompt - a carefully crafted system prompt that provides sufficiently deterministic guidance without overstepping into actual content editing. The challenge in developing this prompt was ensuring it cleans up technical artifacts of speech-to-text conversion while preserving the authentic voice and intent of the original speaker. |
|
|
|
|
|
## Foundational Design |
|
|
|
|
|
This basic cleanup prompt serves as a **foundation layer** that can be combined with specialized text transformation prompts: |
|
|
|
|
|
- **Standalone Use**: Perfect for general transcript cleanup |
|
|
- **Modular Design**: Can be concatenated with purpose-specific prompts from extensive libraries |
|
|
- **Purpose-Agnostic**: Works across all content types and domains |
|
|
- **Extensible**: Hundreds of specialized transformation prompts can be layered on top |
|
|
|
|
|
## Features |
|
|
|
|
|
- **AI-Powered Cleanup**: Uses OpenAI's GPT models with a refined system prompt |
|
|
- **BYOK (Bring Your Own Key)**: Secure - uses your own OpenAI API key |
|
|
- **Copy to Clipboard**: Easy copying of cleaned text |
|
|
- **Re-run Capability**: Quickly re-process the same text |
|
|
- **System Prompt Viewer**: Transparent - see exactly how the AI processes your text |
|
|
- **Deterministic Processing**: Consistent, predictable cleanup results |
|
|
|
|
|
## How to Use |
|
|
|
|
|
1. **Enter API Key**: Provide your OpenAI API key (required for processing) |
|
|
2. **Paste Transcript**: Add your raw speech-to-text transcript |
|
|
3. **Process**: Click "Clean Up Transcript" to apply remediation |
|
|
4. **Copy Results**: Use the cleaned output or re-run if needed |
|
|
|
|
|
## What It Does |
|
|
|
|
|
The tool applies these **foundational improvements** to your transcripts: |
|
|
|
|
|
### Core Remediations |
|
|
- **Removes filler words** (like "um") |
|
|
- **Adds punctuation, sentence structure, and paragraph spacing** |
|
|
- **Fixes obvious STT hallucinations and mistranscriptions** (e.g., "McDonuts" β "McDonalds") |
|
|
- **Removes repetitive or run-on thoughts** that would not be helpful to readers |
|
|
- **Follows inferred instructions** to omit certain clauses (e.g., "wait .. scratch that from the note") |
|
|
|
|
|
### What It Preserves |
|
|
- **All important content** and meaning |
|
|
- **Original speaker's voice** and intent |
|
|
- **Factual accuracy** and details |
|
|
- **Natural flow** of conversation |
|
|
|
|
|
## Design Principles |
|
|
|
|
|
1. **Light Touch Editing**: Minimal intervention while maximizing clarity |
|
|
2. **Content Preservation**: Never removes or alters important information |
|
|
3. **Deterministic Guidance**: Consistent, predictable results |
|
|
4. **Purpose Agnostic**: Works across all content domains |
|
|
5. **Modular Foundation**: Ready for specialized prompt layering |
|
|
|
|
|
## Extended Ecosystem |
|
|
|
|
|
This basic cleanup prompt is part of a larger ecosystem: |
|
|
- **Hundreds of specialized prompts** available in shared libraries |
|
|
- **Domain-specific transformations** for various use cases |
|
|
- **Concatenation-ready design** for complex workflows |
|
|
- **Shared on Hugging Face** and other platforms |
|
|
|
|
|
## System Prompt |
|
|
|
|
|
The tool uses a carefully crafted system prompt (Version 3, September 2025) that balances cleanup effectiveness with content preservation. View the complete prompt using the "Show System Prompt" feature in the interface. |
|
|
|
|
|
## Created By |
|
|
|
|
|
**[Daniel Rosehill](https://danielrosehill.com)** - Specializing in AI-powered text processing and speech-to-text optimization workflows. |
|
|
|