File size: 3,856 Bytes
b260a01
 
9a8a0bf
 
 
b260a01
9a8a0bf
b260a01
 
9dd53d3
b260a01
 
9a8a0bf
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3dc0b3d
 
 
 
 
9a8a0bf
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
---
title: Basic STT Transcript Cleanup
emoji: 🎤
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
short_description: Clean up speech-to-text transcripts with AI
---

# Basic STT Transcript Cleanup Tool (Version 3)

A foundational speech-to-text transcript remediation tool that provides purpose-agnostic text cleanup instructions. This is the **daily workhorse** for cleaning up raw speech-to-text transcripts that naturally contain undesirable material.

## Purpose & Philosophy

This tool implements **Version 3** of the Basic Speech-to-Text Cleanup prompt - a carefully crafted system prompt that provides sufficiently deterministic guidance without overstepping into actual content editing. The challenge in developing this prompt was ensuring it cleans up technical artifacts of speech-to-text conversion while preserving the authentic voice and intent of the original speaker.

## Foundational Design

This basic cleanup prompt serves as a **foundation layer** that can be combined with specialized text transformation prompts:

- **Standalone Use**: Perfect for general transcript cleanup
- **Modular Design**: Can be concatenated with purpose-specific prompts from extensive libraries
- **Purpose-Agnostic**: Works across all content types and domains
- **Extensible**: Hundreds of specialized transformation prompts can be layered on top

## Features

- **AI-Powered Cleanup**: Uses OpenAI's GPT models with a refined system prompt
- **BYOK (Bring Your Own Key)**: Secure - uses your own OpenAI API key
- **Copy to Clipboard**: Easy copying of cleaned text
- **Re-run Capability**: Quickly re-process the same text
- **System Prompt Viewer**: Transparent - see exactly how the AI processes your text
- **Deterministic Processing**: Consistent, predictable cleanup results

## How to Use

1. **Enter API Key**: Provide your OpenAI API key (required for processing)
2. **Paste Transcript**: Add your raw speech-to-text transcript
3. **Process**: Click "Clean Up Transcript" to apply remediation
4. **Copy Results**: Use the cleaned output or re-run if needed

## What It Does

The tool applies these **foundational improvements** to your transcripts:

### Core Remediations
- **Removes filler words** (like "um")
- **Adds punctuation, sentence structure, and paragraph spacing**
- **Fixes obvious STT hallucinations and mistranscriptions** (e.g., "McDonuts" → "McDonalds")
- **Removes repetitive or run-on thoughts** that would not be helpful to readers
- **Follows inferred instructions** to omit certain clauses (e.g., "wait .. scratch that from the note")

### What It Preserves
- **All important content** and meaning
- **Original speaker's voice** and intent
- **Factual accuracy** and details
- **Natural flow** of conversation

## Design Principles

1. **Light Touch Editing**: Minimal intervention while maximizing clarity
2. **Content Preservation**: Never removes or alters important information
3. **Deterministic Guidance**: Consistent, predictable results
4. **Purpose Agnostic**: Works across all content domains
5. **Modular Foundation**: Ready for specialized prompt layering

## Extended Ecosystem

This basic cleanup prompt is part of a larger ecosystem:
- **Hundreds of specialized prompts** available in shared libraries
- **Domain-specific transformations** for various use cases
- **Concatenation-ready design** for complex workflows
- **Shared on Hugging Face** and other platforms

## System Prompt

The tool uses a carefully crafted system prompt (Version 3, September 2025) that balances cleanup effectiveness with content preservation. View the complete prompt using the "Show System Prompt" feature in the interface.

## Created By

**[Daniel Rosehill](https://danielrosehill.com)** - Specializing in AI-powered text processing and speech-to-text optimization workflows.