TranscriptWriting / STORYTELLING_QUICK_START.md
jmisak's picture
Upload 57 files
52d0298 verified

A newer version of the Gradio SDK is available: 6.2.0

Upgrade

Market Research Storytelling - Quick Start Guide

What's New?

Your TranscriptorAI now generates professional market research reports that tell compelling, data-driven stories for business clients.


Key Changes in 3 Bullets

  1. Reports now sound like consulting firms, not research papers - business language, "So What?" orientation, clear action items
  2. Participant quotes are automatically extracted and integrated - brings findings to life with authentic human voice
  3. Professional visual elements - stat callouts, insight boxes, quote highlights, color-coded priority recommendations

Before & After Example

BEFORE (Academic Style):

Summary of Findings

This analysis includes 12 HCP transcripts with an average quality score of 0.82.

Strong Consensus Findings:
- 10 out of 12 participants (83%) mentioned reimbursement challenges

Majority Findings:
- 8 out of 12 participants (67%) discussed efficacy concerns

AFTER (Market Research Style):

Executive Summary

THE HEADLINE: Prior authorization delays are creating a 6-month sales cycle gap
and pushing HCPs toward competitor products with faster approvals.

KEY TAKEAWAYS:
β€’ Reimbursement Barrier: 10 of 12 HCPs (83%) cite prior authorization as their
  #1 prescribing barrier β†’ Your sales team needs patient assistance resources
  during the 4-6 week approval window β†’ Launch patient bridge program (IMMEDIATE)

  As one oncologist noted: "By the time insurance approves, the patient's
  cancer has often progressed to the point where we need more aggressive options."

β€’ Competitive Threat: 7 of 12 HCPs (58%) mention switching to Competitor X
  specifically due to their co-pay card program β†’ Market share at risk without
  similar offering β†’ Evaluate co-pay assistance program (WITHIN 60 DAYS)

How To Use

Option 1: Standard Analysis (App Tab 1)

  1. Upload transcripts as usual
  2. Select interviewee type (HCP/Patient/Other)
  3. Click "Analyze Transcripts"
  4. NEW: Reports now automatically include quotes and business-focused language
  5. Download CSV and PDF as before

What's Different:

  • Summary text now has "THE HEADLINE" and business implications
  • PDF has visual callout boxes for key stats
  • Quotes are woven into findings

Option 2: Narrative Report (App Tab 2)

  1. First run analysis in Tab 1
  2. Go to "Narrative Report" tab
  3. Upload the CSV from step 1
  4. Select report style:
    • Executive: Concise, C-level focused (best for stakeholder presentations)
    • Detailed: Comprehensive analysis (best for product/marketing teams)
    • Presentation: Slide-ready format (best for sales enablement)
  5. Click "Generate Narrative Report"
  6. Download PDF, Word, or HTML

What's Different:

  • Reports follow management consulting structure
  • 5-8 participant quotes integrated throughout
  • Visual elements: stat callouts, quote boxes, priority recommendations
  • Actionable recommendations with timelines (IMMEDIATE/30d/90d)

Report Structure (New Format)

πŸ“„ EXECUTIVE SUMMARY
   └─ THE HEADLINE: One sentence, most important finding
   └─ KEY TAKEAWAYS: 3-4 bullets (finding β†’ implication β†’ action)

πŸ“„ RESEARCH CONTEXT
   └─ Who we spoke with, data quality

πŸ“„ KEY INSIGHTS (3-5 sections)
   └─ Each finding with:
      β€’ Specific numbers and percentages
      β€’ Business implication ("why this matters")
      β€’ Supporting quote from participant
      β€’ Connection to competitive landscape

πŸ“„ MARKET OPPORTUNITIES & BARRIERS
   └─ Unmet needs (with frequency)
   └─ Competitive vulnerabilities
   └─ White space opportunities

πŸ“„ PARTICIPANT PERSPECTIVES
   └─ Points of consensus (80%+ agreement)
   └─ Areas of divergence (where opinions split)
   └─ Notable outliers and why they matter

πŸ“„ STRATEGIC RECOMMENDATIONS
   β”œβ”€ πŸ”΄ IMMEDIATE: Launch patient bridge program
   β”œβ”€ 🟠 WITHIN 30 DAYS: Develop early follow-up protocol
   └─ 🟑 WITHIN 90 DAYS: Evaluate co-pay assistance program

Visual Elements (Automatically Added to PDFs)

1. Key Stat Callouts

Large, bold numbers with context - perfect for opening the report

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                                 β”‚
β”‚              12                 β”‚
β”‚   HCPs Interviewed              β”‚
β”‚ In-depth qualitative research   β”‚
β”‚                                 β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

2. Quote Boxes

Participant voice highlighted and styled

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ "By the time insurance approves,    β”‚
β”‚  the disease has often progressed." β”‚
β”‚                                      β”‚
β”‚            β€” Oncologist, Transcript 3β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

3. Recommendation Boxes

Color-coded by priority for quick scanning

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚IMMEDIATβ”‚ Launch patient bridge program    β”‚
β”‚   E    β”‚ Address the 4-6 week prior auth  β”‚
β”‚ (RED)  β”‚ gap identified by 83% of HCPs    β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Writing Style Rules (What the LLM Now Follows)

DO:

  • βœ… Lead with impact: "THE HEADLINE: [most important finding]"
  • βœ… Active voice: "HCPs prefer..." not "It was found that..."
  • βœ… Specific numbers: "8 out of 12 (67%)" not "most"
  • βœ… Business implications: Every finding β†’ "What this means for you"
  • βœ… Participant quotes: 5-8 per report, naturally integrated
  • βœ… Prioritized actions: IMMEDIATE vs. 30 days vs. 90 days
  • βœ… Skimmable: Key points visible in headers and first sentences

DON'T:

  • ❌ Vague language: "many", "most", "some", "often"
  • ❌ Academic style: "Findings indicate that..."
  • ❌ Data without context: Numbers need interpretation
  • ❌ Generic recommendations: "Consider exploring options"
  • ❌ Passive voice: "It was observed that..."

Quote Extraction (How It Works Behind the Scenes)

Automatic Process:

  1. System scans all transcripts after analysis
  2. Identifies quotes using patterns:
    • Direct quotes: "quoted text"
    • Speaker labels: HCP: statement
    • Narrative references: As one patient noted, "quote"
  3. Filters out greetings, administrative text, too short/long
  4. Scores each quote for storytelling impact (0.0 to 1.0):
    • Higher scores for: emotional language, specific details, numbers, comparisons
    • Lower scores for: generic phrases ("it depends", "maybe")
  5. Categorizes by theme (e.g., prescribing, barriers, symptoms, QoL)
  6. Selects top 10-15 quotes for inclusion in reports
  7. LLM weaves quotes into narrative naturally

You don't need to do anything - it happens automatically!


Report Styles Explained

Executive Style

Best for: C-suite, investors, board presentations Characteristics:

  • Concise (1000-1200 words)
  • ROI focus
  • Strategic recommendations
  • Minimal methodology detail
  • 3-5 key insights maximum

Detailed Style

Best for: Product managers, marketing teams, researchers Characteristics:

  • Comprehensive (1400-1600 words)
  • Full analysis depth
  • All supporting data included
  • 4-6 key insights
  • Detailed methodology notes

Presentation Style

Best for: Sales teams, field force, client briefings Characteristics:

  • Slide-ready format (1200-1400 words)
  • Talking points emphasized
  • Visual elements maximized
  • Key messages highlighted
  • Quote-heavy for impact

Troubleshooting

"Reports still sound too academic"

Fix: Make sure you're using the narrative report tab (Tab 2) with report style selected. The basic analysis (Tab 1) is improved but not as dramatically transformed.

"Not seeing participant quotes in my report"

Check:

  1. Do your transcripts have speaker labels or quotation marks?
  2. Are quotes at least 30 characters long?
  3. Check console output for "[Quotes] Extracted X quotes" message
  4. Try different transcripts to verify quote extraction is working

"Visual elements not showing in PDF"

Try:

  1. Update reportlab: pip install --upgrade reportlab
  2. Check that all imports succeeded (no errors on startup)
  3. Try generating HTML version instead (always works)

"Recommendations are all labeled 'MEDIUM'"

Reason: LLM needs clearer priority signals in the data Fix: In your analysis instructions (Tab 1), mention specific urgency or timing requirements


Tips for Best Results

1. Provide Business Context in Analysis Instructions

Instead of:

"Analyze these HCP interviews"

Try:

"Analyze these interviews focused on understanding barriers to prescribing. Our client needs to know what's blocking sales and what to prioritize for Q1."

2. Use the Right Report Style

  • Busy executive who'll spend 5 minutes? β†’ Executive style
  • Team doing deep dive? β†’ Detailed style
  • Preparing talking points for field team? β†’ Presentation style

3. Review Quote Quality

Check the console output after analysis:

[Quotes] Extracted 47 quotes, top impact score: 0.87
  • 20-50 quotes extracted is typical for 10-12 transcripts
  • Top scores above 0.70 indicate high-quality quotes
  • If top score < 0.50, transcripts may lack substantive quotes

4. Customize for Client Industry

In analysis instructions, mention:

  • Client's industry (pharma, medical device, payer, etc.)
  • Competitive landscape
  • Specific business questions they need answered

Examples of Good vs. Poor Quotes

βœ… HIGH IMPACT (Score: 0.85)

"By the time insurance approves, the patient's cancer has often progressed
to the point where we need to consider more aggressive options."

Why: Specific, emotional, causal reasoning, medical detail

βœ… MEDIUM IMPACT (Score: 0.65)

"I've switched three patients to Competitor X this month because
of their co-pay assistance program."

Why: Specific numbers, comparative, action-oriented

❌ LOW IMPACT (Score: 0.30)

"It depends on the situation."

Why: Generic, vague, no detail

❌ FILTERED OUT (Not included)

"Thank you, that's interesting."

Why: Administrative, non-substantive


Need Help?

Documentation:

  • Full details: MARKET_RESEARCH_ENHANCEMENTS.md
  • This guide: STORYTELLING_QUICK_START.md

Code Reference:

  • Quote extraction logic: quote_extractor.py
  • Narrative prompts: story_writer.py lines 10-100
  • Visual elements: narrative_report_generator.py lines 19-255

Common Questions:

  • "Can I disable quotes?" β†’ Yes, they're optional. Edit app.py line 242 to skip extraction.
  • "Can I adjust quote scoring?" β†’ Yes, edit score_quote_impact() in quote_extractor.py
  • "Can I change visual colors?" β†’ Yes, edit hex codes in narrative_report_generator.py

Quick Wins Checklist

  • Run a test analysis with 3-5 transcripts
  • Review the "THE HEADLINE" in the output
  • Check console for "[Quotes] Extracted X quotes" confirmation
  • Generate a narrative report in all 3 styles (executive, detailed, presentation)
  • Compare PDFs to see visual elements (stat callouts, quote boxes, recommendation boxes)
  • Share with one internal stakeholder for feedback
  • Run full production analysis with client transcripts

Ready to create compelling client deliverables! πŸš€

Your reports now tell stories that drive business decisions.