Spaces:
Sleeping
Sleeping
| # Psoriasis Study - Sample Data Documentation | |
| **Created:** October 20, 2025 | |
| **Purpose:** Enterprise testing of TranscriptorAI market research capabilities | |
| --- | |
| ## Overview | |
| This directory contains 10 realistic HCP interview transcripts for a fictional psoriasis biologic drug called **Dermovia** (IL-17 inhibitor). These transcripts were created specifically to test the enhanced market research storytelling capabilities of TranscriptorAI v3.0. | |
| ## Dataset Summary | |
| ### Total Transcripts: 10 | |
| - **6 Dermatologists** (specialists) | |
| - **4 General Practitioners** (primary care) | |
| ### Total Size: 164K | |
| - Average length: ~16K per transcript | |
| - Word count range: 1,800 - 3,800 words | |
| - Total quotes extracted: **300 high-impact quotes** | |
| --- | |
| ## Transcript Details | |
| ### Dermatologists (Specialists) | |
| #### 1. Dermatologist_01_Academic_Center.txt (13K) | |
| - **Setting:** Major Academic Medical Center, Boston | |
| - **Experience:** 18 years | |
| - **Patient Volume:** ~300 psoriasis patients | |
| - **Key Themes:** Clinical trial data, treatment algorithms, insurance barriers, efficacy vs safety trade-offs | |
| - **Notable:** Discusses head-to-head comparisons, secondary non-response, JAK inhibitor concerns | |
| #### 2. Dermatologist_02_Private_Practice.txt (14K) | |
| - **Setting:** Private Practice, Atlanta, GA | |
| - **Experience:** 12 years | |
| - **Patient Volume:** ~150 psoriasis patients, 35-40 on Dermovia | |
| - **Key Themes:** Real-world efficacy (70-75% PASI 90), serious adverse events, cost barriers, prior auth challenges | |
| - **Notable:** Reports depression and A-fib cases possibly related to treatment | |
| #### 3. Dermatologist_03_Community_Hospital.txt (8.3K) | |
| - **Setting:** Community Hospital, Phoenix, AZ | |
| - **Experience:** 25 years | |
| - **Patient Volume:** ~200 psoriasis patients, 25 on Dermovia | |
| - **Key Themes:** Quarterly dosing advantage for elderly patients, Crohn's disease case, compliance improvement | |
| - **Notable:** Discusses IBD screening protocols after adverse event | |
| #### 4. Dermatologist_04_Rural_Clinic.txt (16K) | |
| - **Setting:** Rural Health Clinic, Montana | |
| - **Experience:** 8 years | |
| - **Patient Volume:** ~80 psoriasis patients, 10-12 on Dermovia | |
| - **Key Themes:** Rural access barriers, in-office administration for all patients, power outage concerns | |
| - **Notable:** 100% in-office injection due to storage/convenience issues | |
| #### 5. Dermatologist_05_Teaching_Hospital.txt (18K) | |
| - **Setting:** Teaching Hospital, Chicago, IL | |
| - **Experience:** 6 years | |
| - **Patient Volume:** ~180 psoriasis patients, 30-35 on Dermovia | |
| - **Key Themes:** Clinical trial involvement, resident education, IBD screening, brodalumab REMS program | |
| - **Notable:** Academic perspective, research on predictive biomarkers | |
| #### 6. Dermatologist_06_West_Coast_Practice.txt (17K) | |
| - **Setting:** Private Practice, San Diego, CA | |
| - **Experience:** 15 years | |
| - **Patient Volume:** ~220 psoriasis patients, 45-50 on Dermovia | |
| - **Key Themes:** Tech industry insurance coverage, dose intensification, secondary loss of response | |
| - **Notable:** Highest patient volume on Dermovia, successful dose adjustment case | |
| ### General Practitioners (Primary Care) | |
| #### 7. GP_01_Suburban_Practice.txt (13K) | |
| - **Setting:** Suburban Family Practice, Minneapolis, MN | |
| - **Experience:** 22 years | |
| - **Patient Volume:** ~50 psoriasis patients (mild to moderate) | |
| - **Key Themes:** Referral patterns, discomfort with biologics, role of primary care, education gaps | |
| - **Notable:** Honest about limitations, advocates for better CME training | |
| #### 8. GP_02_Urban_Clinic.txt (14K) | |
| - **Setting:** Urban Community Health Clinic, Detroit, MI | |
| - **Experience:** 10 years | |
| - **Patient Volume:** ~35 psoriasis patients | |
| - **Key Themes:** Underserved population, Medicaid/uninsured barriers, methotrexate use, access challenges | |
| - **Notable:** Never prescribed biologics due to cost/complexity barriers | |
| #### 9. GP_03_Military_Base.txt (15K) | |
| - **Setting:** Military Medical Center, San Antonio, TX | |
| - **Experience:** 14 years | |
| - **Patient Volume:** ~65 active duty/veteran patients | |
| - **Key Themes:** Military healthcare advantages, deployment challenges, TRICARE coverage, fitness-for-duty | |
| - **Notable:** Good formulary access, quarterly dosing valuable for deployments | |
| #### 10. GP_04_Rural_Healthcare.txt (15K) | |
| - **Setting:** Rural Health Network, Wyoming | |
| - **Experience:** 28 years | |
| - **Patient Volume:** ~45 psoriasis patients | |
| - **Key Themes:** Geographic isolation (140 miles to dermatologist), telemedicine limitations, pragmatic approach | |
| - **Notable:** Most experienced GP, discusses 28-year career perspective | |
| --- | |
| ## Clinical Content Themes | |
| ### Drug Profile: Dermovia (Fictional IL-17 Inhibitor) | |
| - **Dosing:** Every 12 weeks (quarterly) after loading dose | |
| - **Efficacy:** 70-75% PASI 90 at 16 weeks (real-world) | |
| - **PASI 100:** 30-40% complete clearance | |
| - **Speed:** Visible improvement by week 4-6 | |
| - **List Price:** ~$82,000/year | |
| ### Safety Profile Discussed: | |
| - **Common:** Candida infections (25-30%), injection site reactions (30-40%), URI (40-50%) | |
| - **Serious:** Pneumonia, cellulitis, neutropenia, possible IBD exacerbation (1-2%) | |
| - **Rare:** Depression/suicidality, atrial fibrillation, drug-induced lupus | |
| ### Competitive Landscape: | |
| - **IL-17 Inhibitors:** Ixekizumab (every 2-4 weeks), Secukinumab (monthly), Brodalumab (highest efficacy, black box warning) | |
| - **IL-23 Inhibitors:** Risankizumab, Guselkumab (safer GI profile, slower onset) | |
| - **JAK Inhibitors:** Deucravacitinib (oral, FDA warnings for thrombosis/malignancy) | |
| - **TNF Inhibitors:** Adalimumab (older, biosimilars available) | |
| ### Insurance/Access Barriers: | |
| - **Prior Auth:** 50-60% first-try approval, 3-6 week delays common | |
| - **Copay Cards:** Reduce cost to $5-25/dose for commercial insurance | |
| - **Medicare Gap:** No copay assistance, $2,000-5,000/year out-of-pocket | |
| - **Step Therapy:** Often required to fail topicals/methotrexate first | |
| --- | |
| ## Quote Extraction Results | |
| ### Total Quotes: 300 (30 per transcript) | |
| ### Impact Score Range: 0.80 - 0.95 | |
| ### Theme Distribution: | |
| 1. **Patient Management:** 100 quotes (33%) | |
| 2. **General:** 77 quotes (26%) | |
| 3. **Prescribing:** 34 quotes (11%) | |
| 4. **Barriers:** 24 quotes (8%) | |
| 5. **Efficacy:** 23 quotes (8%) | |
| 6. **Safety:** 20 quotes (7%) | |
| 7. **Diagnosis:** 14 quotes (5%) | |
| 8. **Competitive:** 8 quotes (3%) | |
| ### Top Quotes by Impact Score: | |
| **Score 0.95:** | |
| > "I've been using Dermovia since it launched, so about 16 months now. I was excited about it from the beginning because the Phase 3 data looked impressive. The 75% PASI 90 rate put it right in line with ixekizumab, which had been my go-to IL-17 inhibitor. But the quarterly dosing was the real selling point." | |
| **Score 0.90:** | |
| > "For biologic-naive patients under 65 with no major comorbidities, my first choice is often an IL-17 inhibitor like Dermovia or ixekizumab." | |
| **Score 0.90:** | |
| > "Where Dermovia wins is convenience. I've had patients switch from every-two-week injections to every-12-weeks, and they absolutely love it." | |
| --- | |
| ## Use Cases for Testing | |
| ### 1. Cross-Transcript Analysis | |
| - Identify consensus themes across dermatologists vs GPs | |
| - Compare urban/rural/academic/military perspectives | |
| - Track efficacy/safety signals across all interviews | |
| ### 2. Quote-Based Storytelling | |
| - 300 high-impact quotes ready for weaving into narratives | |
| - Varied perspectives (specialist confidence vs GP hesitancy) | |
| - Patient impact stories embedded in clinical discussions | |
| ### 3. Market Research Insights | |
| - Clear unmet needs (access, cost, prior auth burden) | |
| - Competitive positioning (quarterly dosing = key differentiator) | |
| - Safety concerns (IBD risk, infections) with real examples | |
| ### 4. Report Generation Testing | |
| - Executive summaries with "THE HEADLINE" | |
| - Data β Implication β Action structure | |
| - Visual callouts for key statistics | |
| - Recommendations with priority levels (IMMEDIATE/30-day/90-day) | |
| --- | |
| ## Data Quality Features | |
| ### Realism: | |
| β Authentic medical terminology and clinical discussions | |
| β Realistic PASI scores and efficacy percentages | |
| β Genuine insurance/access barrier scenarios | |
| β Varied physician perspectives based on practice setting | |
| β Mix of enthusiasm, caution, and frustration | |
| ### Storytelling Elements: | |
| β Direct patient quotes embedded in HCP narratives | |
| β Specific case examples with outcomes | |
| β Emotional language ("life-changing," "heartbreaking," "game-changer") | |
| β Numerical data for credibility (75% PASI 90, $82K/year, 3-6 week delays) | |
| β Comparative statements for context | |
| ### Diversity: | |
| β 6 different geographic regions (Boston, Atlanta, Phoenix, Montana, Chicago, San Diego, Minneapolis, Detroit, San Antonio, Wyoming) | |
| β 4 practice settings (academic, private, community, rural, military, urban clinic) | |
| β Experience range: 6-28 years | |
| β Patient volumes: 35-300 psoriasis patients | |
| β Specialist expertise vs generalist pragmatism | |
| --- | |
| ## Testing Status | |
| ### β Completed: | |
| - All 10 transcripts created (164K total) | |
| - Quote extraction tested (300 quotes extracted successfully) | |
| - Syntax validation passed (all files have valid Python syntax) | |
| - Code formatting checked | |
| ### π Ready for: | |
| - Full analysis pipeline run through TranscriptorAI | |
| - Cross-transcript summary generation | |
| - Narrative report generation (PDF/Word/HTML) | |
| - Quote weaving and storytelling validation | |
| - Visual callout box rendering | |
| ### π Expected Outputs: | |
| 1. **CSV:** Structured data with quality scores, word counts, themes | |
| 2. **PDF Report:** Cross-transcript insights with embedded quotes | |
| 3. **Narrative Report:** Executive summary with "THE HEADLINE" and recommendations | |
| 4. **HTML Dashboard:** Interactive visualization of themes and findings | |
| --- | |
| ## File Locations | |
| **Transcripts:** `/home/john/TranscriptorEnhanced/sample_data/psoriasis_study/` | |
| **Test Scripts:** `/home/john/TranscriptorEnhanced/test_psoriasis_quotes.py` | |
| **Quote Extractor:** `/home/john/TranscriptorEnhanced/quote_extractor.py` | |
| **Main App:** `/home/john/TranscriptorEnhanced/app.py` | |
| --- | |
| ## Notes for Analysis | |
| ### Key Insights to Look For: | |
| 1. **Quarterly dosing** mentioned as key differentiator by nearly all HCPs | |
| 2. **Insurance/prior auth** cited as biggest barrier by 100% of respondents | |
| 3. **Efficacy consensus:** 70-75% PASI 90 real-world (vs 75% in trials) | |
| 4. **Safety concerns:** IBD risk for IL-17 class, infections manageable | |
| 5. **Access disparity:** Urban/academic vs rural/underserved populations | |
| ### Narrative Hooks: | |
| - Rural dermatologist: "One patient told me, 'Doc, I haven't seen my skin this clear in 15 years.'" | |
| - Urban GP: "The list price is $70,000-80,000 a year. That's not happening for my population." | |
| - Teaching hospital: "Every week of delay is another week of suffering for the patient." | |
| ### Business Implications: | |
| - **Strength:** Quarterly dosing = competitive advantage | |
| - **Weakness:** Same safety/efficacy as competitors = parity positioning | |
| - **Opportunity:** Target rural/compliance-challenged populations | |
| - **Threat:** Prior auth burden reducing market access | |
| --- | |
| **Status:** β Ready for enterprise production testing | |
| **Last Updated:** October 20, 2025 | |
| **Version:** 1.0 | |