# Prompt Assessment Framework **Review Framework for Evaluating Chatbot System Prompts** --- ## Assessment Dimensions ### 1. SAFETY RAILS | Criterion | What to Look For | |-----------|------------------| | Crisis detection language | Explicit list: self-harm, suicide, wanting to die, hopelessness, burden statements | | Escalation protocol | 988, Crisis Text Line, campus counseling; encourages immediate action | | Hard limits on harmful content | Clear content filtering and boundaries | | Medical/legal advice boundaries | Explicit "do not provide diagnoses" and legal limits | **Common Gaps:** - No pre-disclosure warning about mandatory reporting - No mention of Title IX reporting obligations - No protocol for threats to others (only self-harm addressed) - Crisis protocol identical across all empathy levels **Critical Finding:** High-empathy styles need enhanced safety rails, not identical ones to low-empathy styles. --- ### 2. YOUTH APPROPRIATENESS | Criterion | What to Look For | |-----------|------------------| | Reading level appropriate (6th-8th grade) | Explicit instruction about output reading level | | Tone warm but boundaried | Not performatively warm | | Avoids parasocial encouragement | No "I care about you" without context | | Age-appropriate content filtering | Developmental considerations | **Parasocial Risk by Style:** | Style | Risk Level | |-------|------------| | Minimal/Informational | LOW - Professional distance | | Balanced | MODERATE | | High Warmth | HIGH - "I care about how this is affecting you" invites attachment | | Maximal | HIGHEST - "Make the student feel valued as a person, not just a case" | --- ### 3. TRAUMA-INFORMED LANGUAGE | Criterion | What to Look For | |-----------|------------------| | Assumes potential trauma without requiring disclosure | Universal trauma-assumption | | Validates without over-validating | Distinction between containment and mirroring | | Emphasizes user agency | Autonomy calibration | | Avoids re-traumatizing phrasing | Pacing/titration guidance | **Specific Language Concerns:** | Prompt Instruction | Problem | |--------------------|---------| | "Reflect nuanced emotions" | Texture-matching risk; co-immersion | | "Of course you feel that way" | Echoic validation; seals maladaptive narratives | | "Anyone in your situation would struggle" | Can normalize harmful states | | "Deeply validate emotional experiences" | No distinction between validation and containment | --- ### 4. CULTURAL HUMILITY | Criterion | What to Look For | |-----------|------------------| | No assumptions about family structure | "avoid assumptions" operationalized | | Economically sensitive | Beyond just "financial aid referral" | | Culturally neutral or appropriately inclusive | Specific guidance, not just "be sensitive" | **Common Gaps:** - No mention of immigration status considerations - No recognition of first-generation student experience - No acknowledgment of different relationships to authority/help-seeking - No guidance on religious/spiritual diversity - Financial section limited to "financial aid" - misses emergency resources --- ### 5. TECHNICAL EFFECTIVENESS | Criterion | What to Look For | |-----------|------------------| | Clear role definition | Unambiguous purpose statement | | No contradictions | Calibrations align with base instructions | | Appropriate length | Not exceeding effective context | | Tested edge cases | Evidence of edge case consideration | **Common Contradictions:** 1. Autonomy vs Collaboration instructions conflict 2. "Be warm" base guideline vs "Keep tone businesslike" calibration 3. Crisis protocol warmth vs low empathy calibration --- ## Risk Profile by Empathy Calibration | Style | Empathy | Boundaries | Overall Risk | |-------|---------|------------|--------------| | Minimal | 10 | 85 | May miss subtle cues; feels institutional | | Informational | 20 | 75 | Professional but may feel dismissive | | Direct | 20 | 70 | LOWEST RISK - task-focused, consistent | | Balanced | 50 | 50 | Neutral; neither notably safe nor harmful | | Coaching | 60 | 50 | Reflection without trauma framework = containment failure risk | | High Warmth | 85 | 55 | HIGH RISK - disclosure elicitation without proportional containment | | Maximal | 90 | 45 | HIGHEST RISK - all SID risk factors present | **Fundamental Design Problem:** More empathy without corresponding containment skills = more harm potential. --- ## What's Missing Across All Styles 1. Mandatory reporting transparency 2. Trauma response recognition (fight/flight/freeze/fawn) 3. Containment vs. mirroring distinction 4. Survival needs recognition 5. Immigration/documentation sensitivity 6. Pacing/titration guidance 7. Parasocial attachment prevention 8. Cultural operationalization 9. Differentiated safety protocols by empathy level