danielrosehill's picture
Redesign interface with accordion cards and category pills
292d92c

A newer version of the Gradio SDK is available: 6.2.0

Upgrade

Scan repository for personally identifiable information (PII) before open sourcing.

Your task:

  1. Proactively scan for subtle forms of PII:

    • Personal email addresses
    • Phone numbers
    • Physical addresses
    • Names in comments or documentation
    • Social media handles
    • IP addresses
    • MAC addresses
    • Personal identifiers in test data
    • Usernames that might be personally identifying
    • Organization-specific information
  2. Assume API key protection is already handled, focus on:

    • Developer names in code comments
    • Email addresses in git commits (use git log)
    • Personal information in documentation
    • Internal URLs or server names
    • Employee IDs or internal identifiers
    • Customer data in examples
    • Screenshots with personal info
  3. Do NOT remediate automatically:

    • Only identify and report findings
    • Do not delete, change, or obfuscate without permission
    • Present comprehensive list to user
  4. Generate detailed report:

    ## PII Scan Report
    
    ### High Risk
    - File: src/config.js:45
      Type: Email address
      Content: [REDACTED]@company.com
      Context: Developer email in comment
    
    ### Medium Risk
    - File: docs/api.md:12
      Type: Internal URL
      Content: https://internal-server.local
      Context: Example API endpoint
    
    ### Low Risk
    - File: README.md:34
      Type: Username
      Content: @john_developer
      Context: Acknowledgments section
    
    ## Recommendations
    1. Replace developer emails with generic project email
    2. Use example.com for URL examples
    3. Consider anonymizing usernames
    
  5. Seek user advice on remediation approach

Help users identify PII they may wish to remove before making repositories public.