File size: 1,571 Bytes
5b02e22
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
---

title: Sentinel Seed Demo
emoji: 🛡️
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit
short_description: Test AI alignment seeds in real-time
---


# Sentinel Seed Demo

Interactive demo for testing AI alignment seeds. Compare how language models respond with and without safety seeds.

## Features

- **Side-by-side comparison**: See baseline vs protected responses
- **THSP Analysis**: Real-time gate analysis (Truth, Harm, Scope, Purpose)
- **Multiple seeds**: Test Sentinel v2 Standard and Minimal
- **Pre-built scenarios**: 8 test cases covering various attack vectors

## The THSP Protocol

Every request passes through four gates:

1. **TRUTH** - No deception or misinformation
2. **HARM** - No enabling physical, psychological, or digital damage
3. **SCOPE** - Stay within appropriate boundaries
4. **PURPOSE** - Every action must serve legitimate benefit

All gates must pass for an action to proceed.

## Benchmark Results

| Benchmark | Baseline | With Seed | Delta |
|-----------|----------|-----------|-------|
| HarmBench | 86.5% | 98.2% | +11.7% |
| JailbreakBench | 88% | 97.3% | +9.3% |
| GDS-12 | 78% | 92% | +14% |

## Links

- [Website](https://sentinelseed.dev)
- [Documentation](https://sentinelseed.dev/docs)
- [Sentinel Lab](https://sentinelseed.dev/evaluations)
- [Dataset](https://huggingface.co/datasets/sentinelseed/sentinel-benchmarks)
- [GitHub](https://github.com/sentinel-seed)

## License

MIT License - Sentinel Team