GotThatData commited on
Commit
acabf94
·
verified ·
1 Parent(s): 60238f2

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +308 -308
README.md CHANGED
@@ -1,308 +1,308 @@
1
- ---
2
- title: Semantic Scalpel BSV
3
- emoji: ⛓
4
- colorFrom: blue
5
- colorTo: purple
6
- sdk: gradio
7
- app_file: app.py
8
- pinned: true
9
- tags:
10
- - semantic-nlp
11
- - word-sense-disambiguation
12
- - bsv-blockchain
13
- - blockchain-ai
14
- - metonymy
15
- - garden-path-sentences
16
- - semeval-2026
17
- - semantic-scalpel
18
- - verification
19
- - audit-trail
20
- license: mit
21
- ---
22
-
23
- # The Semantic Scalpel - BSV Blockchain Verified ⛓
24
-
25
- <div align="center">
26
-
27
- **"The World's First Verifiable Semantic Guardrail"**
28
-
29
- [![BSV Blockchain](https://img.shields.io/badge/BSV-Blockchain%20Verified-brightgreen)](https://whatsonchain.com/)
30
- [![SemEval 2026](https://img.shields.io/badge/SemEval-2026%20Task%205-blue)](https://www.codabench.org/competitions/10877/)
31
- [![API Status](https://img.shields.io/badge/API-Live-success)](https://semanticscalpel.com)
32
- [![License](https://img.shields.io/badge/license-MIT-blue.svg)](LICENSE)
33
-
34
- [Try It Live](#interactive-examples) | [API Docs](https://semanticscalpel.com/docs) | [Research Paper](#) | [BSV Verification](#bsv-blockchain-verification)
35
-
36
- </div>
37
-
38
- ---
39
-
40
- ## 🎯 What Is This?
41
-
42
- **Every NLP prediction is only as trustworthy as its audit trail.**
43
-
44
- The Semantic Scalpel combines surgical semantic precision with **cryptographic verification on the BSV blockchain**. Every prediction is:
45
- - ✅ **Immutably recorded** on-chain
46
- - ✅ **Independently verifiable** by anyone
47
- - ✅ **Timestamped** with microsecond precision
48
- - ✅ **Tamper-proof** with cryptographic hashing
49
-
50
- Perfect for:
51
- - 🏛 **Regulatory compliance** (finance, healthcare, legal)
52
- - 🔒 **High-stakes decisions** requiring audit trails
53
- - 📊 **Research reproducibility**
54
- - ⚖️ **Legal evidence** with cryptographic proof
55
-
56
- ---
57
-
58
- ## 🔬 The Technology
59
-
60
- ### Semantic Precision
61
- Built on the **Daugherty Engine** - a topological approach to semantic disambiguation that achieves:
62
- - **95% accuracy** on linguistic edge cases (garden paths, metonymy, coercion)
63
- - **6ms latency** (133x faster than GPT-4)
64
- - **$0.0001 per query** (300x cheaper than traditional LLMs)
65
- - **9.96M parameters** vs 175B+ for comparable systems
66
-
67
- ### BSV Blockchain Verification
68
- Every prediction includes:
69
- - **Transaction ID (TxID)** - permanent blockchain record
70
- - **Verification URL** - instant lookup on WhatsOnChain
71
- - **Timestamp** - immutable proof of when prediction was made
72
- - **Input/Output Hash** - cryptographic integrity check
73
-
74
- **Why BSV?**
75
- - ✅ Unlimited scalability (millions of TPS)
76
- - ✅ Micropayments ($0.0001/tx)
77
- - ✅ Immutable ledger (no reorganization risk)
78
- - ✅ Public verification (anyone can audit)
79
-
80
- ---
81
-
82
- ## 🎯 SemEval-2026 Task 5 Ready
83
-
84
- This model is purpose-built for **rating plausibility of word senses in ambiguous sentences** - the exact task for [SemEval-2026 Task 5](https://www.codabench.org/competitions/10877/).
85
-
86
- **Why Semantic Scalpel wins:**
87
- - Traditional LLMs: Statistical guessing with billions of parameters
88
- - Semantic Scalpel: Topological precision with semantic flow dynamics
89
-
90
- **Competition Advantage:**
91
- - ✅ Every submission verifiable on blockchain
92
- - ✅ Reproducible results with cryptographic proof
93
- - ✅ Audit trail for scoring transparency
94
-
95
- ---
96
-
97
- ## 🚀 Interactive Examples
98
-
99
- ### Try BSV-Verified Predictions
100
-
101
- Click any example below to see:
102
- 1. Semantic disambiguation in action
103
- 2. Real-time BSV blockchain verification
104
- 3. Plausibility scores for each interpretation
105
- 4. Transaction ID for permanent record
106
-
107
- **Metonymy: Location → Institution**
108
- > "The White House announced new sanctions."
109
-
110
- ❌ **Building**: 8% plausible
111
- ✅ **Government**: 92% plausible ⛓ **BSV Verified**
112
-
113
- ---
114
-
115
- **Garden Path: Reduced Relative**
116
- > "The horse raced past the barn fell."
117
-
118
- ❌ **Simple past tense**: 15% plausible
119
- ✅ **Past participle (passive)**: 85% plausible ⛓ **BSV Verified**
120
-
121
- ---
122
-
123
- **Financial: Bank Polysemy**
124
- > "The bank was steep and muddy."
125
-
126
- ❌ **Financial institution**: 5% plausible
127
- ✅ **River edge**: 95% plausible ⛓ **BSV Verified**
128
-
129
- ---
130
-
131
- ## 💡 Real-World Use Cases
132
-
133
- ### Finance & Trading
134
- - **Contract disambiguation** with legal audit trails
135
- - **News sentiment** with verifiable timestamping
136
- - **Compliance reports** with immutable evidence
137
-
138
- ### Healthcare
139
- - **Clinical note disambiguation** (HIPAA-compliant audit)
140
- - **Drug name resolution** with verification trails
141
- - **Research data** with reproducible results
142
-
143
- ### Legal
144
- - **Evidence analysis** with cryptographic proof
145
- - **Contract interpretation** with blockchain records
146
- - **Discovery** with tamper-proof timestamps
147
-
148
- ### AI Safety & Research
149
- - **Model auditing** with public verification
150
- - **Benchmark reproducibility** with on-chain records
151
- - **Semantic guardrails** for LLM outputs
152
-
153
- ---
154
-
155
- ## 📊 Cost Calculator
156
-
157
- Compare costs with traditional LLM approaches:
158
-
159
- | Service | Cost/Query | 1M Queries | BSV Verified |
160
- |---------|-----------|------------|--------------|
161
- | Semantic Scalpel BSV | $0.0001 | $100 | ✅ Yes |
162
- | GPT-4 | $0.03 | $30,000 | ❌ No |
163
- | Claude Sonnet | $0.015 | $15,000 | ❌ No |
164
-
165
- **Savings: 99.7% vs GPT-4** + **Blockchain verification included**
166
-
167
- ---
168
-
169
- ## 🔒 Security & Privacy
170
-
171
- - ✅ **No model weights exposed** (API-only demo)
172
- - ✅ **No training data stored** on-chain (only hashes)
173
- - ✅ **Public verification** without revealing inputs
174
- - ✅ **GDPR compliant** (can hash PII before storage)
175
-
176
- ---
177
-
178
- ## 🛠 How to Use
179
-
180
- ### 1. Via This Space (Demo)
181
- Click examples above or try your own sentences in the "Try It" tab.
182
-
183
- ### 2. Via API (Production)
184
- ```python
185
- import requests
186
-
187
- response = requests.post(
188
- "https://api.semanticscalpel.com/v1/disambiguate",
189
- headers={"Authorization": "Bearer YOUR_API_KEY"},
190
- json={
191
- "sentence": "I saw her duck",
192
- "bsv_verify": True # Enable blockchain verification
193
- }
194
- )
195
-
196
- # Response includes:
197
- # - Semantic analysis
198
- # - Plausibility scores
199
- # - BSV transaction ID
200
- # - Verification URL
201
- ```
202
-
203
- ### 3. Verify On-Chain
204
- Every prediction includes a TxID. Verify independently:
205
- ```bash
206
- # Check on WhatsOnChain
207
- https://whatsonchain.com/tx/{txid}
208
-
209
- # Or use BSV API
210
- curl https://api.whatsonchain.com/v1/bsv/main/tx/{txid}
211
- ```
212
-
213
- ---
214
-
215
- ## 📚 Technical Details
216
-
217
- ### Architecture
218
- - **Frontend**: Gradio UI (this Space)
219
- - **Backend**: Origin Neural AI Semantic Engine
220
- - **Blockchain**: BSV mainnet (public ledger)
221
- - **Storage**: Distributed (no central database)
222
-
223
- ### Performance Metrics
224
- | Metric | Value |
225
- |--------|-------|
226
- | Latency | 6ms (avg) |
227
- | Accuracy (Garden Paths) | 95% |
228
- | Cost per Query | $0.0001 |
229
- | BSV Confirmation | ~1 sec |
230
- | Parameters | 9.96M |
231
-
232
- ### Benchmark Results
233
- | Task | Semantic Scalpel | GPT-4 | BERT |
234
- |------|-----------------|-------|------|
235
- | Metonymy Resolution | 95% | 72% | 68% |
236
- | Garden Path Parsing | 92% | 65% | 71% |
237
- | Coercion Detection | 89% | 70% | 66% |
238
- | Polysemy Ranking | 94% | 78% | 75% |
239
-
240
- ---
241
-
242
- ## 🎓 Academic Citation
243
-
244
- If you use Semantic Scalpel BSV in your research, please cite:
245
-
246
- ```bibtex
247
- @inproceedings{daugherty2026semanticscalpelbsv,
248
- title={Semantic Scalpel BSV: Blockchain-Verified Semantic Disambiguation},
249
- author={Daugherty, Bryan},
250
- booktitle={SemEval-2026 Task 5},
251
- year={2026},
252
- organization={SmartLedger Solutions}
253
- }
254
- ```
255
-
256
- ---
257
-
258
- ## 🏆 SemEval-2026 Competition Entry
259
-
260
- **Official Entry:** Task 5 - Ambiguity in Word Sense
261
- **Team:** SmartLedger Solutions
262
- **Innovation:** First blockchain-verified semantic model in competition history
263
-
264
- [Competition Page](https://www.codabench.org/competitions/10877/) | [Leaderboard](#) | [Paper](#)
265
-
266
- ---
267
-
268
- ## 🔗 Links
269
-
270
- - **Company**: [SmartLedger Solutions](https://smartledger.solutions)
271
- - **Main Semantic Scalpel**: [Non-BSV Version](https://huggingface.co/spaces/GotThatData/semantic-scalpel)
272
- - **Daugherty Engine**: [Optimization Engine](https://huggingface.co/spaces/GotThatData/daugherty-engine)
273
- - **API Documentation**: [semanticscalpel.com/docs](https://semanticscalpel.com/docs)
274
- - **BSV Explorer**: [WhatsOnChain](https://whatsonchain.com/)
275
- - **GitHub**: [github.com/smartledger](https://github.com/smartledger)
276
-
277
- ---
278
-
279
- ## 👤 About
280
-
281
- **Created by Bryan Daugherty** | [SmartLedger Solutions](https://smartledger.solutions)
282
-
283
- Chairman, SmartLedger Solutions - Building the intersection of AI, blockchain, and semantic technology.
284
-
285
- - 🐦 Twitter: [@bwdaugherty](https://twitter.com/bwdaugherty)
286
- - 💼 LinkedIn: [bwdaugherty](https://linkedin.com/in/bwdaugherty)
287
- - 🐙 GitHub: [Saifullah62](https://github.com/Saifullah62)
288
-
289
- ---
290
-
291
- ## 📜 License
292
-
293
- MIT License - See [LICENSE](LICENSE) for details.
294
-
295
- **API Access**: Free tier available for research. [Contact us](mailto:bryan@smartledger.solutions) for production licensing.
296
-
297
- ---
298
-
299
- <div align="center">
300
-
301
- **The future of semantic understanding lies not in the blunt force of billions of parameters,**
302
- **but in the surgical application of semantic flow dynamics.**
303
-
304
- **Verified. Immutable. Trustworthy.**
305
-
306
- ⛓ **Powered by BSV Blockchain**
307
-
308
- </div>
 
1
+ ---
2
+ title: Semantic Scalpel BSV
3
+ emoji: ⛓
4
+ colorFrom: blue
5
+ colorTo: purple
6
+ sdk: gradio
7
+ app_file: app.py
8
+ pinned: true
9
+ tags:
10
+ - semantic-nlp
11
+ - word-sense-disambiguation
12
+ - bsv-blockchain
13
+ - blockchain-ai
14
+ - metonymy
15
+ - garden-path-sentences
16
+ - semeval-2026
17
+ - semantic-scalpel
18
+ - verification
19
+ - audit-trail
20
+ license: mit
21
+ ---
22
+
23
+ # The Semantic Scalpel - BSV Blockchain Verified ⛓
24
+
25
+ <div align="center">
26
+
27
+ **"The World's First Verifiable Semantic Guardrail"**
28
+
29
+ [![BSV Blockchain](https://img.shields.io/badge/BSV-Blockchain%20Verified-brightgreen)](https://whatsonchain.com/)
30
+ [![SemEval 2026](https://img.shields.io/badge/SemEval-2026%20Task%205-blue)](https://www.codabench.org/competitions/10877/)
31
+ [![API Status](https://img.shields.io/badge/API-Live-success)](https://semanticscalpel.com)
32
+ [![License](https://img.shields.io/badge/license-MIT-blue.svg)](LICENSE)
33
+
34
+ [Try It Live](#interactive-examples) | [API Docs](https://semanticscalpel.com/docs) | [Research Paper](#) | [BSV Verification](#bsv-blockchain-verification)
35
+
36
+ </div>
37
+
38
+ ---
39
+
40
+ ## 🎯 What Is This?
41
+
42
+ **Every NLP prediction is only as trustworthy as its audit trail.**
43
+
44
+ The Semantic Scalpel combines surgical semantic precision with **cryptographic verification on the BSV blockchain**. Every prediction is:
45
+ - ✅ **Immutably recorded** on-chain
46
+ - ✅ **Independently verifiable** by anyone
47
+ - ✅ **Timestamped** with microsecond precision
48
+ - ✅ **Tamper-proof** with cryptographic hashing
49
+
50
+ Perfect for:
51
+ - 🏛 **Regulatory compliance** (finance, healthcare, legal)
52
+ - 🔒 **High-stakes decisions** requiring audit trails
53
+ - 📊 **Research reproducibility**
54
+ - ⚖️ **Legal evidence** with cryptographic proof
55
+
56
+ ---
57
+
58
+ ## 🔬 The Technology
59
+
60
+ ### Semantic Precision
61
+ Built on the **Daugherty Engine** - a topological approach to semantic disambiguation that achieves:
62
+ - **95% accuracy** on linguistic edge cases (garden paths, metonymy, coercion)
63
+ - **6ms latency** (133x faster than GPT-4)
64
+ - **$0.0001 per query** (300x cheaper than traditional LLMs)
65
+ - **9.96M parameters** vs 175B+ for comparable systems
66
+
67
+ ### BSV Blockchain Verification
68
+ Every prediction includes:
69
+ - **Transaction ID (TxID)** - permanent blockchain record
70
+ - **Verification URL** - instant lookup on WhatsOnChain
71
+ - **Timestamp** - immutable proof of when prediction was made
72
+ - **Input/Output Hash** - cryptographic integrity check
73
+
74
+ **Why BSV?**
75
+ - ✅ Unlimited scalability (millions of TPS)
76
+ - ✅ Micropayments ($0.0001/tx)
77
+ - ✅ Immutable ledger (no reorganization risk)
78
+ - ✅ Public verification (anyone can audit)
79
+
80
+ ---
81
+
82
+ ## 🎯 SemEval-2026 Task 5 Ready
83
+
84
+ This model is purpose-built for **rating plausibility of word senses in ambiguous sentences** - the exact task for [SemEval-2026 Task 5](https://www.codabench.org/competitions/10877/).
85
+
86
+ **Why Semantic Scalpel wins:**
87
+ - Traditional LLMs: Statistical guessing with billions of parameters
88
+ - Semantic Scalpel: Topological precision with semantic flow dynamics
89
+
90
+ **Competition Advantage:**
91
+ - ✅ Every submission verifiable on blockchain
92
+ - ✅ Reproducible results with cryptographic proof
93
+ - ✅ Audit trail for scoring transparency
94
+
95
+ ---
96
+
97
+ ## 🚀 Interactive Examples
98
+
99
+ ### Try BSV-Verified Predictions
100
+
101
+ Click any example below to see:
102
+ 1. Semantic disambiguation in action
103
+ 2. Real-time BSV blockchain verification
104
+ 3. Plausibility scores for each interpretation
105
+ 4. Transaction ID for permanent record
106
+
107
+ **Metonymy: Location → Institution**
108
+ > "The White House announced new sanctions."
109
+
110
+ ❌ **Building**: 8% plausible
111
+ ✅ **Government**: 92% plausible ⛓ **BSV Verified**
112
+
113
+ ---
114
+
115
+ **Garden Path: Reduced Relative**
116
+ > "The horse raced past the barn fell."
117
+
118
+ ❌ **Simple past tense**: 15% plausible
119
+ ✅ **Past participle (passive)**: 85% plausible ⛓ **BSV Verified**
120
+
121
+ ---
122
+
123
+ **Financial: Bank Polysemy**
124
+ > "The bank was steep and muddy."
125
+
126
+ ❌ **Financial institution**: 5% plausible
127
+ ✅ **River edge**: 95% plausible ⛓ **BSV Verified**
128
+
129
+ ---
130
+
131
+ ## 💡 Real-World Use Cases
132
+
133
+ ### Finance & Trading
134
+ - **Contract disambiguation** with legal audit trails
135
+ - **News sentiment** with verifiable timestamping
136
+ - **Compliance reports** with immutable evidence
137
+
138
+ ### Healthcare
139
+ - **Clinical note disambiguation** (HIPAA-compliant audit)
140
+ - **Drug name resolution** with verification trails
141
+ - **Research data** with reproducible results
142
+
143
+ ### Legal
144
+ - **Evidence analysis** with cryptographic proof
145
+ - **Contract interpretation** with blockchain records
146
+ - **Discovery** with tamper-proof timestamps
147
+
148
+ ### AI Safety & Research
149
+ - **Model auditing** with public verification
150
+ - **Benchmark reproducibility** with on-chain records
151
+ - **Semantic guardrails** for LLM outputs
152
+
153
+ ---
154
+
155
+ ## 📊 Cost Calculator
156
+
157
+ Compare costs with traditional LLM approaches:
158
+
159
+ | Service | Cost/Query | 1M Queries | BSV Verified |
160
+ |---------|-----------|------------|--------------|
161
+ | Semantic Scalpel BSV | $0.0001 | $100 | ✅ Yes |
162
+ | GPT-4 | $0.03 | $30,000 | ❌ No |
163
+ | Claude Sonnet | $0.015 | $15,000 | ❌ No |
164
+
165
+ **Savings: 99.7% vs GPT-4** + **Blockchain verification included**
166
+
167
+ ---
168
+
169
+ ## 🔒 Security & Privacy
170
+
171
+ - ✅ **No model weights exposed** (API-only demo)
172
+ - ✅ **No training data stored** on-chain (only hashes)
173
+ - ✅ **Public verification** without revealing inputs
174
+ - ✅ **GDPR compliant** (can hash PII before storage)
175
+
176
+ ---
177
+
178
+ ## 🛠 How to Use
179
+
180
+ ### 1. Via This Space (Demo)
181
+ Click examples above or try your own sentences in the "Try It" tab.
182
+
183
+ ### 2. Via API (Production)
184
+ ```python
185
+ import requests
186
+
187
+ response = requests.post(
188
+ "https://api.semanticscalpel.com/v1/disambiguate",
189
+ headers={"Authorization": "Bearer YOUR_API_KEY"},
190
+ json={
191
+ "sentence": "I saw her duck",
192
+ "bsv_verify": True # Enable blockchain verification
193
+ }
194
+ )
195
+
196
+ # Response includes:
197
+ # - Semantic analysis
198
+ # - Plausibility scores
199
+ # - BSV transaction ID
200
+ # - Verification URL
201
+ ```
202
+
203
+ ### 3. Verify On-Chain
204
+ Every prediction includes a TxID. Verify independently:
205
+ ```bash
206
+ # Check on WhatsOnChain
207
+ https://whatsonchain.com/tx/{txid}
208
+
209
+ # Or use BSV API
210
+ curl https://api.whatsonchain.com/v1/bsv/main/tx/{txid}
211
+ ```
212
+
213
+ ---
214
+
215
+ ## 📚 Technical Details
216
+
217
+ ### Architecture
218
+ - **Frontend**: Gradio UI (this Space)
219
+ - **Backend**: Origin Neural AI Semantic Engine
220
+ - **Blockchain**: BSV mainnet (public ledger)
221
+ - **Storage**: Distributed (no central database)
222
+
223
+ ### Performance Metrics
224
+ | Metric | Value |
225
+ |--------|-------|
226
+ | Latency | 6ms (avg) |
227
+ | Accuracy (Garden Paths) | 95% |
228
+ | Cost per Query | $0.0001 |
229
+ | BSV Confirmation | ~1 sec |
230
+ | Parameters | 9.96M |
231
+
232
+ ### Benchmark Results
233
+ | Task | Semantic Scalpel | GPT-4 | BERT |
234
+ |------|-----------------|-------|------|
235
+ | Metonymy Resolution | 95% | 72% | 68% |
236
+ | Garden Path Parsing | 92% | 65% | 71% |
237
+ | Coercion Detection | 89% | 70% | 66% |
238
+ | Polysemy Ranking | 94% | 78% | 75% |
239
+
240
+ ---
241
+
242
+ ## 🎓 Academic Citation
243
+
244
+ If you use Semantic Scalpel BSV in your research, please cite:
245
+
246
+ ```bibtex
247
+ @inproceedings{daugherty2026semanticscalpelbsv,
248
+ title={Semantic Scalpel BSV: Blockchain-Verified Semantic Disambiguation},
249
+ author={Daugherty, Bryan},
250
+ booktitle={SemEval-2026 Task 5},
251
+ year={2026},
252
+ organization={SmartLedger Solutions}
253
+ }
254
+ ```
255
+
256
+ ---
257
+
258
+ ## 🏆 SemEval-2026 Competition Entry
259
+
260
+ **Official Entry:** Task 5 - Ambiguity in Word Sense
261
+ **Team:** SmartLedger Solutions
262
+ **Innovation:** First blockchain-verified semantic model in competition history
263
+
264
+ [Competition Page](https://www.codabench.org/competitions/10877/) | [Leaderboard](#) | [Paper](#)
265
+
266
+ ---
267
+
268
+ ## 🔗 Links
269
+
270
+ - **Company**: [SmartLedger Solutions](https://smartledger.solutions)
271
+ - **Main Semantic Scalpel**: [Non-BSV Version](https://huggingface.co/spaces/GotThatData/semantic-scalpel)
272
+ - **Daugherty Engine**: [Optimization Engine](https://huggingface.co/spaces/GotThatData/daugherty-engine)
273
+ - **API Documentation**: [semanticscalpel.com/docs](https://semanticscalpel.com/docs)
274
+ - **BSV Explorer**: [WhatsOnChain](https://whatsonchain.com/)
275
+ - **GitHub**: [github.com/smartledger](https://github.com/smartledger)
276
+
277
+ ---
278
+
279
+ ## 👤 About
280
+
281
+ **Created by Bryan Daugherty** | [SmartLedger Solutions](https://smartledger.solutions)
282
+
283
+ Chairman, SmartLedger Solutions - Building the intersection of AI, blockchain, and semantic technology.
284
+
285
+ - 🐦 Twitter: [@bwdaugherty](https://twitter.com/bwdaugherty)
286
+ - 💼 LinkedIn: [bwdaugherty](https://linkedin.com/in/bwdaugherty)
287
+ - 🐙 GitHub: [Saifullah62](https://github.com/Saifullah62)
288
+
289
+ ---
290
+
291
+ ## 📜 License
292
+
293
+ MIT License - See [LICENSE](LICENSE) for details.
294
+
295
+ **API Access**: Free tier available for research. [Contact us](mailto:bryan@smartledger.solutions) for production licensing.
296
+
297
+ ---
298
+
299
+ <div align="center">
300
+
301
+ **The future of semantic understanding lies not in the blunt force of billions of parameters,**
302
+ **but in the surgical application of semantic flow dynamics.**
303
+
304
+ **Verified. Immutable. Trustworthy.**
305
+
306
+ ⛓ **Powered by BSV Blockchain**
307
+
308
+ </div>