Spaces:
Sleeping
Sleeping
readme updated
Browse files
README.md
CHANGED
|
@@ -13,10 +13,18 @@ short_description: Behavioral clustering engine for Web3 wallets
|
|
| 13 |
|
| 14 |
Unsupervised machine learning project to segment cryptocurrency wallets into behavioral personas (e.g., "Whales", "NFT Flippers", "Dormant") based on on-chain transaction data.
|
| 15 |
|
| 16 |
-
##
|
| 17 |
-
|
| 18 |
-
|
| 19 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
|
| 21 |
## Key Features
|
| 22 |
- **Robust Preprocessing:** Handles extreme data skewness (common in financial data) using **Yeo-Johnson Power Transformation**.
|
|
|
|
| 13 |
|
| 14 |
Unsupervised machine learning project to segment cryptocurrency wallets into behavioral personas (e.g., "Whales", "NFT Flippers", "Dormant") based on on-chain transaction data.
|
| 15 |
|
| 16 |
+
## ❓ The Problem
|
| 17 |
+
In the Web3 ecosystem, users are anonymous by default. A wallet address (`0x123...`) gives no indication of whether the user is a high-value institution, a retail trader, a bot, or an NFT collector.
|
| 18 |
+
* **Marketing is blind:** Projects cannot target specific users effectively.
|
| 19 |
+
* **Risk is opaque:** Protocols cannot easily distinguish between organic users and sybil attackers.
|
| 20 |
+
* **Data is noisy:** Raw transaction logs are massive and unreadable without advanced processing.
|
| 21 |
+
|
| 22 |
+
## 💡 The Solution: Cluster Protocol
|
| 23 |
+
**Cluster Protocol** is an AI-powered engine that "fingerprints" wallets based on their behavior, not their identity.
|
| 24 |
+
1. **Ingest:** Pulls raw on-chain data (Gas spent, NFT volume, DEX trades, etc.) via Dune Analytics.
|
| 25 |
+
2. **Process:** Normalizes skewed financial data using **Yeo-Johnson Power Transformations**.
|
| 26 |
+
3. **Cluster:** Uses **K-Means Clustering** to mathematically group similar wallets.
|
| 27 |
+
4. **Label:** Assigns a human-readable persona (e.g., "Active Retail", "High-Frequency Bot") with a confidence score.
|
| 28 |
|
| 29 |
## Key Features
|
| 30 |
- **Robust Preprocessing:** Handles extreme data skewness (common in financial data) using **Yeo-Johnson Power Transformation**.
|