nice-bill commited on
Commit
1228690
·
1 Parent(s): 32fcfa6

readme updated

Browse files
Files changed (1) hide show
  1. README.md +12 -4
README.md CHANGED
@@ -13,10 +13,18 @@ short_description: Behavioral clustering engine for Web3 wallets
13
 
14
  Unsupervised machine learning project to segment cryptocurrency wallets into behavioral personas (e.g., "Whales", "NFT Flippers", "Dormant") based on on-chain transaction data.
15
 
16
- ## Project Overview
17
- - **Goal:** Identify distinct user behaviors in Web3 to enable personalized targeting or risk analysis.
18
- - **Data Source:** Dune Analytics (Ethereum transaction history).
19
- - **Technique:** Unsupervised Clustering (K-Means).
 
 
 
 
 
 
 
 
20
 
21
  ## Key Features
22
  - **Robust Preprocessing:** Handles extreme data skewness (common in financial data) using **Yeo-Johnson Power Transformation**.
 
13
 
14
  Unsupervised machine learning project to segment cryptocurrency wallets into behavioral personas (e.g., "Whales", "NFT Flippers", "Dormant") based on on-chain transaction data.
15
 
16
+ ## The Problem
17
+ In the Web3 ecosystem, users are anonymous by default. A wallet address (`0x123...`) gives no indication of whether the user is a high-value institution, a retail trader, a bot, or an NFT collector.
18
+ * **Marketing is blind:** Projects cannot target specific users effectively.
19
+ * **Risk is opaque:** Protocols cannot easily distinguish between organic users and sybil attackers.
20
+ * **Data is noisy:** Raw transaction logs are massive and unreadable without advanced processing.
21
+
22
+ ## 💡 The Solution: Cluster Protocol
23
+ **Cluster Protocol** is an AI-powered engine that "fingerprints" wallets based on their behavior, not their identity.
24
+ 1. **Ingest:** Pulls raw on-chain data (Gas spent, NFT volume, DEX trades, etc.) via Dune Analytics.
25
+ 2. **Process:** Normalizes skewed financial data using **Yeo-Johnson Power Transformations**.
26
+ 3. **Cluster:** Uses **K-Means Clustering** to mathematically group similar wallets.
27
+ 4. **Label:** Assigns a human-readable persona (e.g., "Active Retail", "High-Frequency Bot") with a confidence score.
28
 
29
  ## Key Features
30
  - **Robust Preprocessing:** Handles extreme data skewness (common in financial data) using **Yeo-Johnson Power Transformation**.