leoguinan commited on
Commit
680a754
·
verified ·
1 Parent(s): 7ee6cb3

Upload model card

Browse files
Files changed (1) hide show
  1. README.md +148 -0
README.md ADDED
@@ -0,0 +1,148 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: mit
5
+ library_name: founder-game-classifier
6
+ tags:
7
+ - text-classification
8
+ - founders
9
+ - content-analysis
10
+ - sentence-transformers
11
+ datasets:
12
+ - custom
13
+ metrics:
14
+ - accuracy
15
+ pipeline_tag: text-classification
16
+ ---
17
+
18
+ # Founder Game Classifier
19
+
20
+ A trained classifier that identifies which of **6 founder games** a piece of content belongs to.
21
+
22
+ ## Model Description
23
+
24
+ This model classifies text content into one of six "founder games" - patterns of communication and content creation common among founders, creators, and thought leaders.
25
+
26
+ ### The 6 Games
27
+
28
+ | Game | Name | Description |
29
+ |------|------|-------------|
30
+ | **G1** | Identity/Canon | Recruiting into identity, lineage, belonging, status, canon formation |
31
+ | **G2** | Ideas/Play Mining | Extracting reusable plays, tactics, heuristics; "do this / steal this" |
32
+ | **G3** | Models/Understanding | Building mental models, frameworks, mechanisms, explanations |
33
+ | **G4** | Performance/Competition | Winning, dominance, execution, metrics, endurance, zero-sum edges |
34
+ | **G5** | Meaning/Therapy | Healing, values, emotional processing, personal transformation |
35
+ | **G6** | Network/Coordination | Community building, protocols, collaboration, collective action |
36
+
37
+ ## Usage
38
+
39
+ ### Installation
40
+
41
+ ```bash
42
+ pip install founder-game-classifier
43
+ ```
44
+
45
+ ### Basic Usage
46
+
47
+ ```python
48
+ from founder_game_classifier import GameClassifier
49
+
50
+ # Load the model (downloads from Hub on first use)
51
+ classifier = GameClassifier.from_pretrained("leoguinan/founder-game-classifier")
52
+
53
+ # Classify a single text
54
+ result = classifier.predict("Here's a tactic you can steal for your next launch...")
55
+
56
+ print(result["primary_game"]) # "G2"
57
+ print(result["confidence"]) # 0.72
58
+ print(result["probabilities"]) # {"G1": 0.05, "G2": 0.72, "G3": 0.10, ...}
59
+ ```
60
+
61
+ ### Batch Classification
62
+
63
+ ```python
64
+ texts = [
65
+ "Here's the mental model I use for thinking about systems...",
66
+ "Join our community of builders who are changing the world...",
67
+ "I tried 47 different tactics. Here's what actually worked...",
68
+ ]
69
+
70
+ results = classifier.predict_batch(texts)
71
+
72
+ for text, result in zip(texts, results):
73
+ print(f"{result['primary_game']}: {text[:50]}...")
74
+ ```
75
+
76
+ ### Get Aggregate Signature
77
+
78
+ Useful for analyzing a corpus of content:
79
+
80
+ ```python
81
+ texts = load_my_blog_posts() # List of strings
82
+ signature = classifier.get_game_signature(texts)
83
+
84
+ print(signature)
85
+ # {'G1': 0.05, 'G2': 0.42, 'G3': 0.18, 'G4': 0.20, 'G5': 0.08, 'G6': 0.07}
86
+ ```
87
+
88
+ ## Model Architecture
89
+
90
+ - **Embedding Model**: `all-MiniLM-L6-v2` (384 dimensions)
91
+ - **Classifier**: Logistic Regression (sklearn)
92
+ - **Manifold System**: Mahalanobis distance to game centroids (optional)
93
+
94
+ ## Training Data
95
+
96
+ The model was trained on labeled founder content spanning:
97
+ - Podcast transcripts
98
+ - Blog posts
99
+ - Twitter threads
100
+ - Newsletter content
101
+
102
+ Training used a multi-stage pipeline:
103
+ 1. Text chunking and span extraction
104
+ 2. LLM-assisted labeling with human verification
105
+ 3. Embedding generation
106
+ 4. Classifier training with cross-validation
107
+
108
+ ## Performance
109
+
110
+ Validated on held-out test set:
111
+
112
+ | Metric | Score |
113
+ |--------|-------|
114
+ | Accuracy | 0.78 |
115
+ | Macro F1 | 0.74 |
116
+ | Top-2 Accuracy | 0.91 |
117
+
118
+ The model performs best on clear examples of each game and may show lower confidence on boundary cases or mixed content.
119
+
120
+ ## Limitations
121
+
122
+ - Trained primarily on English content from tech/startup domain
123
+ - May not generalize well to non-business contexts
124
+ - Short texts (<50 words) may have lower accuracy
125
+ - Cultural and domain biases from training data
126
+
127
+ ## Citation
128
+
129
+ ```bibtex
130
+ @misc{guinan2024foundergameclassifier,
131
+ title={Founder Game Classifier},
132
+ author={Leo Guinan},
133
+ year={2024},
134
+ publisher={Hugging Face},
135
+ url={https://huggingface.co/leoguinan/founder-game-classifier}
136
+ }
137
+ ```
138
+
139
+ ## License
140
+
141
+ MIT License - free for commercial and non-commercial use.
142
+
143
+ ## Files
144
+
145
+ - `classifier.pkl` - Trained LogisticRegression model (19KB)
146
+ - `label_encoder.pkl` - Label encoder for game classes (375B)
147
+ - `metadata.json` - Model metadata and configuration (143B)
148
+ - `game_manifolds.json` - Manifold centroids and covariances for geometric analysis (29MB)