amaan784 commited on
Commit
c6bbb61
·
verified ·
1 Parent(s): 815b6de

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +61 -0
README.md ADDED
@@ -0,0 +1,61 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base model: mistralai/Mistral-7B-Instruct-v0.3
4
+ tags:
5
+ - mistral
6
+ - lora
7
+ - behavioral-ai
8
+ - posthog
9
+ - hackathon
10
+ - mistral-worldwide-2026
11
+ library name: peft
12
+ ---
13
+
14
+ # Agentic World — Behavioral Digital Twin LoRA Adapters
15
+
16
+ Fine-tuned LoRA adapters for Mistral 7B Instruct v0.3, trained on real user behavior
17
+ from PostHog session recordings. Each adapter represents a distinct behavioral demographic
18
+ (e.g., Frustrated Clicker, Cautious Explorer, Engaged User, Speedster).
19
+
20
+ ## How it works
21
+
22
+ 1. Real user sessions recorded via PostHog (clicks, scrolls, inputs, navigation)
23
+ 2. Sessions parsed, described via Mistral, embedded, and clustered via K-Means
24
+ 3. Per-cluster training data generated as (page state → next action) pairs
25
+ 4. LoRA fine-tuned on A100 80GB with bf16, Flash Attention 2, packing
26
+
27
+ ## Training Details
28
+
29
+ - **Base model:** mistralai/Mistral-7B-Instruct-v0.3
30
+ - **Method:** LoRA (rank=32, alpha=64, all projection layers)
31
+ - **Framework:** HuggingFace TRL SFTTrainer + PEFT
32
+ - **Hardware:** NVIDIA A100 80GB on Brev
33
+ - **Epochs:** 5
34
+ - **Tracked in:** [W&B Project](https://wandb.ai/amaan784-columbia-university/agentic-world)
35
+
36
+ ## Clusters
37
+
38
+ Each subdirectory contains a LoRA adapter for one demographic:
39
+
40
+ | Cluster | Label | Description |
41
+ |---------|-------|-------------|
42
+ | 0 | Speedster | Fast, decisive navigation with minimal hesitation |
43
+ | 1 | Scanner | Quick browsing, skimming content |
44
+ | 2 | Erratic Clicker | Unpredictable click patterns |
45
+ | 3 | Cautious Explorer | Slow, careful reading and interaction |
46
+ | 4 | Frustrated Clicker | Repeated clicks, signs of confusion |
47
+
48
+ ## Usage
49
+ ```python
50
+ from peft import PeftModel
51
+ from transformers import AutoModelForCausalLM, AutoTokenizer
52
+
53
+ base = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.3")
54
+ model = PeftModel.from_pretrained(base, "amaan784/agentic-world-behavioral", subfolder="cluster_0_lora")
55
+ tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.3")
56
+ ```
57
+
58
+ ## Links
59
+
60
+ - [GitHub: PosthogAgent](https://github.com/exploring-curiosity/PosthogAgent)
61
+ - [W&B Training Runs](https://wandb.ai/amaan784-columbia-university/agentic-world)