amaan784
/

agentic-world-behavioral

+---
+license: apache-2.0
+base model: mistralai/Mistral-7B-Instruct-v0.3
+tags:
+  - mistral
+  - lora
+  - behavioral-ai
+  - posthog
+  - hackathon
+  - mistral-worldwide-2026
+library name: peft
+---
+# Agentic World — Behavioral Digital Twin LoRA Adapters
+Fine-tuned LoRA adapters for Mistral 7B Instruct v0.3, trained on real user behavior
+from PostHog session recordings. Each adapter represents a distinct behavioral demographic
+(e.g., Frustrated Clicker, Cautious Explorer, Engaged User, Speedster).
+## How it works
+1. Real user sessions recorded via PostHog (clicks, scrolls, inputs, navigation)
+2. Sessions parsed, described via Mistral, embedded, and clustered via K-Means
+3. Per-cluster training data generated as (page state → next action) pairs
+4. LoRA fine-tuned on A100 80GB with bf16, Flash Attention 2, packing
+## Training Details
+- **Base model:** mistralai/Mistral-7B-Instruct-v0.3
+- **Method:** LoRA (rank=32, alpha=64, all projection layers)
+- **Framework:** HuggingFace TRL SFTTrainer + PEFT
+- **Hardware:** NVIDIA A100 80GB on Brev
+- **Epochs:** 5
+- **Tracked in:** [W&B Project](https://wandb.ai/amaan784-columbia-university/agentic-world)
+## Clusters
+Each subdirectory contains a LoRA adapter for one demographic:
+| Cluster | Label | Description |
+|---------|-------|-------------|
+| 0 | Speedster | Fast, decisive navigation with minimal hesitation |
+| 1 | Scanner | Quick browsing, skimming content |
+| 2 | Erratic Clicker | Unpredictable click patterns |
+| 3 | Cautious Explorer | Slow, careful reading and interaction |
+| 4 | Frustrated Clicker | Repeated clicks, signs of confusion |
+## Usage
+```python
+from peft import PeftModel
+from transformers import AutoModelForCausalLM, AutoTokenizer
+base = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.3")
+model = PeftModel.from_pretrained(base, "amaan784/agentic-world-behavioral", subfolder="cluster_0_lora")
+tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.3")
+```
+## Links
+- [GitHub: PosthogAgent](https://github.com/exploring-curiosity/PosthogAgent)
+- [W&B Training Runs](https://wandb.ai/amaan784-columbia-university/agentic-world)