amaan784
/

agentic-world-behavioral

mistral-worldwide-2026

Model card Files Files and versions

agentic-world-behavioral / README.md

amaan784's picture

Upload README.md with huggingface_hub

c6bbb61 verified 2 days ago

|

history blame contribute delete

2.17 kB

	---
	license: apache-2.0
	base model: mistralai/Mistral-7B-Instruct-v0.3
	tags:
	- mistral
	- lora
	- behavioral-ai
	- posthog
	- hackathon
	- mistral-worldwide-2026
	library name: peft
	---

	# Agentic World — Behavioral Digital Twin LoRA Adapters

	Fine-tuned LoRA adapters for Mistral 7B Instruct v0.3, trained on real user behavior
	from PostHog session recordings. Each adapter represents a distinct behavioral demographic
	(e.g., Frustrated Clicker, Cautious Explorer, Engaged User, Speedster).

	## How it works

	1. Real user sessions recorded via PostHog (clicks, scrolls, inputs, navigation)
	2. Sessions parsed, described via Mistral, embedded, and clustered via K-Means
	3. Per-cluster training data generated as (page state → next action) pairs
	4. LoRA fine-tuned on A100 80GB with bf16, Flash Attention 2, packing

	## Training Details

	- Base model: mistralai/Mistral-7B-Instruct-v0.3
	- Method: LoRA (rank=32, alpha=64, all projection layers)
	- Framework: HuggingFace TRL SFTTrainer + PEFT
	- Hardware: NVIDIA A100 80GB on Brev
	- Epochs: 5
	- Tracked in: [W&B Project](https://wandb.ai/amaan784-columbia-university/agentic-world)

	## Clusters

	Each subdirectory contains a LoRA adapter for one demographic:

	\| Cluster \| Label \| Description \|
	\|---------\|-------\|-------------\|
	\| 0 \| Speedster \| Fast, decisive navigation with minimal hesitation \|
	\| 1 \| Scanner \| Quick browsing, skimming content \|
	\| 2 \| Erratic Clicker \| Unpredictable click patterns \|
	\| 3 \| Cautious Explorer \| Slow, careful reading and interaction \|
	\| 4 \| Frustrated Clicker \| Repeated clicks, signs of confusion \|

	## Usage
	```python
	from peft import PeftModel
	from transformers import AutoModelForCausalLM, AutoTokenizer

	base = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.3")
	model = PeftModel.from_pretrained(base, "amaan784/agentic-world-behavioral", subfolder="cluster_0_lora")
	tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.3")
	```

	## Links

	- [GitHub: PosthogAgent](https://github.com/exploring-curiosity/PosthogAgent)
	- [W&B Training Runs](https://wandb.ai/amaan784-columbia-university/agentic-world)