Spaces:

MusoraProductDepartment
/

AI_Message_Generator

Sleeping

App Files Files Community

AI_Message_Generator / README.md

Danialebrat

Updating python version

6e08a38 2 months ago

preview code

raw

history blame contribute delete

36.9 kB

	---
	title: AI Messaging System
	emoji: 🎶
	colorFrom: blue
	colorTo: gray
	sdk: streamlit
	sdk_version: 1.46.0
	python_version: 3.9
	app_file: app.py
	---

	# AI Messaging System - Visualization Tool

	An interactive Streamlit-based visualization and experimentation tool for the AI Messaging System v2. This tool enables non-technical users to generate, visualize, analyze, and improve personalized message campaigns with integrated A/B testing capabilities and comprehensive historical tracking.

	## 🎯 Purpose

	- Generate personalized messages with various configurations
	- Run A/B tests with parallel processing and side-by-side comparison
	- Visualize and analyze messages in real-time across all campaign stages
	- Provide detailed feedback with message header/body tracking
	- Track improvements and trends across historical experiments
	- Cloud-native architecture with Snowflake integration
	- Ready for deployment on HuggingFace Spaces

	---

	## 🏗️ Architecture Overview

	### Core Philosophy: In-Memory + Cloud Persistence

	The system operates on a hybrid architecture:
	- In-Memory Operations: All active experiments run in `session_state` (no local files)
	- Cloud Persistence: Data stored in Snowflake for long-term analytics
	- On-Demand Loading: Historical data loaded from Snowflake when needed
	- One-Click Storage: Results persisted with a single button click

	### Why This Architecture?

	1. HuggingFace Ready: No local file dependencies
	2. Fast Operations: In-memory processing for real-time feedback
	3. Scalable: Snowflake handles unlimited historical data
	4. Clean Separation: Current experiments vs. historical data
	5. Versioned Configs: Automatic configuration versioning in Snowflake

	---

	## 📁 Directory Structure

	```
	visualization/
	├── app.py # Main entry point with authentication & brand selection
	├── pages/ # Multi-page Streamlit application
	│ ├── 1_Campaign_Builder.py # Campaign configuration & generation (with A/B testing)
	│ ├── 2_Message_Viewer.py # Message browsing & feedback (A/B aware)
	│ ├── 4_Analytics.py # Performance metrics for CURRENT experiment
	│ └── 5_Historical_Analytics.py # Historical experiments from Snowflake
	├── utils/ # Utility modules
	│ ├── __init__.py
	│ ├── auth.py # Authentication logic
	│ ├── config_manager.py # Configuration loading from Snowflake
	│ ├── db_manager.py # Snowflake database operations (NEW)
	│ ├── experiment_runner.py # Parallel experiment execution (NEW)
	│ ├── session_feedback_manager.py # In-memory feedback management (NEW)
	│ └── theme.py # Brand-specific theming
	├── data/ # Local data storage (configs cached here)
	│ └── UI_users/ # Pre-loaded user lists (100 users per brand)
	│ ├── drumeo_users.csv
	│ ├── pianote_users.csv
	│ ├── guitareo_users.csv
	│ └── singeo_users.csv
	├── requirements.txt # Python dependencies
	├── README.md # This file
	├── IMPLEMENTATION_COMPLETE.md # Refactoring details & progress
	└── ARCHITECTURE_REFACTOR_GUIDE.md # Technical refactoring guide
	```

	## 🗄️ Snowflake Database Schema

	### Tables

	1. MESSAGING_SYSTEM_V2.UI.CONFIGS
	```sql
	CONFIG_NAME VARCHAR -- Configuration identifier
	CONFIG_FILE VARIANT -- JSON configuration
	CONFIG_VERSION INTEGER -- Auto-incrementing version
	BRAND VARCHAR -- Brand name (drumeo, pianote, etc.)
	CREATED_AT TIMESTAMP -- Creation timestamp
	```

	2. MESSAGING_SYSTEM_V2.UI.EXPERIMENT_METADATA
	```sql
	EXPERIMENT_ID VARCHAR -- Unique experiment identifier
	CONFIG_NAME VARCHAR -- Configuration used
	BRAND VARCHAR -- Brand name
	CAMPAIGN_NAME VARCHAR -- Campaign identifier
	STAGE INTEGER -- Stage number (1-11)
	LLM_MODEL VARCHAR -- Model used (gpt-4o-mini, gemini-2.5-flash-lite, etc.)
	TOTAL_MESSAGES INTEGER -- Messages generated in this stage
	TOTAL_USERS INTEGER -- Unique users in this stage
	PLATFORM VARCHAR -- Platform (push, email, etc.)
	PERSONALIZATION BOOLEAN -- Personalization enabled
	INVOLVE_RECSYS BOOLEAN -- Recommendations enabled
	RECSYS_CONTENTS ARRAY -- Recommendation types
	SEGMENT_INFO VARCHAR -- Segment description
	CAMPAIGN_INSTRUCTIONS VARCHAR -- Campaign-wide instructions
	PER_MESSAGE_INSTRUCTIONS VARCHAR -- Stage-specific instructions
	START_TIME TIMESTAMP -- Experiment start time
	END_TIME TIMESTAMP -- Experiment end time (optional)
	```

	3. MESSAGING_SYSTEM_V2.UI.FEEDBACKS
	```sql
	EXPERIMENT_ID VARCHAR -- Links to EXPERIMENT_METADATA
	USER_ID INTEGER -- User who received the message
	STAGE INTEGER -- Stage number
	FEEDBACK_TYPE VARCHAR -- 'reject' (only type currently)
	REJECTION_REASON VARCHAR -- Reason category key
	REJECTION_TEXT VARCHAR -- Custom text explanation
	MESSAGE_HEADER VARCHAR -- Full message header
	MESSAGE_BODY VARCHAR -- Full message body
	CAMPAIGN_NAME VARCHAR -- Campaign identifier
	BRAND VARCHAR -- Brand name
	CONFIG_NAME VARCHAR -- Configuration used
	TIMESTAMP TIMESTAMP -- Feedback submission time
	```

	---

	## 🚀 Getting Started

	### Prerequisites

	- Python 3.9+
	- Snowflake account and credentials
	- OpenAI/Google AI API keys
	- AI Messaging System v2 installed in parent directory
	- Plotly for data visualization

	### Installation

	```bash
	# Install dependencies
	pip install -r requirements.txt

	# Set up environment variables (.env file)
	SNOWFLAKE_USER=your_user
	SNOWFLAKE_PASSWORD=your_password
	SNOWFLAKE_ACCOUNT=your_account
	SNOWFLAKE_ROLE=your_role
	SNOWFLAKE_DATABASE=MESSAGING_SYSTEM_V2
	SNOWFLAKE_WAREHOUSE=your_warehouse
	SNOWFLAKE_SCHEMA=UI

	# LLM API keys
	OPENAI_API_KEY=your_key
	GOOGLE_API_KEY=your_key
	```

	### Running the Application

	```bash
	# From the visualization directory
	cd visualization
	streamlit run app.py
	```

	### First-Time Setup

	1. Ensure `.env` file exists with Snowflake credentials
	2. Verify user CSV files exist in `data/UI_users/` for each brand
	3. Create Snowflake tables using schema above (or let system auto-create)
	4. Upload initial configurations to Snowflake (optional - can create in UI)
	5. Login with authorized email and access token

	---

	## 📊 Complete Data Flow

	### Session State Architecture

	All active data lives in Streamlit's `session_state`:

	```python
	# Single Experiment Mode
	st.session_state.ui_log_data # DataFrame: Generated messages
	st.session_state.current_experiment_id # String: Experiment identifier
	st.session_state.current_experiment_metadata # List[Dict]: Metadata per stage
	st.session_state.current_feedbacks # List[Dict]: Feedback records

	# AB Testing Mode
	st.session_state.ui_log_data_a # DataFrame: Experiment A messages
	st.session_state.ui_log_data_b # DataFrame: Experiment B messages
	st.session_state.experiment_a_id # String: Experiment A identifier
	st.session_state.experiment_b_id # String: Experiment B identifier
	st.session_state.experiment_a_metadata # List[Dict]: A's metadata
	st.session_state.experiment_b_metadata # List[Dict]: B's metadata
	st.session_state.feedbacks_a # List[Dict]: A's feedback
	st.session_state.feedbacks_b # List[Dict]: B's feedback

	# Configuration
	st.session_state.campaign_config # Dict: Single mode config
	st.session_state.campaign_config_a # Dict: AB mode config A
	st.session_state.campaign_config_b # Dict: AB mode config B
	st.session_state.configs_cache # Dict: Cached configs from Snowflake
	```

	### End-to-End Workflow

	```
	┌─────────────────────────────────────────────────────────────────┐
	│ 1. CAMPAIGN BUILDER │
	├─────────────────────────────────────────────────────────────────┤
	│ • Load configs from Snowflake (cached in session_state) │
	│ • User selects/modifies configuration │
	│ • Sample users from CSV files │
	│ • Generate messages (stored in session_state.ui_log_data) │
	│ • Track metadata (session_state.current_experiment_metadata) │
	└─────────────────────────────────────────────────────────────────┘
	↓
	┌─────────────────────────────────────────────────────────────────┐
	│ 2. MESSAGE VIEWER │
	├─────────────────────────────────────────────────────────────────┤
	│ • Load messages from session_state.ui_log_data │
	│ • Display in user-centric or stage-centric views │
	│ • User provides feedback (reject messages) │
	│ • Feedback stored in session_state.current_feedbacks │
	│ • [BUTTON] Store Results to Snowflake │
	│ → Writes metadata to EXPERIMENT_METADATA table │
	│ → Writes feedback to FEEDBACKS table │
	└─────────────────────────────────────────────────────────────────┘
	↓
	┌─────────────────────────────────────────────────────────────────┐
	│ 3. ANALYTICS │
	├─────────────────────────────────────────────────────────────────┤
	│ • Load current experiment from session_state │
	│ • Calculate metrics using SessionFeedbackManager │
	│ • Show overall performance, stage analysis, rejection reasons │
	│ • Support AB testing side-by-side comparison │
	└─────────────────────────────────────────────────────────────────┘
	↓
	┌─────────────────────────────────────────────────────────────────┐
	│ 4. HISTORICAL ANALYTICS │
	├─────────────────────────────────────────────────────────────────┤
	│ • [BUTTON] Load Historical Data from Snowflake │
	│ • Query EXPERIMENT_METADATA + FEEDBACKS tables │
	│ • Calculate aggregate metrics across all experiments │
	│ • Show trends: rejection rates over time │
	│ • Compare configurations: which performs best │
	│ • Filter by date range │
	└─────────────────────────────────────────────────────────────────┘
	```

	---

	## 📄 Pages Overview

	### Page 0: Home & Authentication (app.py)

	Purpose: Login, brand selection, config loading, and navigation hub

	Features:
	- Email and token-based authentication
	- Brand selection (Drumeo, Pianote, Guitareo, Singeo)
	- Config loading from Snowflake on startup
	- Brand-specific theming applied throughout app
	- Navigation guide and quick start instructions
	- Current experiment status overview

	Technical Details:
	- Session-based authentication
	- Brand selection persists via `st.session_state.selected_brand`
	- Configs cached in `st.session_state.configs_cache` on first load
	- Dynamic theming using `utils/theme.py`
	- Loads environment variables from `.env` file

	Key Functions:
	```python
	def load_configs_from_snowflake(brand):
	"""Load all configs for brand from Snowflake, cache in session_state."""
	session = create_snowflake_session()
	config_manager = ConfigManager(session)
	configs = config_manager.load_configs_from_snowflake(brand)
	st.session_state.configs_cache = configs
	session.close()
	```

	---

	### Page 1: Campaign Builder

	Purpose: Create and run message generation campaigns with integrated A/B testing

	Architecture: In-memory experiment execution with parallel AB testing

	#### Key Features

	Configuration Management:
	- Loads configs from cached `session_state.configs_cache`
	- Real-time config editing in UI
	- Save configs to Snowflake with auto-versioning
	- Quick save or save-as-new options

	A/B Testing Toggle:
	- Single Experiment Mode (default): One campaign
	- A/B Testing Mode: Two parallel experiments for comparison

	User Sampling:
	- Random sampling from brand-specific CSV files
	- 1-25 users selectable
	- Same users used for both AB experiments (fair comparison)

	Parallel Execution:
	- Uses `ExperimentRunner.run_ab_test_parallel()`
	- Threading for simultaneous A/B generation
	- Separate Snowflake sessions per experiment (no conflicts)
	- Console logging for thread progress
	- Results stored directly in session_state

	Technical Details:

	Single Mode Generation:
	```python
	# ExperimentRunner handles all generation logic
	runner = ExperimentRunner(brand=brand, system_config=config)

	success, ui_log_data, metadata = runner.run_single_experiment(
	config=campaign_config,
	sampled_users_df=users_df,
	experiment_id=experiment_id,
	create_session_func=create_snowflake_session,
	progress_container=st.container()
	)

	# Store in session_state
	st.session_state.ui_log_data = ui_log_data
	st.session_state.current_experiment_metadata = metadata
	st.session_state.current_experiment_id = experiment_id
	```

	AB Mode Generation:
	```python
	# Parallel execution in threads
	results = runner.run_ab_test_parallel(
	config_a=campaign_config_a,
	config_b=campaign_config_b,
	sampled_users_df=users_df,
	experiment_a_id=experiment_a_id,
	experiment_b_id=experiment_b_id,
	create_session_func=create_snowflake_session
	)

	# Store both results
	st.session_state.ui_log_data_a = results['a']['ui_log_data']
	st.session_state.ui_log_data_b = results['b']['ui_log_data']
	st.session_state.experiment_a_metadata = results['a']['metadata']
	st.session_state.experiment_b_metadata = results['b']['metadata']
	```

	Configuration Saving to Snowflake:
	```python
	# Save with auto-versioning
	session = create_snowflake_session()
	db_manager = UIDatabaseManager(session)
	success = db_manager.save_config(
	config_name=config_name,
	config_data=campaign_config,
	brand=brand
	)
	session.close()
	```

	Thread-Safe Progress Handling:
	- Main thread: Full Streamlit UI updates
	- Worker threads: Console logging only
	- Dummy progress bars prevent Streamlit errors in threads

	---

	### Page 2: Message Viewer

	Purpose: Browse, search, and evaluate generated messages with A/B testing awareness

	Architecture: Loads from session_state, feedback stored in-memory

	#### Key Features

	Automatic AB Detection:
	```python
	def detect_ab_testing_mode():
	"""Detect AB mode from session_state."""
	return (
	'ui_log_data_a' in st.session_state and
	'ui_log_data_b' in st.session_state and
	st.session_state.ui_log_data_a is not None and
	st.session_state.ui_log_data_b is not None
	)
	```

	Message Loading:
	```python
	# Single mode
	def get_single_experiment_data():
	if 'ui_log_data' in st.session_state:
	return st.session_state.ui_log_data
	return None

	# AB mode - loads both dataframes
	messages_a_df = st.session_state.ui_log_data_a
	messages_b_df = st.session_state.ui_log_data_b
	```

	Feedback System:
	- In-memory storage using `SessionFeedbackManager`
	- Rejection categories: Poor header, Poor body, Grammar, Emoji, Recommendation issues, Similar to previous, etc.
	- Stores full message header and body with feedback
	- Undo rejection capability

	```python
	# Add feedback
	SessionFeedbackManager.add_feedback(
	experiment_id=experiment_id,
	user_id=user_id,
	stage=stage,
	feedback_type="reject",
	rejection_reason="poor_header",
	rejection_text="Too generic",
	message_header=header,
	message_body=body,
	campaign_name=campaign_name,
	brand=brand,
	config_name=config_name,
	feedback_list_key="current_feedbacks" # or "feedbacks_a", "feedbacks_b"
	)
	```

	Store Results to Snowflake (CRITICAL FEATURE):

	Located after message viewing section. Button appears prominently.

	```python
	if st.button("💾 Store Results to Snowflake"):
	session = create_snowflake_session()
	db_manager = UIDatabaseManager(session)

	# Single mode
	if not ab_mode:
	# Store metadata
	for meta in st.session_state.current_experiment_metadata:
	db_manager.store_experiment_metadata(meta)

	# Store feedback
	for feedback in st.session_state.current_feedbacks:
	db_manager.store_feedback(feedback)

	# AB mode
	else:
	# Store both experiments
	for meta in st.session_state.experiment_a_metadata:
	db_manager.store_experiment_metadata(meta)
	for meta in st.session_state.experiment_b_metadata:
	db_manager.store_experiment_metadata(meta)

	for feedback in st.session_state.feedbacks_a:
	db_manager.store_feedback(feedback)
	for feedback in st.session_state.feedbacks_b:
	db_manager.store_feedback(feedback)

	session.close()
	st.success("✅ Results stored to Snowflake successfully!")
	st.balloons()
	```

	View Modes:
	- User-Centric: All stages for each user
	- Stage-Centric: All users for each stage
	- Filters: Stage selection, keyword search, pagination

	---

	### Page 3: Analytics Dashboard

	Purpose: Visualize performance metrics for CURRENT experiment only

	Architecture: Loads from session_state, uses SessionFeedbackManager

	#### Key Features

	Data Loading:
	```python
	# Single mode
	def get_single_experiment_data():
	return st.session_state.ui_log_data

	# AB mode
	detect_ab_testing_mode() # Returns True if AB data exists
	messages_a_df = st.session_state.ui_log_data_a
	messages_b_df = st.session_state.ui_log_data_b
	```

	Feedback Stats Calculation:
	```python
	# Single mode
	feedback_stats = SessionFeedbackManager.get_feedback_stats(
	experiment_id=experiment_id,
	total_messages=len(messages_df),
	feedback_list_key="current_feedbacks"
	)

	# AB mode
	feedback_stats_a = SessionFeedbackManager.get_feedback_stats(
	experiment_a_id,
	total_messages=len(messages_a_df),
	feedback_list_key="feedbacks_a"
	)
	feedback_stats_b = SessionFeedbackManager.get_feedback_stats(
	experiment_b_id,
	total_messages=len(messages_b_df),
	feedback_list_key="feedbacks_b"
	)
	```

	Metrics Displayed:
	- Overall: Total messages, rejection rate, feedback count
	- Stage-by-Stage: Performance breakdown per stage
	- Rejection Reasons: Pie charts and bar charts
	- AB Comparison: Side-by-side metrics with winner determination

	Export Options:
	- Export current messages to CSV
	- Export current feedback to CSV
	- Export analytics summary to CSV

	Important: Analytics page shows ONLY the current in-memory experiment. For historical data, use Historical Analytics.

	---

	### Page 4: Historical Analytics

	Purpose: Track all past experiments and analyze trends from Snowflake

	Architecture: Button-triggered Snowflake queries

	#### Key Features

	Load Button:
	```python
	if st.button("📊 Load Historical Data from Snowflake"):
	session = create_snowflake_session()
	db_manager = UIDatabaseManager(session)

	# Load experiment summary with JOIN
	experiments_df = db_manager.get_experiment_summary(brand=brand)

	st.session_state['historical_experiments'] = experiments_df
	st.session_state['historical_data_loaded'] = True

	session.close()
	```

	SQL Query Example:
	```sql
	SELECT
	m.EXPERIMENT_ID,
	m.CONFIG_NAME,
	m.CAMPAIGN_NAME,
	m.BRAND,
	MIN(m.START_TIME) as start_time,
	SUM(m.TOTAL_MESSAGES) as total_messages,
	MAX(m.TOTAL_USERS) as total_users,
	COUNT(DISTINCT m.STAGE) as total_stages,
	COUNT(f.FEEDBACK_TYPE) as total_rejects,
	(COUNT(f.FEEDBACK_TYPE) * 100.0 / NULLIF(SUM(m.TOTAL_MESSAGES), 0)) as rejection_rate
	FROM MESSAGING_SYSTEM_V2.UI.EXPERIMENT_METADATA m
	LEFT JOIN MESSAGING_SYSTEM_V2.UI.FEEDBACKS f
	ON m.EXPERIMENT_ID = f.EXPERIMENT_ID
	WHERE m.BRAND = :brand
	GROUP BY m.EXPERIMENT_ID, m.CONFIG_NAME, m.CAMPAIGN_NAME, m.BRAND
	ORDER BY start_time DESC
	```

	Visualizations:
	- Experiments summary table
	- Rejection rate trend over time (line chart)
	- Performance comparison by configuration (bar chart)
	- Best/worst performing configs

	Filters:
	- Date range filtering
	- Automatic refresh button

	Export:
	- Export summary to CSV
	- Note: Detailed feedback export coming soon (use SQL queries for now)

	---

	## 🔧 Utility Modules

	### utils/db_manager.py - UIDatabaseManager

	Purpose: All Snowflake database operations

	Key Methods:
	```python
	class UIDatabaseManager:
	def __init__(self, session: Session):
	"""Initialize with Snowflake session."""

	def save_config(self, config_name, config_data, brand):
	"""Save config with auto-versioning."""

	def load_config(self, config_name, brand, version=None):
	"""Load specific config version."""

	def store_experiment_metadata(self, metadata: dict):
	"""Insert metadata record."""

	def store_feedback(self, feedback: dict):
	"""Insert feedback record."""

	def get_experiment_summary(self, brand=None, start_date=None, end_date=None):
	"""Get aggregated experiment metrics with JOIN."""

	def close(self):
	"""Close Snowflake session."""
	```

	Usage Pattern:
	```python
	# Always use context-like pattern
	session = create_snowflake_session()
	db_manager = UIDatabaseManager(session)

	try:
	# Do operations
	db_manager.save_config(...)
	db_manager.store_feedback(...)
	finally:
	db_manager.close() # or session.close()
	```

	---

	### utils/config_manager.py - ConfigManager

	Purpose: Configuration loading and caching

	Key Methods:
	```python
	class ConfigManager:
	def __init__(self, session: Session):
	"""Initialize with Snowflake session."""

	def load_configs_from_snowflake(self, brand: str) -> Dict:
	"""Load all configs for brand, returns {name: config_data}."""

	def get_latest_version(self, config_name: str, brand: str) -> int:
	"""Get latest version number."""
	```

	Caching Strategy:
	- Configs loaded once on app startup
	- Cached in `st.session_state.configs_cache`
	- Format: `{"config_name": {...config_data...}, ...}`
	- No re-querying Snowflake during session

	---

	### utils/experiment_runner.py - ExperimentRunner

	Purpose: Execute experiments with proper session management

	Key Methods:
	```python
	class ExperimentRunner:
	def run_single_experiment(
	self, config, sampled_users_df, experiment_id,
	create_session_func, progress_container
	):
	"""Run one experiment, all stages sequentially."""

	def run_ab_test_parallel(
	self, config_a, config_b, sampled_users_df,
	experiment_a_id, experiment_b_id, create_session_func
	):
	"""Run two experiments in parallel threads."""
	```

	Thread-Safe Design:
	- Each thread gets own Snowflake session
	- Progress updates handled safely:
	- Main thread: Full Streamlit UI
	- Worker threads: Console logging with dummy UI objects
	- No Streamlit context errors

	Implementation:
	```python
	# Thread function
	def run_experiment(exp_key, config, exp_id):
	try:
	success, data, metadata = self.run_single_experiment(
	config=config,
	sampled_users_df=users_df,
	experiment_id=exp_id,
	create_session_func=create_session_func,
	progress_container=None # None = threaded mode
	)
	results[exp_key] = {'success': success, 'ui_log_data': data, 'metadata': metadata}
	except Exception as e:
	results[exp_key] = {'success': False, 'error': str(e)}

	# Start threads
	thread_a = threading.Thread(target=run_experiment, args=('a', config_a, exp_a_id))
	thread_b = threading.Thread(target=run_experiment, args=('b', config_b, exp_b_id))
	thread_a.start()
	thread_b.start()
	thread_a.join()
	thread_b.join()
	```

	---

	### utils/session_feedback_manager.py - SessionFeedbackManager

	Purpose: In-memory feedback management

	Static Methods (no instance needed):
	```python
	@staticmethod
	def add_feedback(experiment_id, user_id, stage, feedback_type,
	rejection_reason, rejection_text, message_header,
	message_body, campaign_name, brand, config_name,
	feedback_list_key):
	"""Add feedback to session_state list."""

	@staticmethod
	def get_feedback_stats(experiment_id, total_messages, feedback_list_key):
	"""Calculate aggregate stats from feedback list."""

	@staticmethod
	def get_stage_feedback_stats(experiment_id, messages_df, feedback_list_key):
	"""Calculate per-stage stats."""

	@staticmethod
	def get_rejection_reason_label(reason_key):
	"""Map reason key to display label."""
	```

	Feedback List Keys:
	- `"current_feedbacks"` - Single mode
	- `"feedbacks_a"` - AB mode experiment A
	- `"feedbacks_b"` - AB mode experiment B

	Usage:
	```python
	# Add feedback
	SessionFeedbackManager.add_feedback(
	experiment_id="drumeo_20260114_1234",
	user_id=12345,
	stage=1,
	feedback_type="reject",
	rejection_reason="poor_header",
	rejection_text="Too generic",
	message_header="Your next lesson 👇",
	message_body="Check it out...",
	campaign_name="Re-engagement",
	brand="drumeo",
	config_name="drumeo_re_engagement_test",
	feedback_list_key="current_feedbacks"
	)

	# Get stats
	stats = SessionFeedbackManager.get_feedback_stats(
	experiment_id="drumeo_20260114_1234",
	total_messages=100,
	feedback_list_key="current_feedbacks"
	)
	# Returns: {'total_feedback': 10, 'total_rejects': 10,
	# 'reject_rate': 10.0, 'rejection_reasons': {...}}
	```

	---

	## 🎨 Brand Theming

	Theming automatically adjusts based on selected brand:

	\| Brand \| Primary Color \| Sidebar BG \| Accent \| Emoji \|
	\|----------\|---------------\|------------\|----------\|-------\|
	\| Base \| Gold \| Dark Gold \| Gold \| 🎵 \|
	\| Drumeo \| Light Blue \| Dark Blue \| Blue \| 🥁 \|
	\| Pianote \| Light Red \| Dark Red \| Red \| 🎹 \|
	\| Guitareo \| Light Green \| Dark Green \| Green \| 🎸 \|
	\| Singeo \| Light Purple \| Dark Purple\| Purple \| 🎤 \|

	Implementation: `utils/theme.py`

	```python
	def get_brand_theme(brand):
	"""Returns theme dictionary."""

	def apply_theme(brand):
	"""Applies CSS via st.markdown."""

	def get_brand_emoji(brand):
	"""Returns brand emoji."""
	```

	---

	## 🔍 Key Implementation Details

	### Configuration File Structure

	```json
	{
	"brand": "drumeo",
	"campaign_type": "re_engagement",
	"campaign_name": "UI-Test-Campaign",
	"campaign_instructions": "Keep messages encouraging and motivational.",
	"1": {
	"stage": 1,
	"model": "gemini-2.5-flash-lite",
	"personalization": true,
	"involve_recsys_result": true,
	"recsys_contents": ["workout", "course", "quick_tips"],
	"specific_content_id": null,
	"segment_info": "Students inactive for 3+ days",
	"instructions": "",
	"sample_examples": "Header: Your next lesson 👇\nMessage: Check it out!",
	"identifier_column": "user_id",
	"platform": "push"
	},
	"2": {
	"stage": 2,
	"model": "gpt-4o-mini",
	"personalization": true,
	"involve_recsys_result": true,
	"recsys_contents": ["song"],
	"specific_content_id": 12345,
	"segment_info": "Students inactive for 7+ days",
	"instructions": "Focus on easy songs",
	"sample_examples": "Header: Let's jam! 🎸\nMessage: Try this song!",
	"identifier_column": "user_id",
	"platform": "push"
	}
	}
	```

	### Rejection Reason Categories

	```python
	REJECTION_REASONS = {
	"poor_header": "Poor Header",
	"poor_body": "Poor Body/Content",
	"grammar_issues": "Grammar Issues",
	"emoji_problems": "Emoji Problems",
	"recommendation_issues": "Recommendation Issues",
	"wrong_information": "Wrong/Inaccurate Information",
	"tone_issues": "Tone Issues",
	"similarity": "Similar To Previous Header/Messages",
	"other": "Other"
	}
	```

	### Environment Variables Required

	```bash
	# Snowflake
	SNOWFLAKE_USER=your_user
	SNOWFLAKE_PASSWORD=your_password
	SNOWFLAKE_ACCOUNT=your_account
	SNOWFLAKE_ROLE=your_role
	SNOWFLAKE_DATABASE=MESSAGING_SYSTEM_V2
	SNOWFLAKE_WAREHOUSE=your_warehouse
	SNOWFLAKE_SCHEMA=UI

	# LLM APIs
	OPENAI_API_KEY=sk-...
	GOOGLE_API_KEY=AIza...
	```

	---

	## 🛠️ Development Guide

	### Adding a New Page

	1. Create `pages/{number}_{Name}.py`
	2. Follow standard structure:
	```python
	import streamlit as st
	import os
	from pathlib import Path
	from dotenv import load_dotenv

	# Load .env
	env_path = Path(__file__).parent.parent.parent / '.env'
	if env_path.exists():
	load_dotenv(env_path)

	from utils.auth import check_authentication
	from utils.theme import apply_theme, get_brand_emoji

	st.set_page_config(page_title="New Page", page_icon="🆕", layout="wide")

	if not check_authentication():
	st.error("🔒 Please login first")
	st.stop()

	if "selected_brand" not in st.session_state:
	st.error("⚠️ Please select a brand first")
	st.stop()

	brand = st.session_state.selected_brand
	apply_theme(brand)

	# Your content here
	```
	3. Update `app.py` navigation
	4. Test with all brands

	### Adding a New Feedback Category

	1. Update `utils/session_feedback_manager.py`:
	```python
	REJECTION_REASONS = {
	# ... existing
	"new_reason": "New Reason Label"
	}
	```
	2. Automatically appears in UI

	### Adding a New Brand

	1. Create `data/UI_users/{brand}_users.csv` (100 users, must have `USER_ID` column)
	2. Add to `utils/theme.py`:
	```python
	BRAND_THEMES["newbrand"] = {
	"primary": "#COLOR",
	"accent": "#COLOR",
	"sidebar_bg": "#DARK_COLOR",
	"text": "#FFFFFF"
	}
	BRAND_EMOJIS["newbrand"] = "🆕"
	```
	3. Update `app.py`:
	```python
	brands = ["drumeo", "pianote", "guitareo", "singeo", "newbrand"]
	brand_labels["newbrand"] = "🆕 New Brand"
	```
	4. Create default config in Snowflake or via UI

	### Debugging Tips

	Common Issues:

	1. "No messages found" in Message Viewer:
	- Check `st.session_state.ui_log_data` exists
	- Verify generation completed in Campaign Builder
	- Look for errors in terminal

	2. Snowflake connection errors:
	- Verify `.env` file exists and is loaded
	- Check credentials are correct
	- Test connection: `create_snowflake_session()`

	3. AB test AttributeError:
	- Fixed in latest version
	- Ensure ExperimentRunner uses thread-safe progress handling

	4. Config save errors with quotes:
	- Fixed: Now uses `write_pandas()` instead of raw SQL
	- Handles JSON with apostrophes correctly

	5. Feedback not in Analytics:
	- Check `st.session_state.current_feedbacks` has data
	- Verify correct feedback_list_key used
	- Check experiment_id matches

	Debugging Code:
	```python
	# Debug session state
	with st.expander("Debug Info"):
	st.write("Session State Keys:", list(st.session_state.keys()))
	if 'ui_log_data' in st.session_state:
	st.write("Messages shape:", st.session_state.ui_log_data.shape)
	if 'current_feedbacks' in st.session_state:
	st.write("Feedback count:", len(st.session_state.current_feedbacks))
	```

	---

	## 🎯 Key Design Decisions

	### 1. In-Memory + Cloud Hybrid
	Decision: Use session_state for active data, Snowflake for persistence
	Rationale:
	- Fast in-memory operations
	- No local file dependencies (HuggingFace ready)
	- Scalable historical storage
	- Clean separation: current vs. historical

	### 2. One-Click Storage
	Decision: Single "Store Results" button to persist everything
	Rationale:
	- Simple user workflow
	- Explicit persistence action
	- User controls when data is saved
	- No auto-save surprises

	### 3. Config Caching
	Decision: Load all configs once, cache in session_state
	Rationale:
	- Reduces Snowflake queries
	- Faster config switching
	- Session-scoped cache (fresh on page load)
	- No stale data issues

	### 4. Thread-Safe AB Testing
	Decision: Separate Snowflake sessions per thread, console logging
	Rationale:
	- Prevents session conflicts
	- Streamlit UI only in main thread
	- Clean error handling
	- Production-ready parallel execution

	### 5. Versioned Configurations
	Decision: Auto-increment version on every config save
	Rationale:
	- Full audit trail
	- Can rollback to previous versions
	- Supports experimentation
	- No data loss

	### 6. Button-Triggered Historical Loading
	Decision: Historical Analytics loads on button click, not auto
	Rationale:
	- User controls when to query Snowflake
	- Avoids unnecessary queries
	- Faster page load
	- Clear user action

	### 7. SessionFeedbackManager Static Methods
	Decision: All methods static, no instance needed
	Rationale:
	- Simpler API
	- Works directly with session_state
	- No state to manage
	- Cleaner code

	---

	## 🚀 Deployment Guide

	### HuggingFace Spaces Deployment

	Requirements:
	- No local file dependencies ✅
	- Environment variables for secrets ✅
	- Snowflake connectivity ✅
	- CSV files in repo (data/UI_users/) ✅

	Steps:
	1. Push code to GitHub/HuggingFace repo
	2. Include `data/UI_users/` CSV files
	3. Set environment variables in Space settings:
	- All SNOWFLAKE_* variables
	- All API keys
	4. Run: `streamlit run app.py`
	5. Verify Snowflake connection works

	Files to Exclude:
	- `.env` (use Space secrets instead)
	- Local cache directories
	- Test data

	---

	## 📞 Support & Resources

	Contact:
	- Technical Support: danial@musora.com

	Related Documentation:
	- Main System: `ai_messaging_system_v2/README.md`
	- UI Mode Guide: `ai_messaging_system_v2/UI_MODE_GUIDE.md`
	- Implementation Details: `visualization/IMPLEMENTATION_COMPLETE.md`
	- Refactoring Guide: `visualization/ARCHITECTURE_REFACTOR_GUIDE.md`

	Useful Links:
	- Streamlit Documentation: https://docs.streamlit.io
	- Snowflake Python Connector: https://docs.snowflake.com/en/developer-guide/python-connector/python-connector
	- Plotly Charts: https://plotly.com/python/

	---

	## 📊 System Status

	Completion: 100% ✅

	Completed Components:
	1. ✅ Database Layer (db_manager.py)
	2. ✅ Config Manager (config_manager.py)
	3. ✅ Session Feedback Manager (session_feedback_manager.py)
	4. ✅ Experiment Runner (experiment_runner.py)
	5. ✅ App.py - Authentication & config loading
	6. ✅ Campaign Builder - Generation & AB testing
	7. ✅ Message Viewer - Viewing & feedback
	8. ✅ Analytics - Current experiment metrics
	9. ✅ Historical Analytics - Snowflake integration

	Recent Fixes:
	- ✅ Configuration save error (JSON escaping) - Fixed with `write_pandas()`
	- ✅ AB testing `AttributeError: enter` - Fixed with thread-safe design
	- ✅ Historical Analytics Snowflake connection - Fixed to use `.env`

	Ready For:
	- ✅ Production use
	- ✅ HuggingFace deployment
	- ✅ End-to-end testing
	- ✅ Team onboarding

	---

	Built with ❤️ for the Musora team

	Last Updated: 2026-01-14
	Version: 2.0 (Refactored Architecture)