Spaces:

SpiralyzeLLC
/

ABTestPredictor

Runtime error

App Files Files Community

ABTestPredictor / README.md

nitish-spz

Mapping of grouped metadata

5b49b49 3 months ago

preview code

raw

history blame contribute delete

3.71 kB

	---
	title: Multimodal A/B Test Predictor
	emoji: 🚀
	colorFrom: blue
	colorTo: purple
	sdk: gradio
	sdk_version: "4.44.0"
	app_file: app.py
	pinned: false
	---

	# 🚀 Multimodal A/B Test Predictor

	## Overview
	Advanced A/B testing outcome predictor using multimodal AI analysis combining:
	- 🖼️ Image Analysis: Visual features from control & variant images
	- 📝 OCR Text Extraction: Automatically extracts and analyzes text from images
	- 📊 Categorical Features: Business context provided via API (industry, page type, etc.)
	- 🎯 Confidence Scores: Based on training data statistics and historical accuracy

	## 🎯 Direct Input Architecture

	### Image + Categorical Data
	- Accepts control and variant images directly via API
	- Requires categorical inputs: Business Model, Customer Type, Conversion Type, Industry, Page Type
	- Fast, efficient predictions without external API dependencies
	- All processing happens locally on GPU

	## 🎯 Features

	### Direct Prediction with Categorical Data
	- Upload control & variant images
	- Provide categorical business context data
	- Fast prediction with comprehensive confidence analysis

	### Enhanced Results
	- Winner Prediction: Variant vs Control with probability
	- Model Confidence: Accuracy percentage from training data
	- Training Data Count: Number of samples model trained on for this category
	- Historical Win/Loss: Real A/B test outcome statistics for this category
	- Confidence Source: Industry + Page Type combination used for scoring

	## 🔧 Setup

	### Model Files
	- `model/multimodal_gated_model_2.7_GGG.pth`: Enhanced multimodal model (789MB)
	- `model/multimodal_cat_mappings_GGG.json`: Category mappings

	## 🚀 Technical Architecture

	### Model: SupervisedSiameseMultimodal (GGG Enhanced)
	- Vision: ViT (Vision Transformer) for image features
	- Text: DistilBERT for OCR text processing
	- Fusion: Gated fusion with directional features
	- Categories: Embedding layers for categorical features
	- Architecture: BatchNorm + Fusion Block + Enhanced Prediction Head

	### Confidence Scoring
	- Based on Industry + Page Type combinations
	- Uses holdout statistics with average 160 samples per combination
	- Much more reliable than full 5-feature combinations

	## 📊 Performance
	- Multimodal Analysis: Images + Text + Categories
	- GPU Accelerated: Fast predictions (2-4 seconds average)
	- High Accuracy: Enhanced GGG architecture with real training data
	- No External Dependencies: All processing done locally

	## 🎯 Use Cases
	- A/B Test Prediction: Predict winners before running tests with provided context
	- Batch Processing: Process multiple tests efficiently from CSV
	- Confidence Assessment: Understand prediction reliability based on historical data
	- API Integration: Easy integration with external systems

	## 📡 API Usage

	### Quick Start
	```python
	from gradio_client import Client

	client = Client("SpiralyzeLLC/ABTestPredictor")
	result = client.predict(
	"control.jpg", "variant.jpg",
	"SaaS", "B2B", "High-Intent Lead Gen",
	"B2B Software & Tech", "Awareness & Discovery",
	api_name="/predict_with_categorical_data"
	)
	```

	### 📚 Documentation

	- [QUICK_START_GUIDE.md](QUICK_START_GUIDE.md) - Get started in 3 steps
	- [API_DOCUMENTATION.md](API_DOCUMENTATION.md) - Complete API reference with examples in Python, JavaScript, and cURL
	- [API_USAGE_UPDATED.md](API_USAGE_UPDATED.md) - Technical details and migration notes

	### 🌐 Web Interface

	Use the interactive interface: https://huggingface.co/spaces/SpiralyzeLLC/ABTestPredictor

	---

	Built with ❤️ using Gradio, PyTorch, Transformers, and Hugging Face.