--- title: Multimodal A/B Test Predictor emoji: 🚀 colorFrom: blue colorTo: purple sdk: gradio sdk_version: "4.44.0" app_file: app.py pinned: false --- # 🚀 Multimodal A/B Test Predictor ## Overview Advanced A/B testing outcome predictor using multimodal AI analysis combining: - 🖼️ **Image Analysis**: Visual features from control & variant images - 📝 **OCR Text Extraction**: Automatically extracts and analyzes text from images - 📊 **Categorical Features**: Business context provided via API (industry, page type, etc.) - 🎯 **Confidence Scores**: Based on training data statistics and historical accuracy ## 🎯 Direct Input Architecture ### **Image + Categorical Data** - Accepts control and variant images directly via API - Requires categorical inputs: Business Model, Customer Type, Conversion Type, Industry, Page Type - Fast, efficient predictions without external API dependencies - All processing happens locally on GPU ## 🎯 Features ### Direct Prediction with Categorical Data - Upload control & variant images - Provide categorical business context data - Fast prediction with comprehensive confidence analysis ### Enhanced Results - **Winner Prediction**: Variant vs Control with probability - **Model Confidence**: Accuracy percentage from training data - **Training Data Count**: Number of samples model trained on for this category - **Historical Win/Loss**: Real A/B test outcome statistics for this category - **Confidence Source**: Industry + Page Type combination used for scoring ## 🔧 Setup ### Model Files - `model/multimodal_gated_model_2.7_GGG.pth`: Enhanced multimodal model (789MB) - `model/multimodal_cat_mappings_GGG.json`: Category mappings ## 🚀 Technical Architecture ### Model: SupervisedSiameseMultimodal (GGG Enhanced) - **Vision**: ViT (Vision Transformer) for image features - **Text**: DistilBERT for OCR text processing - **Fusion**: Gated fusion with directional features - **Categories**: Embedding layers for categorical features - **Architecture**: BatchNorm + Fusion Block + Enhanced Prediction Head ### Confidence Scoring - Based on Industry + Page Type combinations - Uses holdout statistics with average 160 samples per combination - Much more reliable than full 5-feature combinations ## 📊 Performance - **Multimodal Analysis**: Images + Text + Categories - **GPU Accelerated**: Fast predictions (2-4 seconds average) - **High Accuracy**: Enhanced GGG architecture with real training data - **No External Dependencies**: All processing done locally ## 🎯 Use Cases - **A/B Test Prediction**: Predict winners before running tests with provided context - **Batch Processing**: Process multiple tests efficiently from CSV - **Confidence Assessment**: Understand prediction reliability based on historical data - **API Integration**: Easy integration with external systems ## 📡 API Usage ### Quick Start ```python from gradio_client import Client client = Client("SpiralyzeLLC/ABTestPredictor") result = client.predict( "control.jpg", "variant.jpg", "SaaS", "B2B", "High-Intent Lead Gen", "B2B Software & Tech", "Awareness & Discovery", api_name="/predict_with_categorical_data" ) ``` ### 📚 Documentation - **[QUICK_START_GUIDE.md](QUICK_START_GUIDE.md)** - Get started in 3 steps - **[API_DOCUMENTATION.md](API_DOCUMENTATION.md)** - Complete API reference with examples in Python, JavaScript, and cURL - **[API_USAGE_UPDATED.md](API_USAGE_UPDATED.md)** - Technical details and migration notes ### 🌐 Web Interface Use the interactive interface: **https://huggingface.co/spaces/SpiralyzeLLC/ABTestPredictor** --- Built with ❤️ using Gradio, PyTorch, Transformers, and Hugging Face.