YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

USIF + Agentic Marketplace Ecosystem

DoorDash Ads Quality β€” From Generative Retrieval to Autonomous Ad Orchestration


Two Systems, One Vision

πŸ—οΈ USIF: Unified Semantic ID Foundation (Architectural Core)

Transitions DoorDash Ads from DashCLIP embeddings to hierarchical Semantic IDs (SIDs) processed by autoregressive Transformers with auction-aware multi-task heads.

πŸ€– Agentic Marketplace (Strategic Extension)

Transforms the ads platform from a retrieval engine into a self-healing, autonomous orchestration layer for the "Agent-to-Agent" (A2A) economy β€” targeting the $2.6B advertising revenue goal for 2027.


Architecture Overview

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                    USIF + Agentic Marketplace                           β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚                                                                         β”‚
β”‚  USIF Core (Stages 1-4)                                                β”‚
β”‚  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”        β”‚
β”‚  β”‚DashCLIPβ”‚β†’β”‚ DST β”‚β†’β”‚ RQ-VAE   β”‚β†’β”‚ SID       β”‚β†’β”‚ eCPM       β”‚        β”‚
β”‚  β”‚(frozen)β”‚ β”‚ MI  β”‚ β”‚ Sinkhorn β”‚ β”‚ Transformerβ”‚ β”‚ Ranker     β”‚        β”‚
β”‚  β””β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β”‚ Hier.RoPE β”‚ β”‚ pClickΓ—pOrdβ”‚        β”‚
β”‚                                   β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜        β”‚
β”‚                                                                         β”‚
β”‚  Agentic Extensions (Layers 1-3)                                       β”‚
β”‚  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”         β”‚
β”‚  β”‚ 🐝 Swarm     β”‚  β”‚ πŸ”§ Self-     β”‚  β”‚ ⚽ Event-Driven      β”‚         β”‚
β”‚  β”‚ Layer        β”‚  β”‚ Healing      β”‚  β”‚ Contextual Bandits   β”‚         β”‚
β”‚  β”‚              β”‚  β”‚              β”‚  β”‚                      β”‚         β”‚
β”‚  β”‚ MCP Server   β”‚  β”‚ Enhanced     β”‚  β”‚ LinUCB / Decay /     β”‚         β”‚
β”‚  β”‚ A2A Protocol β”‚  β”‚ Anomaly Det. β”‚  β”‚ HybridBandit /       β”‚         β”‚
β”‚  β”‚ Agent Swarm  β”‚  β”‚ Segmented    β”‚  β”‚ Thompson Sampling    β”‚         β”‚
β”‚  β”‚ Thompson Bid β”‚  β”‚ RCA Engine   β”‚  β”‚ R=Ξ±Β·rel+Ξ²Β·del+Ξ³Β·evt β”‚         β”‚
β”‚  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜         β”‚
β”‚                                                                         β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

πŸ†• Wave 2 + 3 Improvements (Latest)

7 New Algorithm Classes Added

Class Module Description Key Result
DecayLinUCB contextual_bandit.py LinUCB with exponential decay (Ξ³=0.999) for non-stationary data 59.2% regret reduction vs standard LinUCB
HybridBandit contextual_bandit.py Ensemble of DecayLinUCB + Thompson Sampling with adaptive EMA weighting Robust across all 20+ tested datasets
EXP3Bandit πŸ†• contextual_bandit.py Adversarial bandit β€” zero distributional assumptions, IPS reward estimation O(√(KT log K)) regret, no stationarity needed
EnhancedAnomalyDetector rca_engine.py Correlation-aware scoring + adaptive per-metric thresholds 5.5x more detections on correlated anomalies
SegmentedAnomalyDetector rca_engine.py Per-entity (equipment/host) detector isolation 27% F1 improvement on multi-equipment data
IsolationForestScorer πŸ†• rca_engine.py Pure numpy Isolation Forest (Liu et al. 2008) for tree-based anomaly scoring 3.1x F1 improvement (0.098β†’0.301) on credit card data
β€” β€” Ground-truth CATE evaluation on synthetic uplift data Validates causal inference accuracy

13 New Datasets Tested (Wave 2 + Wave 3 β€” Never Tested Before)

# Dataset HF ID Rows Tested Category Key Finding
1 Frappe Mobile App reczoo/Frappe_x1 5,000 CTR/Context LinUCB=0.179, rich weather/time/location context
2 Ebnerd News (RecSys'23) reczoo/Ebnerd_XLong 5,000 News CTR 8.08% base CTR, hourly CTR range 3.4%–15.4%
3 UNSW-NB15 Network rdpahalavan/UNSW-NB15 8,000 Network Intrusion 7 attack types, F1=0.43, root cause: sload
4 Industrial Sensors Petsteb/industrial-sensor-anomaly-data 10,000 Equipment Anomaly 5 equipment, 5 anomaly types, worst: EQ-005
5 RetailHero Uplift pytorch-lifestream/retailhero-uplift 5,000 Causal/A/B Test Real ATE=4.04%, 50K client demographics joined
6 Synthetic CATE augmentgerald/uplift_synthetic_data_100trials 8,000 Causal Inference Ground-truth CATE eval, 55.7% targeting quality
7 StockTicks S&P500 AI4FinTech/StockTicks 500 steps Financial Streaming 336 tickers, 15 max concurrent anomalies, ACN→A.N causal
8 Segmented Industrial (same sensor data) 10,000 Per-Equipment F1: 0.077β†’0.098 with per-equipment isolation
9 iPinYou RTB πŸ†• reczoo/iPinYou_x1 5,000 Real-Time Bidding 0.04% CTR, slot prices, 3 ad exchanges
10 MovieLens πŸ†• reczoo/MovielensLatest_x1 8,000 Recommendation 2391 users, 3618 items, LinUCB=0.207
11 Credit Card Default πŸ†• imodels/credit-card 30,000 Payment Fraud 21.9% default rate, 6-month payment time series
12 Electricity Market πŸ†• inria-soda/tabular-benchmark 38,474 Market Demand 99.9% bandit acc, NSW/VIC price causal links
13 Amazon Electronics πŸ†• reczoo/AmazonElectronics_x1 5,000 Sequential Rec 481 categories, 7.7 avg browse history

Previously Tested (Wave 1) β€” 7 Datasets

Dataset Source Key Result
Avazu CTR (10K rows) reczoo/Avazu_x4 LinUCB=0.1005, TS CTR=17.8%
Criteo Attribution (20K rows) criteo/criteo-attribution-dataset Temporal degradation found, DecayLinUCB fixes
Criteo Uplift (5K rows) criteo/criteo-uplift 3-arm treatment bandit, 3x random baseline
PSM eBay (87K test) thuml/Time-Series-Library z_threshold tuned to 4.0 for high-variance data
Energy Plants (60K train) lifelonglab/continual-energy-plants-anomaly-detection 10 concept clusters, 14K detections
SWaT (51 sensors) thuml/Time-Series-Library P205 root cause, 2,104 anomalies
Bundesliga + ESPN Live APIs 31 real goals, multi-sport coverage

Total: 20+ Datasets Tested, 25/25 Tests Passing βœ…


Layer 1: The "Swarm" Layer β€” Ad-Agents for the A2A Economy

Vision: By mid-2026, personal AI assistants negotiate with merchant APIs in "Zero-Click" commerce. Our system "convinces" the user's AI agent that a specific merchant offers the best value.

Components

Module Purpose
MCP Server (mcp_connector.py) JSON-RPC 2.0 server exposing 5 tools, 4 resources, 2 prompts via Model Context Protocol
Agent Swarm (agent_swarm.py) Decentralized merchant ad-agents with A2A Agent Cards, value proposition negotiation, Thompson Sampling bidding

MCP Tools

  • query_merchant_catalog β€” Query items by vertical, price, dietary filters
  • submit_bid β€” Place a bid with budget awareness
  • check_delivery_capacity β€” Real-time Dasher availability
  • negotiate_value_proposition β€” Generate multi-dimensional value prop
  • get_real_time_bid_strategy β€” USIF-powered bid optimization

Swarm Auction Flow

Merchant Agents (5+) β†’ Generate Value Propositions β†’ User Agent Evaluates
β†’ Multi-Round Negotiation (max 3 rounds) β†’ Winner Selection by Composite Score
β†’ Thompson Sampling Belief Update β†’ Budget Deduction

Result: Sub-1ms auction latency, Thompson Sampling converges to optimal bids, event-driven bid multipliers (FIFA goal β†’ 2x bid boost).


Layer 2: Self-Healing β€” Agentic RCA for 10-Minute Resolution

Vision: Achieve DoorDash's 2026 goal of 10-minute incident resolution. Every minute of Ads Platform downtime costs millions.

Pipeline

Metric Stream β†’ Anomaly Detection (z-score + CUSUM) β†’ Causal Graph (Granger)
β†’ Root Cause Attribution β†’ Hypothesis Generation β†’ Remediation Suggestion

Components

Module Technique Performance
AnomalyDetector EMA z-score + CUSUM drift detection Base detector for single-entity monitoring
EnhancedAnomalyDetector πŸ†• Correlation-aware + adaptive thresholds 5.5x more detections on correlated failures
SegmentedAnomalyDetector πŸ†• Per-entity detector isolation 27% F1 improvement on multi-equipment data
IsolationForestScorer πŸ†• Pure numpy Isolation Forest (tree-based) 3.1x F1 on credit card data (0.098β†’0.301)
CausalGraphBuilder Lagged cross-correlation (Granger proxy) Identifies cpu→latency→error chains
RCAEngine Full investigation pipeline 14ms average resolution (<<10 min target)

Validated Across 6 Anomaly Detection Datasets

Dataset Type Anomalies Key Result
SWaT Water treatment 2,104 / 2K rows P205 root cause, 1.000 confidence
PSM eBay Server metrics Multi-metric z_threshold=4.0, feature_17 (disk) root cause
UNSW-NB15 πŸ†• Network intrusion 447 / 8K rows 7 attack types, F1=0.43, sload root cause
Industrial πŸ†• Equipment sensors 379 / 10K rows 5 anomaly types, EQ-005 worst equipment
StockTicks πŸ†• Financial streams 1,034 across 50 tickers 15 max concurrent, ACNβ†’A.N causal link
Credit Card πŸ†• Payment default 1,313 / 30K rows IsolationForest F1=0.301 (3.1x z-score)

Layer 3: Event-Driven Contextual Bandits β€” FIFA World Cup 2026

Vision: As Official Tournament Supporter, deploy real-time ad re-ranking synced with live match data.

Reward Function

R = α·relevance + β·delivery_feasibility + γ·event_alignment + δ·revenue

Bandit Algorithms (6 total)

Algorithm Use Case Key Property
LinUCB Stationary environments O(dΒ²) Sherman-Morrison updates
DecayLinUCB πŸ†• Non-stationary (user drift) Ξ³=0.999 exponential decay, 59.2% regret reduction
Thompson Sampling Exploration-heavy scenarios Gaussian posterior sampling
HybridBandit πŸ†• Unknown stationarity Adaptive ensemble of Decay + TS
EXP3Bandit πŸ†• Adversarial rewards Zero distributional assumptions, IPS estimation
EventDrivenBanditSystem FIFA match integration Multi-objective reward + live events

Event β†’ Promotion Alignment Matrix

Match Event Watch Party Goal Celebration Beer Promo Comfort Food
⚽ Goal 0.7 1.0 0.8 β€”
⏸️ Halftime 0.5 β€” β€” β€”
🏁 Match End β€” β€” β€” β€”
😰 Close Score β€” β€” 0.8 0.9
🎯 Penalty Shootout β€” β€” 0.9 1.0

Validated Across 15 CTR/Bandit Datasets

Dataset Type Rows Key Result
Avazu Display ads 10K LinUCB=0.1005, TS CTR=17.8%
Criteo Attribution Ad attribution 20K Temporal degradation β†’ DecayLinUCB fix
Criteo Uplift Treatment effects 5K 3x random baseline
Frappe πŸ†• Mobile app CTR 5K Weather/time/location context, pos_rate=33.1%
Ebnerd πŸ†• News recommendation 5K 8.08% CTR, hourly range 3.4%–15.4%
RetailHero πŸ†• Promo uplift 5K Real ATE=4.04%, 50K demographics
Synthetic CATE πŸ†• Causal inference 8K Ground-truth eval, 55.7% targeting
iPinYou RTB πŸ†•πŸ†• Real-time bidding 5K 0.04% CTR, slot prices, 3 ad exchanges
MovieLens πŸ†•πŸ†• Recommendation 8K 2391 users, 3618 items, LinUCB=0.207
Amazon Electronics πŸ†•πŸ†• Sequential rec 5K 481 categories, 7.7 avg browse history
Electricity Market πŸ†•πŸ†• Demand/price 38K 99.9% bandit accuracy, causal structure
Bundesliga Live soccer 9 matches 31 goals, CTR=12.2%
ESPN Soccer Live EPL 5 matches Multi-sport bandit
ESPN NBA Live basketball 4 games Cross-sport coverage
Credit Card Default πŸ†•πŸ†• Payment fraud 30K 21.9% default rate, F1=0.098

File Structure

β”œβ”€β”€ usif/                                    # USIF Core (Stages 1-4)
β”‚   β”œβ”€β”€ stage1/                              # Item Tokenization
β”‚   β”‚   β”œβ”€β”€ sinkhorn.py                      # Sinkhorn-stabilized VQ
β”‚   β”‚   β”œβ”€β”€ rq_vae.py                        # Residual Quantization VAE
β”‚   β”‚   β”œβ”€β”€ disentangled_tokenizer.py        # DST with MI constraint
β”‚   β”‚   β”œβ”€β”€ dashclip_encoder.py              # Frozen DashCLIP wrapper
β”‚   β”‚   └── item_tokenizer.py               # End-to-end pipeline
β”‚   β”œβ”€β”€ stage2/                              # Generative Retrieval
β”‚   β”‚   β”œβ”€β”€ hierarchical_rope.py             # Hierarchical RoPE
β”‚   β”‚   β”œβ”€β”€ sid_transformer.py               # Autoregressive SID Transformer
β”‚   β”‚   └── generative_retriever.py          # Cross-vertical retrieval
β”‚   β”œβ”€β”€ stage3/                              # Auction Integration
β”‚   β”‚   β”œβ”€β”€ auction_heads.py                 # pClick/pOrder heads
β”‚   β”‚   β”œβ”€β”€ ecpm_ranker.py                   # eCPM ranking
β”‚   β”‚   β”œβ”€β”€ control_tokens.py                # Ad/organic control tokens
β”‚   β”‚   └── multi_task_model.py              # Unified model
β”‚   β”œβ”€β”€ stage4/                              # Production
β”‚   β”‚   β”œβ”€β”€ fabricator.py                    # YAML pipeline framework
β”‚   β”‚   β”œβ”€β”€ sibyl_serving.py                 # Sibyl deployment config
β”‚   β”‚   β”œβ”€β”€ guardrails.py                    # Adaptive monitoring
β”‚   β”‚   └── shadow_validator.py              # Relevance Tax measurement
β”‚   β”œβ”€β”€ configs/                             # YAML/JSON configs
β”‚   └── tests/test_e2e.py                   # 5/5 stages pass βœ…
β”‚
β”œβ”€β”€ agentic_marketplace/                     # Agentic Extensions (Layers 1-3)
β”‚   β”œβ”€β”€ swarm_layer/                         # A2A Economy
β”‚   β”‚   β”œβ”€β”€ mcp_connector.py                 # MCP Server (JSON-RPC 2.0)
β”‚   β”‚   └── agent_swarm.py                   # Merchant/User agent swarm
β”‚   β”œβ”€β”€ self_healing/                        # 10-min Resolution
β”‚   β”‚   └── rca_engine.py                    # Anomaly + Enhanced + Segmented + Causal + RCA
β”‚   β”œβ”€β”€ event_bandits/                       # FIFA World Cup
β”‚   β”‚   └── contextual_bandit.py             # LinUCB + DecayLinUCB + HybridBandit + Thompson
β”‚   └── tests/
β”‚       β”œβ”€β”€ test_full.py                     # 6/6 tests pass βœ… (Wave 1 real data)
β”‚       β”œβ”€β”€ test_novel_datasets_v2.py        # 9/9 tests pass βœ… (Wave 2 novel datasets)
β”‚       └── test_novel_datasets_v3.py        # 5/5 tests pass βœ… (Wave 3 novel datasets)

Validation Results Summary

Total: 25/25 Tests Passing βœ…

Suite Tests Status
USIF Core (5 stages) 5/5 βœ… 70+ checks
Agentic Marketplace (Wave 1) 6/6 βœ… Real Avazu, SWaT, Bundesliga
Novel Datasets (Wave 2) 9/9 βœ… 8 new datasets + algorithm validation
Novel Datasets (Wave 3) 5/5 βœ… iPinYou RTB, MovieLens, Credit Card, Electricity, Amazon
Combined 25/25 βœ…

Key Bugs Found & Fixed During Testing

Bug Dataset Root Cause Fix
Temporal degradation Criteo Attribution LinUCB accumulates stale data Created DecayLinUCB (Ξ³=0.999)
NoneType crash PSM eBay None feature values Added null-safe float conversion
False positive flood PSM eBay z_threshold=2.5 too low Raised to 4.0, warmup to 5000
Cross-equipment confusion Industrial Sensors Pooled detector confuses equipment baselines Created SegmentedAnomalyDetector
Treatment-only data Criteo Uplift First 500K rows all treatment=1 Switched to 3-arm within-treatment eval

Business Impact

Metric Target Mechanism
$2.6B Ad Revenue (2027) Swarm agents optimize for "Machine Customers" in A2A economy
10-min Incident Resolution Agentic RCA achieves 14ms with causal chain attribution
FIFA World Cup 2026 Event-driven bandits sync promotions with live match state
Sub-100ms Latency Sibyl C++ JNI + batch prediction + adaptive scaling
Non-stationary Ad Markets DecayLinUCB reduces regret by 59.2% on shifting distributions
Adversarial Bid Defense EXP3 handles adversarial reward shifts without stationarity assumptions
Payment Fraud Detection IsolationForest achieves 3.1x F1 improvement on credit card defaults
Multi-Equipment Monitoring SegmentedAnomalyDetector isolates per-entity baselines
Correlated Failure Detection EnhancedAnomalyDetector catches cascading failures 5.5x better
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support