--- model_name: AIDE-Chip-Surrogates license: cc-by-nc-sa-4.0 library_name: xgboost pipeline_tag: tabular-regression tags: - computer-architecture - gem5 - cache - surrogate-model - explainable-ai - shap - monotonic-constraints - systems-ml datasets: - uralstech/AIDE-Chip-15K-gem5-Sims --- # AIDE Chip Surrogates This is a collection of physics-aware, monotonicity-constrained XGBoost models that replace expensive gem5 cache simulations during design-space exploration. Each model predicts either IPC or L2 miss rate for a specific workload, using only cache configuration parameters as input. The models are interpretable via SHAP and enforce microarchitectural monotonicity where physically justified. This model release accompanies the paper: > Udayshankar Ravikumar . Fast, Explainable Surrogate Models for gem5 Cache Design Space Exploration. Authorea. January 14, 2026. > ## Model Architecture * Algorithm: XGBoost Regressor * Targets: * IPC * L2 miss rate * Features: * Log₂ cache sizes & associativities * Set-count proxies * Cache hierarchy ratios * Constraints: * Monotonic constraints encoding cache physics * Selective relaxation for latency-sensitive workloads ## Available Models | Workload | IPC Model | L2 Miss Model | | ---------- | --------- | ------------- | | crc32 | ✔ | ✔ | | dijkstra | ✔ | ✔ | | fft | ✔ | ✔ | | matrix_mul | ✔ | ✔ | | qsort | ✔ | ✔ | | sha | ✔ | ✔ | Total models: **12** ## Performance * Test set accuracy: **R² ≈ 0.999** * OOD validation: * 26 unseen cache configurations * **~817× critical-path speedup** * Low absolute error even when R² is unstable ## Explainability Each model is uploaded with its SHAP summary plot. They confirm: * Cache sizes dominate IPC & miss behavior * Associativity effects are workload-dependent * Learned relationships align with microarchitectural intuition ## Intended Use * Architecture research * Design-space exploration * Educational use * Explainable systems ML **Not for commercial deployment** without separate licensing. ## Limitations * Single-core, single-thread models * Cache hierarchy only (no pipeline, prefetcher, or multicore effects) * Accuracy depends on training coverage; extreme OOD configs are flagged ## Patent Notice The models uploaded here implement techniques described in an accompanying research paper. The author has filed a pending patent application that may cover broader design-space exploration workflows beyond these specific model implementations. The open-source license (CC BY-NC-SA 4.0) governs use of these models. This notice is informational only.