uralstech
/

AIDE-Chip-Surrogates

Tabular Regression

computer-architecture

surrogate-model

monotonic-constraints

Model card Files Files and versions

AIDE-Chip-Surrogates / README.md

uralstech's picture

Update README.md

6a9a002 verified 16 days ago

|

history blame contribute delete

2.84 kB

	---
	model_name: AIDE-Chip-Surrogates
	license: cc-by-nc-sa-4.0
	library_name: xgboost
	pipeline_tag: tabular-regression
	tags:
	- computer-architecture
	- gem5
	- cache
	- surrogate-model
	- explainable-ai
	- shap
	- monotonic-constraints
	- systems-ml
	datasets:
	- uralstech/AIDE-Chip-15K-gem5-Sims
	---

	# AIDE Chip Surrogates

	This is a collection of physics-aware, monotonicity-constrained XGBoost models that replace expensive gem5 cache simulations during design-space exploration.

	Each model predicts either IPC or L2 miss rate for a specific workload, using only cache configuration parameters as input. The models are interpretable via SHAP and enforce microarchitectural monotonicity where physically justified.

	This model release accompanies the paper:

	> Udayshankar Ravikumar . Fast, Explainable Surrogate Models for gem5 Cache Design Space Exploration. Authorea. January 14, 2026.
	> <https://doi.org/10.22541/au.176843174.46109183/v1>

	## Model Architecture

	* Algorithm: XGBoost Regressor
	* Targets:
	* IPC
	* L2 miss rate
	* Features:
	* Log₂ cache sizes & associativities
	* Set-count proxies
	* Cache hierarchy ratios
	* Constraints:
	* Monotonic constraints encoding cache physics
	* Selective relaxation for latency-sensitive workloads

	## Available Models

	\| Workload \| IPC Model \| L2 Miss Model \|
	\| ---------- \| --------- \| ------------- \|
	\| crc32 \| ✔ \| ✔ \|
	\| dijkstra \| ✔ \| ✔ \|
	\| fft \| ✔ \| ✔ \|
	\| matrix_mul \| ✔ \| ✔ \|
	\| qsort \| ✔ \| ✔ \|
	\| sha \| ✔ \| ✔ \|

	Total models: 12

	## Performance

	* Test set accuracy: R² ≈ 0.999
	* OOD validation:
	* 26 unseen cache configurations
	* ~817× critical-path speedup
	* Low absolute error even when R² is unstable

	## Explainability

	Each model is uploaded with its SHAP summary plot. They confirm:

	* Cache sizes dominate IPC & miss behavior
	* Associativity effects are workload-dependent
	* Learned relationships align with microarchitectural intuition

	## Intended Use

	* Architecture research
	* Design-space exploration
	* Educational use
	* Explainable systems ML

	Not for commercial deployment without separate licensing.

	## Limitations

	* Single-core, single-thread models
	* Cache hierarchy only (no pipeline, prefetcher, or multicore effects)
	* Accuracy depends on training coverage; extreme OOD configs are flagged

	## Patent Notice

	The models uploaded here implement techniques described in an accompanying research paper.
	The author has filed a pending patent application that may cover broader
	design-space exploration workflows beyond these specific model implementations.

	The open-source license (CC BY-NC-SA 4.0) governs use of these models.
	This notice is informational only.