Spaces:
Sleeping
Sleeping
Upload Introduction.md
Browse files- Introduction.md +25 -5
Introduction.md
CHANGED
|
@@ -9,15 +9,37 @@ Conformal prediction is a technique for quantifying such uncertainties for AI sy
|
|
| 9 |
|
| 10 |
---
|
| 11 |
|
| 12 |
-
|
| 13 |
|
| 14 |
### 1. Prediction Regions
|
| 15 |
|
| 16 |
-
Prediction regions
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 17 |
|
| 18 |
### 2. Validity
|
| 19 |
|
| 20 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
|
| 22 |
### 3. Inductive Conformal Prediction
|
| 23 |
|
|
@@ -36,5 +58,3 @@ Inductive Conformal Prediction is characterized by its adaptability to the data
|
|
| 36 |
- validity score $s_i$ by counting the number of times the true value $y_i$ falls within the prediction region $R_i$ over repeated experiments.
|
| 37 |
- p-value $p_i$ by dividing the validity score $s_i$ by the number of repeated experiments.
|
| 38 |
- prediction region $R_i$ using the top $k$ errors and the p-value $p_i$.
|
| 39 |
-
|
| 40 |
-
|
|
|
|
| 9 |
|
| 10 |
---
|
| 11 |
|
| 12 |
+
# Theory
|
| 13 |
|
| 14 |
### 1. Prediction Regions
|
| 15 |
|
| 16 |
+
Prediction regions in conformal prediction are intervals that provide a range of possible values for the prediction. For a regression task, this is often referred to as a prediction interval. Let's denote the prediction region as $[a, b]$, where $a$ and $b$ represent the lower and upper bounds, respectively. The confidence level is denoted by $\alpha$. The prediction region is constructed in such a way that it contains the true value with a probability of at least $(1 - \alpha)$.
|
| 17 |
+
|
| 18 |
+
Mathematically, for a prediction $\hat{y}$, the prediction region is defined as:
|
| 19 |
+
|
| 20 |
+
$$ P(a \leq y \leq b) \geq 1 - \alpha $$
|
| 21 |
+
|
| 22 |
+
This ensures that the true value $y$ falls within the predicted interval with a confidence level of at least $(1 - \alpha)$.
|
| 23 |
+
|
| 24 |
+
For a classification task, the prediction region is a set of classes that's above a certain threshold. The threshold is calculated by $\alpha$. Mathematically, for a prediction $\hat{C}$, the prediction region is defined as:
|
| 25 |
+
|
| 26 |
+
$$ P(y \in \hat{C}) \geq 1 - \alpha $$
|
| 27 |
+
|
| 28 |
+
This ensures that the true value $y$ falls within the predicted set of classes with a confidence level of at least $(1 - \alpha)$.
|
| 29 |
|
| 30 |
### 2. Validity
|
| 31 |
|
| 32 |
+
The validity of a conformal predictor is a crucial aspect. It ensures that, over repeated experiments, the true value falls within the predicted region with the specified confidence level. Mathematically, for a given prediction $\hat{y}$ and a true outcome $y$, the validity condition is expressed as:
|
| 33 |
+
|
| 34 |
+
$$ P(y \in [a, b]) \geq 1 - \alpha $$
|
| 35 |
+
|
| 36 |
+
This means that the probability of the true value $y$ lying within the predicted interval $[a, b]$ is greater than or equal to $(1 - \alpha)$.
|
| 37 |
+
|
| 38 |
+
For a classification task, the validity condition is expressed as:
|
| 39 |
+
|
| 40 |
+
$$ P(y \in \hat{C}) \geq 1 - \alpha $$
|
| 41 |
+
|
| 42 |
+
This means that the probability of the true value $y$ lying within the predicted set of classes $\hat{C}$ is greater than or equal to $(1 - \alpha)$.
|
| 43 |
|
| 44 |
### 3. Inductive Conformal Prediction
|
| 45 |
|
|
|
|
| 58 |
- validity score $s_i$ by counting the number of times the true value $y_i$ falls within the prediction region $R_i$ over repeated experiments.
|
| 59 |
- p-value $p_i$ by dividing the validity score $s_i$ by the number of repeated experiments.
|
| 60 |
- prediction region $R_i$ using the top $k$ errors and the p-value $p_i$.
|
|
|
|
|
|