diff --git "a/data_profile_report.html" "b/data_profile_report.html"
new file mode 100644--- /dev/null
+++ "b/data_profile_report.html"
@@ -0,0 +1,393 @@
+
📊 Detailed Data ReportBrought to you by YData
| Number of variables | 19 |
|---|
| Number of observations | 6607 |
|---|
| Missing cells | 157 |
|---|
| Missing cells (%) | 0.1% |
|---|
| Duplicate rows | 0 |
|---|
| Duplicate rows (%) | 0.0% |
|---|
| Total size in memory | 5.0 MiB |
|---|
| Average record size in memory | 799.7 B |
|---|
| Numeric | 6 |
|---|
| Categorical | 10 |
|---|
| Boolean | 3 |
|---|
| Analysis started | 2025-11-09 13:03:17.274384 |
|---|
| Analysis finished | 2025-11-09 13:03:38.720780 |
|---|
| Duration | 21.45 seconds |
|---|
| Software version | ydata-profiling vv4.17.0 |
|---|
| Download configuration | config.json |
|---|
| Distinct | 41 |
|---|
| Distinct (%) | 0.6% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Infinite | 0 |
|---|
| Infinite (%) | 0.0% |
|---|
| Mean | 19.975329 |
|---|
| Minimum | 1 |
|---|
| Maximum | 44 |
|---|
| Zeros | 0 |
|---|
| Zeros (%) | 0.0% |
|---|
| Negative | 0 |
|---|
| Negative (%) | 0.0% |
|---|
| Memory size | 51.7 KiB |
|---|
| Minimum | 1 |
|---|
| 5-th percentile | 10 |
|---|
| Q1 | 16 |
|---|
| median | 20 |
|---|
| Q3 | 24 |
|---|
| 95-th percentile | 30 |
|---|
| Maximum | 44 |
|---|
| Range | 43 |
|---|
| Interquartile range (IQR) | 8 |
|---|
| Standard deviation | 5.9905943 |
|---|
| Coefficient of variation (CV) | 0.29989966 |
|---|
| Kurtosis | 0.017770627 |
|---|
| Mean | 19.975329 |
|---|
| Median Absolute Deviation (MAD) | 4 |
|---|
| Skewness | 0.013498909 |
|---|
| Sum | 131977 |
|---|
| Variance | 35.887221 |
|---|
| Monotonicity | Not monotonic |
|---|
Histogram with fixed size bins (bins=41)
| Value | Count | Frequency (%) |
| 20 | 465 | 7.0% |
| 19 | 441 | 6.7% |
| 21 | 431 | 6.5% |
| 23 | 411 | 6.2% |
| 22 | 402 | 6.1% |
| 18 | 401 | 6.1% |
| 17 | 381 | 5.8% |
| 24 | 357 | 5.4% |
| 16 | 351 | 5.3% |
| 15 | 315 | 4.8% |
| Other values (31) | 2652 | 40.1% |
| Value | Count | Frequency (%) |
| 1 | 3 | < 0.1% |
| 2 | 6 | 0.1% |
| 3 | 12 | 0.2% |
| 4 | 17 | 0.3% |
| 5 | 21 | 0.3% |
| 6 | 17 | 0.3% |
| 7 | 51 | 0.8% |
| 8 | 58 | 0.9% |
| 9 | 86 | 1.3% |
| 10 | 94 | 1.4% |
| Value | Count | Frequency (%) |
| 44 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 39 | 7 | 0.1% |
| 38 | 7 | 0.1% |
| 37 | 6 | 0.1% |
| 36 | 11 | 0.2% |
| 35 | 20 | 0.3% |
| 34 | 29 | 0.4% |
| 33 | 40 | 0.6% |
| 32 | 54 | 0.8% |
| Distinct | 3 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 399.0 KiB |
|---|
| Medium | 3362 |
|---|
| High | 1908 |
|---|
| Low | 1337 |
|---|
| Max length | 6 |
|---|
| Median length | 6 |
|---|
| Mean length | 4.8153474 |
|---|
| Min length | 3 |
|---|
| Total characters | 31815 |
|---|
| Distinct characters | 12 |
|---|
| Distinct categories | 1 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
| 1st row | Low |
|---|
| 2nd row | Low |
|---|
| 3rd row | Medium |
|---|
| 4th row | Low |
|---|
| 5th row | Medium |
|---|
| Value | Count | Frequency (%) |
| Medium | 3362 | 50.9% |
| High | 1908 | 28.9% |
| Low | 1337 | 20.2% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| medium | 3362 | 50.9% |
| high | 1908 | 28.9% |
| low | 1337 | 20.2% |
| Value | Count | Frequency (%) |
| i | 5270 | 16.6% |
| M | 3362 | 10.6% |
| e | 3362 | 10.6% |
| d | 3362 | 10.6% |
| u | 3362 | 10.6% |
| m | 3362 | 10.6% |
| H | 1908 | 6.0% |
| g | 1908 | 6.0% |
| h | 1908 | 6.0% |
| L | 1337 | 4.2% |
| Other values (2) | 2674 | 8.4% |
| Value | Count | Frequency (%) |
| (unknown) | 31815 | 100.0% |
| Value | Count | Frequency (%) |
| i | 5270 | 16.6% |
| M | 3362 | 10.6% |
| e | 3362 | 10.6% |
| d | 3362 | 10.6% |
| u | 3362 | 10.6% |
| m | 3362 | 10.6% |
| H | 1908 | 6.0% |
| g | 1908 | 6.0% |
| h | 1908 | 6.0% |
| L | 1337 | 4.2% |
| Other values (2) | 2674 | 8.4% |
| Value | Count | Frequency (%) |
| (unknown) | 31815 | 100.0% |
| Value | Count | Frequency (%) |
| i | 5270 | 16.6% |
| M | 3362 | 10.6% |
| e | 3362 | 10.6% |
| d | 3362 | 10.6% |
| u | 3362 | 10.6% |
| m | 3362 | 10.6% |
| H | 1908 | 6.0% |
| g | 1908 | 6.0% |
| h | 1908 | 6.0% |
| L | 1337 | 4.2% |
| Other values (2) | 2674 | 8.4% |
| Value | Count | Frequency (%) |
| (unknown) | 31815 | 100.0% |
| Value | Count | Frequency (%) |
| i | 5270 | 16.6% |
| M | 3362 | 10.6% |
| e | 3362 | 10.6% |
| d | 3362 | 10.6% |
| u | 3362 | 10.6% |
| m | 3362 | 10.6% |
| H | 1908 | 6.0% |
| g | 1908 | 6.0% |
| h | 1908 | 6.0% |
| L | 1337 | 4.2% |
| Other values (2) | 2674 | 8.4% |
| Distinct | 3 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 398.9 KiB |
|---|
| Medium | 3319 |
|---|
| High | 1975 |
|---|
| Low | 1313 |
|---|
| Max length | 6 |
|---|
| Median length | 6 |
|---|
| Mean length | 4.8059634 |
|---|
| Min length | 3 |
|---|
| Total characters | 31753 |
|---|
| Distinct characters | 12 |
|---|
| Distinct categories | 1 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
| 1st row | High |
|---|
| 2nd row | Medium |
|---|
| 3rd row | Medium |
|---|
| 4th row | Medium |
|---|
| 5th row | Medium |
|---|
| Value | Count | Frequency (%) |
| Medium | 3319 | 50.2% |
| High | 1975 | 29.9% |
| Low | 1313 | 19.9% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| medium | 3319 | 50.2% |
| high | 1975 | 29.9% |
| low | 1313 | 19.9% |
| Value | Count | Frequency (%) |
| i | 5294 | 16.7% |
| M | 3319 | 10.5% |
| e | 3319 | 10.5% |
| d | 3319 | 10.5% |
| u | 3319 | 10.5% |
| m | 3319 | 10.5% |
| H | 1975 | 6.2% |
| g | 1975 | 6.2% |
| h | 1975 | 6.2% |
| L | 1313 | 4.1% |
| Other values (2) | 2626 | 8.3% |
| Value | Count | Frequency (%) |
| (unknown) | 31753 | 100.0% |
| Value | Count | Frequency (%) |
| i | 5294 | 16.7% |
| M | 3319 | 10.5% |
| e | 3319 | 10.5% |
| d | 3319 | 10.5% |
| u | 3319 | 10.5% |
| m | 3319 | 10.5% |
| H | 1975 | 6.2% |
| g | 1975 | 6.2% |
| h | 1975 | 6.2% |
| L | 1313 | 4.1% |
| Other values (2) | 2626 | 8.3% |
| Value | Count | Frequency (%) |
| (unknown) | 31753 | 100.0% |
| Value | Count | Frequency (%) |
| i | 5294 | 16.7% |
| M | 3319 | 10.5% |
| e | 3319 | 10.5% |
| d | 3319 | 10.5% |
| u | 3319 | 10.5% |
| m | 3319 | 10.5% |
| H | 1975 | 6.2% |
| g | 1975 | 6.2% |
| h | 1975 | 6.2% |
| L | 1313 | 4.1% |
| Other values (2) | 2626 | 8.3% |
| Value | Count | Frequency (%) |
| (unknown) | 31753 | 100.0% |
| Value | Count | Frequency (%) |
| i | 5294 | 16.7% |
| M | 3319 | 10.5% |
| e | 3319 | 10.5% |
| d | 3319 | 10.5% |
| u | 3319 | 10.5% |
| m | 3319 | 10.5% |
| H | 1975 | 6.2% |
| g | 1975 | 6.2% |
| h | 1975 | 6.2% |
| L | 1313 | 4.1% |
| Other values (2) | 2626 | 8.3% |
| Distinct | 2 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 6.6 KiB |
|---|
| Value | Count | Frequency (%) |
| True | 3938 | 59.6% |
| False | 2669 | 40.4% |
| Distinct | 7 |
|---|
| Distinct (%) | 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Infinite | 0 |
|---|
| Infinite (%) | 0.0% |
|---|
| Mean | 7.0290601 |
|---|
| Minimum | 4 |
|---|
| Maximum | 10 |
|---|
| Zeros | 0 |
|---|
| Zeros (%) | 0.0% |
|---|
| Negative | 0 |
|---|
| Negative (%) | 0.0% |
|---|
| Memory size | 51.7 KiB |
|---|
| Minimum | 4 |
|---|
| 5-th percentile | 5 |
|---|
| Q1 | 6 |
|---|
| median | 7 |
|---|
| Q3 | 8 |
|---|
| 95-th percentile | 9 |
|---|
| Maximum | 10 |
|---|
| Range | 6 |
|---|
| Interquartile range (IQR) | 2 |
|---|
| Standard deviation | 1.4681202 |
|---|
| Coefficient of variation (CV) | 0.20886437 |
|---|
| Kurtosis | -0.50369743 |
|---|
| Mean | 7.0290601 |
|---|
| Median Absolute Deviation (MAD) | 1 |
|---|
| Skewness | -0.023805437 |
|---|
| Sum | 46441 |
|---|
| Variance | 2.155377 |
|---|
| Monotonicity | Not monotonic |
|---|
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 7 | 1741 | 26.4% |
| 8 | 1399 | 21.2% |
| 6 | 1376 | 20.8% |
| 9 | 775 | 11.7% |
| 5 | 695 | 10.5% |
| 10 | 312 | 4.7% |
| 4 | 309 | 4.7% |
| Value | Count | Frequency (%) |
| 4 | 309 | 4.7% |
| 5 | 695 | 10.5% |
| 6 | 1376 | 20.8% |
| 7 | 1741 | 26.4% |
| 8 | 1399 | 21.2% |
| 9 | 775 | 11.7% |
| 10 | 312 | 4.7% |
| Value | Count | Frequency (%) |
| 10 | 312 | 4.7% |
| 9 | 775 | 11.7% |
| 8 | 1399 | 21.2% |
| 7 | 1741 | 26.4% |
| 6 | 1376 | 20.8% |
| 5 | 695 | 10.5% |
| 4 | 309 | 4.7% |
| Distinct | 51 |
|---|
| Distinct (%) | 0.8% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Infinite | 0 |
|---|
| Infinite (%) | 0.0% |
|---|
| Mean | 75.070531 |
|---|
| Minimum | 50 |
|---|
| Maximum | 100 |
|---|
| Zeros | 0 |
|---|
| Zeros (%) | 0.0% |
|---|
| Negative | 0 |
|---|
| Negative (%) | 0.0% |
|---|
| Memory size | 51.7 KiB |
|---|
| Minimum | 50 |
|---|
| 5-th percentile | 53 |
|---|
| Q1 | 63 |
|---|
| median | 75 |
|---|
| Q3 | 88 |
|---|
| 95-th percentile | 97 |
|---|
| Maximum | 100 |
|---|
| Range | 50 |
|---|
| Interquartile range (IQR) | 25 |
|---|
| Standard deviation | 14.399784 |
|---|
| Coefficient of variation (CV) | 0.19181674 |
|---|
| Kurtosis | -1.1910804 |
|---|
| Mean | 75.070531 |
|---|
| Median Absolute Deviation (MAD) | 12 |
|---|
| Skewness | -0.0037365338 |
|---|
| Sum | 495991 |
|---|
| Variance | 207.35379 |
|---|
| Monotonicity | Not monotonic |
|---|
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 66 | 165 | 2.5% |
| 94 | 155 | 2.3% |
| 96 | 153 | 2.3% |
| 85 | 150 | 2.3% |
| 71 | 146 | 2.2% |
| 53 | 144 | 2.2% |
| 59 | 142 | 2.1% |
| 82 | 141 | 2.1% |
| 76 | 140 | 2.1% |
| 88 | 139 | 2.1% |
| Other values (41) | 5132 | 77.7% |
| Value | Count | Frequency (%) |
| 50 | 62 | 0.9% |
| 51 | 126 | 1.9% |
| 52 | 136 | 2.1% |
| 53 | 144 | 2.2% |
| 54 | 136 | 2.1% |
| 55 | 124 | 1.9% |
| 56 | 120 | 1.8% |
| 57 | 125 | 1.9% |
| 58 | 120 | 1.8% |
| 59 | 142 | 2.1% |
| Value | Count | Frequency (%) |
| 100 | 69 | 1.0% |
| 99 | 125 | 1.9% |
| 98 | 131 | 2.0% |
| 97 | 115 | 1.7% |
| 96 | 153 | 2.3% |
| 95 | 129 | 2.0% |
| 94 | 155 | 2.3% |
| 93 | 129 | 2.0% |
| 92 | 126 | 1.9% |
| 91 | 136 | 2.1% |
| Distinct | 3 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 374.3 KiB |
|---|
| Max length | 1 |
|---|
| Median length | 1 |
|---|
| Mean length | 1 |
|---|
| Min length | 1 |
|---|
| Total characters | 6607 |
|---|
| Distinct characters | 3 |
|---|
| Distinct categories | 1 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
| 1st row | 1 |
|---|
| 2nd row | 1 |
|---|
| 3rd row | 2 |
|---|
| 4th row | 2 |
|---|
| 5th row | 2 |
|---|
| Value | Count | Frequency (%) |
| 2 | 3351 | 50.7% |
| 1 | 1937 | 29.3% |
| 0 | 1319 | 20.0% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 2 | 3351 | 50.7% |
| 1 | 1937 | 29.3% |
| 0 | 1319 | 20.0% |
| Value | Count | Frequency (%) |
| 2 | 3351 | 50.7% |
| 1 | 1937 | 29.3% |
| 0 | 1319 | 20.0% |
| Value | Count | Frequency (%) |
| (unknown) | 6607 | 100.0% |
| Value | Count | Frequency (%) |
| 2 | 3351 | 50.7% |
| 1 | 1937 | 29.3% |
| 0 | 1319 | 20.0% |
| Value | Count | Frequency (%) |
| (unknown) | 6607 | 100.0% |
| Value | Count | Frequency (%) |
| 2 | 3351 | 50.7% |
| 1 | 1937 | 29.3% |
| 0 | 1319 | 20.0% |
| Value | Count | Frequency (%) |
| (unknown) | 6607 | 100.0% |
| Value | Count | Frequency (%) |
| 2 | 3351 | 50.7% |
| 1 | 1937 | 29.3% |
| 0 | 1319 | 20.0% |
| Distinct | 2 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 6.6 KiB |
|---|
| Value | Count | Frequency (%) |
| True | 6108 | 92.4% |
| False | 499 | 7.6% |
| Distinct | 9 |
|---|
| Distinct (%) | 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Infinite | 0 |
|---|
| Infinite (%) | 0.0% |
|---|
| Mean | 1.4937188 |
|---|
| Minimum | 0 |
|---|
| Maximum | 8 |
|---|
| Zeros | 1513 |
|---|
| Zeros (%) | 22.9% |
|---|
| Negative | 0 |
|---|
| Negative (%) | 0.0% |
|---|
| Memory size | 51.7 KiB |
|---|
| Minimum | 0 |
|---|
| 5-th percentile | 0 |
|---|
| Q1 | 1 |
|---|
| median | 1 |
|---|
| Q3 | 2 |
|---|
| 95-th percentile | 4 |
|---|
| Maximum | 8 |
|---|
| Range | 8 |
|---|
| Interquartile range (IQR) | 1 |
|---|
| Standard deviation | 1.2305704 |
|---|
| Coefficient of variation (CV) | 0.82383005 |
|---|
| Kurtosis | 0.64371832 |
|---|
| Mean | 1.4937188 |
|---|
| Median Absolute Deviation (MAD) | 1 |
|---|
| Skewness | 0.8155296 |
|---|
| Sum | 9869 |
|---|
| Variance | 1.5143036 |
|---|
| Monotonicity | Not monotonic |
|---|
Histogram with fixed size bins (bins=9)
| Value | Count | Frequency (%) |
| 1 | 2179 | 33.0% |
| 2 | 1649 | 25.0% |
| 0 | 1513 | 22.9% |
| 3 | 836 | 12.7% |
| 4 | 301 | 4.6% |
| 5 | 103 | 1.6% |
| 6 | 18 | 0.3% |
| 7 | 7 | 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1513 | 22.9% |
| 1 | 2179 | 33.0% |
| 2 | 1649 | 25.0% |
| 3 | 836 | 12.7% |
| 4 | 301 | 4.6% |
| 5 | 103 | 1.6% |
| 6 | 18 | 0.3% |
| 7 | 7 | 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7 | 7 | 0.1% |
| 6 | 18 | 0.3% |
| 5 | 103 | 1.6% |
| 4 | 301 | 4.6% |
| 3 | 836 | 12.7% |
| 2 | 1649 | 25.0% |
| 1 | 2179 | 33.0% |
| 0 | 1513 | 22.9% |
| Distinct | 3 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 396.3 KiB |
|---|
| Low | 2672 |
|---|
| Medium | 2666 |
|---|
| High | 1269 |
|---|
| Max length | 6 |
|---|
| Median length | 4 |
|---|
| Mean length | 4.4026033 |
|---|
| Min length | 3 |
|---|
| Total characters | 29088 |
|---|
| Distinct characters | 12 |
|---|
| Distinct categories | 1 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
| 1st row | Low |
|---|
| 2nd row | Medium |
|---|
| 3rd row | Medium |
|---|
| 4th row | Medium |
|---|
| 5th row | Medium |
|---|
| Value | Count | Frequency (%) |
| Low | 2672 | 40.4% |
| Medium | 2666 | 40.4% |
| High | 1269 | 19.2% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| low | 2672 | 40.4% |
| medium | 2666 | 40.4% |
| high | 1269 | 19.2% |
| Value | Count | Frequency (%) |
| i | 3935 | 13.5% |
| L | 2672 | 9.2% |
| o | 2672 | 9.2% |
| w | 2672 | 9.2% |
| M | 2666 | 9.2% |
| e | 2666 | 9.2% |
| d | 2666 | 9.2% |
| u | 2666 | 9.2% |
| m | 2666 | 9.2% |
| H | 1269 | 4.4% |
| Other values (2) | 2538 | 8.7% |
| Value | Count | Frequency (%) |
| (unknown) | 29088 | 100.0% |
| Value | Count | Frequency (%) |
| i | 3935 | 13.5% |
| L | 2672 | 9.2% |
| o | 2672 | 9.2% |
| w | 2672 | 9.2% |
| M | 2666 | 9.2% |
| e | 2666 | 9.2% |
| d | 2666 | 9.2% |
| u | 2666 | 9.2% |
| m | 2666 | 9.2% |
| H | 1269 | 4.4% |
| Other values (2) | 2538 | 8.7% |
| Value | Count | Frequency (%) |
| (unknown) | 29088 | 100.0% |
| Value | Count | Frequency (%) |
| i | 3935 | 13.5% |
| L | 2672 | 9.2% |
| o | 2672 | 9.2% |
| w | 2672 | 9.2% |
| M | 2666 | 9.2% |
| e | 2666 | 9.2% |
| d | 2666 | 9.2% |
| u | 2666 | 9.2% |
| m | 2666 | 9.2% |
| H | 1269 | 4.4% |
| Other values (2) | 2538 | 8.7% |
| Value | Count | Frequency (%) |
| (unknown) | 29088 | 100.0% |
| Value | Count | Frequency (%) |
| i | 3935 | 13.5% |
| L | 2672 | 9.2% |
| o | 2672 | 9.2% |
| w | 2672 | 9.2% |
| M | 2666 | 9.2% |
| e | 2666 | 9.2% |
| d | 2666 | 9.2% |
| u | 2666 | 9.2% |
| m | 2666 | 9.2% |
| H | 1269 | 4.4% |
| Other values (2) | 2538 | 8.7% |
| Distinct | 3 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 400.9 KiB |
|---|
| Medium | 4003 |
|---|
| High | 1947 |
|---|
| Low | 657 |
|---|
| Max length | 6 |
|---|
| Median length | 6 |
|---|
| Mean length | 5.1123051 |
|---|
| Min length | 3 |
|---|
| Total characters | 33777 |
|---|
| Distinct characters | 12 |
|---|
| Distinct categories | 1 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
| 1st row | Medium |
|---|
| 2nd row | Medium |
|---|
| 3rd row | Medium |
|---|
| 4th row | Medium |
|---|
| 5th row | High |
|---|
| Value | Count | Frequency (%) |
| Medium | 4003 | 60.6% |
| High | 1947 | 29.5% |
| Low | 657 | 9.9% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| medium | 4003 | 60.6% |
| high | 1947 | 29.5% |
| low | 657 | 9.9% |
| Value | Count | Frequency (%) |
| i | 5950 | 17.6% |
| M | 4003 | 11.9% |
| e | 4003 | 11.9% |
| d | 4003 | 11.9% |
| u | 4003 | 11.9% |
| m | 4003 | 11.9% |
| H | 1947 | 5.8% |
| g | 1947 | 5.8% |
| h | 1947 | 5.8% |
| L | 657 | 1.9% |
| Other values (2) | 1314 | 3.9% |
| Value | Count | Frequency (%) |
| (unknown) | 33777 | 100.0% |
| Value | Count | Frequency (%) |
| i | 5950 | 17.6% |
| M | 4003 | 11.9% |
| e | 4003 | 11.9% |
| d | 4003 | 11.9% |
| u | 4003 | 11.9% |
| m | 4003 | 11.9% |
| H | 1947 | 5.8% |
| g | 1947 | 5.8% |
| h | 1947 | 5.8% |
| L | 657 | 1.9% |
| Other values (2) | 1314 | 3.9% |
| Value | Count | Frequency (%) |
| (unknown) | 33777 | 100.0% |
| Value | Count | Frequency (%) |
| i | 5950 | 17.6% |
| M | 4003 | 11.9% |
| e | 4003 | 11.9% |
| d | 4003 | 11.9% |
| u | 4003 | 11.9% |
| m | 4003 | 11.9% |
| H | 1947 | 5.8% |
| g | 1947 | 5.8% |
| h | 1947 | 5.8% |
| L | 657 | 1.9% |
| Other values (2) | 1314 | 3.9% |
| Value | Count | Frequency (%) |
| (unknown) | 33777 | 100.0% |
| Value | Count | Frequency (%) |
| i | 5950 | 17.6% |
| M | 4003 | 11.9% |
| e | 4003 | 11.9% |
| d | 4003 | 11.9% |
| u | 4003 | 11.9% |
| m | 4003 | 11.9% |
| H | 1947 | 5.8% |
| g | 1947 | 5.8% |
| h | 1947 | 5.8% |
| L | 657 | 1.9% |
| Other values (2) | 1314 | 3.9% |
| Distinct | 2 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 408.6 KiB |
|---|
| Max length | 7 |
|---|
| Median length | 6 |
|---|
| Mean length | 6.3040714 |
|---|
| Min length | 6 |
|---|
| Total characters | 41651 |
|---|
| Distinct characters | 11 |
|---|
| Distinct categories | 1 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
| 1st row | Public |
|---|
| 2nd row | Public |
|---|
| 3rd row | Public |
|---|
| 4th row | Public |
|---|
| 5th row | Public |
|---|
| Value | Count | Frequency (%) |
| Public | 4598 | 69.6% |
| Private | 2009 | 30.4% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| public | 4598 | 69.6% |
| private | 2009 | 30.4% |
| Value | Count | Frequency (%) |
| P | 6607 | 15.9% |
| i | 6607 | 15.9% |
| u | 4598 | 11.0% |
| b | 4598 | 11.0% |
| l | 4598 | 11.0% |
| c | 4598 | 11.0% |
| r | 2009 | 4.8% |
| v | 2009 | 4.8% |
| a | 2009 | 4.8% |
| t | 2009 | 4.8% |
| Value | Count | Frequency (%) |
| (unknown) | 41651 | 100.0% |
| Value | Count | Frequency (%) |
| P | 6607 | 15.9% |
| i | 6607 | 15.9% |
| u | 4598 | 11.0% |
| b | 4598 | 11.0% |
| l | 4598 | 11.0% |
| c | 4598 | 11.0% |
| r | 2009 | 4.8% |
| v | 2009 | 4.8% |
| a | 2009 | 4.8% |
| t | 2009 | 4.8% |
| Value | Count | Frequency (%) |
| (unknown) | 41651 | 100.0% |
| Value | Count | Frequency (%) |
| P | 6607 | 15.9% |
| i | 6607 | 15.9% |
| u | 4598 | 11.0% |
| b | 4598 | 11.0% |
| l | 4598 | 11.0% |
| c | 4598 | 11.0% |
| r | 2009 | 4.8% |
| v | 2009 | 4.8% |
| a | 2009 | 4.8% |
| t | 2009 | 4.8% |
| Value | Count | Frequency (%) |
| (unknown) | 41651 | 100.0% |
| Value | Count | Frequency (%) |
| P | 6607 | 15.9% |
| i | 6607 | 15.9% |
| u | 4598 | 11.0% |
| b | 4598 | 11.0% |
| l | 4598 | 11.0% |
| c | 4598 | 11.0% |
| r | 2009 | 4.8% |
| v | 2009 | 4.8% |
| a | 2009 | 4.8% |
| t | 2009 | 4.8% |
| Distinct | 3 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 417.0 KiB |
|---|
| Positive | 2638 |
|---|
| Neutral | 2592 |
|---|
| Negative | 1377 |
|---|
| Max length | 8 |
|---|
| Median length | 8 |
|---|
| Mean length | 7.6076888 |
|---|
| Min length | 7 |
|---|
| Total characters | 50264 |
|---|
| Distinct characters | 13 |
|---|
| Distinct categories | 1 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
| 1st row | Positive |
|---|
| 2nd row | Negative |
|---|
| 3rd row | Neutral |
|---|
| 4th row | Negative |
|---|
| 5th row | Neutral |
|---|
| Value | Count | Frequency (%) |
| Positive | 2638 | 39.9% |
| Neutral | 2592 | 39.2% |
| Negative | 1377 | 20.8% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| positive | 2638 | 39.9% |
| neutral | 2592 | 39.2% |
| negative | 1377 | 20.8% |
| Value | Count | Frequency (%) |
| e | 7984 | 15.9% |
| i | 6653 | 13.2% |
| t | 6607 | 13.1% |
| v | 4015 | 8.0% |
| N | 3969 | 7.9% |
| a | 3969 | 7.9% |
| P | 2638 | 5.2% |
| o | 2638 | 5.2% |
| s | 2638 | 5.2% |
| u | 2592 | 5.2% |
| Other values (3) | 6561 | 13.1% |
| Value | Count | Frequency (%) |
| (unknown) | 50264 | 100.0% |
| Value | Count | Frequency (%) |
| e | 7984 | 15.9% |
| i | 6653 | 13.2% |
| t | 6607 | 13.1% |
| v | 4015 | 8.0% |
| N | 3969 | 7.9% |
| a | 3969 | 7.9% |
| P | 2638 | 5.2% |
| o | 2638 | 5.2% |
| s | 2638 | 5.2% |
| u | 2592 | 5.2% |
| Other values (3) | 6561 | 13.1% |
| Value | Count | Frequency (%) |
| (unknown) | 50264 | 100.0% |
| Value | Count | Frequency (%) |
| e | 7984 | 15.9% |
| i | 6653 | 13.2% |
| t | 6607 | 13.1% |
| v | 4015 | 8.0% |
| N | 3969 | 7.9% |
| a | 3969 | 7.9% |
| P | 2638 | 5.2% |
| o | 2638 | 5.2% |
| s | 2638 | 5.2% |
| u | 2592 | 5.2% |
| Other values (3) | 6561 | 13.1% |
| Value | Count | Frequency (%) |
| (unknown) | 50264 | 100.0% |
| Value | Count | Frequency (%) |
| e | 7984 | 15.9% |
| i | 6653 | 13.2% |
| t | 6607 | 13.1% |
| v | 4015 | 8.0% |
| N | 3969 | 7.9% |
| a | 3969 | 7.9% |
| P | 2638 | 5.2% |
| o | 2638 | 5.2% |
| s | 2638 | 5.2% |
| u | 2592 | 5.2% |
| Other values (3) | 6561 | 13.1% |
| Distinct | 7 |
|---|
| Distinct (%) | 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Infinite | 0 |
|---|
| Infinite (%) | 0.0% |
|---|
| Mean | 2.9676101 |
|---|
| Minimum | 0 |
|---|
| Maximum | 6 |
|---|
| Zeros | 46 |
|---|
| Zeros (%) | 0.7% |
|---|
| Negative | 0 |
|---|
| Negative (%) | 0.0% |
|---|
| Memory size | 51.7 KiB |
|---|
| Minimum | 0 |
|---|
| 5-th percentile | 1 |
|---|
| Q1 | 2 |
|---|
| median | 3 |
|---|
| Q3 | 4 |
|---|
| 95-th percentile | 5 |
|---|
| Maximum | 6 |
|---|
| Range | 6 |
|---|
| Interquartile range (IQR) | 2 |
|---|
| Standard deviation | 1.0312311 |
|---|
| Coefficient of variation (CV) | 0.34749548 |
|---|
| Kurtosis | -0.05943887 |
|---|
| Mean | 2.9676101 |
|---|
| Median Absolute Deviation (MAD) | 1 |
|---|
| Skewness | -0.031364712 |
|---|
| Sum | 19607 |
|---|
| Variance | 1.0634376 |
|---|
| Monotonicity | Not monotonic |
|---|
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 3 | 2545 | 38.5% |
| 2 | 1627 | 24.6% |
| 4 | 1575 | 23.8% |
| 1 | 421 | 6.4% |
| 5 | 361 | 5.5% |
| 0 | 46 | 0.7% |
| 6 | 32 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 46 | 0.7% |
| 1 | 421 | 6.4% |
| 2 | 1627 | 24.6% |
| 3 | 2545 | 38.5% |
| 4 | 1575 | 23.8% |
| 5 | 361 | 5.5% |
| 6 | 32 | 0.5% |
| Value | Count | Frequency (%) |
| 6 | 32 | 0.5% |
| 5 | 361 | 5.5% |
| 4 | 1575 | 23.8% |
| 3 | 2545 | 38.5% |
| 2 | 1627 | 24.6% |
| 1 | 421 | 6.4% |
| 0 | 46 | 0.7% |
| Distinct | 2 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 6.6 KiB |
|---|
| Value | Count | Frequency (%) |
| False | 5912 | 89.5% |
| True | 695 | 10.5% |
| Distinct | 3 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 90 |
|---|
| Missing (%) | 1.4% |
|---|
| Memory size | 431.3 KiB |
|---|
| High School | 3223 |
|---|
| College | 1989 |
|---|
| Postgraduate | 1305 |
|---|
| Max length | 12 |
|---|
| Median length | 11 |
|---|
| Mean length | 9.9794384 |
|---|
| Min length | 7 |
|---|
| Total characters | 65036 |
|---|
| Distinct characters | 18 |
|---|
| Distinct categories | 1 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
| 1st row | High School |
|---|
| 2nd row | College |
|---|
| 3rd row | Postgraduate |
|---|
| 4th row | High School |
|---|
| 5th row | College |
|---|
| Value | Count | Frequency (%) |
| High School | 3223 | 48.8% |
| College | 1989 | 30.1% |
| Postgraduate | 1305 | 19.8% |
| (Missing) | 90 | 1.4% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| high | 3223 | 33.1% |
| school | 3223 | 33.1% |
| college | 1989 | 20.4% |
| postgraduate | 1305 | 13.4% |
| Value | Count | Frequency (%) |
| o | 9740 | 15.0% |
| l | 7201 | 11.1% |
| g | 6517 | 10.0% |
| h | 6446 | 9.9% |
| e | 5283 | 8.1% |
| H | 3223 | 5.0% |
| 3223 | 5.0% |
| S | 3223 | 5.0% |
| c | 3223 | 5.0% |
| i | 3223 | 5.0% |
| Other values (8) | 13734 | 21.1% |
| Value | Count | Frequency (%) |
| (unknown) | 65036 | 100.0% |
| Value | Count | Frequency (%) |
| o | 9740 | 15.0% |
| l | 7201 | 11.1% |
| g | 6517 | 10.0% |
| h | 6446 | 9.9% |
| e | 5283 | 8.1% |
| H | 3223 | 5.0% |
| 3223 | 5.0% |
| S | 3223 | 5.0% |
| c | 3223 | 5.0% |
| i | 3223 | 5.0% |
| Other values (8) | 13734 | 21.1% |
| Value | Count | Frequency (%) |
| (unknown) | 65036 | 100.0% |
| Value | Count | Frequency (%) |
| o | 9740 | 15.0% |
| l | 7201 | 11.1% |
| g | 6517 | 10.0% |
| h | 6446 | 9.9% |
| e | 5283 | 8.1% |
| H | 3223 | 5.0% |
| 3223 | 5.0% |
| S | 3223 | 5.0% |
| c | 3223 | 5.0% |
| i | 3223 | 5.0% |
| Other values (8) | 13734 | 21.1% |
| Value | Count | Frequency (%) |
| (unknown) | 65036 | 100.0% |
| Value | Count | Frequency (%) |
| o | 9740 | 15.0% |
| l | 7201 | 11.1% |
| g | 6517 | 10.0% |
| h | 6446 | 9.9% |
| e | 5283 | 8.1% |
| H | 3223 | 5.0% |
| 3223 | 5.0% |
| S | 3223 | 5.0% |
| c | 3223 | 5.0% |
| i | 3223 | 5.0% |
| Other values (8) | 13734 | 21.1% |
| Distinct | 3 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 67 |
|---|
| Missing (%) | 1.0% |
|---|
| Memory size | 400.5 KiB |
|---|
| Near | 3884 |
|---|
| Moderate | 1998 |
|---|
| Far | 658 |
|---|
| Max length | 8 |
|---|
| Median length | 4 |
|---|
| Mean length | 5.1214067 |
|---|
| Min length | 3 |
|---|
| Total characters | 33494 |
|---|
| Distinct characters | 9 |
|---|
| Distinct categories | 1 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
| 1st row | Near |
|---|
| 2nd row | Moderate |
|---|
| 3rd row | Near |
|---|
| 4th row | Moderate |
|---|
| 5th row | Near |
|---|
| Value | Count | Frequency (%) |
| Near | 3884 | 58.8% |
| Moderate | 1998 | 30.2% |
| Far | 658 | 10.0% |
| (Missing) | 67 | 1.0% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| near | 3884 | 59.4% |
| moderate | 1998 | 30.6% |
| far | 658 | 10.1% |
| Value | Count | Frequency (%) |
| e | 7880 | 23.5% |
| a | 6540 | 19.5% |
| r | 6540 | 19.5% |
| N | 3884 | 11.6% |
| M | 1998 | 6.0% |
| o | 1998 | 6.0% |
| d | 1998 | 6.0% |
| t | 1998 | 6.0% |
| F | 658 | 2.0% |
| Value | Count | Frequency (%) |
| (unknown) | 33494 | 100.0% |
| Value | Count | Frequency (%) |
| e | 7880 | 23.5% |
| a | 6540 | 19.5% |
| r | 6540 | 19.5% |
| N | 3884 | 11.6% |
| M | 1998 | 6.0% |
| o | 1998 | 6.0% |
| d | 1998 | 6.0% |
| t | 1998 | 6.0% |
| F | 658 | 2.0% |
| Value | Count | Frequency (%) |
| (unknown) | 33494 | 100.0% |
| Value | Count | Frequency (%) |
| e | 7880 | 23.5% |
| a | 6540 | 19.5% |
| r | 6540 | 19.5% |
| N | 3884 | 11.6% |
| M | 1998 | 6.0% |
| o | 1998 | 6.0% |
| d | 1998 | 6.0% |
| t | 1998 | 6.0% |
| F | 658 | 2.0% |
| Value | Count | Frequency (%) |
| (unknown) | 33494 | 100.0% |
| Value | Count | Frequency (%) |
| e | 7880 | 23.5% |
| a | 6540 | 19.5% |
| r | 6540 | 19.5% |
| N | 3884 | 11.6% |
| M | 1998 | 6.0% |
| o | 1998 | 6.0% |
| d | 1998 | 6.0% |
| t | 1998 | 6.0% |
| F | 658 | 2.0% |
| Distinct | 2 |
|---|
| Distinct (%) | < 0.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 399.2 KiB |
|---|
| Max length | 6 |
|---|
| Median length | 4 |
|---|
| Mean length | 4.8454669 |
|---|
| Min length | 4 |
|---|
| Total characters | 32014 |
|---|
| Distinct characters | 6 |
|---|
| Distinct categories | 1 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
| 1st row | Male |
|---|
| 2nd row | Female |
|---|
| 3rd row | Male |
|---|
| 4th row | Male |
|---|
| 5th row | Female |
|---|
| Value | Count | Frequency (%) |
| Male | 3814 | 57.7% |
| Female | 2793 | 42.3% |
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| male | 3814 | 57.7% |
| female | 2793 | 42.3% |
| Value | Count | Frequency (%) |
| e | 9400 | 29.4% |
| a | 6607 | 20.6% |
| l | 6607 | 20.6% |
| M | 3814 | 11.9% |
| F | 2793 | 8.7% |
| m | 2793 | 8.7% |
| Value | Count | Frequency (%) |
| (unknown) | 32014 | 100.0% |
| Value | Count | Frequency (%) |
| e | 9400 | 29.4% |
| a | 6607 | 20.6% |
| l | 6607 | 20.6% |
| M | 3814 | 11.9% |
| F | 2793 | 8.7% |
| m | 2793 | 8.7% |
| Value | Count | Frequency (%) |
| (unknown) | 32014 | 100.0% |
| Value | Count | Frequency (%) |
| e | 9400 | 29.4% |
| a | 6607 | 20.6% |
| l | 6607 | 20.6% |
| M | 3814 | 11.9% |
| F | 2793 | 8.7% |
| m | 2793 | 8.7% |
| Value | Count | Frequency (%) |
| (unknown) | 32014 | 100.0% |
| Value | Count | Frequency (%) |
| e | 9400 | 29.4% |
| a | 6607 | 20.6% |
| l | 6607 | 20.6% |
| M | 3814 | 11.9% |
| F | 2793 | 8.7% |
| m | 2793 | 8.7% |
| Distinct | 45 |
|---|
| Distinct (%) | 0.7% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Infinite | 0 |
|---|
| Infinite (%) | 0.0% |
|---|
| Mean | 67.235659 |
|---|
| Minimum | 55 |
|---|
| Maximum | 101 |
|---|
| Zeros | 0 |
|---|
| Zeros (%) | 0.0% |
|---|
| Negative | 0 |
|---|
| Negative (%) | 0.0% |
|---|
| Memory size | 51.7 KiB |
|---|
| Minimum | 55 |
|---|
| 5-th percentile | 62 |
|---|
| Q1 | 65 |
|---|
| median | 67 |
|---|
| Q3 | 69 |
|---|
| 95-th percentile | 73 |
|---|
| Maximum | 101 |
|---|
| Range | 46 |
|---|
| Interquartile range (IQR) | 4 |
|---|
| Standard deviation | 3.8904558 |
|---|
| Coefficient of variation (CV) | 0.057862983 |
|---|
| Kurtosis | 10.575423 |
|---|
| Mean | 67.235659 |
|---|
| Median Absolute Deviation (MAD) | 2 |
|---|
| Skewness | 1.6448083 |
|---|
| Sum | 444226 |
|---|
| Variance | 15.135646 |
|---|
| Monotonicity | Not monotonic |
|---|
Histogram with fixed size bins (bins=45)
| Value | Count | Frequency (%) |
| 68 | 759 | 11.5% |
| 66 | 751 | 11.4% |
| 67 | 717 | 10.9% |
| 65 | 679 | 10.3% |
| 69 | 624 | 9.4% |
| 70 | 542 | 8.2% |
| 64 | 501 | 7.6% |
| 71 | 408 | 6.2% |
| 63 | 371 | 5.6% |
| 72 | 304 | 4.6% |
| Other values (35) | 951 | 14.4% |
| Value | Count | Frequency (%) |
| 55 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 57 | 4 | 0.1% |
| 58 | 22 | 0.3% |
| 59 | 40 | 0.6% |
| 60 | 77 | 1.2% |
| 61 | 171 | 2.6% |
| 62 | 264 | 4.0% |
| 63 | 371 | 5.6% |
| 64 | 501 | 7.6% |
| Value | Count | Frequency (%) |
| 101 | 1 | < 0.1% |
| 100 | 1 | < 0.1% |
| 99 | 2 | < 0.1% |
| 98 | 3 | < 0.1% |
| 97 | 3 | < 0.1% |
| 96 | 1 | < 0.1% |
| 95 | 2 | < 0.1% |
| 94 | 4 | 0.1% |
| 93 | 2 | < 0.1% |
| 92 | 2 | < 0.1% |
| access_to_resources | distance_from_home | exam_score | extracurricular_activities | family_income | gender | hours_studied | internet_access | learning_disabilities | motivation_level | parental_education_level | parental_involvement | peer_influence | physical_activity | previous_scores | school_type | sleep_hours | teacher_quality | tutoring_sessions |
|---|
| access_to_resources | 1.000 | 0.006 | 0.131 | 0.006 | 0.010 | 0.000 | 0.033 | 0.000 | 0.000 | 0.000 | 0.000 | 0.021 | 0.000 | 0.013 | 0.018 | 0.026 | 0.006 | 0.004 | 0.001 |
|---|
| distance_from_home | 0.006 | 1.000 | 0.080 | 0.019 | 0.007 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.007 | 0.018 | 0.000 | 0.000 | 0.000 | 0.000 |
|---|
| exam_score | 0.131 | 0.080 | 1.000 | 0.064 | 0.061 | 0.000 | 0.481 | 0.049 | 0.120 | 0.072 | 0.078 | 0.118 | 0.069 | 0.029 | 0.192 | 0.000 | -0.008 | 0.062 | 0.164 |
|---|
| extracurricular_activities | 0.006 | 0.019 | 0.064 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.018 | 0.029 | 0.000 | 0.027 | 0.000 | 0.000 | 0.012 | 0.011 |
|---|
| family_income | 0.010 | 0.007 | 0.061 | 0.000 | 1.000 | 0.000 | 0.019 | 0.005 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.027 | 0.009 | 0.000 | 0.000 | 0.009 | 0.000 |
|---|
| gender | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.008 | 0.008 | 0.000 | 0.000 | 0.018 | 0.008 | 0.000 | 0.028 | 0.000 | 0.011 | 0.000 | 0.000 |
|---|
| hours_studied | 0.033 | 0.000 | 0.481 | 0.000 | 0.019 | 0.000 | 1.000 | 0.000 | 0.033 | 0.000 | 0.015 | 0.023 | 0.029 | -0.003 | 0.024 | 0.000 | 0.011 | 0.000 | -0.013 |
|---|
| internet_access | 0.000 | 0.000 | 0.049 | 0.000 | 0.005 | 0.008 | 0.000 | 1.000 | 0.000 | 0.011 | 0.018 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.016 | 0.000 | 0.018 |
|---|
| learning_disabilities | 0.000 | 0.000 | 0.120 | 0.000 | 0.000 | 0.008 | 0.033 | 0.000 | 1.000 | 0.008 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.005 | 0.000 | 0.016 |
|---|
| motivation_level | 0.000 | 0.000 | 0.072 | 0.000 | 0.000 | 0.000 | 0.000 | 0.011 | 0.008 | 1.000 | 0.000 | 0.005 | 0.007 | 0.021 | 0.009 | 0.000 | 0.000 | 0.026 | 0.009 |
|---|
| parental_education_level | 0.000 | 0.000 | 0.078 | 0.000 | 0.000 | 0.000 | 0.015 | 0.018 | 0.000 | 0.000 | 1.000 | 0.000 | 0.016 | 0.006 | 0.000 | 0.018 | 0.000 | 0.000 | 0.000 |
|---|
| parental_involvement | 0.021 | 0.000 | 0.118 | 0.018 | 0.000 | 0.018 | 0.023 | 0.000 | 0.000 | 0.005 | 0.000 | 1.000 | 0.010 | 0.016 | 0.000 | 0.013 | 0.000 | 0.000 | 0.019 |
|---|
| peer_influence | 0.000 | 0.000 | 0.069 | 0.029 | 0.000 | 0.008 | 0.029 | 0.000 | 0.000 | 0.007 | 0.016 | 0.010 | 1.000 | 0.000 | 0.010 | 0.000 | 0.016 | 0.000 | 0.000 |
|---|
| physical_activity | 0.013 | 0.007 | 0.029 | 0.000 | 0.027 | 0.000 | -0.003 | 0.000 | 0.000 | 0.021 | 0.006 | 0.016 | 0.000 | 1.000 | -0.008 | 0.000 | 0.001 | 0.018 | 0.008 |
|---|
| previous_scores | 0.018 | 0.018 | 0.192 | 0.027 | 0.009 | 0.028 | 0.024 | 0.000 | 0.000 | 0.009 | 0.000 | 0.000 | 0.010 | -0.008 | 1.000 | 0.000 | -0.022 | 0.031 | -0.018 |
|---|
| school_type | 0.026 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.018 | 0.013 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
|---|
| sleep_hours | 0.006 | 0.000 | -0.008 | 0.000 | 0.000 | 0.011 | 0.011 | 0.016 | 0.005 | 0.000 | 0.000 | 0.000 | 0.016 | 0.001 | -0.022 | 0.000 | 1.000 | 0.018 | -0.006 |
|---|
| teacher_quality | 0.004 | 0.000 | 0.062 | 0.012 | 0.009 | 0.000 | 0.000 | 0.000 | 0.000 | 0.026 | 0.000 | 0.000 | 0.000 | 0.018 | 0.031 | 0.000 | 0.018 | 1.000 | 0.012 |
|---|
| tutoring_sessions | 0.001 | 0.000 | 0.164 | 0.011 | 0.000 | 0.000 | -0.013 | 0.018 | 0.016 | 0.009 | 0.000 | 0.019 | 0.000 | 0.008 | -0.018 | 0.000 | -0.006 | 0.012 | 1.000 |
|---|
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
| hours_studied | parental_involvement | access_to_resources | extracurricular_activities | sleep_hours | previous_scores | motivation_level | internet_access | tutoring_sessions | family_income | teacher_quality | school_type | peer_influence | physical_activity | learning_disabilities | parental_education_level | distance_from_home | gender | exam_score |
|---|
| 0 | 23 | Low | High | No | 7 | 73 | 1 | Yes | 0 | Low | Medium | Public | Positive | 3 | No | High School | Near | Male | 67 |
|---|
| 1 | 19 | Low | Medium | No | 8 | 59 | 1 | Yes | 2 | Medium | Medium | Public | Negative | 4 | No | College | Moderate | Female | 61 |
|---|
| 2 | 24 | Medium | Medium | Yes | 7 | 91 | 2 | Yes | 2 | Medium | Medium | Public | Neutral | 4 | No | Postgraduate | Near | Male | 74 |
|---|
| 3 | 29 | Low | Medium | Yes | 8 | 98 | 2 | Yes | 1 | Medium | Medium | Public | Negative | 4 | No | High School | Moderate | Male | 71 |
|---|
| 4 | 19 | Medium | Medium | Yes | 6 | 65 | 2 | Yes | 3 | Medium | High | Public | Neutral | 4 | No | College | Near | Female | 70 |
|---|
| 5 | 19 | Medium | Medium | Yes | 8 | 89 | 2 | Yes | 3 | Medium | Medium | Public | Positive | 3 | No | Postgraduate | Near | Male | 71 |
|---|
| 6 | 29 | Medium | Low | Yes | 7 | 68 | 1 | Yes | 1 | Low | Medium | Private | Neutral | 2 | No | High School | Moderate | Male | 67 |
|---|
| 7 | 25 | Low | High | Yes | 6 | 50 | 2 | Yes | 1 | High | High | Public | Negative | 2 | No | High School | Far | Male | 66 |
|---|
| 8 | 17 | Medium | High | No | 6 | 80 | 0 | Yes | 0 | Medium | Low | Private | Neutral | 1 | No | College | Near | Male | 69 |
|---|
| 9 | 23 | Medium | Medium | Yes | 8 | 71 | 2 | Yes | 0 | High | High | Public | Positive | 5 | No | High School | Moderate | Male | 72 |
|---|
| hours_studied | parental_involvement | access_to_resources | extracurricular_activities | sleep_hours | previous_scores | motivation_level | internet_access | tutoring_sessions | family_income | teacher_quality | school_type | peer_influence | physical_activity | learning_disabilities | parental_education_level | distance_from_home | gender | exam_score |
|---|
| 6597 | 16 | High | Medium | Yes | 6 | 72 | 0 | Yes | 0 | High | High | Public | Negative | 2 | No | Postgraduate | Near | Female | 70 |
|---|
| 6598 | 9 | Low | Medium | Yes | 6 | 64 | 2 | Yes | 1 | High | Medium | Public | Neutral | 2 | No | High School | Near | Female | 64 |
|---|
| 6599 | 30 | Medium | Low | No | 5 | 52 | 1 | No | 3 | High | Medium | Private | Neutral | 2 | No | Postgraduate | Moderate | Female | 70 |
|---|
| 6600 | 12 | Medium | Low | Yes | 4 | 54 | 2 | Yes | 2 | Medium | High | Private | Neutral | 3 | No | High School | Near | Female | 67 |
|---|
| 6601 | 20 | Medium | Low | No | 6 | 51 | 1 | Yes | 2 | Medium | Medium | Public | Neutral | 4 | No | High School | Moderate | Female | 65 |
|---|
| 6602 | 25 | High | Medium | No | 7 | 76 | 2 | Yes | 1 | High | Medium | Public | Positive | 2 | No | High School | Near | Female | 68 |
|---|
| 6603 | 23 | High | Medium | No | 8 | 81 | 2 | Yes | 3 | Low | High | Public | Positive | 2 | No | High School | Near | Female | 69 |
|---|
| 6604 | 20 | Medium | Low | Yes | 6 | 65 | 1 | Yes | 3 | Low | Medium | Public | Negative | 2 | No | Postgraduate | Near | Female | 68 |
|---|
| 6605 | 10 | High | High | Yes | 6 | 91 | 0 | Yes | 2 | Low | Medium | Private | Positive | 3 | No | High School | Far | Female | 68 |
|---|
| 6606 | 15 | Medium | Low | Yes | 9 | 94 | 2 | Yes | 0 | Medium | Medium | Public | Positive | 4 | No | Postgraduate | Near | Male | 64 |
|---|
\ No newline at end of file