Overview
Brought to you by YData
Dataset statistics
Number of variables | 16 |
---|---|
Number of observations | 920 |
Missing cells | 1759 |
Missing cells (%) | 11.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 445.8 KiB |
Average record size in memory | 496.1 B |
Variable types
Numeric | 6 |
---|---|
Categorical | 8 |
Boolean | 2 |
dataset is highly overall correlated with id | High correlation |
id is highly overall correlated with dataset | High correlation |
trestbps has 59 (6.4%) missing values | Missing |
chol has 30 (3.3%) missing values | Missing |
fbs has 90 (9.8%) missing values | Missing |
thalch has 55 (6.0%) missing values | Missing |
exang has 55 (6.0%) missing values | Missing |
oldpeak has 62 (6.7%) missing values | Missing |
slope has 309 (33.6%) missing values | Missing |
ca has 611 (66.4%) missing values | Missing |
thal has 486 (52.8%) missing values | Missing |
id is uniformly distributed | Uniform |
id has unique values | Unique |
chol has 172 (18.7%) zeros | Zeros |
oldpeak has 370 (40.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-11-29 20:12:38.173456 |
---|---|
Analysis finished | 2024-11-29 20:12:50.804639 |
Duration | 12.63 seconds |
Software version | ydata-profiling vv4.12.0 |
Download configuration | config.json |
Variables
id
Real number (ℝ)
High correlation  Uniform  Unique 
Distinct | 920 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 460.5 |
Minimum | 1 |
---|---|
Maximum | 920 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 46.95 |
Q1 | 230.75 |
median | 460.5 |
Q3 | 690.25 |
95-th percentile | 874.05 |
Maximum | 920 |
Range | 919 |
Interquartile range (IQR) | 459.5 |
Descriptive statistics
Standard deviation | 265.72542 |
---|---|
Coefficient of variation (CV) | 0.57703675 |
Kurtosis | -1.2 |
Mean | 460.5 |
Median Absolute Deviation (MAD) | 230 |
Skewness | 0 |
Sum | 423660 |
Variance | 70610 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.1% |
605 | 1 | 0.1% |
607 | 1 | 0.1% |
608 | 1 | 0.1% |
609 | 1 | 0.1% |
610 | 1 | 0.1% |
611 | 1 | 0.1% |
612 | 1 | 0.1% |
613 | 1 | 0.1% |
614 | 1 | 0.1% |
Other values (910) | 910 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
920 | 1 | |
919 | 1 | |
918 | 1 | |
917 | 1 | |
916 | 1 | |
915 | 1 | |
914 | 1 | |
913 | 1 | |
912 | 1 | |
911 | 1 |
age
Real number (ℝ)
Distinct | 50 |
---|---|
Distinct (%) | 5.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 53.51087 |
Minimum | 28 |
---|---|
Maximum | 77 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.3 KiB |
Quantile statistics
Minimum | 28 |
---|---|
5-th percentile | 37 |
Q1 | 47 |
median | 54 |
Q3 | 60 |
95-th percentile | 68 |
Maximum | 77 |
Range | 49 |
Interquartile range (IQR) | 13 |
Descriptive statistics
Standard deviation | 9.4246852 |
---|---|
Coefficient of variation (CV) | 0.17612656 |
Kurtosis | -0.38292982 |
Mean | 53.51087 |
Median Absolute Deviation (MAD) | 6.5 |
Skewness | -0.19599386 |
Sum | 49230 |
Variance | 88.824691 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
54 | 51 | 5.5% |
58 | 43 | 4.7% |
55 | 41 | 4.5% |
56 | 38 | 4.1% |
57 | 38 | 4.1% |
52 | 36 | 3.9% |
62 | 35 | 3.8% |
51 | 35 | 3.8% |
59 | 35 | 3.8% |
53 | 33 | 3.6% |
Other values (40) | 535 |
Value | Count | Frequency (%) |
28 | 1 | 0.1% |
29 | 3 | 0.3% |
30 | 1 | 0.1% |
31 | 2 | 0.2% |
32 | 5 | |
33 | 2 | 0.2% |
34 | 7 | |
35 | 11 | |
36 | 6 | |
37 | 11 |
Value | Count | Frequency (%) |
77 | 2 | 0.2% |
76 | 2 | 0.2% |
75 | 3 | 0.3% |
74 | 7 | |
73 | 1 | 0.1% |
72 | 4 | 0.4% |
71 | 5 | 0.5% |
70 | 7 | |
69 | 13 | |
68 | 10 |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 4.4217391 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Male |
---|---|
2nd row | Male |
3rd row | Male |
4th row | Male |
5th row | Female |
Common Values
Value | Count | Frequency (%) |
Male | 726 | |
Female | 194 | 21.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
male | 726 | |
female | 194 | 21.1% |
Most occurring characters
Value | Count | Frequency (%) |
e | 1114 | |
a | 920 | |
l | 920 | |
M | 726 | |
F | 194 | 4.8% |
m | 194 | 4.8% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 4068 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 1114 | |
a | 920 | |
l | 920 | |
M | 726 | |
F | 194 | 4.8% |
m | 194 | 4.8% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 4068 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 1114 | |
a | 920 | |
l | 920 | |
M | 726 | |
F | 194 | 4.8% |
m | 194 | 4.8% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 4068 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 1114 | |
a | 920 | |
l | 920 | |
M | 726 | |
F | 194 | 4.8% |
m | 194 | 4.8% |
dataset
Categorical
High correlation 
Distinct | 4 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 59.9 KiB |
Cleveland | |
---|---|
Hungary | |
VA Long Beach | |
Switzerland |
Length
Max length | 13 |
---|---|
Median length | 11 |
Mean length | 9.5 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Cleveland |
---|---|
2nd row | Cleveland |
3rd row | Cleveland |
4th row | Cleveland |
5th row | Cleveland |
Common Values
Value | Count | Frequency (%) |
Cleveland | 304 | |
Hungary | 293 | |
VA Long Beach | 200 | |
Switzerland | 123 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
cleveland | 304 | |
hungary | 293 | |
va | 200 | |
long | 200 | |
beach | 200 | |
switzerland | 123 |
Most occurring characters
Value | Count | Frequency (%) |
e | 931 | 10.7% |
a | 920 | 10.5% |
n | 920 | 10.5% |
l | 731 | 8.4% |
g | 493 | 5.6% |
d | 427 | 4.9% |
r | 416 | 4.8% |
400 | 4.6% | |
C | 304 | 3.5% |
v | 304 | 3.5% |
Other values (15) | 2894 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 8740 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 931 | 10.7% |
a | 920 | 10.5% |
n | 920 | 10.5% |
l | 731 | 8.4% |
g | 493 | 5.6% |
d | 427 | 4.9% |
r | 416 | 4.8% |
400 | 4.6% | |
C | 304 | 3.5% |
v | 304 | 3.5% |
Other values (15) | 2894 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 8740 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 931 | 10.7% |
a | 920 | 10.5% |
n | 920 | 10.5% |
l | 731 | 8.4% |
g | 493 | 5.6% |
d | 427 | 4.9% |
r | 416 | 4.8% |
400 | 4.6% | |
C | 304 | 3.5% |
v | 304 | 3.5% |
Other values (15) | 2894 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 8740 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 931 | 10.7% |
a | 920 | 10.5% |
n | 920 | 10.5% |
l | 731 | 8.4% |
g | 493 | 5.6% |
d | 427 | 4.9% |
r | 416 | 4.8% |
400 | 4.6% | |
C | 304 | 3.5% |
v | 304 | 3.5% |
Other values (15) | 2894 |
cp
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 62.5 KiB |
asymptomatic | |
---|---|
non-anginal | |
atypical angina | |
typical angina | 46 |
Length
Max length | 15 |
---|---|
Median length | 12 |
Mean length | 12.445652 |
Min length | 11 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | typical angina |
---|---|
2nd row | asymptomatic |
3rd row | asymptomatic |
4th row | non-anginal |
5th row | atypical angina |
Common Values
Value | Count | Frequency (%) |
asymptomatic | 496 | |
non-anginal | 204 | |
atypical angina | 174 | 18.9% |
typical angina | 46 | 5.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
asymptomatic | 496 | |
angina | 220 | |
non-anginal | 204 | |
atypical | 174 | 15.3% |
typical | 46 | 4.0% |
Most occurring characters
Value | Count | Frequency (%) |
a | 2234 | |
n | 1256 | |
t | 1212 | |
i | 1140 | |
m | 992 | |
y | 716 | 6.3% |
p | 716 | 6.3% |
c | 716 | 6.3% |
o | 700 | 6.1% |
s | 496 | 4.3% |
Other values (4) | 1272 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 11450 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
a | 2234 | |
n | 1256 | |
t | 1212 | |
i | 1140 | |
m | 992 | |
y | 716 | 6.3% |
p | 716 | 6.3% |
c | 716 | 6.3% |
o | 700 | 6.1% |
s | 496 | 4.3% |
Other values (4) | 1272 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 11450 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
a | 2234 | |
n | 1256 | |
t | 1212 | |
i | 1140 | |
m | 992 | |
y | 716 | 6.3% |
p | 716 | 6.3% |
c | 716 | 6.3% |
o | 700 | 6.1% |
s | 496 | 4.3% |
Other values (4) | 1272 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 11450 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
a | 2234 | |
n | 1256 | |
t | 1212 | |
i | 1140 | |
m | 992 | |
y | 716 | 6.3% |
p | 716 | 6.3% |
c | 716 | 6.3% |
o | 700 | 6.1% |
s | 496 | 4.3% |
Other values (4) | 1272 |
trestbps
Real number (ℝ)
Missing 
Distinct | 61 |
---|---|
Distinct (%) | 7.1% |
Missing | 59 |
Missing (%) | 6.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 132.1324 |
Minimum | 0 |
---|---|
Maximum | 200 |
Zeros | 1 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 105 |
Q1 | 120 |
median | 130 |
Q3 | 140 |
95-th percentile | 160 |
Maximum | 200 |
Range | 200 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 19.06607 |
---|---|
Coefficient of variation (CV) | 0.14429518 |
Kurtosis | 2.9586644 |
Mean | 132.1324 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 0.21333447 |
Sum | 113766 |
Variance | 363.51501 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
120 | 131 | |
130 | 115 | |
140 | 102 | 11.1% |
110 | 59 | 6.4% |
150 | 56 | 6.1% |
160 | 50 | 5.4% |
125 | 29 | 3.2% |
115 | 19 | 2.1% |
135 | 18 | 2.0% |
128 | 17 | 1.8% |
Other values (51) | 265 | |
(Missing) | 59 | 6.4% |
Value | Count | Frequency (%) |
0 | 1 | 0.1% |
80 | 1 | 0.1% |
92 | 1 | 0.1% |
94 | 2 | 0.2% |
95 | 6 | 0.7% |
96 | 1 | 0.1% |
98 | 1 | 0.1% |
100 | 15 | |
101 | 1 | 0.1% |
102 | 3 | 0.3% |
Value | Count | Frequency (%) |
200 | 4 | 0.4% |
192 | 1 | 0.1% |
190 | 2 | 0.2% |
185 | 1 | 0.1% |
180 | 12 | |
178 | 3 | 0.3% |
174 | 1 | 0.1% |
172 | 2 | 0.2% |
170 | 14 | |
165 | 2 | 0.2% |
chol
Real number (ℝ)
Missing  Zeros 
Distinct | 217 |
---|---|
Distinct (%) | 24.4% |
Missing | 30 |
Missing (%) | 3.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 199.13034 |
Minimum | 0 |
---|---|
Maximum | 603 |
Zeros | 172 |
Zeros (%) | 18.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 175 |
median | 223 |
Q3 | 268 |
95-th percentile | 334.1 |
Maximum | 603 |
Range | 603 |
Interquartile range (IQR) | 93 |
Descriptive statistics
Standard deviation | 110.78081 |
---|---|
Coefficient of variation (CV) | 0.55632312 |
Kurtosis | 0.062272688 |
Mean | 199.13034 |
Median Absolute Deviation (MAD) | 46 |
Skewness | -0.61383609 |
Sum | 177226 |
Variance | 12272.388 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 172 | 18.7% |
220 | 10 | 1.1% |
254 | 10 | 1.1% |
211 | 9 | 1.0% |
223 | 9 | 1.0% |
204 | 9 | 1.0% |
230 | 9 | 1.0% |
216 | 9 | 1.0% |
219 | 9 | 1.0% |
240 | 8 | 0.9% |
Other values (207) | 636 | |
(Missing) | 30 | 3.3% |
Value | Count | Frequency (%) |
0 | 172 | |
85 | 1 | 0.1% |
100 | 2 | 0.2% |
117 | 1 | 0.1% |
126 | 1 | 0.1% |
129 | 1 | 0.1% |
131 | 1 | 0.1% |
132 | 1 | 0.1% |
139 | 1 | 0.1% |
141 | 1 | 0.1% |
Value | Count | Frequency (%) |
603 | 1 | |
564 | 1 | |
529 | 1 | |
518 | 1 | |
491 | 1 | |
468 | 1 | |
466 | 1 | |
458 | 1 | |
417 | 1 | |
412 | 1 |
fbs
Boolean
Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.2% |
Missing | 90 |
Missing (%) | 9.8% |
Memory size | 29.4 KiB |
False | |
---|---|
True | |
(Missing) |
Value | Count | Frequency (%) |
False | 692 | |
True | 138 | 15.0% |
(Missing) | 90 | 9.8% |
restecg
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 0.3% |
Missing | 2 |
Missing (%) | 0.2% |
Memory size | 59.9 KiB |
normal | |
---|---|
lv hypertrophy | |
st-t abnormality |
Length
Max length | 16 |
---|---|
Median length | 6 |
Mean length | 9.5882353 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | lv hypertrophy |
---|---|
2nd row | lv hypertrophy |
3rd row | lv hypertrophy |
4th row | normal |
5th row | lv hypertrophy |
Common Values
Value | Count | Frequency (%) |
normal | 551 | |
lv hypertrophy | 188 | 20.4% |
st-t abnormality | 179 | 19.5% |
(Missing) | 2 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
normal | 551 | |
lv | 188 | 14.6% |
hypertrophy | 188 | 14.6% |
st-t | 179 | 13.9% |
abnormality | 179 | 13.9% |
Most occurring characters
Value | Count | Frequency (%) |
r | 1106 | |
l | 918 | |
o | 918 | |
a | 909 | |
n | 730 | |
m | 730 | |
t | 725 | |
y | 555 | 6.3% |
p | 376 | 4.3% |
h | 376 | 4.3% |
Other values (7) | 1459 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 8802 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
r | 1106 | |
l | 918 | |
o | 918 | |
a | 909 | |
n | 730 | |
m | 730 | |
t | 725 | |
y | 555 | 6.3% |
p | 376 | 4.3% |
h | 376 | 4.3% |
Other values (7) | 1459 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 8802 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
r | 1106 | |
l | 918 | |
o | 918 | |
a | 909 | |
n | 730 | |
m | 730 | |
t | 725 | |
y | 555 | 6.3% |
p | 376 | 4.3% |
h | 376 | 4.3% |
Other values (7) | 1459 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 8802 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
r | 1106 | |
l | 918 | |
o | 918 | |
a | 909 | |
n | 730 | |
m | 730 | |
t | 725 | |
y | 555 | 6.3% |
p | 376 | 4.3% |
h | 376 | 4.3% |
Other values (7) | 1459 |
thalch
Real number (ℝ)
Missing 
Distinct | 119 |
---|---|
Distinct (%) | 13.8% |
Missing | 55 |
Missing (%) | 6.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 137.54566 |
Minimum | 60 |
---|---|
Maximum | 202 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.3 KiB |
Quantile statistics
Minimum | 60 |
---|---|
5-th percentile | 95 |
Q1 | 120 |
median | 140 |
Q3 | 157 |
95-th percentile | 178 |
Maximum | 202 |
Range | 142 |
Interquartile range (IQR) | 37 |
Descriptive statistics
Standard deviation | 25.926276 |
---|---|
Coefficient of variation (CV) | 0.18849214 |
Kurtosis | -0.47972463 |
Mean | 137.54566 |
Median Absolute Deviation (MAD) | 20 |
Skewness | -0.21111858 |
Sum | 118977 |
Variance | 672.17181 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
150 | 43 | 4.7% |
140 | 41 | 4.5% |
120 | 35 | 3.8% |
130 | 30 | 3.3% |
160 | 26 | 2.8% |
110 | 21 | 2.3% |
170 | 20 | 2.2% |
125 | 20 | 2.2% |
122 | 16 | 1.7% |
145 | 14 | 1.5% |
Other values (109) | 599 | |
(Missing) | 55 | 6.0% |
Value | Count | Frequency (%) |
60 | 1 | |
63 | 1 | |
67 | 1 | |
69 | 1 | |
70 | 1 | |
71 | 1 | |
72 | 2 | |
73 | 1 | |
77 | 1 | |
78 | 1 |
Value | Count | Frequency (%) |
202 | 1 | 0.1% |
195 | 1 | 0.1% |
194 | 1 | 0.1% |
192 | 1 | 0.1% |
190 | 2 | |
188 | 2 | |
187 | 1 | 0.1% |
186 | 2 | |
185 | 4 | |
184 | 4 |
exang
Boolean
Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.2% |
Missing | 55 |
Missing (%) | 6.0% |
Memory size | 30.2 KiB |
False | |
---|---|
True | |
(Missing) |
Value | Count | Frequency (%) |
False | 528 | |
True | 337 | |
(Missing) | 55 | 6.0% |
oldpeak
Real number (ℝ)
Missing  Zeros 
Distinct | 53 |
---|---|
Distinct (%) | 6.2% |
Missing | 62 |
Missing (%) | 6.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.87878788 |
Minimum | -2.6 |
---|---|
Maximum | 6.2 |
Zeros | 370 |
Zeros (%) | 40.2% |
Negative | 12 |
Negative (%) | 1.3% |
Memory size | 7.3 KiB |
Quantile statistics
Minimum | -2.6 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0.5 |
Q3 | 1.5 |
95-th percentile | 3 |
Maximum | 6.2 |
Range | 8.8 |
Interquartile range (IQR) | 1.5 |
Descriptive statistics
Standard deviation | 1.0912262 |
---|---|
Coefficient of variation (CV) | 1.2417402 |
Kurtosis | 1.1270692 |
Mean | 0.87878788 |
Median Absolute Deviation (MAD) | 0.5 |
Skewness | 1.0414266 |
Sum | 754 |
Variance | 1.1907747 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 370 | |
1 | 83 | 9.0% |
2 | 76 | 8.3% |
1.5 | 48 | 5.2% |
3 | 28 | 3.0% |
0.5 | 19 | 2.1% |
1.2 | 17 | 1.8% |
2.5 | 16 | 1.7% |
0.8 | 15 | 1.6% |
1.4 | 15 | 1.6% |
Other values (43) | 171 | |
(Missing) | 62 | 6.7% |
Value | Count | Frequency (%) |
-2.6 | 1 | |
-2 | 1 | |
-1.5 | 1 | |
-1.1 | 1 | |
-1 | 2 | |
-0.9 | 1 | |
-0.8 | 1 | |
-0.7 | 1 | |
-0.5 | 2 | |
-0.1 | 1 |
Value | Count | Frequency (%) |
6.2 | 1 | 0.1% |
5.6 | 1 | 0.1% |
5 | 1 | 0.1% |
4.4 | 1 | 0.1% |
4.2 | 2 | 0.2% |
4 | 8 | |
3.8 | 1 | 0.1% |
3.7 | 1 | 0.1% |
3.6 | 4 | |
3.5 | 2 | 0.2% |
slope
Categorical
Missing 
Distinct | 3 |
---|---|
Distinct (%) | 0.5% |
Missing | 309 |
Missing (%) | 33.6% |
Memory size | 54.8 KiB |
flat | |
---|---|
upsloping | |
downsloping |
Length
Max length | 11 |
---|---|
Median length | 4 |
Mean length | 6.3829787 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | downsloping |
---|---|
2nd row | flat |
3rd row | flat |
4th row | downsloping |
5th row | upsloping |
Common Values
Value | Count | Frequency (%) |
flat | 345 | |
upsloping | 203 | |
downsloping | 63 | 6.8% |
(Missing) | 309 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
flat | 345 | |
upsloping | 203 | |
downsloping | 63 | 10.3% |
Most occurring characters
Value | Count | Frequency (%) |
l | 611 | |
p | 469 | |
f | 345 | |
a | 345 | |
t | 345 | |
o | 329 | |
n | 329 | |
s | 266 | |
i | 266 | |
g | 266 | |
Other values (3) | 329 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 3900 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
l | 611 | |
p | 469 | |
f | 345 | |
a | 345 | |
t | 345 | |
o | 329 | |
n | 329 | |
s | 266 | |
i | 266 | |
g | 266 | |
Other values (3) | 329 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 3900 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
l | 611 | |
p | 469 | |
f | 345 | |
a | 345 | |
t | 345 | |
o | 329 | |
n | 329 | |
s | 266 | |
i | 266 | |
g | 266 | |
Other values (3) | 329 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 3900 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
l | 611 | |
p | 469 | |
f | 345 | |
a | 345 | |
t | 345 | |
o | 329 | |
n | 329 | |
s | 266 | |
i | 266 | |
g | 266 | |
Other values (3) | 329 |
ca
Categorical
Missing 
Distinct | 4 |
---|---|
Distinct (%) | 1.3% |
Missing | 611 |
Missing (%) | 66.4% |
Memory size | 51.6 KiB |
0.0 | |
---|---|
1.0 | |
2.0 | |
3.0 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0.0 |
---|---|
2nd row | 3.0 |
3rd row | 2.0 |
4th row | 0.0 |
5th row | 0.0 |
Common Values
Value | Count | Frequency (%) |
0.0 | 181 | 19.7% |
1.0 | 67 | 7.3% |
2.0 | 41 | 4.5% |
3.0 | 20 | 2.2% |
(Missing) | 611 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0.0 | 181 | |
1.0 | 67 | 21.7% |
2.0 | 41 | 13.3% |
3.0 | 20 | 6.5% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 490 | |
. | 309 | |
1 | 67 | 7.2% |
2 | 41 | 4.4% |
3 | 20 | 2.2% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 927 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 490 | |
. | 309 | |
1 | 67 | 7.2% |
2 | 41 | 4.4% |
3 | 20 | 2.2% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 927 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 490 | |
. | 309 | |
1 | 67 | 7.2% |
2 | 41 | 4.4% |
3 | 20 | 2.2% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 927 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 490 | |
. | 309 | |
1 | 67 | 7.2% |
2 | 41 | 4.4% |
3 | 20 | 2.2% |
thal
Categorical
Missing 
Distinct | 3 |
---|---|
Distinct (%) | 0.7% |
Missing | 486 |
Missing (%) | 52.8% |
Memory size | 55.7 KiB |
normal | |
---|---|
reversable defect | |
fixed defect |
Length
Max length | 17 |
---|---|
Median length | 12 |
Mean length | 11.502304 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | fixed defect |
---|---|
2nd row | normal |
3rd row | reversable defect |
4th row | normal |
5th row | normal |
Common Values
Value | Count | Frequency (%) |
normal | 196 | |
reversable defect | 192 | 20.9% |
fixed defect | 46 | 5.0% |
(Missing) | 486 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
defect | 238 | |
normal | 196 | |
reversable | 192 | |
fixed | 46 | 6.8% |
Most occurring characters
Value | Count | Frequency (%) |
e | 1098 | |
r | 580 | |
a | 388 | 7.8% |
l | 388 | 7.8% |
f | 284 | 5.7% |
d | 284 | 5.7% |
t | 238 | 4.8% |
c | 238 | 4.8% |
238 | 4.8% | |
n | 196 | 3.9% |
Other values (7) | 1060 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 4992 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 1098 | |
r | 580 | |
a | 388 | 7.8% |
l | 388 | 7.8% |
f | 284 | 5.7% |
d | 284 | 5.7% |
t | 238 | 4.8% |
c | 238 | 4.8% |
238 | 4.8% | |
n | 196 | 3.9% |
Other values (7) | 1060 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 4992 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 1098 | |
r | 580 | |
a | 388 | 7.8% |
l | 388 | 7.8% |
f | 284 | 5.7% |
d | 284 | 5.7% |
t | 238 | 4.8% |
c | 238 | 4.8% |
238 | 4.8% | |
n | 196 | 3.9% |
Other values (7) | 1060 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 4992 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 1098 | |
r | 580 | |
a | 388 | 7.8% |
l | 388 | 7.8% |
f | 284 | 5.7% |
d | 284 | 5.7% |
t | 238 | 4.8% |
c | 238 | 4.8% |
238 | 4.8% | |
n | 196 | 3.9% |
Other values (7) | 1060 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 2 |
3rd row | 1 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 411 | |
1 | 265 | |
2 | 109 | 11.8% |
3 | 107 | 11.6% |
4 | 28 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 411 | |
1 | 265 | |
2 | 109 | 11.8% |
3 | 107 | 11.6% |
4 | 28 | 3.0% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 411 | |
1 | 265 | |
2 | 109 | 11.8% |
3 | 107 | 11.6% |
4 | 28 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 920 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
0 | 411 | |
1 | 265 | |
2 | 109 | 11.8% |
3 | 107 | 11.6% |
4 | 28 | 3.0% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 920 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
0 | 411 | |
1 | 265 | |
2 | 109 | 11.8% |
3 | 107 | 11.6% |
4 | 28 | 3.0% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 920 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
0 | 411 | |
1 | 265 | |
2 | 109 | 11.8% |
3 | 107 | 11.6% |
4 | 28 | 3.0% |
Interactions
Correlations
age | ca | chol | cp | dataset | exang | fbs | id | num | oldpeak | restecg | sex | slope | thal | thalch | trestbps | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
age | 1.000 | 0.218 | -0.037 | 0.149 | 0.257 | 0.185 | 0.219 | 0.238 | 0.162 | 0.288 | 0.162 | 0.000 | 0.120 | 0.130 | -0.348 | 0.259 |
ca | 0.218 | 1.000 | 0.067 | 0.147 | 0.086 | 0.170 | 0.126 | 0.047 | 0.330 | 0.175 | 0.078 | 0.089 | 0.078 | 0.159 | 0.121 | 0.036 |
chol | -0.037 | 0.067 | 1.000 | 0.122 | 0.494 | 0.097 | 0.072 | -0.308 | 0.174 | 0.048 | 0.153 | 0.208 | 0.062 | 0.145 | 0.175 | 0.104 |
cp | 0.149 | 0.147 | 0.122 | 1.000 | 0.204 | 0.446 | 0.065 | 0.260 | 0.307 | 0.198 | 0.087 | 0.195 | 0.185 | 0.255 | 0.221 | 0.044 |
dataset | 0.257 | 0.086 | 0.494 | 0.204 | 1.000 | 0.248 | 0.283 | 0.896 | 0.298 | 0.259 | 0.438 | 0.291 | 0.294 | 0.252 | 0.253 | 0.098 |
exang | 0.185 | 0.170 | 0.097 | 0.446 | 0.248 | 1.000 | 0.000 | 0.357 | 0.463 | 0.441 | 0.077 | 0.175 | 0.343 | 0.337 | 0.390 | 0.143 |
fbs | 0.219 | 0.126 | 0.072 | 0.065 | 0.283 | 0.000 | 1.000 | 0.283 | 0.158 | 0.027 | 0.167 | 0.078 | 0.092 | 0.131 | 0.000 | 0.163 |
id | 0.238 | 0.047 | -0.308 | 0.260 | 0.896 | 0.357 | 0.283 | 1.000 | 0.338 | 0.050 | 0.439 | 0.339 | 0.304 | 0.275 | -0.474 | 0.057 |
num | 0.162 | 0.330 | 0.174 | 0.307 | 0.298 | 0.463 | 0.158 | 0.338 | 1.000 | 0.266 | 0.131 | 0.302 | 0.281 | 0.350 | 0.210 | 0.081 |
oldpeak | 0.288 | 0.175 | 0.048 | 0.198 | 0.259 | 0.441 | 0.027 | 0.050 | 0.266 | 1.000 | 0.115 | 0.118 | 0.361 | 0.185 | -0.188 | 0.161 |
restecg | 0.162 | 0.078 | 0.153 | 0.087 | 0.438 | 0.077 | 0.167 | 0.439 | 0.131 | 0.115 | 1.000 | 0.057 | 0.066 | 0.152 | 0.116 | 0.078 |
sex | 0.000 | 0.089 | 0.208 | 0.195 | 0.291 | 0.175 | 0.078 | 0.339 | 0.302 | 0.118 | 0.057 | 1.000 | 0.111 | 0.375 | 0.169 | 0.000 |
slope | 0.120 | 0.078 | 0.062 | 0.185 | 0.294 | 0.343 | 0.092 | 0.304 | 0.281 | 0.361 | 0.066 | 0.111 | 1.000 | 0.225 | 0.296 | 0.087 |
thal | 0.130 | 0.159 | 0.145 | 0.255 | 0.252 | 0.337 | 0.131 | 0.275 | 0.350 | 0.185 | 0.152 | 0.375 | 0.225 | 1.000 | 0.284 | 0.030 |
thalch | -0.348 | 0.121 | 0.175 | 0.221 | 0.253 | 0.390 | 0.000 | -0.474 | 0.210 | -0.188 | 0.116 | 0.169 | 0.296 | 0.284 | 1.000 | -0.090 |
trestbps | 0.259 | 0.036 | 0.104 | 0.044 | 0.098 | 0.143 | 0.163 | 0.057 | 0.081 | 0.161 | 0.078 | 0.000 | 0.087 | 0.030 | -0.090 | 1.000 |
Missing values
Sample
id | age | sex | dataset | cp | trestbps | chol | fbs | restecg | thalch | exang | oldpeak | slope | ca | thal | num | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | 63 | Male | Cleveland | typical angina | 145.0 | 233.0 | True | lv hypertrophy | 150.0 | False | 2.3 | downsloping | 0.0 | fixed defect | 0 |
1 | 2 | 67 | Male | Cleveland | asymptomatic | 160.0 | 286.0 | False | lv hypertrophy | 108.0 | True | 1.5 | flat | 3.0 | normal | 2 |
2 | 3 | 67 | Male | Cleveland | asymptomatic | 120.0 | 229.0 | False | lv hypertrophy | 129.0 | True | 2.6 | flat | 2.0 | reversable defect | 1 |
3 | 4 | 37 | Male | Cleveland | non-anginal | 130.0 | 250.0 | False | normal | 187.0 | False | 3.5 | downsloping | 0.0 | normal | 0 |
4 | 5 | 41 | Female | Cleveland | atypical angina | 130.0 | 204.0 | False | lv hypertrophy | 172.0 | False | 1.4 | upsloping | 0.0 | normal | 0 |
5 | 6 | 56 | Male | Cleveland | atypical angina | 120.0 | 236.0 | False | normal | 178.0 | False | 0.8 | upsloping | 0.0 | normal | 0 |
6 | 7 | 62 | Female | Cleveland | asymptomatic | 140.0 | 268.0 | False | lv hypertrophy | 160.0 | False | 3.6 | downsloping | 2.0 | normal | 3 |
7 | 8 | 57 | Female | Cleveland | asymptomatic | 120.0 | 354.0 | False | normal | 163.0 | True | 0.6 | upsloping | 0.0 | normal | 0 |
8 | 9 | 63 | Male | Cleveland | asymptomatic | 130.0 | 254.0 | False | lv hypertrophy | 147.0 | False | 1.4 | flat | 1.0 | reversable defect | 2 |
9 | 10 | 53 | Male | Cleveland | asymptomatic | 140.0 | 203.0 | True | lv hypertrophy | 155.0 | True | 3.1 | downsloping | 0.0 | reversable defect | 1 |
id | age | sex | dataset | cp | trestbps | chol | fbs | restecg | thalch | exang | oldpeak | slope | ca | thal | num | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
910 | 911 | 51 | Female | VA Long Beach | asymptomatic | 114.0 | 258.0 | True | lv hypertrophy | 96.0 | False | 1.0 | upsloping | NaN | NaN | 0 |
911 | 912 | 62 | Male | VA Long Beach | asymptomatic | 160.0 | 254.0 | True | st-t abnormality | 108.0 | True | 3.0 | flat | NaN | NaN | 4 |
912 | 913 | 53 | Male | VA Long Beach | asymptomatic | 144.0 | 300.0 | True | st-t abnormality | 128.0 | True | 1.5 | flat | NaN | NaN | 3 |
913 | 914 | 62 | Male | VA Long Beach | asymptomatic | 158.0 | 170.0 | False | st-t abnormality | 138.0 | True | 0.0 | NaN | NaN | NaN | 1 |
914 | 915 | 46 | Male | VA Long Beach | asymptomatic | 134.0 | 310.0 | False | normal | 126.0 | False | 0.0 | NaN | NaN | normal | 2 |
915 | 916 | 54 | Female | VA Long Beach | asymptomatic | 127.0 | 333.0 | True | st-t abnormality | 154.0 | False | 0.0 | NaN | NaN | NaN | 1 |
916 | 917 | 62 | Male | VA Long Beach | typical angina | NaN | 139.0 | False | st-t abnormality | NaN | NaN | NaN | NaN | NaN | NaN | 0 |
917 | 918 | 55 | Male | VA Long Beach | asymptomatic | 122.0 | 223.0 | True | st-t abnormality | 100.0 | False | 0.0 | NaN | NaN | fixed defect | 2 |
918 | 919 | 58 | Male | VA Long Beach | asymptomatic | NaN | 385.0 | True | lv hypertrophy | NaN | NaN | NaN | NaN | NaN | NaN | 0 |
919 | 920 | 62 | Male | VA Long Beach | atypical angina | 120.0 | 254.0 | False | lv hypertrophy | 93.0 | True | 0.0 | NaN | NaN | NaN | 1 |