Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 418 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 24.9 KiB |
Average record size in memory | 61.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 6 |
Embarked_C is highly overall correlated with Embarked_S | High correlation |
Embarked_S is highly overall correlated with Embarked_C | High correlation |
Sex_female is highly overall correlated with Sex_male | High correlation |
Sex_male is highly overall correlated with Sex_female | High correlation |
PassengerId is uniformly distributed | Uniform |
PassengerId has unique values | Unique |
SibSp has 283 (67.7%) zeros | Zeros |
Parch has 324 (77.5%) zeros | Zeros |
Reproduction
Analysis started | 2023-06-20 12:18:42.520446 |
---|---|
Analysis finished | 2023-06-20 12:18:45.662900 |
Duration | 3.14 seconds |
Software version | ydata-profiling vv4.2.0 |
Download configuration | config.json |
PassengerId
Real number (ℝ)
UNIFORM
  UNIQUE
 
Distinct | 418 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1100.5 |
Minimum | 892 |
---|---|
Maximum | 1309 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.5 KiB |
Quantile statistics
Minimum | 892 |
---|---|
5-th percentile | 912.85 |
Q1 | 996.25 |
median | 1100.5 |
Q3 | 1204.75 |
95-th percentile | 1288.15 |
Maximum | 1309 |
Range | 417 |
Interquartile range (IQR) | 208.5 |
Descriptive statistics
Standard deviation | 120.81046 |
---|---|
Coefficient of variation (CV) | 0.10977779 |
Kurtosis | -1.2 |
Mean | 1100.5 |
Median Absolute Deviation (MAD) | 104.5 |
Skewness | 0 |
Sum | 460009 |
Variance | 14595.167 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1100 | 1 | 0.2% |
910 | 1 | 0.2% |
968 | 1 | 0.2% |
971 | 1 | 0.2% |
972 | 1 | 0.2% |
975 | 1 | 0.2% |
977 | 1 | 0.2% |
979 | 1 | 0.2% |
990 | 1 | 0.2% |
980 | 1 | 0.2% |
Other values (408) | 408 |
Value | Count | Frequency (%) |
892 | 1 | |
893 | 1 | |
894 | 1 | |
895 | 1 | |
896 | 1 | |
897 | 1 | |
898 | 1 | |
899 | 1 | |
900 | 1 | |
901 | 1 |
Value | Count | Frequency (%) |
1309 | 1 | |
1308 | 1 | |
1307 | 1 | |
1306 | 1 | |
1305 | 1 | |
1304 | 1 | |
1303 | 1 | |
1302 | 1 | |
1301 | 1 | |
1300 | 1 |
Pclass
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.5 KiB |
3 | |
---|---|
1 | |
2 |
Common Values
Value | Count | Frequency (%) |
3 | 218 | |
1 | 107 | |
2 | 93 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
3 | 218 | |
1 | 107 | |
2 | 93 |
Most occurring characters
Value | Count | Frequency (%) |
3 | 218 | |
1 | 107 | |
2 | 93 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 418 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
3 | 218 | |
1 | 107 | |
2 | 93 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 418 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
3 | 218 | |
1 | 107 | |
2 | 93 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 418 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3 | 218 | |
1 | 107 | |
2 | 93 |
Age
Real number (ℝ)
Distinct | 85 |
---|---|
Distinct (%) | 20.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 29.423005 |
Minimum | 0.17 |
---|---|
Maximum | 76 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.5 KiB |
Quantile statistics
Minimum | 0.17 |
---|---|
5-th percentile | 10 |
Q1 | 23 |
median | 25 |
Q3 | 36.375 |
95-th percentile | 55 |
Maximum | 76 |
Range | 75.83 |
Interquartile range (IQR) | 13.375 |
Descriptive statistics
Standard deviation | 12.963036 |
---|---|
Coefficient of variation (CV) | 0.44057485 |
Kurtosis | 0.67088016 |
Mean | 29.423005 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 0.664077 |
Sum | 12298.816 |
Variance | 168.0403 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
24.52510417 | 50 | 12.0% |
23.0734 | 22 | 5.3% |
21 | 17 | 4.1% |
24 | 17 | 4.1% |
22 | 16 | 3.8% |
30 | 15 | 3.6% |
18 | 13 | 3.1% |
27 | 12 | 2.9% |
26 | 12 | 2.9% |
25 | 11 | 2.6% |
Other values (75) | 233 |
Value | Count | Frequency (%) |
0.17 | 1 | 0.2% |
0.33 | 1 | 0.2% |
0.75 | 1 | 0.2% |
0.83 | 1 | 0.2% |
0.92 | 1 | 0.2% |
1 | 3 | |
2 | 2 | |
3 | 1 | 0.2% |
5 | 1 | 0.2% |
6 | 3 |
Value | Count | Frequency (%) |
76 | 1 | 0.2% |
67 | 1 | 0.2% |
64 | 3 | |
63 | 2 | |
62 | 1 | 0.2% |
61 | 2 | |
60.5 | 1 | 0.2% |
60 | 3 | |
59 | 1 | 0.2% |
58 | 1 | 0.2% |
SibSp
Real number (ℝ)
Distinct | 7 |
---|---|
Distinct (%) | 1.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.44736842 |
Minimum | 0 |
---|---|
Maximum | 8 |
Zeros | 283 |
Zeros (%) | 67.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 2 |
Maximum | 8 |
Range | 8 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.89675956 |
---|---|
Coefficient of variation (CV) | 2.0045214 |
Kurtosis | 26.498712 |
Mean | 0.44736842 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.1683366 |
Sum | 187 |
Variance | 0.80417771 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 283 | |
1 | 110 | 26.3% |
2 | 14 | 3.3% |
3 | 4 | 1.0% |
4 | 4 | 1.0% |
8 | 2 | 0.5% |
5 | 1 | 0.2% |
Value | Count | Frequency (%) |
0 | 283 | |
1 | 110 | 26.3% |
2 | 14 | 3.3% |
3 | 4 | 1.0% |
4 | 4 | 1.0% |
5 | 1 | 0.2% |
8 | 2 | 0.5% |
Value | Count | Frequency (%) |
8 | 2 | 0.5% |
5 | 1 | 0.2% |
4 | 4 | 1.0% |
3 | 4 | 1.0% |
2 | 14 | 3.3% |
1 | 110 | 26.3% |
0 | 283 |
Parch
Real number (ℝ)
Distinct | 8 |
---|---|
Distinct (%) | 1.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.3923445 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 324 |
Zeros (%) | 77.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 2 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.98142888 |
---|---|
Coefficient of variation (CV) | 2.5014468 |
Kurtosis | 31.412513 |
Mean | 0.3923445 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.6544617 |
Sum | 164 |
Variance | 0.96320264 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 324 | |
1 | 52 | 12.4% |
2 | 33 | 7.9% |
3 | 3 | 0.7% |
4 | 2 | 0.5% |
9 | 2 | 0.5% |
6 | 1 | 0.2% |
5 | 1 | 0.2% |
Value | Count | Frequency (%) |
0 | 324 | |
1 | 52 | 12.4% |
2 | 33 | 7.9% |
3 | 3 | 0.7% |
4 | 2 | 0.5% |
5 | 1 | 0.2% |
6 | 1 | 0.2% |
9 | 2 | 0.5% |
Value | Count | Frequency (%) |
9 | 2 | 0.5% |
6 | 1 | 0.2% |
5 | 1 | 0.2% |
4 | 2 | 0.5% |
3 | 3 | 0.7% |
2 | 33 | 7.9% |
1 | 52 | 12.4% |
0 | 324 |
Fare
Real number (ℝ)
Distinct | 170 |
---|---|
Distinct (%) | 40.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 35.627188 |
Minimum | 0 |
---|---|
Maximum | 512.3292 |
Zeros | 2 |
Zeros (%) | 0.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 7.2292 |
Q1 | 7.8958 |
median | 14.4542 |
Q3 | 31.5 |
95-th percentile | 151.55 |
Maximum | 512.3292 |
Range | 512.3292 |
Interquartile range (IQR) | 23.6042 |
Descriptive statistics
Standard deviation | 55.8405 |
---|---|
Coefficient of variation (CV) | 1.5673564 |
Kurtosis | 17.971266 |
Mean | 35.627188 |
Median Absolute Deviation (MAD) | 6.85 |
Skewness | 3.6915998 |
Sum | 14892.165 |
Variance | 3118.1615 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7.75 | 21 | 5.0% |
26 | 19 | 4.5% |
13 | 17 | 4.1% |
8.05 | 17 | 4.1% |
7.8958 | 11 | 2.6% |
10.5 | 11 | 2.6% |
7.775 | 10 | 2.4% |
7.2292 | 9 | 2.2% |
7.225 | 9 | 2.2% |
7.8542 | 8 | 1.9% |
Other values (160) | 286 |
Value | Count | Frequency (%) |
0 | 2 | 0.5% |
3.1708 | 1 | 0.2% |
6.4375 | 2 | 0.5% |
6.4958 | 1 | 0.2% |
6.95 | 1 | 0.2% |
7 | 2 | 0.5% |
7.05 | 2 | 0.5% |
7.225 | 9 | |
7.2292 | 9 | |
7.25 | 5 |
Value | Count | Frequency (%) |
512.3292 | 1 | 0.2% |
263 | 2 | 0.5% |
262.375 | 5 | |
247.5208 | 1 | 0.2% |
227.525 | 1 | 0.2% |
221.7792 | 3 | |
211.5 | 4 | |
211.3375 | 1 | 0.2% |
164.8667 | 2 | 0.5% |
151.55 | 2 | 0.5% |
Embarked_C
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.5 KiB |
0 | |
---|---|
1 |
Common Values
Value | Count | Frequency (%) |
0 | 316 | |
1 | 102 | 24.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 316 | |
1 | 102 | 24.4% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 316 | |
1 | 102 | 24.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 418 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 316 | |
1 | 102 | 24.4% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 418 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 316 | |
1 | 102 | 24.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 418 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 316 | |
1 | 102 | 24.4% |
Embarked_Q
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.5 KiB |
0 | |
---|---|
1 |
Common Values
Value | Count | Frequency (%) |
0 | 372 | |
1 | 46 | 11.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 372 | |
1 | 46 | 11.0% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 372 | |
1 | 46 | 11.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 418 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 372 | |
1 | 46 | 11.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 418 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 372 | |
1 | 46 | 11.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 418 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 372 | |
1 | 46 | 11.0% |
Embarked_S
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.5 KiB |
1 | |
---|---|
0 |
Common Values
Value | Count | Frequency (%) |
1 | 270 | |
0 | 148 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 270 | |
0 | 148 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 270 | |
0 | 148 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 418 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 270 | |
0 | 148 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 418 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 270 | |
0 | 148 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 418 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 270 | |
0 | 148 |
Sex_female
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.5 KiB |
0 | |
---|---|
1 |
Common Values
Value | Count | Frequency (%) |
0 | 266 | |
1 | 152 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 266 | |
1 | 152 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 266 | |
1 | 152 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 418 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 266 | |
1 | 152 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 418 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 266 | |
1 | 152 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 418 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 266 | |
1 | 152 |
Sex_male
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 6.5 KiB |
1 | |
---|---|
0 |
Common Values
Value | Count | Frequency (%) |
1 | 266 | |
0 | 152 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 266 | |
0 | 152 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 266 | |
0 | 152 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 418 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 266 | |
0 | 152 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 418 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 266 | |
0 | 152 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 418 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 266 | |
0 | 152 |
PassengerId | Age | SibSp | Parch | Fare | Pclass | Embarked_C | Embarked_Q | Embarked_S | Sex_female | Sex_male | |
---|---|---|---|---|---|---|---|---|---|---|---|
PassengerId | 1.000 | -0.032 | -0.010 | 0.051 | 0.019 | 0.054 | 0.000 | 0.134 | 0.000 | 0.000 | 0.000 |
Age | -0.032 | 1.000 | -0.025 | -0.122 | 0.309 | 0.392 | 0.133 | 0.185 | 0.033 | 0.045 | 0.045 |
SibSp | -0.010 | -0.025 | 1.000 | 0.412 | 0.439 | 0.113 | 0.093 | 0.130 | 0.000 | 0.136 | 0.136 |
Parch | 0.051 | -0.122 | 0.412 | 1.000 | 0.377 | 0.000 | 0.123 | 0.113 | 0.086 | 0.213 | 0.213 |
Fare | 0.019 | 0.309 | 0.439 | 0.377 | 1.000 | 0.475 | 0.348 | 0.092 | 0.214 | 0.155 | 0.155 |
Pclass | 0.054 | 0.392 | 0.113 | 0.000 | 0.475 | 1.000 | 0.378 | 0.254 | 0.259 | 0.106 | 0.106 |
Embarked_C | 0.000 | 0.133 | 0.093 | 0.123 | 0.348 | 0.378 | 1.000 | 0.185 | 0.761 | 0.000 | 0.000 |
Embarked_Q | 0.134 | 0.185 | 0.130 | 0.113 | 0.092 | 0.254 | 0.185 | 1.000 | 0.465 | 0.096 | 0.096 |
Embarked_S | 0.000 | 0.033 | 0.000 | 0.086 | 0.214 | 0.259 | 0.761 | 0.465 | 1.000 | 0.088 | 0.088 |
Sex_female | 0.000 | 0.045 | 0.136 | 0.213 | 0.155 | 0.106 | 0.000 | 0.096 | 0.088 | 1.000 | 0.995 |
Sex_male | 0.000 | 0.045 | 0.136 | 0.213 | 0.155 | 0.106 | 0.000 | 0.096 | 0.088 | 0.995 | 1.000 |
PassengerId | Pclass | Age | SibSp | Parch | Fare | Embarked_C | Embarked_Q | Embarked_S | Sex_female | Sex_male | |
---|---|---|---|---|---|---|---|---|---|---|---|
208 | 1100 | 1 | 33.00 | 0 | 0 | 27.7208 | 1 | 0 | 0 | 1 | 0 |
350 | 1242 | 1 | 45.00 | 0 | 1 | 63.3583 | 1 | 0 | 0 | 1 | 0 |
122 | 1014 | 1 | 35.00 | 1 | 0 | 57.7500 | 1 | 0 | 0 | 1 | 0 |
343 | 1235 | 1 | 58.00 | 0 | 1 | 512.3292 | 1 | 0 | 0 | 1 | 0 |
131 | 1023 | 1 | 53.00 | 0 | 0 | 28.5000 | 1 | 0 | 0 | 0 | 1 |
335 | 1227 | 1 | 30.00 | 0 | 0 | 26.0000 | 0 | 0 | 1 | 0 | 1 |
141 | 1033 | 1 | 33.00 | 0 | 0 | 151.5500 | 0 | 0 | 1 | 1 | 0 |
118 | 1010 | 1 | 36.00 | 0 | 0 | 75.2417 | 1 | 0 | 0 | 0 | 1 |
142 | 1034 | 1 | 61.00 | 1 | 3 | 262.3750 | 1 | 0 | 0 | 0 | 1 |
146 | 1038 | 1 | 40.52 | 0 | 0 | 51.8625 | 0 | 0 | 1 | 0 | 1 |
PassengerId | Pclass | Age | SibSp | Parch | Fare | Embarked_C | Embarked_Q | Embarked_S | Sex_female | Sex_male | |
---|---|---|---|---|---|---|---|---|---|---|---|
172 | 1064 | 3 | 23.000000 | 1 | 0 | 13.9000 | 0 | 0 | 1 | 0 | 1 |
171 | 1063 | 3 | 27.000000 | 0 | 0 | 7.2250 | 1 | 0 | 0 | 0 | 1 |
170 | 1062 | 3 | 24.525104 | 0 | 0 | 7.5500 | 0 | 0 | 1 | 0 | 1 |
169 | 1061 | 3 | 22.000000 | 0 | 0 | 8.9625 | 0 | 0 | 1 | 1 | 0 |
167 | 1059 | 3 | 18.000000 | 2 | 2 | 34.3750 | 0 | 0 | 1 | 0 | 1 |
165 | 1057 | 3 | 26.000000 | 1 | 1 | 22.0250 | 0 | 0 | 1 | 1 | 0 |
163 | 1055 | 3 | 24.525104 | 0 | 0 | 7.0000 | 0 | 0 | 1 | 0 | 1 |
161 | 1053 | 3 | 7.000000 | 1 | 1 | 15.2458 | 1 | 0 | 0 | 0 | 1 |
199 | 1091 | 3 | 23.073400 | 0 | 0 | 8.1125 | 0 | 0 | 1 | 1 | 0 |
417 | 1309 | 3 | 24.525104 | 1 | 1 | 22.3583 | 1 | 0 | 0 | 0 | 1 |