Overview

Dataset info

Number of variables 13
Number of observations 506
Total Missing (%) 0.0%
Total size in memory 51.5 KiB
Average record size in memory 104.2 B

Variables types

Numeric 11
Categorical 0
Boolean 1
Date 0
Text (Unique) 0
Rejected 1
Unsupported 0

Warnings

  • TAX is highly correlated with RAD (ρ = 0.91023) Rejected
  • ZN has 372 / 73.5% zeros Zeros

Variables

AGE
Numeric

Distinct count 356
Unique (%) 70.4%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 68.575
Minimum 2.9
Maximum 100
Zeros (%) 0.0%

Quantile statistics

Minimum 2.9
5-th percentile 17.725
Q1 45.025
Median 77.5
Q3 94.075
95-th percentile 100
Maximum 100
Range 97.1
Interquartile range 49.05

Descriptive statistics

Standard deviation 28.149
Coef of variation 0.41048
Kurtosis -0.96772
Mean 68.575
MAD 24.611
Skewness -0.59896
Sum 34699
Variance 792.36
Memory size 4.0 KiB
Value Count Frequency (%)  
100.0 43 8.5%
 
96.0 4 0.8%
 
98.2 4 0.8%
 
95.4 4 0.8%
 
97.9 4 0.8%
 
87.9 4 0.8%
 
98.8 4 0.8%
 
94.1 3 0.6%
 
88.0 3 0.6%
 
21.4 3 0.6%
 
Other values (346) 430 85.0%
 

Minimum 5 values

Value Count Frequency (%)  
2.9 1 0.2%
 
6.0 1 0.2%
 
6.2 1 0.2%
 
6.5 1 0.2%
 
6.6 2 0.4%
 

Maximum 5 values

Value Count Frequency (%)  
98.8 4 0.8%
 
98.9 3 0.6%
 
99.1 1 0.2%
 
99.3 1 0.2%
 
100.0 43 8.5%
 

B
Numeric

Distinct count 357
Unique (%) 70.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 356.67
Minimum 0.32
Maximum 396.9
Zeros (%) 0.0%

Quantile statistics

Minimum 0.32
5-th percentile 84.59
Q1 375.38
Median 391.44
Q3 396.23
95-th percentile 396.9
Maximum 396.9
Range 396.58
Interquartile range 20.848

Descriptive statistics

Standard deviation 91.295
Coef of variation 0.25596
Kurtosis 7.2268
Mean 356.67
MAD 54.629
Skewness -2.8904
Sum 180480
Variance 8334.8
Memory size 4.0 KiB
Value Count Frequency (%)  
396.9 121 23.9%
 
395.24 3 0.6%
 
393.74 3 0.6%
 
393.23 2 0.4%
 
394.72 2 0.4%
 
396.21 2 0.4%
 
395.69 2 0.4%
 
396.06 2 0.4%
 
395.63 2 0.4%
 
395.6 2 0.4%
 
Other values (347) 365 72.1%
 

Minimum 5 values

Value Count Frequency (%)  
0.32 1 0.2%
 
2.52 1 0.2%
 
2.6 1 0.2%
 
3.5 1 0.2%
 
3.65 1 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
396.28 1 0.2%
 
396.3 1 0.2%
 
396.33 1 0.2%
 
396.42 1 0.2%
 
396.9 121 23.9%
 

CHAS
Boolean

Distinct count 2
Unique (%) 0.4%
Missing (%) 0.0%
Missing (n) 0
Mean 0.06917
0.0
471
1.0
 
35
Value Count Frequency (%)  
0.0 471 93.1%
 
1.0 35 6.9%
 

CRIM
Numeric

Distinct count 504
Unique (%) 99.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 3.6135
Minimum 0.00632
Maximum 88.976
Zeros (%) 0.0%

Quantile statistics

Minimum 0.00632
5-th percentile 0.02791
Q1 0.082045
Median 0.25651
Q3 3.6771
95-th percentile 15.789
Maximum 88.976
Range 88.97
Interquartile range 3.595

Descriptive statistics

Standard deviation 8.6015
Coef of variation 2.3804
Kurtosis 37.131
Mean 3.6135
MAD 4.7841
Skewness 5.2231
Sum 1828.4
Variance 73.987
Memory size 4.0 KiB
Value Count Frequency (%)  
14.3337 2 0.4%
 
0.01501 2 0.4%
 
0.08265 1 0.2%
 
0.537 1 0.2%
 
1.35472 1 0.2%
 
0.14103 1 0.2%
 
0.03502 1 0.2%
 
0.03615 1 0.2%
 
0.66351 1 0.2%
 
0.1265 1 0.2%
 
Other values (494) 494 97.6%
 

Minimum 5 values

Value Count Frequency (%)  
0.00632 1 0.2%
 
0.00906 1 0.2%
 
0.01096 1 0.2%
 
0.01301 1 0.2%
 
0.01311 1 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
45.7461 1 0.2%
 
51.1358 1 0.2%
 
67.9208 1 0.2%
 
73.5341 1 0.2%
 
88.9762 1 0.2%
 

DIS
Numeric

Distinct count 412
Unique (%) 81.4%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 3.795
Minimum 1.1296
Maximum 12.127
Zeros (%) 0.0%

Quantile statistics

Minimum 1.1296
5-th percentile 1.462
Q1 2.1002
Median 3.2074
Q3 5.1884
95-th percentile 7.8278
Maximum 12.127
Range 10.997
Interquartile range 3.0883

Descriptive statistics

Standard deviation 2.1057
Coef of variation 0.55486
Kurtosis 0.48794
Mean 3.795
MAD 1.7194
Skewness 1.0118
Sum 1920.3
Variance 4.434
Memory size 4.0 KiB
Value Count Frequency (%)  
3.4952 5 1.0%
 
5.2873 4 0.8%
 
5.4007 4 0.8%
 
5.7209 4 0.8%
 
6.8147 4 0.8%
 
3.6519 3 0.6%
 
7.3172 3 0.6%
 
5.4917 3 0.6%
 
7.8278 3 0.6%
 
5.4159 3 0.6%
 
Other values (402) 470 92.9%
 

Minimum 5 values

Value Count Frequency (%)  
1.1296 1 0.2%
 
1.137 1 0.2%
 
1.1691 1 0.2%
 
1.1742 1 0.2%
 
1.1781 1 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
9.2203 2 0.4%
 
9.2229 1 0.2%
 
10.5857 2 0.4%
 
10.7103 2 0.4%
 
12.1265 1 0.2%
 

INDUS
Numeric

Distinct count 76
Unique (%) 15.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 11.137
Minimum 0.46
Maximum 27.74
Zeros (%) 0.0%

Quantile statistics

Minimum 0.46
5-th percentile 2.18
Q1 5.19
Median 9.69
Q3 18.1
95-th percentile 21.89
Maximum 27.74
Range 27.28
Interquartile range 12.91

Descriptive statistics

Standard deviation 6.8604
Coef of variation 0.61601
Kurtosis -1.2335
Mean 11.137
MAD 6.202
Skewness 0.29502
Sum 5635.2
Variance 47.064
Memory size 4.0 KiB
Value Count Frequency (%)  
18.1 132 26.1%
 
19.58 30 5.9%
 
8.14 22 4.3%
 
6.2 18 3.6%
 
21.89 15 3.0%
 
9.9 12 2.4%
 
3.97 12 2.4%
 
8.56 11 2.2%
 
10.59 11 2.2%
 
5.86 10 2.0%
 
Other values (66) 233 46.0%
 

Minimum 5 values

Value Count Frequency (%)  
0.46 1 0.2%
 
0.74 1 0.2%
 
1.21 1 0.2%
 
1.22 1 0.2%
 
1.25 2 0.4%
 

Maximum 5 values

Value Count Frequency (%)  
18.1 132 26.1%
 
19.58 30 5.9%
 
21.89 15 3.0%
 
25.65 7 1.4%
 
27.74 5 1.0%
 

LSTAT
Numeric

Distinct count 455
Unique (%) 89.9%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 12.653
Minimum 1.73
Maximum 37.97
Zeros (%) 0.0%

Quantile statistics

Minimum 1.73
5-th percentile 3.7075
Q1 6.95
Median 11.36
Q3 16.955
95-th percentile 26.808
Maximum 37.97
Range 36.24
Interquartile range 10.005

Descriptive statistics

Standard deviation 7.1411
Coef of variation 0.56437
Kurtosis 0.49324
Mean 12.653
MAD 5.7153
Skewness 0.90646
Sum 6402.5
Variance 50.995
Memory size 4.0 KiB
Value Count Frequency (%)  
14.1 3 0.6%
 
6.36 3 0.6%
 
18.13 3 0.6%
 
8.05 3 0.6%
 
7.79 3 0.6%
 
9.5 2 0.4%
 
4.59 2 0.4%
 
3.76 2 0.4%
 
17.27 2 0.4%
 
10.11 2 0.4%
 
Other values (445) 481 95.1%
 

Minimum 5 values

Value Count Frequency (%)  
1.73 1 0.2%
 
1.92 1 0.2%
 
1.98 1 0.2%
 
2.47 1 0.2%
 
2.87 1 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
34.37 1 0.2%
 
34.41 1 0.2%
 
34.77 1 0.2%
 
36.98 1 0.2%
 
37.97 1 0.2%
 

NOX
Numeric

Distinct count 81
Unique (%) 16.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.5547
Minimum 0.385
Maximum 0.871
Zeros (%) 0.0%

Quantile statistics

Minimum 0.385
5-th percentile 0.40925
Q1 0.449
Median 0.538
Q3 0.624
95-th percentile 0.74
Maximum 0.871
Range 0.486
Interquartile range 0.175

Descriptive statistics

Standard deviation 0.11588
Coef of variation 0.2089
Kurtosis -0.064667
Mean 0.5547
MAD 0.095695
Skewness 0.72931
Sum 280.68
Variance 0.013428
Memory size 4.0 KiB
Value Count Frequency (%)  
0.538 23 4.5%
 
0.713 18 3.6%
 
0.437 17 3.4%
 
0.871 16 3.2%
 
0.489 15 3.0%
 
0.624 15 3.0%
 
0.693 14 2.8%
 
0.605 14 2.8%
 
0.74 13 2.6%
 
0.544 12 2.4%
 
Other values (71) 349 69.0%
 

Minimum 5 values

Value Count Frequency (%)  
0.385 1 0.2%
 
0.389 1 0.2%
 
0.392 2 0.4%
 
0.394 1 0.2%
 
0.398 2 0.4%
 

Maximum 5 values

Value Count Frequency (%)  
0.713 18 3.6%
 
0.718 6 1.2%
 
0.74 13 2.6%
 
0.77 8 1.6%
 
0.871 16 3.2%
 

PTRATIO
Numeric

Distinct count 46
Unique (%) 9.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 18.456
Minimum 12.6
Maximum 22
Zeros (%) 0.0%

Quantile statistics

Minimum 12.6
5-th percentile 14.7
Q1 17.4
Median 19.05
Q3 20.2
95-th percentile 21
Maximum 22
Range 9.4
Interquartile range 2.8

Descriptive statistics

Standard deviation 2.1649
Coef of variation 0.11731
Kurtosis -0.28509
Mean 18.456
MAD 1.7873
Skewness -0.80232
Sum 9338.5
Variance 4.687
Memory size 4.0 KiB
Value Count Frequency (%)  
20.2 140 27.7%
 
14.7 34 6.7%
 
21.0 27 5.3%
 
17.8 23 4.5%
 
19.2 19 3.8%
 
17.4 18 3.6%
 
18.6 17 3.4%
 
19.1 17 3.4%
 
16.6 16 3.2%
 
18.4 16 3.2%
 
Other values (36) 179 35.4%
 

Minimum 5 values

Value Count Frequency (%)  
12.6 3 0.6%
 
13.0 12 2.4%
 
13.6 1 0.2%
 
14.4 1 0.2%
 
14.7 34 6.7%
 

Maximum 5 values

Value Count Frequency (%)  
20.9 11 2.2%
 
21.0 27 5.3%
 
21.1 1 0.2%
 
21.2 15 3.0%
 
22.0 2 0.4%
 

RAD
Numeric

Distinct count 9
Unique (%) 1.8%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 9.5494
Minimum 1
Maximum 24
Zeros (%) 0.0%

Quantile statistics

Minimum 1
5-th percentile 2
Q1 4
Median 5
Q3 24
95-th percentile 24
Maximum 24
Range 23
Interquartile range 20

Descriptive statistics

Standard deviation 8.7073
Coef of variation 0.91181
Kurtosis -0.86723
Mean 9.5494
MAD 7.5394
Skewness 1.0048
Sum 4832
Variance 75.816
Memory size 4.0 KiB
Value Count Frequency (%)  
24.0 132 26.1%
 
5.0 115 22.7%
 
4.0 110 21.7%
 
3.0 38 7.5%
 
6.0 26 5.1%
 
8.0 24 4.7%
 
2.0 24 4.7%
 
1.0 20 4.0%
 
7.0 17 3.4%
 

Minimum 5 values

Value Count Frequency (%)  
1.0 20 4.0%
 
2.0 24 4.7%
 
3.0 38 7.5%
 
4.0 110 21.7%
 
5.0 115 22.7%
 

Maximum 5 values

Value Count Frequency (%)  
5.0 115 22.7%
 
6.0 26 5.1%
 
7.0 17 3.4%
 
8.0 24 4.7%
 
24.0 132 26.1%
 

RM
Numeric

Distinct count 446
Unique (%) 88.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 6.2846
Minimum 3.561
Maximum 8.78
Zeros (%) 0.0%

Quantile statistics

Minimum 3.561
5-th percentile 5.314
Q1 5.8855
Median 6.2085
Q3 6.6235
95-th percentile 7.5875
Maximum 8.78
Range 5.219
Interquartile range 0.738

Descriptive statistics

Standard deviation 0.70262
Coef of variation 0.1118
Kurtosis 1.8915
Mean 6.2846
MAD 0.51329
Skewness 0.40361
Sum 3180
Variance 0.49367
Memory size 4.0 KiB
Value Count Frequency (%)  
6.167 3 0.6%
 
6.229 3 0.6%
 
6.127 3 0.6%
 
5.713 3 0.6%
 
6.417 3 0.6%
 
6.405 3 0.6%
 
6.38 2 0.4%
 
5.304 2 0.4%
 
5.983 2 0.4%
 
7.185 2 0.4%
 
Other values (436) 480 94.9%
 

Minimum 5 values

Value Count Frequency (%)  
3.561 1 0.2%
 
3.863 1 0.2%
 
4.138 2 0.4%
 
4.368 1 0.2%
 
4.519 1 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
8.375 1 0.2%
 
8.398 1 0.2%
 
8.704 1 0.2%
 
8.725 1 0.2%
 
8.78 1 0.2%
 

TAX
Highly correlated

This variable is highly correlated with RAD and should be ignored for analysis

Correlation 0.91023

ZN
Numeric

Distinct count 26
Unique (%) 5.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 11.364
Minimum 0
Maximum 100
Zeros (%) 73.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 12.5
95-th percentile 80
Maximum 100
Range 100
Interquartile range 12.5

Descriptive statistics

Standard deviation 23.322
Coef of variation 2.0524
Kurtosis 4.0315
Mean 11.364
MAD 16.709
Skewness 2.2257
Sum 5750
Variance 543.94
Memory size 4.0 KiB
Value Count Frequency (%)  
0.0 372 73.5%
 
20.0 21 4.2%
 
80.0 15 3.0%
 
12.5 10 2.0%
 
22.0 10 2.0%
 
25.0 10 2.0%
 
40.0 7 1.4%
 
45.0 6 1.2%
 
30.0 6 1.2%
 
90.0 5 1.0%
 
Other values (16) 44 8.7%
 

Minimum 5 values

Value Count Frequency (%)  
0.0 372 73.5%
 
12.5 10 2.0%
 
17.5 1 0.2%
 
18.0 1 0.2%
 
20.0 21 4.2%
 

Maximum 5 values

Value Count Frequency (%)  
82.5 2 0.4%
 
85.0 2 0.4%
 
90.0 5 1.0%
 
95.0 4 0.8%
 
100.0 1 0.2%
 

Correlations

Sample

CRIM ZN INDUS CHAS NOX RM AGE DIS RAD TAX PTRATIO B LSTAT
0 0.00632 18.0 2.31 0.0 0.538 6.575 65.2 4.0900 1.0 296.0 15.3 396.90 4.98
1 0.02731 0.0 7.07 0.0 0.469 6.421 78.9 4.9671 2.0 242.0 17.8 396.90 9.14
2 0.02729 0.0 7.07 0.0 0.469 7.185 61.1 4.9671 2.0 242.0 17.8 392.83 4.03
3 0.03237 0.0 2.18 0.0 0.458 6.998 45.8 6.0622 3.0 222.0 18.7 394.63 2.94
4 0.06905 0.0 2.18 0.0 0.458 7.147 54.2 6.0622 3.0 222.0 18.7 396.90 5.33