Skip to content

Latest commit

 

History

History
156 lines (134 loc) · 4.11 KB

stats_4_testing_data.md

File metadata and controls

156 lines (134 loc) · 4.11 KB
  1. "PassengerId"
Type of data:          Number
Contains null values:  False
Unique values:         418
Smallest value:        892
Largest value:         1,309
Sum:                   460,009
Mean:                  1,100.5
Median:                1,100.5
StDev:                 120.81
Most common values:    892 (1x)
                       893 (1x)
                       894 (1x)
                       895 (1x)
                       896 (1x)
  1. "Pclass"
Type of data:          Number
Contains null values:  False
Unique values:         3
Smallest value:        1
Largest value:         3
Sum:                   947
Mean:                  2.266
Median:                3
StDev:                 0.842
Most common values:    3 (218x)
                       1 (107x)
                       2 (93x)
  1. "Name"
Type of data:          Text
Contains null values:  False
Unique values:         418
Longest value:         63 characters
Most common values:    Kelly, Mr. James (1x)
                       Wilkes, Mrs. James (Ellen Needs) (1x)
                       Myles, Mr. Thomas Francis (1x)
                       Wirz, Mr. Albert (1x)
                       Hirvonen, Mrs. Alexander (Helga E Lindqvist) (1x)
  1. "Sex"
Type of data:          Text
Contains null values:  False
Unique values:         2
Longest value:         6 characters
Most common values:    male (266x)
                       female (152x)
  1. "Age"
Type of data:          Number
Contains null values:  True (excluded from calculations)
Unique values:         80
Smallest value:        0.17
Largest value:         76
Sum:                   10,050.5
Mean:                  30.273
Median:                27
StDev:                 14.181
Most common values:    None (86x)
                       21 (17x)
                       24 (17x)
                       22 (16x)
                       30 (15x)
  1. "SibSp"
Type of data:          Number
Contains null values:  False
Unique values:         7
Smallest value:        0
Largest value:         8
Sum:                   187
Mean:                  0.447
Median:                0
StDev:                 0.897
Most common values:    0 (283x)
                       1 (110x)
                       2 (14x)
                       3 (4x)
                       4 (4x)
  1. "Parch"
Type of data:          Number
Contains null values:  False
Unique values:         8
Smallest value:        0
Largest value:         9
Sum:                   164
Mean:                  0.392
Median:                0
StDev:                 0.981
Most common values:    0 (324x)
                       1 (52x)
                       2 (33x)
                       3 (3x)
                       4 (2x)
  1. "Ticket"
Type of data:          Text
Contains null values:  False
Unique values:         363
Longest value:         18 characters
Most common values:    PC 17608 (5x)
                       113503 (4x)
                       CA. 2343 (4x)
                       C.A. 31029 (3x)
                       PC 17483 (3x)
  1. "Fare"
Type of data:          Number
Contains null values:  True (excluded from calculations)
Unique values:         170
Smallest value:        0
Largest value:         512.329
Sum:                   14,856.538
Mean:                  35.627
Median:                14.454
StDev:                 55.908
Most common values:    7.75 (21x)
                       26 (19x)
                       8.05 (17x)
                       13 (17x)
                       7.896 (11x)
  1. "Cabin"
Type of data:          Text
Contains null values:  True (excluded from calculations)
Unique values:         77
Longest value:         15 characters
Most common values:    None (327x)
                       B57 B59 B63 B66 (3x)
                       B45 (2x)
                       C78 (2x)
                       C31 (2x)
  1. "Embarked"
Type of data:          Text
Contains null values:  False
Unique values:         3
Longest value:         1 characters
Most common values:    S (270x)
                       C (102x)
                       Q (46x)

Row count: 418