Imported the necessary libraries
Read the data as a data frame
a. Shape of the data
b. Data type of each attribute
c. Checking the presence of missing values
d. 5 point summary of numerical attributes
e. Distribution of ‘bmi’, ‘age’ and ‘charges’ columns.
f. Measure of skewness of ‘bmi’, ‘age’ and ‘charges’ columns
g. Checking the presence of outliers in ‘bmi’, ‘age’ and ‘charges columns
h. Distribution of categorical columns (include children)
i. Pair plot that includes all the columns of the data frame
a. Do charges of people who smoke differ significantly from the people who don't?
b. Does bmi of males differ significantly from that of females?
c. Is the proportion of smokers significantly different in different genders?
d. Is the distribution of bmi across women with no children, one child and two children,the same ?