Exploratory Data Analysis and Visualization on Habermans Breast Cancer Survival Dataset
The dataset contains cases from a study that was conducted between 1958 and 1970 at the University of Chicago's Billings Hospital on the survival of patients who had undergone surgery for breast cancer. Download the dataset from here => https://www.kaggle.com/gilsousa/habermans-survival-data-set/version/1#haberman.csv (Sources: (a) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: March 4, 1999)
- Age of patient at time of operation (numerical).
- Patient's year of operation (year - 1900, numerical).
- Number of positive axillary nodes detected (numerical).
- Survival status (class attribute) 1 = the patient survived 5 years or longer. 2 = the patient died within 5 years.
Attributes 1, 2 and 3 form our features (independent variables), while attribute 4 is our class variable (dependent variable).