The leading cause of death worldwide is cardiovascular (heart) diseases. In this project, the probability of heart failure was modeled using 918 observations of 11 features linked to heart failure. Thus, the dataset was limited in number of observations and features. The amount of information was limited for a disease that affects millions of people annually, and the dataset was reduced by 20% to create a test set. Thus, the reliability of the models may be less than desired to be applied in practice. A logistic regression model performed the best with an accuracy of 0.87 and a kappa of 0.735, closely followed by a support vector machine model and a random forest. Two MARS and AdaBoost models performed the worst with accuracies of 0.85 and kappas of 0.689. Hence, the recommendation would be to go forward with logistic regression models in the future.
-
Notifications
You must be signed in to change notification settings - Fork 0
bins0000/Heart-Disease-Classification
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Implemented classification models on heart failure dataset to assess the likelihood of an event that can be attributed to cardiovascular disease with R
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published