Cholesterol Prediction Without a Clinical Test

Authors/Contributors: Shimona Narang, Zehui Lin, Esmond Tang, Dylan Mendonca

Description

This repository contains the the notebooks and presentation for a final project for CHE1147 (Data Mining in Engineering) taken at the University of Toronto in Winter 2020. The goal of the project was to assess whether a person might have a high or low level of cholesterol by using features that are either (a) known beforehand (age, gender, etc.) or (b) can easily be measured at home (weight, arm thickness, etc.). The motivation behind this project was to create a model that could be deployed on an app as a survey to alleviate the financial burden of having to take a cholesterol test (this survey could act as a precursor to taking a lab test).

Files

Here's a list of files in the directory:

The repository contains the following files:

Cholesterol Prediction.ipynb: Contains all the code and models used
Business Model Presentation.pdf: Contains the slide deck for my project presentation

Dataset

The project used data taken from National Health and Nutrition Examination Survey (2014). Features are extracted from datasets related to diet, demographics, examinations (height, weight, BMI, etc.) and a questionnaire (questions about exercise/eating habits, background, etc.). Supervised Machine Learning Techniques are used to classify cholesterol as high or normal.

Workflow

The project is broken has been broken down into various sections:

Feature Selection: Selecting salient features based on personal discretion
Data Pre-processing: Imputing missing values in the dataset
Exploratory Data Analysis (EDA): Exploring the data through visualizations and PCA
Model Development and Fine Tuning: Both linear and non-linear approaches (L1/L2 Logistic Regression, K-Nearest Neighbors, XG Boost, Random Forest, SVMs)
Designing for Deployment: Using feature importance to extract the 10 most important features, using those results to build final models. These 10 features could be implemented in an app

Questions/Contributions

Myself and the rest of the team are not actively pursuing this project at the moment
If you'd like to chat about this project, please reach out to me or any of my team members on GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Business Model Presentation.pdf		Business Model Presentation.pdf
Cholesterol Prediction.ipynb		Cholesterol Prediction.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cholesterol Prediction Without a Clinical Test

Authors/Contributors: Shimona Narang, Zehui Lin, Esmond Tang, Dylan Mendonca

Description

Files

Dataset

Workflow

Questions/Contributions

About

Releases

Packages

Contributors 2

Languages

shimonanarang/Prediction-of-cholesterol-without-lab-test

Folders and files

Latest commit

History

Repository files navigation

Cholesterol Prediction Without a Clinical Test

Authors/Contributors: Shimona Narang, Zehui Lin, Esmond Tang, Dylan Mendonca

Description

Files

Dataset

Workflow

Questions/Contributions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages