Skip to content

jecastrom/lab-cross-validation

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

logo_ironhack_blue 7

Lab | Cross Validation

For this lab, we will build a model on customer churn binary classification problem. You will be using files_for_lab/Customer-Churn.csv file.

Instructions

  1. Apply SMOTE for upsampling the data

    • Use logistic regression to fit the model and compute the accuracy of the model.
    • Use decision tree classifier to fit the model and compute the accuracy of the model.
    • Compare the accuracies of the two models.
  2. Apply TomekLinks for downsampling

    • It is important to remember that it does not make the two classes equal but only removes the points from the majority class that are close to other points in minority class.
    • Use logistic regression to fit the model and compute the accuracy of the model.
    • Use decision tree classifier to fit the model and compute the accuracy of the model.
    • Compare the accuracies of the two models.
    • You can also apply this algorithm one more time and check the how the imbalance in the two classes changed from the last time.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%