Skip to content

Latest commit

 

History

History
49 lines (4 loc) · 1.08 KB

README.md

File metadata and controls

49 lines (4 loc) · 1.08 KB

ANZ-Virtual-Internship Completed two online modules in Exploratory Data Analysis and Predictive Analytics Python libraries used - Pandas, Matplotlib, Scikit-learn

Task 1 - Used Pandas to segregate customer data by each month and used Matplotlib to visualise transaction volume and mean transaction amount each day. Also visualised mean customer balance and mean payment amount by age, with gender means included, for each month in data set.

Task 2 - Used Pandas to evaluate mean customer annual salary and then grouped customer daean. Used Scikit-lta by customer id and mearn for machine learning algorithms in Python. Linear regression model - used card present flag, merchant code, balance, age and amount from grouped data set to predict annual salary, obtained test prediction accuracy of 0.23. Reverting back to original data set, created dummy variables for categorical variables including gender and age. Decision tree classifier and regression models - used modified data set to predict annual salary, obtained test scores of 0.75 and 0.67 respectively.