Learn Data-Science This is a repo containing some useful fundamentals and code snippets to Learn DataScience. Data Links https://www.kaggle.com/harlfoxem/housesalesprediction/version/1 Climate and Environment related data files Link for learning feature engineering , very nice https://spark.apache.org/docs/latest/ml-features