Skip to content

Predict price of diamonds based on input, implements pipelines, hyperparameter optimization and ensemble learning

Notifications You must be signed in to change notification settings

tejjusbhat/diamond_price_prediction

Repository files navigation

Diamond Price Prediction using Random Search, SKlearn Pipelines, and Random Forest/XGBoost with Standard Scaler

Overview

This Jupyter Notebook demonstrates a machine learning pipeline for predicting the prices of diamonds using various techniques including Random Search, SKlearn Pipelines, and Random Forest/XGBoost models with Standard Scaler preprocessing. The dataset used contains features such as carat weight, cut, color, clarity, etc., which are commonly associated with diamond #.

Dependencies

  • Python 3.12
  • Jupyter Notebook
  • Libraries:
    • numpy
    • pandas
    • sklearn
    • xgboost

Dataset

The dataset used in this project contains information about various diamonds, including their characteristics and prices. The dataset is not included in this repository due to size limitations, but it can be obtained from source. Make sure to place the dataset file in the same directory as the notebook.

Usage

To run the notebook:

  1. Clone the repository
  2. Install requirements
  3. Run the notebook

Acknowledgments

  • The dataset used in this project is sourced from source.
  • This project draws inspiration from various tutorials and documentation available online for SKlearn, XGBoost, aand Kaggle.

About

Predict price of diamonds based on input, implements pipelines, hyperparameter optimization and ensemble learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published