Disaster Response Pipeline Project

Udacity Data Science Nanodegree Project

Introduction:

This project takes in messages and categories csv files and trains the ML model to classify whether which category the message corresponds to. We will be creating a machine learning pipeline to categorize these messages so that we can send the messages to an appropriate disaster relief agency. Data has been provided by Figure Eight (https://appen.com/) and is copyright protected.

Project Components

The project consists of three components

1. ETL Pipeline

In a Python script, process_data.py, we wrote a data cleaning pipeline that:

Loads the messages and categories datasets
Merges the two datasets
Cleans the data
Stores it in a SQLite database

2. ML Pipeline

In a Python script, train_classifier.py, wrote a machine learning pipeline that:

Loads data from the SQLite database
Splits the dataset into training and test sets
Builds a text processing and machine learning pipeline
Trains and tunes a model using GridSearchCV
Outputs results on the test set
Exports the final model as a pickle file

3. Flask Web App

The Web App includes some of the visualizations of the training data and includes a prompt which take input in the form of a message from the user and classify that message in one or multiples of the 36 categories.

4. Files

Files are arranged in following way:
--App
----run.py <--RUNS THE MAIN APP
----templates
------go.html <--CLASSIFICATION RESULT PAGE OF WEB APP
------master.html <--MAIN PAGE OF WEB APP
--Data
----DisasterResponse.db <--DATABASE TO SAVED CLEANED DATA
----disaster_categories.csv <--DATA TO PROCESS
----disaster_messages.csv <--DATA TO PROCESS
----process.py <--SCRIPT TO PERFORM ETL PROCESS

--Model
----train_classifier <--PERFORMS CLASSIFICATION TASKS

5. How to run the App

Run the following commands in the project's root directory to set up your database and model.
- To run ETL pipeline that cleans data and stores in database python data/process_data.py data/disaster_messages.csv data/disaster_categories.csv data/DisasterResponse.db
  The first two arguments are input data and the third argument is the SQLite Database in which we want to save the cleaned data. The ETL pipeline is in process_data.py. DisasterResponse.db already exists in data folder but the above command will still run and replace the file with same information.
- To run ML pipeline that trains classifier and saves python models/train_classifier.py data/DisasterResponse.db models/classifier.pkl
  This will use cleaned data to train the model, improve the model with grid search and saved the model to a pickle file (classifer.pkl). classifier.pkl already exists but the above command will still run and replace the file will same information.
Run the following command in the app's directory to run your web app. python run.py
Go to http://0.0.0.0:3001/ or link of wherever the file app is deployed

6. Software Requirements

This project uses Python 3.6.3 and the necessary libraries are mentioned in requirements.txt. The standard libraries which are not mentioned in requirements.txt are json, operator, pickle, pprint, re, and sys.

7. Acknowlegements

Credit goes to Figure Eight for giving me the data for this wonderful project and special thanks to Udacity for this awesome experience :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Disaster Response Pipeline Project

Udacity Data Science Nanodegree Project

Introduction:

Project Components

1. ETL Pipeline

2. ML Pipeline

3. Flask Web App

4. Files

5. How to run the App

6. Software Requirements

7. Acknowlegements

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
app		app
data		data
models		models
README.md		README.md
requirements.txt		requirements.txt

zainfarrukh/disaster_response_app

Folders and files

Latest commit

History

Repository files navigation

Disaster Response Pipeline Project

Udacity Data Science Nanodegree Project

Introduction:

Project Components

1. ETL Pipeline

2. ML Pipeline

3. Flask Web App

4. Files

5. How to run the App

6. Software Requirements

7. Acknowlegements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages