Skip to content

Latest commit

 

History

History
105 lines (59 loc) · 6.31 KB

Time Series Forecasting - Daily Weather Prediction (Simple LSTM Model).md

File metadata and controls

105 lines (59 loc) · 6.31 KB

Time Series Forecasting - Daily Weather Prediction using a Simple LSTM Model

The following workflow will demonstrate how to use a CNN to do deep learning in KNIME for image classification of a sign language dataset.

Dataset Link

Daily weather forecast (India): https://www.kaggle.com/sumanthvrao/daily-climate-time-series-data

Workflow Link

Daily Weather Forecast (India) Workflow: https://tinyurl.com/2p9bywnr

Daily Weather Prediction - Deep Learning

image
CSV Data Snippet

All the training data is in a csv format with the following columns:
1.date
2.mean temperature
3.humidity
4.wind speed
5.mean pressure

There are 1462 rows with the dates spanning from 2013-01-01 to 2017-01-01. The testing data has the same columns with dates spanning from 2017-01-01 to 2017-04-24.

Data Pre-Processing

Before anything, the images from the dataset has to be cleaned and processed in a way that the model can accept it.

image

We will first use two CSV Reader nodes in order to read both the training and test csv files from the downloaded dataset. We can then use the Concatenate node to combine both tables together.

image

Inside the Pre Processing Metanode, we will clean the data up before feeding it into the model. We use Sorter to sort the data ascending by date (as time series is usually highly dependent on the date/time, this step is crucial).

image

We then convert the date column - originally a string - to KNIME’s version of date/time using the String to Date&Time node. This node will convert the date column (string) to KNIME’s date and string object. You can configure the date format and type as well as the execution (whether it should fail on error). Follow the configurations on the image.

image

image

Now we can use the column filter node to filter out the unneeded data. In our case we are going to use the Mean Temperature column to train our model, so we are only passing the date and mean temperature column.

image

The timestamp alignment node hecks whether the selected timestamp column is uniformly sampled in the selected time scale. Missing values will be inserted at skipped sampling times. Select Period = Day, Timestamp column = date and tick replace timestamp column.

image

Lagging & Partitioning

image

We are now going to lag and partition the data before feeding it into our model. Using the lag column, we are going to put in the following configurations.

image

This will lag our input column by 1,200 times. We then aggregate all the lagged columns so that they fit into one list column.

image

image

Finally, partition the data into a 80-20 split using TAKE FROM TOP.

LSTM Model

image

Follow the configurations listed below for all 3 layers of the LSTM model.

image

image

image

Training & Post Evaluation

image

Follow the configurations below for the Keras Network Learner.

image

image

image

The Deployment loop component can be taken from the original cited workflow, or made from scratch. Essentially, this component will first convert the Keras network to Tensorflow. Then to execute the network, we start with an input of the same length as the training. We apply our network to predict the next character, delete the first character, and apply the network again to our new sequence and so on.

image

The scorer will use the selected columns to determine the model’s accuracy. Follow the configurations. To see the confusion matrix, right click on the node and select “confusion matrix.”

image

image

This nline plot node will present a visual plot of how accurate the predicted forecast was in comparison with the actual values (mean temperature). To see this plot, just right click on the node and select view line plot.

image

image