Improve the 16-homesale-forecasting.ipynb to give details on how to tune every configuration. #1278

xzdandy · 2023-10-12T06:32:54Z

Update the notebook with neuralforecast and prediction for every postcode after the math domain error get fixed in #1283

review-notebook-app · 2023-10-12T06:32:59Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

xzdandy · 2023-10-23T05:47:12Z

Hi @americast, please review the updated notebook. Below are several issues I am still facing now:

It is not clear how to choose the frequency.
It is not clear how to decide which model / parameters are better. Do we have any measurable / quantitative metrics we can offer after the training.
Is the NeuralForecast training time and accuracy tunable? 28 minutes with neuralforecast vs 21 seconds with statsforecast is a huge gap.
Even though, we fix the math domain error when there is only one data point, there are many 0 outputs, which does not make sense. I am using WHERE price > 0 to filter them out now.
The date predicted under different unique_id differ a lot, some are 2017, while others are 2011. I think this is due to the fact that the next 3 step is based on the latest date from the training dataset, which can differ. I feel in reality, users want to predict at the same point in time.

jarulraj · 2023-10-24T02:50:38Z

@americast While you are fixing some of these issues, we could also discuss the plan for fixing here.

americast · 2023-10-24T04:53:08Z

@americast While you are fixing some of these issues, we could also discuss the plan for fixing here.

Sure @jarulraj. Thanks @xzdandy for the review!

Hi @americast, please review the updated notebook. Below are several issues I am still facing now:

It is not clear how to choose the frequency.

Yes, it can get a little confusing. I will send a separate PR for the frequency -related discussion.

It is not clear how to decide which model / parameters are better. Do we have any measurable / quantitative metrics we can offer after the training.

We should add a metric for normalized RMSE or Interval Score. I shall take care of that in #1258

Is the NeuralForecast training time and accuracy tunable? 28 minutes with neuralforecast vs 21 seconds with statsforecast is a huge gap.

It's not very linear. With larger datasets, statsforecast might as well take a lot more time than neuralforecast. The amount of time taken by neuralforecast is kind of going to be linear corresponding to the number of unique IDs. For statsforecast, it might grow non-linearly with more data.

Even though, we fix the math domain error when there is only one data point, there are many 0 outputs, which does not make sense. I am using WHERE price > 0 to filter them out now.

That's weird. Will check that. Anyway, forecasting with just one data point doesn't really make much sense. Perhaps we should also return some suggestion or warning?

The date predicted under different unique_id differ a lot, some are 2017, while others are 2011. I think this is due to the fact that the next 3 step is based on the latest date from the training dataset, which can differ. I feel in reality, users want to predict at the same point in time.

This is an interesting problem. Perhaps we can ask for a time step range where the user wants forecast and predict at that step.

As of now, I am trying to come up with a confidence interval in forecasting, as well as a metric, that would better help analyze which method works the best. The entire setup could be a part of the feedback system. I'll be adding my commits in #1258. I'll update this doc with the metrics once that's merged.

Created using Colaboratory

5fa8e51

xzdandy linked an issue Oct 12, 2023 that may be closed by this pull request

Explore the Neuralforecast in the 16-homesale-forecasting.ipynb #1236

Open

2 tasks

xzdandy self-assigned this Oct 12, 2023

xzdandy added the AI Engines Features, Bugs, related to AI Engines label Oct 12, 2023

xzdandy added 2 commits October 12, 2023 00:00

Created using Colaboratory

a29a086

Created using Colaboratory

623fbfb

americast self-requested a review October 18, 2023 02:58

Created using Colaboratory

28612be

xzdandy marked this pull request as ready for review October 23, 2023 05:31

xzdandy added this to the v0.3.9 milestone Oct 23, 2023

xzdandy removed this from the v0.3.9 milestone Nov 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the 16-homesale-forecasting.ipynb to give details on how to tune every configuration. #1278

Improve the 16-homesale-forecasting.ipynb to give details on how to tune every configuration. #1278

xzdandy commented Oct 12, 2023 •

edited

Loading

review-notebook-app bot commented Oct 12, 2023

xzdandy commented Oct 23, 2023

jarulraj commented Oct 24, 2023

americast commented Oct 24, 2023

Improve the 16-homesale-forecasting.ipynb to give details on how to tune every configuration. #1278

Are you sure you want to change the base?

Improve the 16-homesale-forecasting.ipynb to give details on how to tune every configuration. #1278

Conversation

xzdandy commented Oct 12, 2023 • edited Loading

review-notebook-app bot commented Oct 12, 2023

xzdandy commented Oct 23, 2023

jarulraj commented Oct 24, 2023

americast commented Oct 24, 2023

xzdandy commented Oct 12, 2023 •

edited

Loading