A Machine Learning Approach for Forecasting with Limited Data and for Distant Time Horizons

Time series forecasting has attracted the attention of the machine learning (ML) community to produce accurate forecasting models that address the limitations of classical methods. A large part of ML research focuses on innovative algorithms, but another important area is transitioning ML to industr...

Full description

Bibliographic Details
Main Author: Eiskowitz, Skylar
Other Authors: Crawley, Edward F.
Format: Thesis
Published: Massachusetts Institute of Technology 2022
Online Access:https://hdl.handle.net/1721.1/139189
_version_ 1826201511947927552
author Eiskowitz, Skylar
author2 Crawley, Edward F.
author_facet Crawley, Edward F.
Eiskowitz, Skylar
author_sort Eiskowitz, Skylar
collection MIT
description Time series forecasting has attracted the attention of the machine learning (ML) community to produce accurate forecasting models that address the limitations of classical methods. A large part of ML research focuses on innovative algorithms, but another important area is transitioning ML to industry settings. The objective of this Thesis is to apply ML in realistic scenarios by devising methods that make practical, usable forecasts and models. We focus on three areas that contribute to more practical forecasts. First, we improve the problem formulation of multi-step ahead forecasting by including a notion of an offset to create a more customizable forecasting window. A comparative analysis across three datasets shows that at further out horizons, two models that include the notion of an offset consistently outperform their original counterparts. Secondly, we simulate a scenario where an ML model does not have access to the immense amount of training data normally necessary to train deep neural networks. We use transfer learning with a weight sharing algorithm and observe that it improves all seven target model accuracies even after they have accumulated two weeks of their own data. However, the input data to transfer knowledge from must be chosen wisely to avoid negative transfer. Finally, we address the challenge of deploying a safe and robust ML model by outlining key features in a time series forecasting library and compare 13 notable, actively maintained libraries, finding a wide variability in the features included in the libraries. A surprisingly low number of libraries include a benchmarking system, and around half of the libraries provide a pre/post-processing engine that would allow for modular ML models in a pipelining manner for easy deployment.
first_indexed 2024-09-23T11:52:56Z
format Thesis
id mit-1721.1/139189
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T11:52:56Z
publishDate 2022
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1391892022-01-15T03:37:52Z A Machine Learning Approach for Forecasting with Limited Data and for Distant Time Horizons Eiskowitz, Skylar Crawley, Edward F. Massachusetts Institute of Technology. Department of Aeronautics and Astronautics Time series forecasting has attracted the attention of the machine learning (ML) community to produce accurate forecasting models that address the limitations of classical methods. A large part of ML research focuses on innovative algorithms, but another important area is transitioning ML to industry settings. The objective of this Thesis is to apply ML in realistic scenarios by devising methods that make practical, usable forecasts and models. We focus on three areas that contribute to more practical forecasts. First, we improve the problem formulation of multi-step ahead forecasting by including a notion of an offset to create a more customizable forecasting window. A comparative analysis across three datasets shows that at further out horizons, two models that include the notion of an offset consistently outperform their original counterparts. Secondly, we simulate a scenario where an ML model does not have access to the immense amount of training data normally necessary to train deep neural networks. We use transfer learning with a weight sharing algorithm and observe that it improves all seven target model accuracies even after they have accumulated two weeks of their own data. However, the input data to transfer knowledge from must be chosen wisely to avoid negative transfer. Finally, we address the challenge of deploying a safe and robust ML model by outlining key features in a time series forecasting library and compare 13 notable, actively maintained libraries, finding a wide variability in the features included in the libraries. A surprisingly low number of libraries include a benchmarking system, and around half of the libraries provide a pre/post-processing engine that would allow for modular ML models in a pipelining manner for easy deployment. S.M. 2022-01-14T14:55:39Z 2022-01-14T14:55:39Z 2021-06 2021-06-16T13:26:26.458Z Thesis https://hdl.handle.net/1721.1/139189 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Eiskowitz, Skylar
A Machine Learning Approach for Forecasting with Limited Data and for Distant Time Horizons
title A Machine Learning Approach for Forecasting with Limited Data and for Distant Time Horizons
title_full A Machine Learning Approach for Forecasting with Limited Data and for Distant Time Horizons
title_fullStr A Machine Learning Approach for Forecasting with Limited Data and for Distant Time Horizons
title_full_unstemmed A Machine Learning Approach for Forecasting with Limited Data and for Distant Time Horizons
title_short A Machine Learning Approach for Forecasting with Limited Data and for Distant Time Horizons
title_sort machine learning approach for forecasting with limited data and for distant time horizons
url https://hdl.handle.net/1721.1/139189
work_keys_str_mv AT eiskowitzskylar amachinelearningapproachforforecastingwithlimiteddataandfordistanttimehorizons
AT eiskowitzskylar machinelearningapproachforforecastingwithlimiteddataandfordistanttimehorizons