Application of Linear Models, Random Forest, and Gradient Boosting Methods to Identify Key Factors and Predict Truck Dwell Time for a Global 3PL Company

Driver dwell time is an important challenge the U.S trucking industry faces. High, unplanned dwell times are costly to all stakeholders in the industry as they result in detention costs, declining performance and decreased driver capacity. With the increasing demand for these services, it is impo...

Full description

Bibliographic Details
Main Authors: Benjatanont, Sireethorn, Tantuico, Dylan
Format: Other
Language:en_US
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/1721.1/126379
_version_ 1811084220844474368
author Benjatanont, Sireethorn
Tantuico, Dylan
author_facet Benjatanont, Sireethorn
Tantuico, Dylan
author_sort Benjatanont, Sireethorn
collection MIT
description Driver dwell time is an important challenge the U.S trucking industry faces. High, unplanned dwell times are costly to all stakeholders in the industry as they result in detention costs, declining performance and decreased driver capacity. With the increasing demand for these services, it is important to maximize the driving time of drivers in the industry by minimizing dwell time to free up capacity and provide competitive wages. This project utilizes the data of a third-party logistics company with the goal to understand the factors that influence dwell time, and to construct the model to predict dwell time of a load. In the analysis, linear models, random forest, and gradient boosting methods were explored based on regression and classification approach. Ultimately, the random forest classification model with one-hour bins is the recommended model as it had the highest predictive performance while the one-hour bins was sufficient to meet the business need. Additionally, the analysis concludes that shipper facilities are the most significant driver of dwell time. Hence, understanding and integrating more granular observations on shipper practices within their facilities will allow a third-party logistics company to improve its driver fleet utilization and increase the predictive performance of their dwell time prediction model.
first_indexed 2024-09-23T12:47:12Z
format Other
id mit-1721.1/126379
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T12:47:12Z
publishDate 2020
record_format dspace
spelling mit-1721.1/1263792020-07-25T03:17:06Z Application of Linear Models, Random Forest, and Gradient Boosting Methods to Identify Key Factors and Predict Truck Dwell Time for a Global 3PL Company Benjatanont, Sireethorn Tantuico, Dylan Data Analytics Transportation Machine Learning Driver dwell time is an important challenge the U.S trucking industry faces. High, unplanned dwell times are costly to all stakeholders in the industry as they result in detention costs, declining performance and decreased driver capacity. With the increasing demand for these services, it is important to maximize the driving time of drivers in the industry by minimizing dwell time to free up capacity and provide competitive wages. This project utilizes the data of a third-party logistics company with the goal to understand the factors that influence dwell time, and to construct the model to predict dwell time of a load. In the analysis, linear models, random forest, and gradient boosting methods were explored based on regression and classification approach. Ultimately, the random forest classification model with one-hour bins is the recommended model as it had the highest predictive performance while the one-hour bins was sufficient to meet the business need. Additionally, the analysis concludes that shipper facilities are the most significant driver of dwell time. Hence, understanding and integrating more granular observations on shipper practices within their facilities will allow a third-party logistics company to improve its driver fleet utilization and increase the predictive performance of their dwell time prediction model. 2020-07-24T18:09:35Z 2020-07-24T18:09:35Z 2020-07-24 Other https://hdl.handle.net/1721.1/126379 en_US application/pdf
spellingShingle Data Analytics
Transportation
Machine Learning
Benjatanont, Sireethorn
Tantuico, Dylan
Application of Linear Models, Random Forest, and Gradient Boosting Methods to Identify Key Factors and Predict Truck Dwell Time for a Global 3PL Company
title Application of Linear Models, Random Forest, and Gradient Boosting Methods to Identify Key Factors and Predict Truck Dwell Time for a Global 3PL Company
title_full Application of Linear Models, Random Forest, and Gradient Boosting Methods to Identify Key Factors and Predict Truck Dwell Time for a Global 3PL Company
title_fullStr Application of Linear Models, Random Forest, and Gradient Boosting Methods to Identify Key Factors and Predict Truck Dwell Time for a Global 3PL Company
title_full_unstemmed Application of Linear Models, Random Forest, and Gradient Boosting Methods to Identify Key Factors and Predict Truck Dwell Time for a Global 3PL Company
title_short Application of Linear Models, Random Forest, and Gradient Boosting Methods to Identify Key Factors and Predict Truck Dwell Time for a Global 3PL Company
title_sort application of linear models random forest and gradient boosting methods to identify key factors and predict truck dwell time for a global 3pl company
topic Data Analytics
Transportation
Machine Learning
url https://hdl.handle.net/1721.1/126379
work_keys_str_mv AT benjatanontsireethorn applicationoflinearmodelsrandomforestandgradientboostingmethodstoidentifykeyfactorsandpredicttruckdwelltimeforaglobal3plcompany
AT tantuicodylan applicationoflinearmodelsrandomforestandgradientboostingmethodstoidentifykeyfactorsandpredicttruckdwelltimeforaglobal3plcompany