Effects of action masking on deep reinforcement learning for inventory management

Inventory Management has always been a crucial part of Supply Chain Management, and not managing it carefully would lead to unnecessary inventory costs such as lost sales and holding cost. Over the years, many researchers have investigated solutions and systems in the field of operations research to...

Full description

Bibliographic Details
Main Author: Goh, Bryan Zheng Ting
Other Authors: Lee Bu Sung, Francis
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/166091
_version_ 1811684452753997824
author Goh, Bryan Zheng Ting
author2 Lee Bu Sung, Francis
author_facet Lee Bu Sung, Francis
Goh, Bryan Zheng Ting
author_sort Goh, Bryan Zheng Ting
collection NTU
description Inventory Management has always been a crucial part of Supply Chain Management, and not managing it carefully would lead to unnecessary inventory costs such as lost sales and holding cost. Over the years, many researchers have investigated solutions and systems in the field of operations research to better manage inventory and optimize it by lowering the inventory cost as much as possible. Due to recent advancement in reinforcement learning and the advancement of deep neural network, there has been rising interest in making use of Deep Reinforcement Learning to train an artificial agent that would be able to manage inventory and minimize inventory costs. Through this report, a solution for a single retailer, single item Inventory Management Environment with stochastic demand would be developed using Deep Q-Network (DQN). Moreover, even though there are recent works of using DQN in Inventory Management, not many have investigated the effects of action masking on this problem domain. Thus, this report will attempt to focus on investigating different methods of action masking and analyze their effects on the speed of convergence during the training phase and additional metric such as mean reward, fill rate and service level during the inference phase. Furthermore, this report will also analyze the effects of different demand distribution and whether that will affect the training of a DQN agent.
first_indexed 2024-10-01T04:28:52Z
format Final Year Project (FYP)
id ntu-10356/166091
institution Nanyang Technological University
language English
last_indexed 2024-10-01T04:28:52Z
publishDate 2023
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1660912023-04-21T15:37:16Z Effects of action masking on deep reinforcement learning for inventory management Goh, Bryan Zheng Ting Lee Bu Sung, Francis School of Computer Science and Engineering EBSLEE@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Industrial engineering::Supply chain Inventory Management has always been a crucial part of Supply Chain Management, and not managing it carefully would lead to unnecessary inventory costs such as lost sales and holding cost. Over the years, many researchers have investigated solutions and systems in the field of operations research to better manage inventory and optimize it by lowering the inventory cost as much as possible. Due to recent advancement in reinforcement learning and the advancement of deep neural network, there has been rising interest in making use of Deep Reinforcement Learning to train an artificial agent that would be able to manage inventory and minimize inventory costs. Through this report, a solution for a single retailer, single item Inventory Management Environment with stochastic demand would be developed using Deep Q-Network (DQN). Moreover, even though there are recent works of using DQN in Inventory Management, not many have investigated the effects of action masking on this problem domain. Thus, this report will attempt to focus on investigating different methods of action masking and analyze their effects on the speed of convergence during the training phase and additional metric such as mean reward, fill rate and service level during the inference phase. Furthermore, this report will also analyze the effects of different demand distribution and whether that will affect the training of a DQN agent. Bachelor of Engineering (Computer Science) 2023-04-21T05:24:15Z 2023-04-21T05:24:15Z 2023 Final Year Project (FYP) Goh, B. Z. T. (2023). Effects of action masking on deep reinforcement learning for inventory management. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/166091 https://hdl.handle.net/10356/166091 en application/pdf Nanyang Technological University
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Industrial engineering::Supply chain
Goh, Bryan Zheng Ting
Effects of action masking on deep reinforcement learning for inventory management
title Effects of action masking on deep reinforcement learning for inventory management
title_full Effects of action masking on deep reinforcement learning for inventory management
title_fullStr Effects of action masking on deep reinforcement learning for inventory management
title_full_unstemmed Effects of action masking on deep reinforcement learning for inventory management
title_short Effects of action masking on deep reinforcement learning for inventory management
title_sort effects of action masking on deep reinforcement learning for inventory management
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Industrial engineering::Supply chain
url https://hdl.handle.net/10356/166091
work_keys_str_mv AT gohbryanzhengting effectsofactionmaskingondeepreinforcementlearningforinventorymanagement