Effects of action masking on deep reinforcement learning for inventory management
Inventory Management has always been a crucial part of Supply Chain Management, and not managing it carefully would lead to unnecessary inventory costs such as lost sales and holding cost. Over the years, many researchers have investigated solutions and systems in the field of operations research to...
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project (FYP) |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/166091 |
_version_ | 1811684452753997824 |
---|---|
author | Goh, Bryan Zheng Ting |
author2 | Lee Bu Sung, Francis |
author_facet | Lee Bu Sung, Francis Goh, Bryan Zheng Ting |
author_sort | Goh, Bryan Zheng Ting |
collection | NTU |
description | Inventory Management has always been a crucial part of Supply Chain Management, and not managing it carefully would lead to unnecessary inventory costs such as lost sales and holding cost. Over the years, many researchers have investigated solutions and systems in the field of operations research to better manage inventory and optimize it by lowering the inventory cost as much as possible. Due to recent advancement in reinforcement learning and the advancement of deep neural network, there has been rising interest in making use of Deep Reinforcement Learning to train an artificial agent that would be able to manage inventory and minimize inventory costs. Through this report, a solution for a single retailer, single item Inventory Management Environment with stochastic demand would be developed using Deep Q-Network (DQN). Moreover, even though there are recent works of using DQN in Inventory Management, not many have investigated the effects of action masking on this problem domain. Thus, this report will attempt to focus on investigating different methods of action masking and analyze their effects on the speed of convergence during the training phase and additional metric such as mean reward, fill rate and service level during the inference phase. Furthermore, this report will also analyze the effects of different demand distribution and whether that will affect the training of a DQN agent. |
first_indexed | 2024-10-01T04:28:52Z |
format | Final Year Project (FYP) |
id | ntu-10356/166091 |
institution | Nanyang Technological University |
language | English |
last_indexed | 2024-10-01T04:28:52Z |
publishDate | 2023 |
publisher | Nanyang Technological University |
record_format | dspace |
spelling | ntu-10356/1660912023-04-21T15:37:16Z Effects of action masking on deep reinforcement learning for inventory management Goh, Bryan Zheng Ting Lee Bu Sung, Francis School of Computer Science and Engineering EBSLEE@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Industrial engineering::Supply chain Inventory Management has always been a crucial part of Supply Chain Management, and not managing it carefully would lead to unnecessary inventory costs such as lost sales and holding cost. Over the years, many researchers have investigated solutions and systems in the field of operations research to better manage inventory and optimize it by lowering the inventory cost as much as possible. Due to recent advancement in reinforcement learning and the advancement of deep neural network, there has been rising interest in making use of Deep Reinforcement Learning to train an artificial agent that would be able to manage inventory and minimize inventory costs. Through this report, a solution for a single retailer, single item Inventory Management Environment with stochastic demand would be developed using Deep Q-Network (DQN). Moreover, even though there are recent works of using DQN in Inventory Management, not many have investigated the effects of action masking on this problem domain. Thus, this report will attempt to focus on investigating different methods of action masking and analyze their effects on the speed of convergence during the training phase and additional metric such as mean reward, fill rate and service level during the inference phase. Furthermore, this report will also analyze the effects of different demand distribution and whether that will affect the training of a DQN agent. Bachelor of Engineering (Computer Science) 2023-04-21T05:24:15Z 2023-04-21T05:24:15Z 2023 Final Year Project (FYP) Goh, B. Z. T. (2023). Effects of action masking on deep reinforcement learning for inventory management. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/166091 https://hdl.handle.net/10356/166091 en application/pdf Nanyang Technological University |
spellingShingle | Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Industrial engineering::Supply chain Goh, Bryan Zheng Ting Effects of action masking on deep reinforcement learning for inventory management |
title | Effects of action masking on deep reinforcement learning for inventory management |
title_full | Effects of action masking on deep reinforcement learning for inventory management |
title_fullStr | Effects of action masking on deep reinforcement learning for inventory management |
title_full_unstemmed | Effects of action masking on deep reinforcement learning for inventory management |
title_short | Effects of action masking on deep reinforcement learning for inventory management |
title_sort | effects of action masking on deep reinforcement learning for inventory management |
topic | Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Industrial engineering::Supply chain |
url | https://hdl.handle.net/10356/166091 |
work_keys_str_mv | AT gohbryanzhengting effectsofactionmaskingondeepreinforcementlearningforinventorymanagement |