Reinforcement Learning in the Problem of Synthesis of Majority Schemes

The article presents an approach to the synthesis of combinational-logic circuits using artificial neural networks (ANNs). The presented method is focused on the use of a perspective basis using the majority function (a Boolean function of three arguments that takes the value "true" if at...

Full description

Bibliographic Details
Main Authors: Sergey Gurov, Dmitry Zolotarev, Alexander Samburskiy
Format: Article
Language:Russian
Published: The Fund for Promotion of Internet media, IT education, human development «League Internet Media» 2021-06-01
Series:Современные информационные технологии и IT-образование
Subjects:
Online Access:http://sitito.cs.msu.ru/index.php/SITITO/article/view/740
_version_ 1818355058511708160
author Sergey Gurov
Dmitry Zolotarev
Alexander Samburskiy
author_facet Sergey Gurov
Dmitry Zolotarev
Alexander Samburskiy
author_sort Sergey Gurov
collection DOAJ
description The article presents an approach to the synthesis of combinational-logic circuits using artificial neural networks (ANNs). The presented method is focused on the use of a perspective basis using the majority function (a Boolean function of three arguments that takes the value "true" if at least two of its inputs are true). This choice is based on emerging nanotechnologies, where the majority element is most easily represented. The class of applied ANNs is deep networks with reinforcement. Such networks have been actively studied and applied in recent years. There are examples of their effective use for automatic logic circuits optimization. The original synthesis method proposed in the article with the simplification of circuits that implement the Shannon expansion in all variables of the corresponding function of the logic algebra (FLA). On large schemes, it becomes essential to use some simple but effective techniques for training deep ANN agents with reinforcement. This allows you to distribute calculations into several independent subtasks, each of which is explored and makes performing by agents quicker and easier. Two reinforcement learning algorithms for simplifying schemas are described. They provide a solution to the Exploration-Exploitation conflict, which is the contradiction between exploring the environment to find the optimal episode and using information about the episode considered optimal at the current time. The dependences of the parameters of the synthesized circuits on the number n = 3, ..., 10 of FAL variables and the number of network training episodes are presented.
first_indexed 2024-12-13T19:35:17Z
format Article
id doaj.art-5974178212e14509a16aabf3875b1e61
institution Directory Open Access Journal
issn 2411-1473
language Russian
last_indexed 2024-12-13T19:35:17Z
publishDate 2021-06-01
publisher The Fund for Promotion of Internet media, IT education, human development «League Internet Media»
record_format Article
series Современные информационные технологии и IT-образование
spelling doaj.art-5974178212e14509a16aabf3875b1e612022-12-21T23:33:49ZrusThe Fund for Promotion of Internet media, IT education, human development «League Internet Media»Современные информационные технологии и IT-образование2411-14732021-06-0117229530710.25559/SITITO.17.202102.295-307Reinforcement Learning in the Problem of Synthesis of Majority SchemesSergey Gurov0https://orcid.org/0000-0001-5486-1357Dmitry Zolotarev1https://orcid.org/0000-0002-5349-2147Alexander Samburskiy2https://orcid.org/0000-0003-1188-0229Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University, Moscow, Russia Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University, Moscow, Russia Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University, Moscow, Russia The article presents an approach to the synthesis of combinational-logic circuits using artificial neural networks (ANNs). The presented method is focused on the use of a perspective basis using the majority function (a Boolean function of three arguments that takes the value "true" if at least two of its inputs are true). This choice is based on emerging nanotechnologies, where the majority element is most easily represented. The class of applied ANNs is deep networks with reinforcement. Such networks have been actively studied and applied in recent years. There are examples of their effective use for automatic logic circuits optimization. The original synthesis method proposed in the article with the simplification of circuits that implement the Shannon expansion in all variables of the corresponding function of the logic algebra (FLA). On large schemes, it becomes essential to use some simple but effective techniques for training deep ANN agents with reinforcement. This allows you to distribute calculations into several independent subtasks, each of which is explored and makes performing by agents quicker and easier. Two reinforcement learning algorithms for simplifying schemas are described. They provide a solution to the Exploration-Exploitation conflict, which is the contradiction between exploring the environment to find the optimal episode and using information about the episode considered optimal at the current time. The dependences of the parameters of the synthesized circuits on the number n = 3, ..., 10 of FAL variables and the number of network training episodes are presented.http://sitito.cs.msu.ru/index.php/SITITO/article/view/740logic synthesisartificial neural networksreinforcement learning
spellingShingle Sergey Gurov
Dmitry Zolotarev
Alexander Samburskiy
Reinforcement Learning in the Problem of Synthesis of Majority Schemes
Современные информационные технологии и IT-образование
logic synthesis
artificial neural networks
reinforcement learning
title Reinforcement Learning in the Problem of Synthesis of Majority Schemes
title_full Reinforcement Learning in the Problem of Synthesis of Majority Schemes
title_fullStr Reinforcement Learning in the Problem of Synthesis of Majority Schemes
title_full_unstemmed Reinforcement Learning in the Problem of Synthesis of Majority Schemes
title_short Reinforcement Learning in the Problem of Synthesis of Majority Schemes
title_sort reinforcement learning in the problem of synthesis of majority schemes
topic logic synthesis
artificial neural networks
reinforcement learning
url http://sitito.cs.msu.ru/index.php/SITITO/article/view/740
work_keys_str_mv AT sergeygurov reinforcementlearningintheproblemofsynthesisofmajorityschemes
AT dmitryzolotarev reinforcementlearningintheproblemofsynthesisofmajorityschemes
AT alexandersamburskiy reinforcementlearningintheproblemofsynthesisofmajorityschemes