Reinforcement Learning in the Problem of Synthesis of Majority Schemes
The article presents an approach to the synthesis of combinational-logic circuits using artificial neural networks (ANNs). The presented method is focused on the use of a perspective basis using the majority function (a Boolean function of three arguments that takes the value "true" if at...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | Russian |
Published: |
The Fund for Promotion of Internet media, IT education, human development «League Internet Media»
2021-06-01
|
Series: | Современные информационные технологии и IT-образование |
Subjects: | |
Online Access: | http://sitito.cs.msu.ru/index.php/SITITO/article/view/740 |
_version_ | 1818355058511708160 |
---|---|
author | Sergey Gurov Dmitry Zolotarev Alexander Samburskiy |
author_facet | Sergey Gurov Dmitry Zolotarev Alexander Samburskiy |
author_sort | Sergey Gurov |
collection | DOAJ |
description | The article presents an approach to the synthesis of combinational-logic circuits using artificial neural networks (ANNs). The presented method is focused on the use of a perspective basis using the majority function (a Boolean function of three arguments that takes the value "true" if at least two of its inputs are true). This choice is based on emerging nanotechnologies, where the majority element is most easily represented. The class of applied ANNs is deep networks with reinforcement. Such networks have been actively studied and applied in recent years. There are examples of their effective use for automatic logic circuits optimization.
The original synthesis method proposed in the article with the simplification of circuits that implement the Shannon expansion in all variables of the corresponding function of the logic algebra (FLA). On large schemes, it becomes essential to use some simple but effective techniques for training deep ANN agents with reinforcement. This allows you to distribute calculations into several independent subtasks, each of which is explored and makes performing by agents quicker and easier.
Two reinforcement learning algorithms for simplifying schemas are described. They provide a solution to the Exploration-Exploitation conflict, which is the contradiction between exploring the environment to find the optimal episode and using information about the episode considered optimal at the current time. The dependences of the parameters of the synthesized circuits on the number n = 3, ..., 10 of FAL variables and the number of network training episodes are presented. |
first_indexed | 2024-12-13T19:35:17Z |
format | Article |
id | doaj.art-5974178212e14509a16aabf3875b1e61 |
institution | Directory Open Access Journal |
issn | 2411-1473 |
language | Russian |
last_indexed | 2024-12-13T19:35:17Z |
publishDate | 2021-06-01 |
publisher | The Fund for Promotion of Internet media, IT education, human development «League Internet Media» |
record_format | Article |
series | Современные информационные технологии и IT-образование |
spelling | doaj.art-5974178212e14509a16aabf3875b1e612022-12-21T23:33:49ZrusThe Fund for Promotion of Internet media, IT education, human development «League Internet Media»Современные информационные технологии и IT-образование2411-14732021-06-0117229530710.25559/SITITO.17.202102.295-307Reinforcement Learning in the Problem of Synthesis of Majority SchemesSergey Gurov0https://orcid.org/0000-0001-5486-1357Dmitry Zolotarev1https://orcid.org/0000-0002-5349-2147Alexander Samburskiy2https://orcid.org/0000-0003-1188-0229Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University, Moscow, Russia Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University, Moscow, Russia Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University, Moscow, Russia The article presents an approach to the synthesis of combinational-logic circuits using artificial neural networks (ANNs). The presented method is focused on the use of a perspective basis using the majority function (a Boolean function of three arguments that takes the value "true" if at least two of its inputs are true). This choice is based on emerging nanotechnologies, where the majority element is most easily represented. The class of applied ANNs is deep networks with reinforcement. Such networks have been actively studied and applied in recent years. There are examples of their effective use for automatic logic circuits optimization. The original synthesis method proposed in the article with the simplification of circuits that implement the Shannon expansion in all variables of the corresponding function of the logic algebra (FLA). On large schemes, it becomes essential to use some simple but effective techniques for training deep ANN agents with reinforcement. This allows you to distribute calculations into several independent subtasks, each of which is explored and makes performing by agents quicker and easier. Two reinforcement learning algorithms for simplifying schemas are described. They provide a solution to the Exploration-Exploitation conflict, which is the contradiction between exploring the environment to find the optimal episode and using information about the episode considered optimal at the current time. The dependences of the parameters of the synthesized circuits on the number n = 3, ..., 10 of FAL variables and the number of network training episodes are presented.http://sitito.cs.msu.ru/index.php/SITITO/article/view/740logic synthesisartificial neural networksreinforcement learning |
spellingShingle | Sergey Gurov Dmitry Zolotarev Alexander Samburskiy Reinforcement Learning in the Problem of Synthesis of Majority Schemes Современные информационные технологии и IT-образование logic synthesis artificial neural networks reinforcement learning |
title | Reinforcement Learning in the Problem of Synthesis of Majority Schemes |
title_full | Reinforcement Learning in the Problem of Synthesis of Majority Schemes |
title_fullStr | Reinforcement Learning in the Problem of Synthesis of Majority Schemes |
title_full_unstemmed | Reinforcement Learning in the Problem of Synthesis of Majority Schemes |
title_short | Reinforcement Learning in the Problem of Synthesis of Majority Schemes |
title_sort | reinforcement learning in the problem of synthesis of majority schemes |
topic | logic synthesis artificial neural networks reinforcement learning |
url | http://sitito.cs.msu.ru/index.php/SITITO/article/view/740 |
work_keys_str_mv | AT sergeygurov reinforcementlearningintheproblemofsynthesisofmajorityschemes AT dmitryzolotarev reinforcementlearningintheproblemofsynthesisofmajorityschemes AT alexandersamburskiy reinforcementlearningintheproblemofsynthesisofmajorityschemes |