Evaluation of BERT and ChatGPT models in inference, paraphrase and similarity tasks

The purpose of this paper is to study the application of ChatGPT and BERT models in the field of mechanical engineering. In the context of machine learning, the ChatGPT and BERT models can be applied to various natural language processing tasks such as analyzing technical documentation and building...

Full description

Bibliographic Details
Main Authors:	Kim Radmir, Kotsenko Anton, Andreev Aleksandr, Bazanova Anastasiia, Aladin Dmitry, Todua David, Marushchenko Aleksei, Varlamov Oleg
Format:	Article
Language:	English
Published:	EDP Sciences 2024-01-01
Series:	E3S Web of Conferences
Online Access:	https://www.e3s-conferences.org/articles/e3sconf/pdf/2024/45/e3sconf_tt21c-2024_03016.pdf

_version_	1827278841103515648
author	Kim Radmir Kotsenko Anton Andreev Aleksandr Bazanova Anastasiia Aladin Dmitry Todua David Marushchenko Aleksei Varlamov Oleg
author_facet	Kim Radmir Kotsenko Anton Andreev Aleksandr Bazanova Anastasiia Aladin Dmitry Todua David Marushchenko Aleksei Varlamov Oleg
author_sort	Kim Radmir
collection	DOAJ
description	The purpose of this paper is to study the application of ChatGPT and BERT models in the field of mechanical engineering. In the context of machine learning, the ChatGPT and BERT models can be applied to various natural language processing tasks such as analyzing technical documentation and building instructions according to a particular version of the documentation, diagnosing malfunctions or customer service. The paper discusses the fundamental features of BERT and ChatGPT models, their origin, and also investigates the main architectural features and identifies the main advantages and disadvantages of the models. The paper analyzes and selects various natural language processing tasks to test the models’ ability to understand natural language in the context of machine learning. The selected criterion tasks are divided into semantic groups to identify the capabilities of ChatGPT and BERT models in each of three areas: logical inference tasks, paraphrasing tasks, and text similarity tasks. The paper also discusses the concept of operational design, which involves developing inputs that guide the models to produce desired outputs. The paper quantitatively analyzes and compares the performance of BERT and ChatGPT based models. The reasons for the bottlenecks of ChatGPT model in natural language understanding tasks are discovered and investigated. Possible improvements of ChatGPT model performance using the mivar approach are considered.
first_indexed	2024-04-24T08:07:06Z
format	Article
id	doaj.art-6cc64802f43147bbb85dc83638f1feaf
institution	Directory Open Access Journal
issn	2267-1242
language	English
last_indexed	2024-04-24T08:07:06Z
publishDate	2024-01-01
publisher	EDP Sciences
record_format	Article
series	E3S Web of Conferences
spelling	doaj.art-6cc64802f43147bbb85dc83638f1feaf2024-04-17T09:12:18ZengEDP SciencesE3S Web of Conferences2267-12422024-01-015150301610.1051/e3sconf/202451503016e3sconf_tt21c-2024_03016Evaluation of BERT and ChatGPT models in inference, paraphrase and similarity tasksKim Radmir0Kotsenko Anton1Andreev Aleksandr2Bazanova Anastasiia3Aladin Dmitry4Todua David5Marushchenko Aleksei6Varlamov Oleg7Bauman Moscow State Technical UniversityBauman Moscow State Technical UniversityBauman Moscow State Technical UniversityBauman Moscow State Technical UniversityBauman Moscow State Technical UniversityBauman Moscow State Technical UniversityBauman Moscow State Technical UniversityBauman Moscow State Technical UniversityThe purpose of this paper is to study the application of ChatGPT and BERT models in the field of mechanical engineering. In the context of machine learning, the ChatGPT and BERT models can be applied to various natural language processing tasks such as analyzing technical documentation and building instructions according to a particular version of the documentation, diagnosing malfunctions or customer service. The paper discusses the fundamental features of BERT and ChatGPT models, their origin, and also investigates the main architectural features and identifies the main advantages and disadvantages of the models. The paper analyzes and selects various natural language processing tasks to test the models’ ability to understand natural language in the context of machine learning. The selected criterion tasks are divided into semantic groups to identify the capabilities of ChatGPT and BERT models in each of three areas: logical inference tasks, paraphrasing tasks, and text similarity tasks. The paper also discusses the concept of operational design, which involves developing inputs that guide the models to produce desired outputs. The paper quantitatively analyzes and compares the performance of BERT and ChatGPT based models. The reasons for the bottlenecks of ChatGPT model in natural language understanding tasks are discovered and investigated. Possible improvements of ChatGPT model performance using the mivar approach are considered.https://www.e3s-conferences.org/articles/e3sconf/pdf/2024/45/e3sconf_tt21c-2024_03016.pdf
spellingShingle	Kim Radmir Kotsenko Anton Andreev Aleksandr Bazanova Anastasiia Aladin Dmitry Todua David Marushchenko Aleksei Varlamov Oleg Evaluation of BERT and ChatGPT models in inference, paraphrase and similarity tasks E3S Web of Conferences
title	Evaluation of BERT and ChatGPT models in inference, paraphrase and similarity tasks
title_full	Evaluation of BERT and ChatGPT models in inference, paraphrase and similarity tasks
title_fullStr	Evaluation of BERT and ChatGPT models in inference, paraphrase and similarity tasks
title_full_unstemmed	Evaluation of BERT and ChatGPT models in inference, paraphrase and similarity tasks
title_short	Evaluation of BERT and ChatGPT models in inference, paraphrase and similarity tasks
title_sort	evaluation of bert and chatgpt models in inference paraphrase and similarity tasks
url	https://www.e3s-conferences.org/articles/e3sconf/pdf/2024/45/e3sconf_tt21c-2024_03016.pdf
work_keys_str_mv	AT kimradmir evaluationofbertandchatgptmodelsininferenceparaphraseandsimilaritytasks AT kotsenkoanton evaluationofbertandchatgptmodelsininferenceparaphraseandsimilaritytasks AT andreevaleksandr evaluationofbertandchatgptmodelsininferenceparaphraseandsimilaritytasks AT bazanovaanastasiia evaluationofbertandchatgptmodelsininferenceparaphraseandsimilaritytasks AT aladindmitry evaluationofbertandchatgptmodelsininferenceparaphraseandsimilaritytasks AT toduadavid evaluationofbertandchatgptmodelsininferenceparaphraseandsimilaritytasks AT marushchenkoaleksei evaluationofbertandchatgptmodelsininferenceparaphraseandsimilaritytasks AT varlamovoleg evaluationofbertandchatgptmodelsininferenceparaphraseandsimilaritytasks

Evaluation of BERT and ChatGPT models in inference, paraphrase and similarity tasks

Similar Items