Automated source code summarization via transformer

Source code summarization is a comprehensible description of a program’s functionality. The code summarization assists developers to understand large portions of source code, thus reducing the time taken to comprehend a program’s capabilities. To automate the code summarization, programs have used R...

Full description

Bibliographic Details
Main Author: Viswen Kumar Mariammalle
Other Authors: Liu Yang
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/153188
_version_ 1826121403468873728
author Viswen Kumar Mariammalle
author2 Liu Yang
author_facet Liu Yang
Viswen Kumar Mariammalle
author_sort Viswen Kumar Mariammalle
collection NTU
description Source code summarization is a comprehensible description of a program’s functionality. The code summarization assists developers to understand large portions of source code, thus reducing the time taken to comprehend a program’s capabilities. To automate the code summarization, programs have used RNN-based neural architecture to create neural network models for this natural language translation. However, the RNN-based neural architecture has two particular limitations which are its disability to process the non-sequential structure of the source codes and missing out on the long-term relationships between code tokens. My proposed approach of using Transformer neural architecture is able to overcome these limitations. Compared against the RNN-based neural network models, the Transformer network model has shown significantly better experimental results of BLEU 1, 2, 3 and 4 scores, ranging between three to seven scores higher, METEOR score of three higher and ROUGE-L score of one higher.
first_indexed 2024-10-01T05:32:04Z
format Final Year Project (FYP)
id ntu-10356/153188
institution Nanyang Technological University
language English
last_indexed 2024-10-01T05:32:04Z
publishDate 2021
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1531882021-11-16T05:09:40Z Automated source code summarization via transformer Viswen Kumar Mariammalle Liu Yang School of Computer Science and Engineering yangliu@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Document and text processing Source code summarization is a comprehensible description of a program’s functionality. The code summarization assists developers to understand large portions of source code, thus reducing the time taken to comprehend a program’s capabilities. To automate the code summarization, programs have used RNN-based neural architecture to create neural network models for this natural language translation. However, the RNN-based neural architecture has two particular limitations which are its disability to process the non-sequential structure of the source codes and missing out on the long-term relationships between code tokens. My proposed approach of using Transformer neural architecture is able to overcome these limitations. Compared against the RNN-based neural network models, the Transformer network model has shown significantly better experimental results of BLEU 1, 2, 3 and 4 scores, ranging between three to seven scores higher, METEOR score of three higher and ROUGE-L score of one higher. Bachelor of Engineering (Computer Science) 2021-11-16T01:11:53Z 2021-11-16T01:11:53Z 2021 Final Year Project (FYP) Viswen Kumar Mariammalle (2021). Automated source code summarization via transformer. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/153188 https://hdl.handle.net/10356/153188 en SCSE20-0713 application/pdf Nanyang Technological University
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Viswen Kumar Mariammalle
Automated source code summarization via transformer
title Automated source code summarization via transformer
title_full Automated source code summarization via transformer
title_fullStr Automated source code summarization via transformer
title_full_unstemmed Automated source code summarization via transformer
title_short Automated source code summarization via transformer
title_sort automated source code summarization via transformer
topic Engineering::Computer science and engineering::Computing methodologies::Document and text processing
url https://hdl.handle.net/10356/153188
work_keys_str_mv AT viswenkumarmariammalle automatedsourcecodesummarizationviatransformer