Machine learning code snippets semantic classification

Program code has recently become a valuable active data source for training various data science models, from code classification to controlled code synthesis. Annotating code snippets play an essential role in such tasks. This article presents a novel approach that leverages CodeBERT, a powerful tr...

Full description

Bibliographic Details
Main Authors: Valeriy Berezovskiy, Anastasia Gorodilova, Ekaterina Trofimova, Andrey Ustyuzhanin
Format: Article
Language:English
Published: PeerJ Inc. 2023-11-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-1654.pdf