Code4ML: a large-scale dataset of annotated Machine Learning code

The use of program code as a data source is increasingly expanding among data scientists. The purpose of the usage varies from the semantic classification of code to the automatic generation of programs. However, the machine learning model application is somewhat limited without annotating the code...

Full description

Bibliographic Details
Main Authors: Anastasia Drozdova, Ekaterina Trofimova, Polina Guseva, Anna Scherbakova, Andrey Ustyuzhanin
Format: Article
Language:English
Published: PeerJ Inc. 2023-02-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-1230.pdf