Data cleaning and refinement for code related AI systems

This final-year project will cover data cleaning and refinement to improve the quality of data for various natural language processing (NLP) projects, such as code clone detection and code-to-text conversion. The project will focus on using the PyTorch library to train a masked language model to det...

Full description

Bibliographic Details
Main Author: Tay, Arron Hong Yi
Other Authors: Liu Yang
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/166197