GitHub repository analysis & prediction

GitHub is a popular hosting service for software projects boasting over 35 million repositories. Many software projects today rely upon reusing existing Open Source projects in the form of a starting reference or as a package dependency. Bad software dependencies may impact a project in the long run...

Full description

Bibliographic Details
Main Author: Li, Shing To
Other Authors: Sourav Saha Bhowmick
Format: Final Year Project (FYP)
Language:English
Published: 2016
Subjects:
Online Access:http://hdl.handle.net/10356/66680
Description
Summary:GitHub is a popular hosting service for software projects boasting over 35 million repositories. Many software projects today rely upon reusing existing Open Source projects in the form of a starting reference or as a package dependency. Bad software dependencies may impact a project in the long run. This project aims to use data mining to uncover patterns and discover new knowledge on what makes a repository healthy. To apply the results of this finding, a web application that uses the results of this analysis has been built and provides prediction for a GitHub repository. This web application can be visited at http://gitvital.ddns.net while the server is online.