A dataset for pull-based development research

  title={A dataset for pull-based development research},
  author={Georgios Gousios and Andy Zaidman},
Pull requests form a new method for collaborating in distributed software development. To study the pull request distributed development model, we constructed a dataset of almost 900 projects and 350,000 pull requests, including some of the largest users of pull requests on Github. In this paper, we describe how the project selection was done, we analyze the selected features and present a machine learning tool set for the R statistics environment. 
Highly Cited
This paper has 28 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 16 extracted citations

A Case for Deep Learning in Mining Software Repositories by H . L .

D. Nijessen
View 6 Excerpts
Highly Influenced

Prevalence of Single-Fault Fixes and Its Impact on Fault Localization

2017 IEEE International Conference on Software Testing, Verification and Validation (ICST) • 2017
View 2 Excerpts


Publications referenced by this paper.

Creating a shared understanding of testing culture on a social coding site

2013 35th International Conference on Software Engineering (ICSE) • 2013
View 4 Excerpts
Highly Influenced

Similar Papers

Loading similar papers…