Improving the pull requests review process using learning-to-rank algorithms

Abstract

Collaborative software development platforms (such as GitHub and GitLab) have become increasingly popular as they have attracted thousands of external contributors to contribute to open source projects. The external contributors may submit their contributions via pull requests, which must be reviewed before being integrated into the central repository. During the review process, reviewers provide feedback to contributors, conduct tests and request further modifications before finally accepting or rejecting the contributions. The role of reviewers is key to maintain the effective review process of the project. However, the number of decisions that reviewers can make is far superseded by the increasing number of pull requests submissions. To help reviewers to perform more decisions on pull requests within their limited working time, we propose a learning-to-rank (LtR) approach to recommend pull requests that can be quickly reviewed by reviewers. Different from a binary model for predicting the decisions of pull requests, our ranking approach complements the existing list of pull requests based on their likelihood of being quickly merged or rejected. We use 18 metrics to build LtR models and we use six different LtR algorithms, such as ListNet, RankNet, MART and random forest. We conduct empirical studies on 74 Java projects to compare the performances of the six LtR algorithms. We compare the best performing algorithm against two baselines obtained from previous research regarding pull requests prioritization: the first-in-and-first-out (FIFO) baseline and the small-size-first baseline. We then conduct a survey with GitHub reviewers to understand the perception of code reviewers regarding the usefulness of our approach. We observe that: (1) The random forest LtR algorithm outperforms other five well adapted LtR algorithms to rank quickly merged pull requests. (2) The random forest LtR algorithm performs better than both the FIFO and the small-size-first baselines, which means our LtR approach can help reviewers make more decisions and improve their productivity. (3) The contributor’s social connections and contributor’s experience are the most influential metrics to rank pull requests that can be quickly merged. (4) The GitHub reviewers that participated in our survey acknowledge that our approach complements existing prioritization baselines to help them to prioritize and to review more pull requests.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving the pull requests review process using learning-to-rank algorithms

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering

Lead the way for us

Journal: Empirical Software Engineering	Publication Date: Mar 1, 2019
Citations: 33

Similar Papers

Query-dependent learning to rank for cross-lingual information retrieval
Elham Ghanbari ... Azadeh Shakery
Knowledge and Information Systems | VOL. 59
Elham Ghanbari, et. al.Elham Ghanbari ... Azadeh Shakery
04 Jul 2018
Knowledge and Information Systems | VOL. 59

An empirical comparison of random forest-based and other learning-to-rank algorithms
Muhammad Ibrahim
Pattern Analysis and Applications | VOL. 23
Muhammad IbrahimMuhammad Ibrahim
28 Oct 2019
Pattern Analysis and Applications | VOL. 23

Research on the Classification of High Dimensional Imbalanced Data based on the Optimization of Random Forest Algorithm
Ma Xiaojuan
-
Ma XiaojuanMa Xiaojuan
25 Aug 2018
25 Aug 2018

Comparing Pointwise and Listwise Objective Functions for Random-Forest-Based Learning-to-Rank
Muhammad Ibrahim ... Mark Carman
ACM Transactions on Information Systems | VOL. 34
Muhammad Ibrahim, et. al.Muhammad Ibrahim ... Mark Carman
17 Aug 2016
ACM Transactions on Information Systems | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving the pull requests review process using learning-to-rank algorithms

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering