Bug Analysis in Jupyter Notebook Projects: An Empirical Study

Taijara Loiola De Santana,Iftekhar Ahmed,Paulo Anselmo Da Mota Silveira Neto,Eduardo Santana De Almeida

doi:10.1145/3641539

Abstract

Computational notebooks, such as Jupyter, have been widely adopted by data scientists to write code for analyzing and visualizing data. Despite their growing adoption and popularity, few studies were found to understand Jupyter development challenges from the practitioners’ point of view. This paper presents a systematic study of bugs and challenges that Jupyter practitioners face through a large-scale empirical investigation. We mined 14,740 commits from 105 GitHub open-source projects with Jupyter notebook code. Next, we analyzed 30,416 Stack Overflow posts, which gave us insights into bugs that practitioners face when developing Jupyter notebook projects. Next, we conducted nineteen interviews with data scientists to uncover more details about Jupyter bugs and to gain insight into Jupyter developers’ challenges. Finally, to validate the study results and proposed taxonomy, we conducted a survey with 91 data scientists. We also highlight bug categories, their root causes, and the challenges that Jupyter practitioners face.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bug Analysis in Jupyter Notebook Projects: An Empirical Study

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology

Lead the way for us

Journal: ACM Transactions on Software Engineering and Methodology	Publication Date: Jan 22, 2024
Citations: 1

Similar Papers

Representation Learning for Stack Overflow Posts: How Far Are We?
Junda He ... Ivana Clairine Irsan
ACM Transactions on Software Engineering and Methodology | VOL. 33
Junda He, et. al.Junda He ... Ivana Clairine Irsan
15 Mar 2024
ACM Transactions on Software Engineering and Methodology | VOL. 33

Case Study Comparison of Computational Notebook Platforms for Interactive Visual Analytics
Han Liu ... Chris North
-
Han Liu, et. al.Han Liu ... Chris North
01 Oct 2022
01 Oct 2022

EDAssistant: Supporting Exploratory Data Analysis in Computational Notebooks with In Situ Code Search and Recommendation
Xingjun Li ... Chengnian Sun
ACM Transactions on Interactive Intelligent Systems | VOL. 13
Xingjun Li, et. al.Xingjun Li ... Chengnian Sun
09 Mar 2023
ACM Transactions on Interactive Intelligent Systems | VOL. 13

Stack Overflow: A code laundering platform?
Le An ... Giuliano Antoniol
-
Le An, et. al.Le An ... Giuliano Antoniol
01 Feb 2017
01 Feb 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bug Analysis in Jupyter Notebook Projects: An Empirical Study

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology