WARDER: Towards effective spreadsheet defect detection by validity-based cell cluster refinements

Yicheng Huang,Chang Xu,Yanyan Jiang,Huiyan Wang,Da Li

doi:10.1016/j.jss.2020.110615

Abstract

Nowadays spreadsheets are very popular and being widely used. However, they can be prone to various defects and cause severe consequences when end users poorly maintain them. Our research communities have proposed various techniques for automated detection of spreadsheet defects, but they commonly fall short of effectiveness, either due to their limited scope or relying on strict patterns. In this article, we discuss and improve one state-of-the-art technique, CUSTODES, which exploits spreadsheet cell clustering and defect detection to extend its scope and make its detection patterns adaptive to varying spreadsheet styles. Still, CUSTODES can be prone to problematic clustering when accidentally involving irrelevant cells, leading to a largely reduced detection precision. Regarding this, we present WARDER to refine CUSTODES’s spreadsheet cell clustering based on three extensible validity-based properties. Experimental results show that WARDER could improve the precision by 19.1% on spreadsheet cell clustering, which contributed to a precision improvement of 23.3 ~ 24.3% for spreadsheet defect detection, as compared to CUSTODES (F-measure increased from 0.71 to 0.79 ~ 0.82). WARDER also exhibited satisfactory results on another practical large-scale spreadsheet corpus VEnron2, improving the defect detection precision by 10.7 ~ 21.2% over CUSTODES.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

WARDER: Towards effective spreadsheet defect detection by validity-based cell cluster refinements

Abstract

Talk to us

Similar Papers

More From: The Journal of Systems & Software

Lead the way for us

Journal: The Journal of Systems & Software	Publication Date: Apr 28, 2020
Citations: 5

Similar Papers

Detection of Fabric Defects by Auto-Regressive Spectral Analysis and Support Vector Data Description
H.-G Bu ... X.-B Huang
Textile Research Journal | VOL. 80
H.-G Bu, et. al.H.-G Bu ... X.-B Huang
30 Jul 2009
Textile Research Journal | VOL. 80

WARDER: Refining Cell Clustering for Effective Spreadsheet Defect Detection via Validity Properties
Da Li ... Huiyan Wang
-
Da Li, et. al.Da Li ... Huiyan Wang
01 Jul 2019
01 Jul 2019

A Method for Image Anomaly Detection Based on Distillation and Reconstruction.
Jiaxiang Luo ... Jianzhao Zhang
Sensors (Basel, Switzerland) | VOL. 23
Jiaxiang Luo, et. al.Jiaxiang Luo ... Jianzhao Zhang
20 Nov 2023
Sensors (Basel, Switzerland) | VOL. 23

A Systematic Review of Lithium Battery Defect Detection Techniques and Technologies
Tianyuan Lu ... Chengyu Jin
International Journal of Electric Power and Energy Studies | VOL. 2
Tianyuan Lu, et. al.Tianyuan Lu ... Chengyu Jin
25 Jun 2024
International Journal of Electric Power and Energy Studies | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

WARDER: Towards effective spreadsheet defect detection by validity-based cell cluster refinements

Abstract

Talk to us

Similar Papers

More From: The Journal of Systems & Software