Abstract

In experimental studies on query languages, subjects are required to write queries using different query languages. User query performance is usually measured by query accuracy. There is no clearly defined objective method of applying findings to other queries. This study examines the suitability of using a software metric based on lines of code to estimate user query accuracy. Lines of code have been measured in various ways, such as physical source code lines, logical source code lines or compiled bytes. A method of counting lines of code for database queries is proposed and applied to two query languages. The new method counts Boolean conditions as well as other statements. The relationship between lines of code and user query accuracy was examined with regression models. The results show that lines of code can explain a high percentage of the variance in accuracy, with R2>0.8 for the standard relational model query language SQL, and R2>0.9 for the entity relationship model query language KQL. The common assumption that more lines of code will lead to lower accuracy is only partly validated. The findings show a nonlinear relationship, with a possible recovery in accuracy for queries with many lines of code. The results indicate that lines of code can be usefully applied in the study of query languages.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call