Complex Software Research Articles

In retrospective secondary data analysis studies, researchers often seek waiver of consent from institutional Review Boards (IRB) and minimize risk by utilizing complex software. Yet, little is known about the perspectives of IRB experts on these approaches. To facilitate effective communication about risk mitigation strategies using software, we conducted two studies with IRB experts to co-create appropriate language when describing a software to IRBs. We conducted structured focus groups with IRB experts to solicit ideas on questions regarding benefits, risks, and informational needs. Based on these results, we developed a template IRB application and template responses for a generic study using privacy-enhancing software. We then conducted a three-round Delphi study to refine the template IRB application and the template responses based on expert panel feedback. To facilitate participants' deliberation, we shared the revisions and a summary of participants' feedback during each Delphi round. 11 experts in two focus groups generated 13 ideas on risks, benefits, and informational needs. 17 experts participated in the Delphi study with 13 completing all rounds. Most agreed that privacy-enhancing software will minimize risk, but regardless all secondary data studies have an inherent risk of unexpected disclosures. The majority (84.6%) noted that subjects in retrospective secondary data studies experience no greater risks than the risks experienced in ordinary life in the modern digital society. Hence, all retrospective data-only studies with no contact with subjects would be minimal risk studies. First, we found fundamental disagreements in how some IRB experts view risks in secondary data research. Such disagreements are consequential because they can affect determination outcomes and might suggest IRBs at different institutions might come to different conclusions regarding similar study protocols. Second, the highest ranked risks and benefits of privacy-enhancing software in our study were societal rather than individual. The highest ranked benefits were facilitating more research and promoting responsible data governance practices. The highest ranked risks were risk of invalid results from systematic user error or erroneous algorithms. These societal considerations are typically more characteristic of public health ethics as opposed to the bioethical approach of research ethics, possibly reflecting the difficulty applying a bioethical approach (eg, informed consent) in secondary data studies. Finally, the development of privacy-enhancing technology for secondary data research depends on effective communication and collaboration between the privacy experts and technology developers. Privacy is a complex issue that requires a holistic approach that is best addressed through privacy-by-design principles. Privacy expert participation is important yet often neglected in this design process. This study suggests best practice strategies for engaging the privacy community through co-developing companion documents for software through participatory design to facilitate transparency and communication. In this case study, the final template IRB application and responses we released with the open-source software can be easily adapted by researchers to better communicate with their IRB when using the software. This can help increase responsible data governance practices when many software developers are not research ethics experts.

Read full abstract

Background: Continuous modifications, suboptimal software design practices, and stringent project deadlines contribute to the proliferation of code smells. Detecting and refactoring these code smells are pivotal to maintaining complex and essential software systems. Neglecting them may lead to future software defects, rendering systems challenging to maintain, and eventually obsolete. Supervised machine learning techniques have emerged as valuable tools for classifying code smells without needing expert knowledge or fixed threshold values. Further enhancement of classifier performance can be achieved through effective feature selection techniques and the optimization of hyperparameter values. Aim: Performance measures of multiple machine learning classifiers are improved by fine tuning its hyperparameters using various type of meta-heuristic algorithms including swarm intelligent, physics, math, and bio-based etc. Their performance measures are compared to find the best meta-heuristic algorithm in the context of code smell detection and its impact is evaluated based on statistical tests. Method: This study employs sixteen contemporary and robust meta-heuristic algorithms to optimize the hyperparameters of two machine learning algorithms: Support Vector Machine (SVM) and k-nearest Neighbors (k-NN). The No Free Lunch theorem underscores that the success of an optimization algorithm in one application may not necessarily extend to others. Consequently, a rigorous comparative analysis of these algorithms is undertaken to identify the best-fit solutions for code smell detection. A diverse range of optimization algorithms, encompassing Arithmetic, Jellyfish Search, Flow Direction, Student Psychology Based, Pathfinder, Sine Cosine, Jaya, Crow Search, Dragonfly, Krill Herd, Multi-Verse, Symbiotic Organisms Search, Flower Pollination, Teaching Learning Based, Gravitational Search, and Biogeography-Based Optimization, have been implemented. Results: In the case of optimized SVM, the highest attained accuracy, AUC, and F-measure values are 98.75%, 100%, and 98.57%, respectively. Remarkably, significant increases in accuracy and AUC, reaching 32.22% and 45.11% respectively, are observed. For k-NN, the best accuracy, AUC, and F-measure values are all perfect at 100%, with noteworthy hikes in accuracy and ROC-AUC values, amounting to 43.89% and 40.83%, respectively. Conclusion: Optimized SVM exhibits exceptional performance with the Sine Cosine Optimization algorithm, while k-NN attains its peak performance with the Flower Optimization algorithm. Statistical analysis underscores the substantial impact of employing meta-heuristic algorithms for optimizing machine learning classifiers, enhancing their performance significantly. Optimized SVM excels in detecting the God Class, while optimized k-NN is particularly effective in identifying the Data Class. This innovative fusion automates the tuning process and elevates classifier performance, simultaneously addressing multiple longstanding challenges.

Read full abstract

Complex Software Research Articles

Related Topics

Articles published on Complex Software

An empirical analysis of feature selection techniques for Software Defect Prediction

Edge IoT Prototyping Using Model-Driven Representations: A Use Case for Smart Agriculture.

A Comparative Study of Commit Representations for JIT Vulnerability Prediction

An OTA Upgrade Differential Compression Algorithm Based on Suffix Array Induced Sorting and BsDiff Methods

Case study on communicating with research ethics committees about minimizing risk through software: an application for record linkage in secondary data analysis.

Experimental studies of the control process of the working body of a single-bucket excavator

Discursive Modulation in Open Source Software: How Online Communities Shape Novelty and Complexity

Efficient Rollout of a Dynamic Optimization Algorithm

ЗАКОНОМЕРНОСТИ ПОКАЗАТЕЛЯ ТРАНСПОРТНОГО ЗАТОРА НА НЕКОТОРЫХ ПЕРЕСЕЧЕНИЯХ УЛИЧНО-ДОРОЖНОЙ СЕТИ

IIOT ПЕРЕДАЧА ДАННЫХ С БПЛА В ПРОМЫШЛЕННУЮ СРЕДУ АГРОКОМПЛЕКСОВ В РЕЖИМЕ РЕАЛЬНОГО ВРЕМЕНИ

Hardware and software complex for creating a digital passport of an athlete’smotor stereotype

A modern approach to sports selection of children

Assessing the reliability of the hardware and software complex of fault-tolerant control systems

РАЗРАБОТКА КОНСТРУКЦИИ И ОБОСНОВАНИЕ ПАРАМЕТРОВ РАБОТЫ ПАСТЕРИЗАТОРА С ИНДУКЦИОННЫМ НАГРЕВОМ

Building a Flexible and Resource-Light Monitoring Platform for a WLCG-Tier2

Signal modeling of means of tacit information acquisition using spline functions

Boosting and Comparing Performance of Machine Learning Classifiers with Meta-heuristic Techniques to Detect Code Smell

Optimization of the current spray angle of the nozzles of the adaptive distribution system of a single-support boom sprayer

The concept of architectural reliability of software for ensuring the functioning of request-free measuring stations

ПРАКТИЧЕСКАЯ РЕАЛИЗАЦИЯ ПРИНЦИПОВ ИНТЕЛЛЕКТУАЛЬНОГО УПРАВЛЕНИЯ ТРАНСПОРТНЫМИ ПОТОКАМИ В ГОРОДЕ БЕЛГОРОДЕ

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Complex Software Research Articles

Related Topics

Articles published on Complex Software

An empirical analysis of feature selection techniques for Software Defect Prediction

Edge IoT Prototyping Using Model-Driven Representations: A Use Case for Smart Agriculture.

A Comparative Study of Commit Representations for JIT Vulnerability Prediction

An OTA Upgrade Differential Compression Algorithm Based on Suffix Array Induced Sorting and BsDiff Methods

Case study on communicating with research ethics committees about minimizing risk through software: an application for record linkage in secondary data analysis.

Experimental studies of the control process of the working body of a single-bucket excavator

Discursive Modulation in Open Source Software: How Online Communities Shape Novelty and Complexity

Efficient Rollout of a Dynamic Optimization Algorithm

ЗАКОНОМЕРНОСТИ ПОКАЗАТЕЛЯ ТРАНСПОРТНОГО ЗАТОРА НА НЕКОТОРЫХ ПЕРЕСЕЧЕНИЯХ УЛИЧНО-ДОРОЖНОЙ СЕТИ

IIOT ПЕРЕДАЧА ДАННЫХ С БПЛА В ПРОМЫШЛЕННУЮ СРЕДУ АГРОКОМПЛЕКСОВ В РЕЖИМЕ РЕАЛЬНОГО ВРЕМЕНИ

Hardware and software complex for creating a digital passport of an athlete’smotor stereotype

A modern approach to sports selection of children

Assessing the reliability of the hardware and software complex of fault-tolerant control systems

РАЗРАБОТКА КОНСТРУКЦИИ И ОБОСНОВАНИЕ ПАРАМЕТРОВ РАБОТЫ ПАСТЕРИЗАТОРА С ИНДУКЦИОННЫМ НАГРЕВОМ

Building a Flexible and Resource-Light Monitoring Platform for a WLCG-Tier2

Signal modeling of means of tacit information acquisition using spline functions

Boosting and Comparing Performance of Machine Learning Classifiers with Meta-heuristic Techniques to Detect Code Smell

Optimization of the current spray angle of the nozzles of the adaptive distribution system of a single-support boom sprayer

The concept of architectural reliability of software for ensuring the functioning of request-free measuring stations

ПРАКТИЧЕСКАЯ РЕАЛИЗАЦИЯ ПРИНЦИПОВ ИНТЕЛЛЕКТУАЛЬНОГО УПРАВЛЕНИЯ ТРАНСПОРТНЫМИ ПОТОКАМИ В ГОРОДЕ БЕЛГОРОДЕ