Existing Software Libraries Research Articles

Software engineers construct modern-day software applications by building on existing software libraries and components that they necessarily do not author themselves. Thus, contemporary software applications rely heavily on existing standard and third-party libraries for their execution and behavior. As such, effective runtime analysis of such a software application’s behavior is met with new challenges. To perform dynamic analysis of a software application, all transitively dependent external libraries must also be monitored and analyzed at each layer of the software application’s call stack. However, monitoring and analyzing large and often numerous external libraries may prove to be prohibitively expensive. Moreover, an overabundance of library-level analyses may obfuscate the details of the actual software application’s dynamic behavior. In other words, the extensive use of existing libraries by a software application renders the results of its dynamic analysis both expensive to compute and difficult to understand. We model software component behavior as dynamically observed data- and control dependencies between inputs and outputs of a software component. Such data- and control dependencies are monitored at a fine-grain instruction-level and are collected as dynamic execution traces for software runs. As an approach to address the complexities and expenses associated with analyzing dynamically observable behavior of software components, we summarize and reuse the data- and control dependencies between the inputs and outputs of software components. Dynamically monitored data- and control dependencies, between the inputs and outputs of software components, upon summarization are called dynamic dependence summaries . Software components, equipped with dynamic dependence summaries, afford the omission of their exhaustive runtime analysis. Nonetheless, the reuse of dependence summaries would necessitate the abstraction of any concrete runtime information enclosed within the summary, thus potentially causing a loss in the information modeled by the dependence summary. Therefore, benefits to the efficiency of dynamic analyses that use such summarization may be afforded with losses of accuracy. As such, we evaluate the potential accuracy loss and the potential performance gain with the use of dynamic dependence summaries. Our results show, on average, a 13× speedup with the use of dynamic dependence summaries, with an accuracy of 90% in a real-world software engineering task.

Read full abstract

The goal of the Partial Metrics Project is the automatic acquisition of planning knowledge from target code modules in a program library. In the current prototype the system is given a target code module written in Ada as input, and the result is a sequence of generalized transformations that can be used to design a class of related modules. This is accomplished by embedding techniques from Artificial Intelligence into the traditional structure of a compiler. The compiler performs compilation in reverse, starting with detailed code and producing an abstract description of it. The principal task facing the compiler is to find a decomposition of the target code into a collection of syntactic components that are nearly decomposable. Here, nearly decomposable corresponds to the need for each code segment to be nearly independent syntactically from the others. The most independent segments are then the target of the code generalization process. This process can be described as a form of chunking and is implemented here in terms of explanation-based learning. The problem of producing nearly decomposable code components becomes difficult when target code module is not well structured. The task facing users of the system is to be able to identify well-structured code modules from a library of modules that are suitable for input to the system. In this paper we describe the use of inductive learning techniques, namely variations on Quinlan's ID3 system that are capable of producing a decision tree that can be used to conceptually distinguish between well poorly structured code. In order to accomplish that task a set of high-level concepts used by software engineers to characterize structurally understandable code were identified. Next, each of these concepts was operationalized in terms of code complexity metrics that can be easily calculated during the compilation process. These metrics are related to various aspects of the program structure including its coupling, cohesion, data structure, control structure, and documentation. Each candidate module was then described in terms of a collection of such metrics. Using a training set of positive and negative examples of well-structured modules, each described in terms of the appointed metrics, a decision tree was produced that was used to recognize other well-structured modules in terms of their metric properties. This approach was applied to modules from existing software libraries in a variety of domains such as database, editor, graphic, window, data processing, FFT and computer vision software. The results achieved by the system were then benchmarked against the performance of experienced programmers in terms of recognizing well structured code. In a test case involving 120 modules, the system was able to discriminate between poor and well-structured code 99% of the time as compared to an 80% average for the 52 programmers sampled. The results suggest that such an inductive system can serve as a practical mechanism for effectively identifying reusable code modules in terms of their structural properties.

Read full abstract

Existing Software Libraries Research Articles

Articles published on Existing Software Libraries

API-Driven Program Synthesis for Testing Static Typing Implementations

Gonomics: uniting high performance and readability for genomics with Go.

VALIDATION OPTIMIZATION ENVIRONMENT FOR IMPROVED SELECTION OF SOFTWARE COMPONENTS: CONCEPTUAL MODELING AND ARCHITECTURE

UncertainSCI: Uncertainty quantification for computational models in biomedicine and bioengineering

GinJinn2: Object detection and segmentation for ecology and evolution

Open-Source Coprocessor for Integer Multiple Precision Arithmetic

Implementation of Graphic Plugin Loading Platform Based on Python

Cryptography for #MeToo

Dynamic Dependence Summaries

Augmented reality implementation methods in mainstream applications

Open source libraries and frameworks for mass spectrometry based proteomics: A developer's perspective

Comparing and evaluating computer graphics and visualization software

Improving numerical software

QCS: A system for querying, clustering and summarizing documents

APECS – the Atacama pathfinder experiment control system

Extending object database interfaces with fuzziness through aspect-oriented design

Introducing fuzziness in object models and database interfaces through aspects

Interfacing Software Libraries from Nondeterministic Prototypes

Reusability of mathematical software: a contribution

IDENTIFYING REUSABLE SOFTWARE COMPONENTS BY INDUCTION

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Existing Software Libraries Research Articles

Articles published on Existing Software Libraries

API-Driven Program Synthesis for Testing Static Typing Implementations

Gonomics: uniting high performance and readability for genomics with Go.

VALIDATION OPTIMIZATION ENVIRONMENT FOR IMPROVED SELECTION OF SOFTWARE COMPONENTS: CONCEPTUAL MODELING AND ARCHITECTURE

UncertainSCI: Uncertainty quantification for computational models in biomedicine and bioengineering

GinJinn2: Object detection and segmentation for ecology and evolution

Open-Source Coprocessor for Integer Multiple Precision Arithmetic

Implementation of Graphic Plugin Loading Platform Based on Python

Cryptography for #MeToo

Dynamic Dependence Summaries

Augmented reality implementation methods in mainstream applications

Open source libraries and frameworks for mass spectrometry based proteomics: A developer's perspective

Comparing and evaluating computer graphics and visualization software

Improving numerical software

QCS: A system for querying, clustering and summarizing documents

APECS – the Atacama pathfinder experiment control system

Extending object database interfaces with fuzziness through aspect-oriented design

Introducing fuzziness in object models and database interfaces through aspects

Interfacing Software Libraries from Nondeterministic Prototypes

Reusability of mathematical software: a contribution

IDENTIFYING REUSABLE SOFTWARE COMPONENTS BY INDUCTION