Scalability Evaluation Research Articles

ABSTRACT With the development of online service, recent forms of databas es have been changed from static database structures to dynamic stream database structures. Previous data mining techniques have been used as tools of decision making such as establishment o f marketing strategies and DNA analyses. However, the capability to analyze real-time data more quickly is necessary in the recent interesting areas such as sensor network, robotics, and artific ial intelligence. Landmark window-based frequent pattern mining , one of the stream mining approaches, performs mining operations with respect to parts of databases or each transaction of them, inste ad of all the data. In this paper, we analyze and evaluate the tec hniques of the well-known landmark window-based frequent patter n mining algorithms, called Lossy counting and hMiner. When Lossy counting mines frequent patterns from a set of new transactions, it performs union operations between the previous and current mining results. hMiner, which is a state-of-the-art algorithm based on the landmark window model, conducts mining operations whenever a new transaction occurs. Since hMiner extracts frequent patterns a s soon as a new transaction is entered, we can obtain the latest mining results reflecting real-time information. For this reaso n, such algorithms are also called online mining approaches. We evaluat e and compare the performance of the primitive algorithm, Lossy counting and the latest one, hMiner. As the criteria of our performance analysis, we first consider algorithms’ total runtime and average processing time per transaction. In addition, to compare the ef ficiency of storage structures between them, their maximum memo ry usage is also evaluated. Lastly, we show how stably the two alg orithms conduct their mining works with respect to the database s that feature gradually increasing items. With respect to the evaluat ion results of mining time and transaction processing, hMiner h as higher speed than that of Lossy counting. Since hMiner stores candidat e frequent patterns in a hash method, it can directly access candidate frequent patterns. Meanwhile, Lossy counting stores them in a l attice manner; thus, it has to search for multiple nodes in ord er to access the candidate frequent patterns. On the other hand, hMiner show s worse performance than that of Lossy counting in terms of maximum memory usage. hMiner should have all of the information for candidate frequent patterns to store them to hash’s bucket s, while Lossy counting stores them, reducing their information by using the lattice method. Since the storage of Lossy counting can share items concurrently included in multiple patterns, its memory usage is more efficient than that of hMiner. However, hMiner pres ents better efficiency than that of Lossy counting with respect to scalability evaluation due to the following reasons. If the number of items is

Read full abstract

Modern software systems are increasingly configurable. While this has many benefits, it also makes some software engineering tasks,such as software testing, much harder. This is because, in theory,unique errors could be hiding in any configuration, and, therefore,every configuration may need to undergo expensive testing. As this is generally infeasible, developers need cost-effective technique for selecting which specific configurations they will test. One popular selection approach is combinatorial interaction testing (CIT), where the developer selects a strength t and then computes a covering array (a set of configurations) in which all t-way combinations of configuration option settings appear at least once. In prior work, we demonstrated several limitations of the CIT approach. In particular, we found that a given system's effective configuration space - the minimal set of configurations needed to achieve a specific goal - could comprise only a tiny subset of the system's full configuration space. We also found that effective configuration space may not be well approximated by t-way covering arrays. Based on these insights we have developed an algorithm called interaction tree discovery (iTree). iTree is an iterative learning algorithm that efficiently searches for a small set of configurations that closely approximates a system's effective configuration space. On each iteration iTree tests the system on a small sample of carefully chosen configurations, monitors the system's behaviors, and then applies machine learning techniques to discover which combinations of option settings are potentially responsible for any newly observed behaviors. This information is used in the next iteration to pick a new sample of configurations that are likely to reveal further new behaviors. In prior work, we presented an initial version of iTree and performed an initial evaluation with promising results. This paper presents an improved iTree algorithm in greater detail. The key improvements are based on our use of composite proto-interactions - a construct that improves iTree's ability to correctly learn key configuration option combinations, which in turn significantly improves iTree's running time, without sacrificing effectiveness. Finally, the paper presents a detailed evaluation of the improved iTree algorithm by comparing the coverage it achieves versus that of covering arrays and randomly generated configuration sets, including a significantly expanded scalability evaluation with the ~ 1M-LOC MySQL. Our results strongly suggest that the improved iTree algorithm is highly scalable and can identify a high-coverage test set of configurations more effectively than existing methods.

Read full abstract

Scalability Evaluation Research Articles

Articles published on Scalability Evaluation

Scalable Evaluation of Trajectory Queries over Imprecise Location Data

랜드마크 윈도우 기반의 빈발 패턴 마이닝 기법의 분석 및 성능평가

EVALUATING PAAS SCALABILITY AND IMPROVING PERFORMANCE USING SCALABILITY IMPROVEMENT SYSTEMS

Scalable Evaluation of Polarization Energy and Associated Forces in Polarizable Molecular Dynamics: II.Towards Massively Parallel Computations using Smooth Particle Mesh Ewald.

ITree: Efficiently Discovering High-Coverage Configurations Using Interaction Trees

Experimental Evaluation of Scalability and Reliability of a Feedback-Based UPC-Parameters Renegotiation Mechanism

Experimental Evaluation of Scalability and Reliability of a Feedback-Based UPC-Parameters Renegotiation Mechanism

Scalability evaluation of an FPGA-based multi-core architecture with hardware-enforced domain partitioning

QR-tree: An efficient and scalable method for evaluation of continuous range queries

Remote service discovery and binding architecture for soft real-time QoS in indoor location-based service

Evaluation of Scalability and Bandwidth Efficiency of Multipoint to Multipoint Hierarchy for Fast Recovery in MPLS Networks

Indexing Volumetric Shapes with Matching and Packing.

Масштабируемый метод оценки управления доверием на основе распределенных систем онлайн мониторинга

Scalable evaluation of platelet aggregation by the degree of blood migration

A real‐time capable coherent data cache for multicores

Scalability Analysis of KVM-Based Private Cloud For Iaas

Distributed Schemes for Routing Table Management in Next Generation Routers

Facilitating representation and retrieval of structured cases: Principles and toolkit

A systematic literature review of service choreography adaptation

Quality assessment of multidimensional video scalability

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Scalability Evaluation Research Articles

Articles published on Scalability Evaluation

Scalable Evaluation of Trajectory Queries over Imprecise Location Data

랜드마크 윈도우 기반의 빈발 패턴 마이닝 기법의 분석 및 성능평가

EVALUATING PAAS SCALABILITY AND IMPROVING PERFORMANCE USING SCALABILITY IMPROVEMENT SYSTEMS

Scalable Evaluation of Polarization Energy and Associated Forces in Polarizable Molecular Dynamics: II.Towards Massively Parallel Computations using Smooth Particle Mesh Ewald.

ITree: Efficiently Discovering High-Coverage Configurations Using Interaction Trees

Experimental Evaluation of Scalability and Reliability of a Feedback-Based UPC-Parameters Renegotiation Mechanism

Experimental Evaluation of Scalability and Reliability of a Feedback-Based UPC-Parameters Renegotiation Mechanism

Scalability evaluation of an FPGA-based multi-core architecture with hardware-enforced domain partitioning

QR-tree: An efficient and scalable method for evaluation of continuous range queries

Remote service discovery and binding architecture for soft real-time QoS in indoor location-based service

Evaluation of Scalability and Bandwidth Efficiency of Multipoint to Multipoint Hierarchy for Fast Recovery in MPLS Networks

Indexing Volumetric Shapes with Matching and Packing.

Масштабируемый метод оценки управления доверием на основе распределенных систем онлайн мониторинга

Scalable evaluation of platelet aggregation by the degree of blood migration

A real‐time capable coherent data cache for multicores

Scalability Analysis of KVM-Based Private Cloud For Iaas

Distributed Schemes for Routing Table Management in Next Generation Routers

Facilitating representation and retrieval of structured cases: Principles and toolkit

A systematic literature review of service choreography adaptation

Quality assessment of multidimensional video scalability