Data Placement Approach Research Articles

Social networks (SNs) are sources with extreme number of users around the world who are all sharing data like images, audio, and video to their friends using IoT devices. This concept is the so-called Social Internet of Things (SIot). The evolving nature of edge-cloud computing has enabled storage of a large volume of data from various sources, and this task demands an efficient storage procedure. For this kind of large volume of data storage, the usage of data replication using edge with geo-distributed cloud service area is suited to fulfill the user’s expectations with low latency. The major issue is the way to store the data and replicate these large data items optimally and allocate the request from the data center efficiently. For efficient storage of these data, we use edge server, which is part of the cloud server, in this study. Thus, the data are distributed and stored with quick access, which will reduce the latency with response. The proposed data placement approach learns with machine learning (ML) algorithm called radial basis kernel function assisted with support vector machine (RBF-SVM) to classify the data center for storing the user and friend’s data from the SIoT devices. These learning algorithms will be used to predict the workload of the data stored in the data center as either edge or cloud depending on the existing time slots. The data placement with dynamic nature is also optimized using the proposed dynamic graph partitioning (GP) method to meet the individual user’s demand of low latency with minimum costs. This way will keep the SIoT data placement efficient and effective over time. Accordingly, this proposed data placement and replication approach introduces three kinds of innovations compared with the existing data placement approach. (i) Rather than storing the user data in a single cloud, this study uses the edge server closest to the SIoT devices for faster access with reduced response time. (ii) The classification algorithm called RBF-SVM is used to find storage for user for reducing data replication. (iii) Dynamic GP is introduced for data placement with reduced latency and minimum cost to fulfil the dynamic nature of the SN. The simulation result of this approach obtains reduced latency of 130 ms and minimum cost compared with those of the existing data placement approaches. Therefore, our proposed data placement with ML-based learning on edge provides promising results in terms of efficiency, effectiveness, and performance with reduced latency and minimum cost.

Read full abstract

We are happy to present this special issue of the scientific journal Scalable Computing: Practice and Experience. In this special issue on Infrastructures and Algorithms for Scalable Computing (Volume 19, No 3 June 2018), we have selected four papers out of submitted nine, which gone through a peer review according to the journal policy. All papers represent novel results in the fields of distributed algorithms and infrastructures for scalable computing. The first paper presents present a novel approach for efficient data placement, which improves the performance of workflow execution in distributed datacenters. The greedy heuristic algorithm, which is based on a network flow optimization framework, minimizes the total storage cost, including efforts to move and store the data from different source locations and dependencies. The second paper evaluated the significance of different clustering techniques viz. k-means, Hierarchical Agglomerative Clustering and Markov Clustering in groupingawaredata placement for data-intensive applications with interest locality. The evaluation in Azure reported that Markov Clustering-based data placement strategy improves the local map execution and reduces the execution time compared to Hadoops Default Data Placement Strategy and other evaluated clustering techniques. This is more emphasized for data-intensive applications that have interest locality. The third paper presents an experimental evaluation of the openMP thread-mapping strategies in different hardware environments (IntelXeon Phi coprocessor and hybrid CPU-MIC platforms). The paper shows the optimal choice of thread affinity, the number of threads and the execution mode that can provide optimal performance of the LU factorization. In the fourth paper, the authors study the amount of memory occupied by sparse matrices split up into same-size blocks. The paper considers and statistically evaluates four popular storage formats and combinations among them. The conclusion is that block-based storage formats may significantly reduce memory footprints of sparse matrices arising from a wide range of application domains. We use this opportunity to thank all contributors to this Special Issue: all authors who submitted the results of their latest research and all reviewers for their valuable comments and suggestions for improvement. We would like to express our special gratitude for the Editor-in-Chief, Professor Dana Petcu, for her constant support during the whole process of this Special Issue.

Read full abstract

Data Placement Approach Research Articles

Related Topics

Articles published on Data Placement Approach

Dynamic data replication and placement strategy in geographically distributed data centers

Optimal Data Placement and Replication Approach for SIoT with Edge

RENDA: Resource and Network Aware Data Placement Algorithm for Periodic Workloads in Cloud

Reliability-aware Garbage Collection for Hybrid HBM-DRAM Memories

Application and Storage-Aware Data Placement and Job Scheduling for Hadoop Clusters

QoS-Aware Data Placement for MapReduce Applications in Geo-Distributed Data Centers

Special Issue on Infrastructures and Algorithms for Scalable Computing

Exact and Heuristic Data Workflow Placement Algorithms for Big Data Computing in Cloud Datacenters

A vibrant data placement approach for map reduce in diverse environments

Data placement approach for scalable online social networks

ExaPlan

Efficient location-aware data placement for data-intensive applications in geo-distributed scientific data centers

Tology-Aware Optimal Data Placement Algorithm for Network Traffic Optimization

SWORD: workload-aware data placement and replica selection for cloud data management systems

A LNS-based data placement strategy for data-intensive e-science applications

A Cloud‐Computing‐Based Data Placement Strategy in High‐Speed Railway

A High-Speed Railway Data Placement Strategy Based on Cloud Computing

Distributed Real Time Architecture for Data Placement in Wireless Sensor Networks

Exploiting sequential access when declustering data over disks and MEMS-based storage

A simulated annealing approach for multimedia data placement

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Data Placement Approach Research Articles

Related Topics

Articles published on Data Placement Approach

Dynamic data replication and placement strategy in geographically distributed data centers

Optimal Data Placement and Replication Approach for SIoT with Edge

RENDA: Resource and Network Aware Data Placement Algorithm for Periodic Workloads in Cloud

Reliability-aware Garbage Collection for Hybrid HBM-DRAM Memories

Application and Storage-Aware Data Placement and Job Scheduling for Hadoop Clusters

QoS-Aware Data Placement for MapReduce Applications in Geo-Distributed Data Centers

Special Issue on Infrastructures and Algorithms for Scalable Computing

Exact and Heuristic Data Workflow Placement Algorithms for Big Data Computing in Cloud Datacenters

A vibrant data placement approach for map reduce in diverse environments

Data placement approach for scalable online social networks

ExaPlan

Efficient location-aware data placement for data-intensive applications in geo-distributed scientific data centers

Tology-Aware Optimal Data Placement Algorithm for Network Traffic Optimization

SWORD: workload-aware data placement and replica selection for cloud data management systems

A LNS-based data placement strategy for data-intensive e-science applications

A Cloud‐Computing‐Based Data Placement Strategy in High‐Speed Railway

A High-Speed Railway Data Placement Strategy Based on Cloud Computing

Distributed Real Time Architecture for Data Placement in Wireless Sensor Networks

Exploiting sequential access when declustering data over disks and MEMS-based storage

A simulated annealing approach for multimedia data placement