Multi-node Cluster Research Articles

Objectives: The main objective of this paper is to propose a methodology to develop a Storm analysis model from Raw Rainfall dataset using techniques such as Artificial Neural Network and Min-Max Algorithm. Storm analysis model aims at predicting the occurrence and strength of a storm by analyzing the rainfall data of that region. Methods: In the proposed methodology the raw rainfall dataset is being trained by Artificial Neural network based on the three layers -Input, Hidden, and Output layers. The trained dataset is then summarized into a model which performs the prediction of storm centric characteristics. Neural network training is implemented in Hadoop framework. We obtain a considerable improvement in the total performance of the system by employing Artificial Neural Network. Min-Max algorithm is also used in the system for predicting the intensity of storm. The dataset used for training and prediction consists of daily rainfall data of Cherrapunjee area collected by The Meteorological Department of India. Findings: In the existing system, the raw rainfall dataset is collected and stored in a relational database and then map-reduce based techniques are applied for storm analysis. The major disadvantages associated with this technique are the performance and accuracy rate get reduced with increase in data size. In the proposed methodology as the raw rainfall dataset is being trained by Artificial Neural network the performance and accuracy rate got improved. Also, the training process is done on multi-node hadoop cluster by considering large raw rainfall dataset. With multi-node hadoop cluster there was a large reduction in the total training time. Storm depth of a particular region is calculated by applying MIN-MAX algorithm. This improved the total efficiency of the storm intensity prediction. Applications/Improvement: The performance of the system can be further improved by reducing the training time by adding more nodes while implementing the process in multi node hadoop cluster. Also higher prediction accuracy can be obtained by combining various suitable fuzzy inference models5 with the proposed neural network mode.

Read full abstract

This paper describes a new QR factorization algorithm which is especially designed for massively parallel platforms combining parallel distributed nodes, where a node is a multi-core processor. These platforms represent the present and the foreseeable future of high-performance computing. Our new QR factorization algorithm falls in the category of the tile algorithms which naturally enables good data locality for the sequential kernels executed by the cores (high sequential performance), low number of messages in a parallel distributed setting (small latency term), and fine granularity (high parallelism). Each tile algorithm is uniquely characterized by its sequence of reduction trees. In the context of a cluster of nodes, in order to minimize the number of inter-processor communications (aka, “communication-avoiding”), it is natural to consider hierarchical trees composed of an “inter-node” tree which acts on top of “intra-node” trees. At the intra-node level, we propose a hierarchical tree made of three levels: (0) “TS level” for cache-friendliness, (1) “low-level” for decoupled highly parallel inter-node reductions, (2) “domino level” to efficiently resolve interactions between local reductions and global reductions. Our hierarchical algorithm and its implementation are flexible and modular, and can accommodate several kernel types, different distribution layouts, and a variety of reduction trees at all levels, both inter-node and intra-node. Numerical experiments on a cluster of multi-core nodes (i) confirm that each of the four levels of our hierarchical tree contributes to build up performance and (ii) build insights on how these levels influence performance and interact within each other. Our implementation of the new algorithm with the DAGuE scheduling tool significantly outperforms currently available QR factorization software for all matrix shapes, thereby bringing a new advance in numerical linear algebra for petascale and exascale platforms.

Read full abstract

Multi-node Cluster Research Articles

Related Topics

Articles published on Multi-node Cluster

Scaling machine learning for target prediction in drug discovery using Apache Spark

Storm Analysis with Raw Rainfall Dataset by using Artificial Neural Network and Min-Max Algorithms

HadoopCL2: Motivating the Design of a Distributed, Heterogeneous Programming System With Machine-Learning Applications

A Service Oriented Framework for Analysing Social Network Activities

Efficient Pairwise Document Similarity Computation in Big Datasets

Time-domain BEM for the wave equation on distributed-heterogeneous architectures: A blocking approach

An application of partition method for solving 3D Stokes equation

English

Comparison of Machine Learning Algorithm on Map Reduction for Performance Improvement in Big Data

Using distributed memory parallel computers and GPU clusters for multidimensional Monte Carlo integration

Applying a new synchronous inversion of seismograms using maximum likelihood method and stochastic refinements to study ultra-thin oil-saturated reservoirs

Cluster synchronization induced by one-node clusters in networks with asymmetric negative couplings

Parallel contributing area calculation with granularity control on massive grid terrain datasets

High-performance computing selection of models of DNA substitution for multicore clusters

Hierarchical QR factorization algorithms for multi-core clusters

Large-Scale Modeling of Epileptic Seizures: Scaling Properties of Two Parallel Neuronal Network Simulation Algorithms

Research on MPI-Based Parallel Max-Min Ant System

Repair Method of Complex Network Based on Matthew Effect

Fast anomaly detection in hyperspectral images with RX method on heterogeneous clusters

On the Scaling Behavior of HDA*

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-node Cluster Research Articles

Related Topics

Articles published on Multi-node Cluster

Scaling machine learning for target prediction in drug discovery using Apache Spark

Storm Analysis with Raw Rainfall Dataset by using Artificial Neural Network and Min-Max Algorithms

HadoopCL2: Motivating the Design of a Distributed, Heterogeneous Programming System With Machine-Learning Applications

A Service Oriented Framework for Analysing Social Network Activities

Efficient Pairwise Document Similarity Computation in Big Datasets

Time-domain BEM for the wave equation on distributed-heterogeneous architectures: A blocking approach

An application of partition method for solving 3D Stokes equation

English

Comparison of Machine Learning Algorithm on Map Reduction for Performance Improvement in Big Data

Using distributed memory parallel computers and GPU clusters for multidimensional Monte Carlo integration

Applying a new synchronous inversion of seismograms using maximum likelihood method and stochastic refinements to study ultra-thin oil-saturated reservoirs

Cluster synchronization induced by one-node clusters in networks with asymmetric negative couplings

Parallel contributing area calculation with granularity control on massive grid terrain datasets

High-performance computing selection of models of DNA substitution for multicore clusters

Hierarchical QR factorization algorithms for multi-core clusters

Large-Scale Modeling of Epileptic Seizures: Scaling Properties of Two Parallel Neuronal Network Simulation Algorithms

Research on MPI-Based Parallel Max-Min Ant System

Repair Method of Complex Network Based on Matthew Effect

Fast anomaly detection in hyperspectral images with RX method on heterogeneous clusters

On the Scaling Behavior of HDA*