A Distributed Framework for Parallel Data Mining Using HPJava

O Rana,D Fisk

doi:10.1023/a:1009696924527

Abstract

Java has become a language of choice for applications executing in heterogeneous environments utilising distributed objects and multithreading. To handle large data sets, scalable and efficient implementations of data mining approaches are required, generally employing computationally intensive algorithms. Conventional Java implementations do not directly provide support for the data structures often encountered in such algorithms, and they also lack repeatability in numerical precision across platforms. This paper describes a distributed framework employing task and data parallelism, and implemented in high performance Java (HPJava). Issues of interest for data mining algorithms are identified, and possible solutions discussed for overcoming limitations in the Java Virtual Machine. The framework supports parallelism across workstation clusters, using the message-passing interface as middleware, and can support different analysis algorithms, wrapped as Java objects, and linked to various databases using the Java database connectivity interface. Guidelines are provided for implementing parallel and distributed data mining on large data sets, and a proof-of-concept data mining application is analysed using a neural network.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Distributed Framework for Parallel Data Mining Using HPJava

Abstract

Talk to us

Similar Papers

More From: BT Technology Journal

Lead the way for us

Journal: BT Technology Journal	Publication Date: Jan 1, 1999
Citations: 10

Similar Papers

Data Mining Techniques in Grid Computing Environments
-
-
--
14 Nov 2008
14 Nov 2008

A framework for cost-effective distributed data mining in academic institutions using intelligent agents
Ramakrishnan Raman ... Benson Edwin Raj
-
Ramakrishnan Raman, et. al.Ramakrishnan Raman ... Benson Edwin Raj
01 Oct 2017
01 Oct 2017

An Analysis of students’ performance using classification algorithms
Mrs M.S Mythili ... Dr A.R.Mohamed Shanavas
IOSR Journal of Computer Engineering | VOL. 16
Mrs M.S Mythili, et. al.Mrs M.S Mythili ... Dr A.R.Mohamed Shanavas
01 Jan 2014
IOSR Journal of Computer Engineering | VOL. 16

Evaluating Pattern Classification Techniques of Neural Network Using k-Means Clustering Algorithm
Swati Sah ... Manu Pratap Singh
-
Swati Sah, et. al.Swati Sah ... Manu Pratap Singh
21 Nov 2017
21 Nov 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Distributed Framework for Parallel Data Mining Using HPJava

Abstract

Talk to us

Similar Papers

More From: BT Technology Journal