Data Mining Meets Grid Computing: Time to Dance?

Alberto Sánchez,Pedro de Miguel,María S. Pérez,Werner Dubitzky,Julio J. Valdés,Jesús Montes

doi:10.1002/9780470699904.ch1

Abstract

A grand challenge problem (Wah, 1993) refers to a computing problem that cannot be solved in a reasonable amount of time with conventional computers. While grand challenge problems canbefoundinmanydomains,scienceapplicationsaretypicallyattheforefrontoftheselargescalecomputingproblems.Fundamentalscientificproblemscurrentlybeingexploredgenerate increasingly complex data, require more realistic simulations of the processes under study and demand greater and more intricate visualizations of the results. These problems often require numerous complex calculations and collaboration among people with multiple disciplines and geographic locations. Examples of scientific grand challenge problems include multi-scale environmentalmodellingandecosystemsimulations,biomedicalimagingandbiomechanics,nuclear power and weapons simulations, fluid dynamics and fundamental computational science (use of computation to attain scientific knowledge) (Butler, 1999; Gomes and Selman, 2005). Many grand challenge problems involve the analysis of very large volumes of data. Data mining (also known as knowledge discovery in databases) (Frawley, Piatetsky-Shapiro and Matheus, 1992) is a well stablished field of computer science concerned with the automated search of large volumes of data for patterns that can be considered knowledge about the data. Dataminingisoftendescribedasderivingknowledgefromtheinputdata.Applyingdatamining to grand challenge problems brings its own computational challenges. One way to address these computational challenges is grid computing (Kesselman and Foster, 1998). ‘Grid’ refers topersistentcomputingenvironmentsthatenablesoftwareapplicationstointegrateprocessors, storage, networks, instruments, applications and other resources that are managed by diverse organizations in widespread locations. This chapter describes how both paradigms ‐ data mining and grid computing ‐ can benefit from each other: data mining techniques can be efficiently deployed in a grid environment and operational grids can be mined for patterns that may help to optimize the effectiveness and efficiency of the grid computing infrastructure. The chapter will also briefly outline the chapters of this volume.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data Mining Meets Grid Computing: Time to Dance?

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Guest Editorial Introduction to the Special Section on Mining Biomedical Data
A Tsymbal ... N Bolshakova
IEEE Transactions on Information Technology in Biomedicine | VOL. 10
A Tsymbal, et. al.A Tsymbal ... N Bolshakova
01 Jul 2006
IEEE Transactions on Information Technology in Biomedicine | VOL. 10

Swarm Intelligence for Multi-objective Problems in Data Mining
-
-
--
01 Jan 2009
01 Jan 2009

Application of data mining techniques in pharmacovigilance.
Andrew M Wilson ... Lehana Thabane
British Journal of Clinical Pharmacology | VOL. 57
Andrew M Wilson, et. al.Andrew M Wilson ... Lehana Thabane
30 Sep 2003
British Journal of Clinical Pharmacology | VOL. 57

Parallelism in Knowledge Discovery Techniques
Domenico Talia
-
Domenico TaliaDomenico Talia
01 Jan 2002
01 Jan 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Mining Meets Grid Computing: Time to Dance?

Abstract

Talk to us

Similar Papers