Minersoft

Marios D Dikaiakos,George Pallis,Asterios Katsifodimos

doi:10.1145/2220352.2220354

Abstract

One of the main goals of Cloud and Grid infrastructures is to make their services easily accessible and attractive to end-users. In this article we investigate the problem of supporting keyword-based searching for the discovery of software files that are installed on the nodes of large-scale, federated Grid and Cloud computing infrastructures. We address a number of challenges that arise from the unstructured nature of software and the unavailability of software-related metadata on large-scale networked environments. We present Minersoft, a harvester that visits Grid/Cloud infrastructures, crawls their file systems, identifies and classifies software files, and discovers implicit associations between them. The results of Minersoft harvesting are encoded in a weighted, typed graph, called the Software Graph. A number of information retrieval (IR) algorithms are used to enrich this graph with structural and content associations, to annotate software files with keywords and build inverted indexes to support keyword-based searching for software. Using a real testbed, we present an evaluation study of our approach, using data extracted from production-quality Grid and Cloud computing infrastructures. Experimental results show that Minersoft is a powerful tool for software search and discovery.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Minersoft

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Internet Technology

Lead the way for us

Journal: ACM Transactions on Internet Technology	Publication Date: Jun 1, 2012
Citations: 21

Similar Papers

Combining Families of Information Retrieval Algorithms Using Metalearning
Michael Cornelson ... Ron Karidi
-
Michael Cornelson, et. al.Michael Cornelson ... Ron Karidi
01 Jan 2004
01 Jan 2004

Implementation of Information Retrieval (IR) Algorithm for Cloud Computing: A Comparative Study Between With and Without Mapreduce Mechanism
Riktesh Srivastava
SSRN Electronic Journal | VOL. -
Riktesh SrivastavaRiktesh Srivastava
30 Oct 2013
SSRN Electronic Journal | VOL. -

An antivirus API for Android malware recognition
Rafael Fedler ... Marcel Kulicke
-
Rafael Fedler, et. al.Rafael Fedler ... Marcel Kulicke
01 Oct 2013
01 Oct 2013

NETorium
Kunio Akashi ... Tomoya Inoue
-
Kunio Akashi, et. al.Kunio Akashi ... Tomoya Inoue
30 Nov 2016
30 Nov 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Minersoft

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Internet Technology