Abstract

Data streams are a prevalent and growing source of timely data, particularly in the scientific domain. Just as it is common today to read starting conditions such as initial weather conditions, for a scientific simulation from a file, it should be equally as easy to draw starting conditions on-demand from live data streams. But efforts to date to bring streaming data to the grid have lacked generality. In this article we introduce a new model for bringing existing data streams systems onto the grid. The model is predicated on the ability to identify stream systems that meet the criteria of being a "data resource". We establish the criteria in this article, and define a grid service architecture for a data streams resource that leverages standardization efforts in the Global Grid Forum. We discuss key research issues in realizing the data streams model. We are currently developing a prototype of this architecture using our dQUOB system.

Highlights

  • Data streams are a prevalent and growing source of timely data [16]

  • We propose a new model for bringing data stream systems onto the Grid

  • The model we propose for bringing data stream systems onto the Grid is broad so as to encompass a wide range of existing data stream systems for sensor networks and instruments

Read more

Summary

Introduction

Data streams are a prevalent and growing source of timely data [16]. Stream applications are broad: sensor networks monitor traffic flow on US Interstates; NEXRAD Doppler radars continuously generate streamed data for weather forecasting and prediction. The model treats data stream systems as a single unit and gives that system a presence on the grid Access to all such stream systems is through a single, common access protocol. OGSA-DAI defined a general database query access protocol to a grid service that serves a database management system. In order to fully understand the breadth of data stream systems, we survey the literature and categorize the systems into three broad categories: data manipulation systems, stream routing systems, and detection systems Of these categories, two satisfy the property that a global snapshot can be taken by examining stream data alone. The architecture leverages the Global Grid Forum specification for grid service access to database management systems. The remainder of the paper is organized as follows: in Section 2 we define the model of a data stream resource on the grid. Our current effort is to realize this architecture using our dQUOB system [22]

Stream resource model
Categorizing data streams systems
Architecture of stream resource
Grid service interface
Research challenges
Methods supported by port
Related work
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call