Abstract

Consistently, massive volumes of sensory, transactional, and web information are persistently produced as streams, which need to be investigated online as they arrive. The arrival rate of the big data stream may vary after some time. Scheduling plays a key part in big data streaming applications in a big data stream computing environment. In this paper, optimal scheduling is proposed on big data streams to handle the incomplete and delayed information. In this process, big data stream is taken as an input. The input big data stream consists of number of data streams and each data stream consists of number of tasks. Initially, the input big data stream is analyzed and the task is selected by calculating the features such as volatility, Hurst exponent, and distance. Enthalpy value is then computed based on the extracted feature for each data streams and the computed enthalpy value is taken as a feedback ID. Finally, krill herd optimization algorithm is used for the optimal scheduling of tasks based on the generated feedback ID. The results have shown that our proposed model outperformed popular scheduling algorithms in terms of computational time, schedule time, and throughput.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.