Abstract
Processing semantics ensure tuple processing guarantees during system recovery from failures. It determines whether particular tuples are re-processed or not upon system recovery. Processing semantics facilitate to maintain real-time response as well as correctness of results in stream data processing. Existing literature is in real deficit to provide sufficient resources exclusively on processing semantics. This work focuses on reviewing processing semantics in stream data processing systems. This article aims at exploring the impact of processing guarantees on performance of stream data processing. We summarize current research directions to guarantee processing semantics, based on prominent stream processing engines. Furthermore, we compare throughput for a couple of representative stream processing engines under all possible processing guarantees. Our experiments show that at-most once processing guarantee provides the highest throughput among other guarantees. Lastly, challenges for further research are also discussed in this paper.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.