An increased use of data driven applications and integrated systems have caused an accelerating expansion in data volumes and increase in the number of digital records, over the past few decades. Exponentially growing data volumes being processed by large-scale distributed data-intensive applications have placed an increasing pressure on the underlying storage services for timely and efficient storage and retrieval of the data. The use of cloud storage is among the best strategies to efficiently store growing volumes of data. However, outsourcing data to public cloud storage leads to the challenge of data confidentiality preservation. Data Confidentiality is among the top challenges associated with cloud storage which have contributed substantially as an inhibitor for cloud computing adoption all over the world and is considered a serious concern, especially in case of big data, where securing data in a timely and accurate manner is an arduous task. Our study aims to contribute in anti-cybercrime by protecting the confidentiality of sensitive growing data. We enunciate an optimized confidentially preserving framework on distributed cloud storage, that works for growing data with time-efficiency and minimum memory usage. Our framework uses a merger of Genetic Algorithm (GA), parallel data distribution, and privacy-aware selective encryption techniques. The experiments and comparative analysis depict that our proposed framework outperforms others under consideration, in terms of execution time, memory usage and network throughput respectively.
Read full abstract