Digital devices and information systems have made data privacy essential. The collected data contains sensitive attributes such as salary, marital status and health history that need to be protected. Such data is exchanged or published to a third party using cloud infrastructure to perform various analyses, conduct research, and make critical decisions. Unauthorized users of the published data may violate privacy, notwithstanding the benefits. Data anonymization is one of the technique for achieving data privacy. Existing techniques consider single sensitive attribute and data is anonymized using generalization or suppression approaches. On observation, it is found that these techniques are less efficient since the collected data contains multiple sensitive attributes when anonymized using the same approaches leads to higher information loss and residue records. In this paper, multiple sensitive attributes are considered and the dataset is anonymized by constructing a semantic hierarchical tree it is further partitioned using the anatomy approach. Later, the partitions are stored in interclouds to achieve better privacy protection. Experiments are conducted to observe and analyze the computational performance, residue records and diversity percentage. The results obtained prove that the proposed technique is efficient when compared to the existing ones.
Read full abstract