Abstract

The rise in healthcare-related illnesses has generated a substantial amount of patient data, making the safeguarding of patient data privacy imperative. Existing privacy protection methods face challenges, including longer execution times, compromised data quality, and increased information loss as data dimensions expand. Effective attribute selection is vital to enhance preservation methods. Our research introduces a privacy-preserving clustering approach that addresses these concerns through two stages: feature selection and anonymization. The first stage selects relevant features using symmetrical uncertainty (SU) and eliminates duplicates with Kendall’s Tau Correlation Coefficient. The Utility Preserved Anonymization (UPA) algorithm is employed in the second phase to achieve privacy preservation. Additionally, our approach reduces data dimensionality to simplify cluster creation for anonymization. Experimental analysis on real-time data demonstrates the strategy’s effectiveness, with outstanding sensitivity (97.85%) and accuracy (95%), efficiently eliminating unnecessary features and simplifying clustering complexity.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.