Abstract
Considering the efficiency and security of healthcare data processing, indiscriminate data collection, annotation, and transmission are unwise. In this article, we propose the normalized double entropy (NDE) method to assess image data quality in the form of metatask. In specific, the probability entropy and distance entropy are both adopted and normalized to evaluate the data quality. The experimental results show the stable ability of the NDE to distinguish good and bad data in terms of information contribution. Furthermore, the model's diagnostic performances driven by selected good and bad data are compared, and a clear gap exists between them under the premise of the same amount of data. Screening 70% of the dataset can achieve almost the same accuracy as that based on all data. This article focuses on healthcare data quality and data redundancy and provides a practical evaluation tool to facilitate the identification and collection of valuable data, which is beneficial to improve efficiency and protect cybersecurity in healthcare systems.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Similar Papers
More From: IEEE Transactions on Industrial Informatics
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.