Abstract

A novel content-based heterogeneous information retrieval framework, particularly well suited to browse medical databases and support new generation computer aided diagnosis (CADx) systems, is presented in this paper. It was designed to retrieve possibly incomplete documents, consisting of several images and semantic information, from a database; more complex data types such as videos can also be included in the framework. The proposed retrieval method relies on image processing, in order to characterize each individual image in a document by their digital content, and information fusion. Once the available images in a query document are characterized, a degree of match, between the query document and each reference document stored in the database, is defined for each attribute (an image feature or a metadata). A Bayesian network is used to recover missing information if need be. Finally, two novel information fusion methods are proposed to combine these degrees of match, in order to rank the reference documents by decreasing relevance for the query. In the first method, the degrees of match are fused by the Bayesian network itself. In the second method, they are fused by the Dezert-Smarandache theory: the second approach lets us model our confidence in each source of information (i.e., each attribute) and take it into account in the fusion process for a better retrieval performance. The proposed methods were applied to two heterogeneous medical databases, a diabetic retinopathy database and a mammography screening database, for computer aided diagnosis. Precisions at five of 0.809 ± 0.158 and 0.821 ± 0.177, respectively, were obtained for these two databases, which is very promising.

Highlights

  • T WO main tasks in Computer Aided Diagnosis (CADx) using medical images are extraction of relevant information from images and combination of the extracted features with other sources of information to automatically or semiautomatically generate a reliable diagnosis

  • This paper presents a generic solution to use digital medical databases for heterogeneous information retrieval, and solve CADx problems using CaseBased Reasoning (CBR) [1]

  • We introduced two methods to include image series and their signatures, with contextual information, in a CBR system

Read more

Summary

INTRODUCTION

T WO main tasks in Computer Aided Diagnosis (CADx) using medical images are extraction of relevant information from images and combination of the extracted features with other sources of information to automatically or semiautomatically generate a reliable diagnosis. A Bayesian network is used to model the relationships between the different attributes (the extracted features of each digital image and each contextual information field): we associate each attribute with a variable in the Bayesian network It lets us compare incomplete documents: the Bayesian network is used to estimate the probability of unknown variables (associated with missing attributes) knowing the value of other variables (associated with available attributes). The first fusion operator is incorporated in the Bayesian network: the computation of the degree of match, with respect to a given attribute, relies on the design of conditional probabilities relating this attribute to the overall degree of match An evolution of this fusion operator that models our confidence in each source of information (i.e. each attribute) is introduced. By transforming the network into a cycle-free hypergraph, and performing inference in this hypergraph, Lauritzen and Spiegelhalter proposed an exact inference algorithm with relatively low complexity [23]; this algorithm was used in the proposed system

Learning a Bayesian Network from Data
Including Images in a Bayesian Network
System Design
Retrieval Process
Description of the Dezert-Smarandache Theory
Link with Bayesian Network based Retrieval
APPLICATION TO MEDICAL IMAGE DATABASES
Training and Test Sets
Objective of the System
Results
Findings
DISCUSSION AND CONCLUSIONS
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.