Abstract
We are concerned by the use of Factorial Correspondence Analysis (FCA) for image retrieval. FCA is designed for analysing contingency tables. In Textual Data Analysis (TDA), FCA analyses a contingency table crossing terms/words and documents. For adapting FCA on images, we first define ”visual words” computed from Scalable Invariant Feature Transform (SIFT) descriptors in images and use them for image quantization. At this step, we can build a contingency table crossing ”visual words” as terms/words and images as documents. In spite of its successful applications in information retrieval, FCA suffers from large dimension problem because of the diagonalization of a large matrix. We propose a new algorithm, CABoost, which overcomes this large dimension problem of FCA. The data are sampled by column (word) and a FCA is applied on the sample. After some samplings, we finally combine separated results by a weighting - Principle Component Analysis (PCA). The numerical experiments show that our algorithm performs more rapidly than the classical FCA without losing precision.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.