Abstract

Background: Owing to increased growth in satellite imagery, the development of an architecture that rapidly and efficiently identifies similar images has become crucial. Hadoop has become a de-facto platform for storing large amounts of data. Apache Spark and MapReduce have also become key frameworks for distributed processing of big data. Objective: This paper proposes a novel distributed content-based image retrieval (DCBIR) architecture that leverages the qualities of these engines, which were not utilized in previous studies. Methods: Features of 40 satellite images with sizes greater than 500 MB were indexed, on a 15-node Hadoop cluster with two different databases viz. Neo4J, a graph database, and HBase, a columnar database. Results: Performance and Scalability of both indexing and query phases, along with precision and recall were observed for both databases. Conclusion: Experimental results show that the proposed system can efficiently perform image retrieval on large remote sensing images.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.