Abstract
Objective: To develop a system that will optimize storage, automatically classify and tag images using image content and image context by using classification techniques and construct image archive to enable enhanced reference and inference with a simple query. Methods: The approach uses both image processing and data mining techniques. Multi modal processing of the Human brain has the ability to distinguish the components in an image and understand the content instantly. To automate this processing ability by the Computer, Computer Vision attempts various techniques. Features that describe the objects of interest are extracted by converting the image into Hue, Saturation, Value (HSV) space. Further segmentation, profiling and thresholding techniques are applied. Finally K Nearest Neighbours (KNN) classifier has been used for classifying the images. Hierarchical decision tree has been built by uniquely identifying classes that could be used for tagging. Findings: Two levels of classification have been addressed in this paper. Statistical features are extracted from the H-plane of the HSV color space and with an appropriate threshold obtained from the histogram, images have been classified into indoor and outdoor images at the first level. Further the indoor images are classified using segmentation and profiling, as classes of images with presentation and without presentation. Applications/Improvements: Images can be segmented based on more image content that signify the event by watermarking technique and by extracting more features. These segments could further be used to build the decision trees that will identify distinct classes. The depth of the decision tree will enhance indexing, tagging and improve the efficiency of query based image retrieval.
Highlights
Academic Institutions and Business organizations conduct events to enhance skill and production
Taking advantage of the fact that generally in events multiple shots of the same scene are captured in continuous intervals, similarity check is done between two consecutive image pairs
Each image has been segmented into 3x3 equal segments with padding, each segment pair has been compared with its Structural Similarity Measure (SSIM) index and one of the two similar image is retained and the other is eliminated
Summary
Academic Institutions and Business organizations conduct events to enhance skill and production. Once images are segmented and classified in terms of image content and image context, they could be tagged and placed in an image archive to be retrieved even after a longer period of time through a simple query. This will contribute to the documentation of annual activities, newsletters, and compilation activity of the organization. Content Based Image Retrieval (CBIR)[2] and Region Based Image Retrieval (RBIR) improves the performance of retrieval by extracting the features of an image after segmenting the image into regions.
Published Version (
Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have