In this work, one new approach for RSTB-invariant object representation is presented based on the modified Mellin–Fourier Transform (MFT). For this, in the well-known steps of MFT, the logarithm operation in the log-polar transform is replaced by the operation “rising on a power”. As a result, the central part of the processed area is represented by a significantly larger number of points (transform coefficients), which permits us to give a more accurate description of the main part of the object. The symmetrical properties of the complex conjugated transform coefficients were used, and as a result, the number of coefficients participating in the object representation can be halved without deteriorating the quality of the restored image. The invariant representation is particularly suitable when searching for objects in large databases, which comprise different classes of objects. To verify the performance of the algorithm, object search experiments using the K-Nearest Neighbors (KNN) algorithm were performed, which confirmed this idea. As a result of the analysis, it can be concluded that the complexity of the solutions based on the proposed method depends on the applications, and the inclusion of neural networks is suggested. The neural networks have no conflict with the proposed idea and can only support decision making.
Read full abstract