In research on building a one-shot learning neural network without pre-training using mass data, the limitation on the information obtained from a single training sample downgrades the performance of the network. In order to improve performance and take full advantage of the support set, in this study, we design three kinds of shadow nodes and propose a structure-based training method for a correlation-coefficient-based neural network. This training strategy focuses on branches that are not activated or inactivated as expected. In contrast to existing networks that optimize the parameters using back-propagation, the training method proposed in this paper optimizes the structure of the correlation-coefficient-based network by correcting its pixel errors. For the shadow nodes and training process based on this strategy, the intersection over union (IOU) of a detected target increases by 4.83% in the experiments when using the Fashion-Mnist dataset, increases by 4.02% when using the Omniglot dataset, and increases by 3.89% when using the Cifar-10 dataset. The samples in category "7" wrongly classified as "1" decreased by 27.32% when using the Mnist dataset after training. This training strategy, along with shadow nodes, makes the correlation-coefficient-based network a more practical model and enables the network to develop during the accumulation of reliable samples, thus making it more suitable for simple target detection projects that collect samples over time. Moreover, the shadow nodes and training method proposed in this paper supplement the non-gradient-based parameter-gaining strategy. Additionally, it is a new attempt to explore the imitation of a human's ability to learn a new pattern from a low number of references.
Read full abstract