In recent years, convolutional neural networks have demonstrated significant advancements in the domain of computer vision, effectively addressing numerous previously challenging issues. An increasing number of researchers are focusing their investigations on this field, proposing innovative network architectures. However, many existing networks necessitate intricate module designs and a substantial number of parameters to achieve satisfactory fusion outcomes, which poses challenges for lightweight devices with constrained computational resources. To mitigate this concern, the present study introduces a novel methodology that integrates block segmentation with pixel optimization. Specifically, we initially employ graph convolutional networks to execute flexible convolutions on large-scale, irregular regions generated through superpixel clustering, thereby achieving coarse segmentation at the block level. Subsequently, we utilize parallel lightweight convolutional networks to provide pixel-level guidance, ultimately resulting in a more accurate decision map. Furthermore, to leverage the strengths of both networks and facilitate the optimization of feature generation from the graph convolutional network for non-Euclidean data, we meticulously design a superpixel-based graph decoder alongside a pixel-based convolutional neural network extraction block to enhance feature acquisition and propagation. In comparison to numerous state-of-the-art methodologies, our approach demonstrates commendable competitiveness in both qualitative and quantitative analyses, as well as in efficiency evaluations. The code can be downloaded at https://github.com/ouyangbaicai/FusionGCN.
Read full abstract