Abstract

Computer vision obtains object and environment information by simulating human visual senses and borrowing human sensory activity. As one of the main tasks of computer vision, image classification can be used not only for face recognition, traffic scene recognition, image retrieval, and automatic photo categorization but also as a theoretical basis for target detection and image segmentation. In this paper, we use the existing CNN architecture network-ConvNeXt. By adapting and modifying the residual connectivity and convolutional structure of the network, we achieve a balance between classification accuracy and inference speed. These modifications are able to reduce both computation and memory consumption while keeping accuracy largely unchanged, thus better facilitating network lightweighting.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call