Hand-drawn is one of the few visual descriptors that can directly represent visual content, and has significant research in the area of computer vision. Aiming at the problem of sparse features in the realm of hand-drawn image retrieval, hand-drawn images, and the easy deformation of hand-drawn images, this paper proposes a feature extraction method of grid resource sharing collaborative algorithm, which can be obtained utilizing precisely extracted semantic characteristics from hand-drawn images through computer multimedia-aided design Efficient and accurate retrieval results. First, the fundamental framework for obtaining semantic features is algorithm; then the attention model mechanism is the grid resource sharing collaborative introduced in the process of supervised training, and the attention structure block is introduced after the convolutional neural network’s bottom layer. To locate effective semantic features, In order to accomplish high-precision retrieval, the attention structure block combines channel attention structure and spatial attention structure to build the attention structure block. The last feature descriptor is then created by combining various semantic feature levels. The proposed strategy is practical and efficient, as demonstrated by the experimental findings on the comparison database Flickr15k. In addition, in the task of hand-drawn image classification, the proposed attention mechanism greatly improves the classification accuracy.