ABSTRACT Occlusion detection is vital in longer video sequences of real-time scenarios, as deformable objects generally occlude themselves or be occluded by exterior objects eventually. Occlusion can affect accuracy of the entire model significantly, because optical flow may reveal random attributes in occluded as well as neighbourhood areas. Occlusion is a challenging issue in real-world visual object tracking. Several techniques learn imprecise appearance of target while it becomes occluded by another object in the scene. Occlusion is a complicated issue that causes target loss in tracking procedure. Here, long short-term memory fused quantum dilated convolutional neural network (LSTM-QDCNN) is presented for occlusion percentage prediction. Initially, video frames are extracted from considered input video. The extracted frame is pre-processed by adaptive fuzzy filtering, and object localisation is accomplished by employing sparse fuzzy c-means with local optimal oriented pattern feature. Thereafter, occlusion category detection is conducted by deep Q-network. Lastly, occlusion percentage prediction is done by utilising LSTM-QDCNN. Additionally, LSTM-QDCNN is designed by merging deep LSTM (DLSTM) with quantum dilated convolutional neural network (QDCNN). Furthermore LSTM-QDCNN achieved maximum multiple-object tracking precision of 0.892 and minimum tracking distance of 3.714. In addition, LSTM-QDCNN achieved tracking number about 18.