Visual tracking is the capacity to estimate or forecast a target object’s location in each frame of a video after specifying its starting position. Visual tracking is of essential relevance in defence and military operations. The military can use it to improve situational awareness, improve precise targeting, acquire intelligence in real-time, and efficiently respond to a variety of threats and circumstances. In the past, object tracking systems have relied mostly on algorithms based on deep learning techniques and these tracking algorithms are lacking in both accuracy and speed. In this research, an Extreme Learning Machine-based visual tracking system has been proposed that incorporates properties like high accuracy, low training time, and less network computing complexity as compared to existing deep learning-based tracking algorithms. The Haar wavelet transform is utilized in the recommended technique for feature learning, while the extreme learning machine is utilised for classification and recognition. A benchmark dataset object tracking benchmark-2013 has been used to carry out the experiments. The experiment values indicated that the proposed technique has accomplished enhanced performance over another tracking model. Additionally, we tested the proposed method’s accuracy and robustness regarding certain visual characteristics: Illumination variation, occlusion, deformation, out-of-plane rotation, background clutters, and in-plane rotation. The findings of the simulation revealed that the objects in videos have been 84% accurately tracked by the suggested method.