The fundamental job of shape reconstruction in computer vision is crucial for many different applications, including robotics, medical imaging, and autonomous systems. The state-of-the-art methods, difficulties, and potential future directions in the field of shape reconstruction are succinctly outlined in this abstract. Inferring the three-dimensional (3D) geometry of objects or scenes from two-dimensional (2D) photographs or point clouds is the process of shape reconstruction. The conventional approaches relied on depth sensors, structure from motion, and stereo vision. Convolutional neural networks (CNNs) and generative models, in particular, are deep learning techniques that have been used recently to achieve impressive results in shape reconstruction. This abstract addresses important procedures, such as point cloud processing, depth estimation, and multi-view stereo techniques. It emphasizes the benefits and drawbacks of each strategy while highlighting the necessity of combining various modalities for effective rebuilding. To improve reconstruction accuracy, context-aware algorithms and the inclusion of semantic data are also being investigated. Efforts are made to overcome concerns with scalability, ambiguous features, and occlusions in form reconstruction. Deep learning models' computational complexity is also examined, highlighting the necessity for effective algorithms that strike a compromise between accuracy and speed. In order to evaluate the effectiveness of reconstruction algorithms in an objective manner, the significance of benchmark datasets and assessment criteria is also emphasized.