Abstract To improve the accuracy and robustness of visual simultaneous localization and mapping (SLAM) in low-texture environments, this paper proposes a robust and fast stereo vision inertial SLAM pose estimation method that combines point and line features with an inertial measurement unit (IMU). The method tightly couples visual point and line features with IMU constraints, forming a least-squares problem through the error of each constraint term for nonlinear optimization. To address the issues of over-segmentation and time consumption in traditional line segment detection (LSD) algorithms, an improved LSD algorithm is adopted to accelerate line feature extraction. This approach merges nearby line segments based on spatial geometric relationships and filters out invalid segments, improving the time efficiency of the algorithm. Finally, experiments conducted in low-texture environments demonstrate that our algorithm achieves high localization accuracy and robustness.