The identification of branches and bud points is the key to intelligent pruning of dormant grapevine branches and precise positioning of the pruning point on the branch is an important prerequisite for robotic arm pruning. This study takes Cabernet Sauvignon wine grapes as the experimental object and proposes a depth image-based pruning point localization algorithm based on pruning rules. In order to solve the problem of bud recognition in complex backgrounds, this study adopts a detection method that combines semantic segmentation and target detection. Firstly, the semantic segmentation algorithm PSP-net is used to separate the branches and the main stem from the background and the separated image undergoes two kinds of processing: one is to skeletonize it using the Zhang–Suen thinning algorithm and the other is to identify the buds and obtain the center coordinates of the buds using the target-detection method YOLOv5; finally, combining with the depth information of the depth image, we use the coordinates of the buds to determine the location of the pruning point located on the skeleton image. The results show that PSP-net has better results in segmentation performance with mIoU reaching 83.73%. YOLOv5 performs better in target detection with mAP reaching 81.06% and F1 reaching 0.80. The accuracy of this method in determining the location of pruning points reaches 82.35%. It can provide a method for fruit tree pruning robots to determine the location of pruning points.