Wildfires, which are distinguished by their destructive nature and challenging suppression, present a significant threat to ecological environments and socioeconomic systems. In order to address this issue, the development of efficient and accurate fire detection technologies for early warning and timely response is essential. This paper addresses the complexity of forest and mountain fire detection by proposing YOLO-CSQ, a drone-based fire detection method built upon an improved YOLOv8 algorithm. Firstly, we introduce the CBAM attention mechanism, which enhances the model’s multi-scale fire feature extraction capabilities by adaptively adjusting weights in both the channel and spatial dimensions of feature maps, thereby improving detection accuracy. Secondly, we propose an improved ShuffleNetV2 backbone network structure, which significantly reduces the model’s parameter count and computational complexity while maintaining feature extraction capabilities. This results in a more lightweight and efficient model. Thirdly, to address the challenges of varying fire scales and numerous weak emission targets in mountain fires, we propose a Quadrupled-ASFF detection head for weighted feature fusion. This enhances the model’s robustness in detecting targets of different scales. Finally, we introduce the WIoU loss function to replace the traditional CIoU object detection loss function, thereby enhancing the model’s localization accuracy. The experimental results demonstrate that the improved model achieves an mAP@50 of 96.87%, which is superior to the original YOLOV8, YOLOV9, and YOLOV10 by 10.9, 11.66, and 13.33 percentage points, respectively. Moreover, it exhibits significant advantages over other classic algorithms in key evaluation metrics such as precision, recall, and F1 score. These findings validate the effectiveness of the improved model in mountain fire detection scenarios, offering a novel solution for early warning and intelligent monitoring of mountain wildfires.