In this paper, a global probability distribution structure-sparsity filter pruning is proposed to address the problem of difficult deployment of diagnostic models in resource constrained wireless sensor networks (WSNs) for edge fault diagnosis. Firstly, local and global weight distributions are analyzed. A global probability distribution weight sparsity method is proposed to obtain the global sparse boundary and the important features of the model. Secondly, according to the distribution of weight amplitude, the problem of uneven weight distribution is disclosed. A computational acceleration sensing method is proposed to reduce the high computational cost caused by small weight amplitude with low importance estimation. Moreover, a novel layer filter clustering pruning scheme is proposed. The intra-layer filters are classified and redundant filters are removed by using the clustering idea according to the predetermined pruning rate. Thus, the structure-sparsity filter pruning is realized. Compared with other advanced filter pruning methods, the proposed method analyzes the global and local model weight distribution and the influence of weight importance on the computational complexity of the model. The proposed method then performs filter pruning on the importance of overall filters for each network layer rather than individual filters. The accuracy of the proposed method on experimental dataset for pruned ResNet8 model under pruning rate at 0.95 is 99.12% with 3.668K parameters and 3.636MFLOPs. The experimental results elucidate that the proposed method is more suitable for deployment in resource-constrained WSNs for fault diagnosis of rotating machinery. This provides a potential solution for practical engineering applications.