AbstractThe internal defects in rock masses can significantly impact the quality and safety of geotechnical projects. Mechanical waves, as a common nondestructive testing (NDT) method, can reflect the external and internal structures of rock or rock masses. Analyses on the reflected and transmitted waves enable nondestructive identification and assessment of potential defects within rocks. Previous studies mainly focused on the variation of single or limited wave features like main frequency, amplitude and energy between the intact and non‐intact samples. In fact, most information contained in the waveforms is neglected. Techniques of data mining can provide a powerful tool to reveal this information and therefore a more accurate determination of the internal structures. In this study, 995,412 NDT data from 14 types of granite and gypsum samples with different cross‐section shapes and different types of defects are recorded by an ultrasonic wave generation and collection system. This dataset can be used not only as the training data for defect classification in NDT but also as a good reference for conventional NDT analyses. Besides, time‐series data analysis is an opportunity and challenging issue, this dataset holds great potential for broader application in general time‐series classification analysis.