Generating speckle images is crucial for calibrating errors in digital image correlation (DIC) algorithms, alleviating the scarcity of training data for deep learning-based DIC algorithms, and even for the broader application of digital speckle methods. Traditional methods for speckle image synthesis struggle to accurately replicate the brightness variations in the foreground and background of realistic speckle images, and these methods come with high computational costs. Additionally, there needs to be more scientific and systematic evaluation methods for the quality of generated DIC speckle images. Our study presents a high-fidelity method for generating DIC speckle images with cracks based on a conditional diffusion model. We established evaluation metrics for generated DIC speckle images based on multiscale spatial and spectrum domains. We verified them on an open-source dataset of DIC speckle images depicting cracks in stone masonry walls. The conditional diffusion model excels at forward tasks (DIC crack image segmentation) and inverse tasks (DIC speckle image generation), with particular strength in the latter. The proposed spatial and multiscale spectrum domain metrics allow for the comprehensive and accurate evaluation of the alignment between the synthetic and natural speckle images. The conditional diffusion model for DIC speckle image generation and evaluation proposed in this paper has the potential to enhance further the accuracy and robustness of DIC technology in experimental mechanics and engineering applications.