This article addresses the significant challenge posed by the rare occurrence of paper breaks in the pulp-and-paper industry, which, despite their infrequency during production, have considerable economic consequences. We analyze operational data from a paper manufacturing machine, focusing on the rare instances of paper breaks (124 out of 18,398 cases) identified through a quality assurance protocol. This scarcity presents a challenge for machine learning-based predictive models. To overcome this, we introduce a novel data augmentation strategy using Conditional Generative Adversarial Networks (CTGAN) and Synthetic Minority Oversampling Technique (SMOTE), aimed at generating synthetic data that not only reflects the real operational data’s distribution but also improves the predictive models’ performance metrics. We assess the impact of this data augmentation on three machine learning algorithms—Decision Trees (DT), Random Forest (RF), and Logistic Regression (LR)—both before and after augmentation. The use of a CTGAN-enhanced dataset led to notable enhancements in predictive maintenance metrics. Specifically, the ability of the models to detect machine breaks (Class 1) increased significantly: over 30% for Decision Trees, 20% for Random Forest, and nearly 90% for Logistic Regression. This methodological progress offers valuable contributions to the fields of industrial quality control and maintenance scheduling by improving the prediction of rare events in manufacturing processes.