• Reasonable asphalt mixture design can mitigate the cracking problems. • Alligator cracking (AC) and longitudinal (LC) cracking are predicted from asphalt mixture properties. • The effect of three different dimension reduction methods on the performance of ML models is analyzed. • ML algorithms includes Support vector regression, Artificial Neural Networks, Kernel ridge regression, Gradient boosting, Extra-trees, eXtreme Gradient Boosting. • The paper improves the current asphalt mix design by adding the criteria of AC and LC and ML models. The asphalt mix design based on traditional laboratory fatigue cracking tests of asphalt mixture is not reasonable due to the difficulty of simulating the circumstance where asphalt mixture experiences in the pavement structure and under realistic climate and traffic conditions. The study aims to mitigate the cracking problems during the pavement design life, by improving the asphalt mix design process by introducing machine learning (ML) models, which are used to predict alligator cracking (AC) and longitudinal cracking (LC), and their criteria (required ranges). The data containing AC and LC were extracted from the NCHRP 1-37A report and the long-term pavement performance (LTPP) program. A total of 33 input features about climate condition, traffic, pavement structure, and materials properties of each pavement layer were selected. Support Vector Regression (SVR), Artificial Neural Networks (ANN), Kernel ridge regression (KRR), Gradient boosting (GB), Extra-trees, and eXtreme Gradient Boosting (XGBoost) were used. Meanwhile, three dimensionality reduction approaches, including Auto-Encoder (AE), Principal Component Analysis (PCA), and Recursive feature elimination with Random Forest, were combined with the six ML algorithms (18 hybrid models produced) in order to decrease the computation complexity and search for the optimum model. The 24 models (6 basic ML models plus 18 hybrid models) were trained, and their hyperparameters were tuned using Bayesian optimization. Different evaluation criteria R 2 , RMSE, MAE, SMAPE, and SI are calculated to evaluate the performance of these models. The results of the study showed that except for ANN and SVR, the basic ML models can get better for predicting AC and LC when they are combined with PCA or AE. Based on the performance indices, the optimum among these developed models proved to be PCA-ANN for predicting AC (R 2 = 0.84) and LC (R 2 = 0.83). Compared to other pavement layers, the asphalt surface course is the highest contributor to AC and LC. The improved mix design process was implemented to determine the mix proportion for the asphalt surface course of a construction project and Mix-2 with 5.1 % asphalt content was recommended.