Selecting the appropriate training technique is a significant step in utilizing intelligent approaches. It becomes even more important when it comes to critical problems like analyzing the bearing capacity of foundations. This study investigates the feasibility of a capable metaheuristic algorithm, called the water cycle algorithm (WCA), for training a multi-layer perceptron (MLP). The WCA-MLP is applied to a large finite element dataset to predict the settlement. The results of this model are compared with electromagnetic field optimization (EFO) and shuffled complex evolution (SCE) benchmarks. With reference to the obtained Pearson correlation factors (larger than 0.88 in all stages), all employed models are suitable for the mentioned objective. Moreover, it was observed that the training error of the WCA was 5.84 and 3.89% smaller than the EFO and SCE, respectively. Likewise, the accuracy of the WCA-MLP was 1.85 and 2.04% larger in the testing phase. Also, a predictive equation is finally elicited for practical applications in compatible circumstances.