Abstract

Data are the fibers out of which the extraordinary capabilities of machine learning (ML) are woven. But despite having gained widespread attention in civil engineering, ML still faces several overarching issues: small and incomplete training datasets, questionable generalization ability, and lack of physical interpretability. In fact, over-fitting is almost inescapable for models trained with limited data. Furthermore, the “black box” effect sharply restricts the use of ML. To deal with those issues, we here put forward a general integrated ML pipeline that, with the help of transfer learning and synthetic data augmentation, remains not only robust to highly limited data, but also enables engineers to carry transparent and interpretable analysis. To showcase the resulting advantages, the shear capacity of reinforced concrete (RC) beams strengthened with fiber-reinforced cementitious matrix (FRCM) is predicted as a demonstration. An experimental dataset containing only 91 tests is supplemented with data from a synthetic data generator, Synthpop. Six baseline ML algorithms are evaluated to seek out the most suitable model for predicting the shear capacity. It turns out that the GBDT model performs the best among all the algorithms assessed. To further boost its efficacy, a transfer learning algorithm, two-stage TrAdaboost, is modified to enhance the GBDT model which is taken as a base learner via re-weighting the synthetic data points. The resulting emulator is termed TrAGBDT. Thereafter, the SHapley Additive exPlanation (SHAP) approach is utilized which lends itself ideally to deciphering the mechanism of the TrAGBDT. A graphic user interface is also provided along with the pertinent Python-based source codes, thereby allowing users and developers to access the aforesaid emulator without any barrier. The findings of this study indicate that over-fitting is largely balanced out for the TrAGBDT, which greatly improves the extendibility and interpretability of the proposed ML-centric pipeline. Hence it can be a viable solution to promote ML applications in civil engineering practice.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call