The escalating volume of construction activities and resultant waste generation underscores the imperative for developing sophisticated segmentation models to facilitate efficient sorting and recycling processes. This study introduces WasteSAM, an enhanced iteration of the segment anything model (SAM), specifically tailored to address the intricate complexities inherent in construction waste imagery. Drawing upon a comprehensive dataset comprising over 15,000 masks representing five distinct categories of construction materials, WasteSAM exhibits notably superior segmentation capabilities. Quantitative analysis demonstrates significant performance improvements, with WasteSAM outperforming the original SAM model by an average of 23.9% in dice similarity coefficient and 30.0% in normalized surface distance metrics. The integration of stereo-image techniques in refining the training dataset has facilitated WasteSAM in more accurately discerning the three-dimensional structure of waste materials, thereby augmenting the precision of waste classification. Noteworthy is the model’s adeptness in handling intricate textures and patterns across diverse imaging modalities, including varying lighting conditions and complex object interactions. While showing promising results, this study also highlights the need for high-quality, diverse datasets that reflect real-world construction site complexities, rather than merely larger datasets.
Read full abstract