AbstractThis study introduces image‐based classification techniques to identify whether trash screens in urban rivers are blocked. The study obtained 755 images from a CCTV camera surveying a trash screen located on an urban river at Tongwynlais in Cardiff. Manual quality control reduced the dataset to 577 images, labelled as either blocked (80%) or unblocked (20%). The performance of a logistic regression for classification of images was investigated using three different subsets of the labelled images: (1) the original dataset, (2) a balanced but under‐sampled dataset with equal number of blocked and unblocked images, and (3) an augmented dataset with an equal number of blocked and unblocked images using Gaussian noise augmentation to increase the number of unblocked images. Results show that our data‐augmentation method enhanced model accuracy by 8%, successfully classifying images as blocked or unblocked with an accuracy of 88%; by overcoming the bias in the dataset these results also highlight potential solutions to overcome the challenges of operating this methodology across a network of cameras. This enables authorities in both data rich and data scarce regions the ability to take advantage of machine learning to open up the next generation of a distributed, data‐driven flood warning systems, protecting people, infrastructure and the environment.
Read full abstract