Abstract

National Bridge Inventory (NBI) data is regularly collected for 617,000+ national bridges in the U.S. These data, which consist of 100+ fields related to bridges and culverts, have been shown to contain errors. These errors could reduce the effectiveness of the decisions made based on this data, and cause safety issues. For this reason, an anomaly detection platform is developed to identify data anomalies in NBI datasets more effectively than existing rule-based error-check tools can. First, the user provides groups of correlated NBI fields as input to the platform. Then, for each group, it utilizes two tools to detect anomalous data and determine errors. The first tool uses three machine learning algorithms to identify anomalous data points and categorizes them based on their degree of anomaly. The second tool visualizes the distributions of the NBI fields in the group with histograms, scatter plots, and so forth. These plots are used to analyze the data points that are identified from the first tool as anomalies. The results of these two tools, together with expert knowledge about the data fields, are then used to distinguish data errors from outliers. The proposed platform is applied to a state’s NBI dataset that was submitted to the Federal Highway Administration (FHWA) in 2020. For this dataset, two groups of correlated fields are considered. The results showed the platform could effectively pinpoint anomalous values of NBI fields that individually, or in conjunction with other fields, do not follow the patterns that characterize most of the data, prompting the identification of potential inconsistencies and errors.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.