Abstract
Damage models for natural hazards are used for decision making on reducing and transferring risk. The damage estimates from these models depend on many variables and their complex sometimes nonlinear relationships with the damage. In recent years, data‐driven modeling techniques have been used to capture those relationships. The available data to build such models are often limited. Therefore, in practice it is usually necessary to transfer models to a different context. In this article, we show that this implies the samples used to build the model are often not fully representative for the situation where they need to be applied on, which leads to a “sample selection bias.” In this article, we enhance data‐driven damage models by applying methods, not previously applied to damage modeling, to correct for this bias before the machine learning (ML) models are trained. We demonstrate this with case studies on flooding in Europe, and typhoon wind damage in the Philippines. Two sample selection bias correction methods from the ML literature are applied and one of these methods is also adjusted to our problem. These three methods are combined with stochastic generation of synthetic damage data. We demonstrate that for both case studies, the sample selection bias correction techniques reduce model errors, especially for the mean bias error this reduction can be larger than 30%. The novel combination with stochastic data generation seems to enhance these techniques. This shows that sample selection bias correction methods are beneficial for damage model transfer.
Highlights
Over the last decades, both the developed and the developing world have seen an increase in the frequency and severity of hydrometeorological disasters and their impacts
We introduce a variation of the cluster-based estimation (CBE) method that we call single variable distribution matching (SVDM), which only uses the most relevant variable
It is apparent from the highlighted numbers in this table that the best performing models in both case studies and for all evaluation criteria always have some form of sample selection bias correction included
Summary
Both the developed and the developing world have seen an increase in the frequency and severity of hydrometeorological disasters and their impacts. Natural hazard damage models predict the damages of a disaster given hazard characteristics such as the water depth of a flood (e.g., Merz, Kreibich, Schwarze, & Thieken, 2010) or the wind speed of a cyclone Damage models are increasingly used for providing impact information in early warning systems (e.g., Bachmann et al, 2016), and many national meteorological and hydrological organizations are attempting to move from hazard forecasts to impact-based forecasts (WMO, 2015) whereby damage models are essential Several actors, such as humanitarian organizations, can use these impactbased forecasts to initiate early actions that reduce risks just before a hazardous event (Coughlan de Perez et al, 2015). Damage models or so-called catastrophe models are applied in the insurance sector to determine premiums (Grossi & Kunreuther, 2005; Merz et al, 2010; Pielke, Landsea, Musulin, & Downton, 1999)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.