Correcting sampling bias in species distribution models (SDMs) is challenging. The difficulty lies in accurately identifying and quantifying bias and the scarcity of samples, which greatly impedes the implementation of bias correction. Current methods often adjust the distribution of presence or background points within geographic or environmental spaces to correct the sampling bias in probability estimation within SDMs. However, these methods may lead to information loss, rely on subjective assumptions, and often separate geography and environment when correcting for bias. This study proposes a novel and easily implementable method termed “aggregation background.” This method selects background data based on the aggregation degree of presence points in the geographic and environmental feature space, thereby approximating the representation and correction of sampling bias in the presence samples. We compared this new method with other prevalent sampling bias correction methods in the existing literature by analyzing ecological authenticity. Under varying biases and sample sizes, the aggregation background and geographic filtering methods achieved more accurate species distribution predictions compared to the target group background and other methods. Notably, when the sample size was small (≤70), the aggregation background was superior to that obtained using the geographic filtering method. These findings underscore the effectiveness of the aggregation background in improving bias correction using limited available presence sample data, without relying on assumptions about sampling bias. Our method provides a new approach for correcting complex unknown biases in SDMs.
Read full abstract