The lack of long term and well distributed precipitation observations on the Tibetan Plateau (TiP) with its complex terrain raises the need for other sources of precipitation data for this area. Satellite-based precipitation retrievals can fill those data gaps. Before precipitation rates can be retrieved from satellite imagery, the precipitating area needs to be classified properly. Here, we present a feasibility study of a precipitation area delineation scheme for the TiP based on multispectral data with data fusion from the geostationary orbit (GEO, Insat-3D and Elektro-L2) and a machine learning approach (Random Forest, RF). The GEO data are used as predictors for the RF model, extensively validated by independent GPM (Global Precipitation Measurement Mission) IMERG (Integrated Multi-satellitE Retrievals for GPM) gauge calibrated microwave (MW) best-quality precipitation estimates. To improve the RF model performance, we tested different optimization schemes. Here, we find that (1) using more precipitating pixels and reducing the amount of non-precipitating pixels during training greatly improved the classification results. The accuracy of the precipitation area delineation also benefits from (2) changing the temporal resolution into smaller segments. We particularly compared our results to the Infrared (IR) only precipitation product from GPM IMERG and found a markedly improved performance of the new multispectral product (Heidke Skill Score (HSS) of 0.19 (IR only) compared to 0.57 (new multispectral product)). Other studies with a precipitation area delineation obtained a probability of detection (POD) of 0.61, whereas our POD is comparable, with 0.56 on average. The new multispectral product performs best (worse) for precipitation rates above the 90th percentile (below the 10th percentile). Our results point to a clear strategy to improve the IMERG product in the absence of MW radiances.
Read full abstract