Flash flood is one of the most dangerous natural phenomena because of its high magnitudes and sudden occurrence, resulting in huge damages for people and properties. Our work aims to propose a state-of-the-art model for susceptibility mapping of the flash flood using the decision tree random subspace ensemble optimized by hybrid firefly–particle swarm optimization (HFPS), namely the HFPS-RSTree model. In this work, we used data from a flood inventory map consisting of 1866 polygons derived from Sentinel-1 C-band synthetic aperture radar (SAR) data and a field survey conducted in the northwest mountainous area of the Van Ban district, Lao Cai Province in Vietnam. A total of eleven flooding conditioning factors (soil type, geology, rainfall, river density, elevation, slope, aspect, topographic wetness index (TWI), normalized difference vegetation index (NDVI), plant curvature, and profile curvature) were used as explanatory variables. These indicators were compiled from a geological and mineral resources map, soil type map, and topographic map, ALOS PALSAR DEM 30 m, and Landsat-8 imagery. The HFPS-RSTree model was trained and verified using the inventory map and the eleven conditioning variables and then compared with four machine learning algorithms, i.e., the support vector machine (SVM), the random forests (RF), the C4.5 decision trees (C4.5 DT), and the logistic model trees (LMT) models. We employed a range of statistical standard metrics to assess the predictive performance of the proposed model. The results show that the HFPS-RSTree model had the best predictive performance and achieved better results than those of other benchmarks with the ability to predict flash flood, reaching an overall accuracy of over 90%. It can be concluded that the proposed approach provides new insights into flash flood prediction in mountainous regions.