Machine learning-based techniques for land subsidence simulation in an urban area
Machine learning-based techniques for land subsidence simulation in an urban area
23
- 10.1007/s11069-021-05171-0
- Jan 29, 2022
- Natural Hazards
12
- 10.1002/ldr.749
- Jul 1, 2006
- Land Degradation & Development
69
- 10.1080/15481603.2017.1331511
- May 25, 2017
- GIScience & Remote Sensing
70
- 10.1007/s12665-019-8254-8
- Apr 1, 2019
- Environmental Earth Sciences
213
- 10.1029/2008gl033814
- Aug 1, 2008
- Geophysical Research Letters
33
- 10.1111/wej.12688
- Feb 15, 2021
- Water and Environment Journal
217
- 10.1007/s00254-005-0010-6
- Jul 16, 2005
- Environmental Geology
46
- 10.1016/j.scitotenv.2021.149244
- Jul 24, 2021
- Science of The Total Environment
137
- 10.1007/s00254-007-0680-3
- Feb 24, 2007
- Environmental Geology
22
- 10.1007/bf02687252
- Jan 1, 1999
- Mine Water and the Environment
- Research Article
- 10.1080/02626667.2025.2494866
- May 16, 2025
- Hydrological Sciences Journal
ABSTRACT Rapid urbanization and extreme weather events exert significant pressure on urban geological environments, yet the impact of extreme floods on ground deformation, particularly the interplay between underground engineering, building density, and surface response, remains unclear. Using Enhanced Small Baseline Subset (E-SBAS) InSAR technology and machine learning methods, this study analysed the impact of the 2020 Chongqing flood on ground deformation patterns. Results shifted from uniform subsidence (−5.19 ± 8.66 mm/year) pre-flood to significant spatial differentiation (−0.16 ± 125.46 mm/year) post-flood. Areas within 250 m of subway lines shifted from subsidence (−25 to −5 mm/year) to uplift (0 to 15 mm/year), with high-risk zones (35%) exceeding the study area’s average (15%). Machine learning revealed land cover type (18.25%) and subway proximity (14.45%) to be key factors affecting flood susceptibility. These findings emphasize the importance of coordinated surface and underground development in urban disaster prevention.
- Research Article
- 10.1016/j.ijdrr.2025.105429
- Apr 1, 2025
- International Journal of Disaster Risk Reduction
Integrated assessment of flood and subsidence hazards: A strategic approach for risk mitigation and water sustainability
- Research Article
- 10.1007/s11269-025-04256-w
- Jun 13, 2025
- Water Resources Management
Artificial Intelligence Prediction of Water Quality of Complex Urban River Networks
- Research Article
- 10.3389/feart.2025.1522488
- Mar 10, 2025
- Frontiers in Earth Science
The geotechnical properties of soil are crucial in determining the stability of foundations and construction safety in regions with high groundwater levels, such as Warsak Road in Peshawar, Pakistan. Due to its proximity to the Warsak Dam and intersecting irrigation canals, the area experiences a consistently high water table, which significantly impacts soil stability, leading to potential issues such as excessive settlement, reduced shear strength, and increased structural instability. These groundwater conditions pose unique challenges for foundation stability, making it essential to develop a comprehensive understanding of the soil’s consolidation behavior and shear strength properties. To address these concerns, this study employs a combined experimental and numerical approach, aiming to evaluate these critical soil properties in detail. The experimental phase involved collecting three undisturbed soil samples from each of the five distinct sites along Warsak Road, spaced approximately 5 km apart. These samples were subjected to standardized laboratory tests, including grain size distribution, specific gravity, Atterberg Limits, direct shear, unconfined compression, and oedometer tests, per ASTM standards. To further validate the laboratory findings, numerical analysis using PLAXIS software was conducted, along with analytical evaluations using the Meyerhof and Vesic bearing capacity equations. This integrated methodology provided a comprehensive understanding of the soil’s behavior under varying conditions, revealing distinct variations in the average values of the three samples from each site. Specifically, Site 1 exhibited an average cohesion of 18.22 kN/m2, making it suitable for low-rise structures, whereas Site 2, with an average cohesion of 15.23 kN/m2, indicated the need for stabilization due to its high consolidation potential. Site 3, averaging 13.3 kN/m2, showed higher settlement risk, necessitating deep foundations, while Site 4, with the lowest average cohesion of 9.94 kN/m2, was deemed unsuitable for heavy loads without reinforcement. In contrast, Site 5, having the highest average cohesion of 20.2 kN/m2, demonstrated excellent stability, ideal for multi-story buildings and other heavy structures. The numerical results from PLAXIS offered a more accurate understanding of soil behavior compared to the traditional Meyerhof and Vesic methods, highlighting the necessity of integrating advanced numerical techniques with conventional approaches. Accordingly, targeted soil improvement measures are recommended for weak and highly compressible soils to ensure the long-term stability and safety of structures in the region.
- Research Article
6
- 10.1007/s12145-024-01236-3
- Feb 15, 2024
- Earth Science Informatics
K-Means Featurizer: A booster for intricate datasets
- Research Article
27
- 10.1016/j.jenvman.2024.122130
- Aug 23, 2024
- Journal of Environmental Management
Multifaceted anomaly detection framework for leachate monitoring in landfills
- Research Article
- 10.3390/su17052315
- Mar 6, 2025
- Sustainability
Resource conflicts constitute a major global issue in areas rich in natural resources. The modeling of factors influencing natural resource conflicts (NRCs), including environmental, health, socio-economic, political, and legal aspects, presents a significant challenge compounded by inadequate data. Quantitative research frequently emphasizes large-scale conflicts. This study presents a novel multilevel approach, SEFLAME-CM—Spatially Explicit Fuzzy Logic-Adapted Model for Conflict Management—for advancing understanding of the relationship between NRCs and drivers under territorial and rebel-based typologies at a community level. SEFLAME-CM is hypothesized to yield a more robust positive correlation between the risk of NRCs and the interacting conflict drivers, provided that the conflict drivers and input variables remain the same. Local knowledge from stakeholders is integrated into spatial decision-making tools to advance sustainable peace initiatives. We compared our model with spatial multi-criteria evaluation for conflict management (SMCE-CM) and spatial statistics. The results from the Moran’s I scatter plots of the overall conflicts of the SEFLAME-CM and SMCE-CM models exhibit substantial values of 0.99 and 0.98, respectively. Territorial resource violence due to environmental drivers increases coast-wards, more than that stemming from rebellion. Weighing fuzzy rules and conflict drivers enables equal comparison. Environmental variables, including proximity to arable land, mangrove ecosystems, polluted water, and oil infrastructures are key factors in NRCs. Conversely, socio-economic and political factors seem to be of lesser importance, contradicting prior research conclusions. In Third World nations, local communities emphasize food security and access to environmental services over local political matters amid competition for resources. The synergistic integration of fuzzy logic analysis and community perception to address sustainable peace while simultaneously connecting environmental and socio-economic factors is SEFLAME-CM’s contribution. This underscores the importance of a holistic approach to resource conflicts in communities and the dissemination of knowledge among specialists and local stakeholders in the sustainable management of resource disputes. The findings can inform national policies and international efforts in addressing the intricate underlying challenges while emphasizing the knowledge and needs of impacted communities. SEFLAME-CM, with improvements, proficiently illustrates the capacity to model intricate real-world issues.
- Research Article
3
- 10.3390/land13030322
- Mar 2, 2024
- Land
Land subsidence (LS) due to natural and human-driven forces (e.g., earthquakes and overexploitation of groundwater) has detrimental and irreversible impacts on the environmental, economic, and social aspects of human life. Thus, LS hazard mapping, monitoring, and prediction are important for scientists and decision-makers. This study evaluated the performance of seven machine learning approaches (MLAs), comprising six classification approaches and one regression approach, namely (1) classification and regression trees (CARTs), (2) boosted regression tree (BRT), (3) Bayesian linear regression (BLR), (4) support vector machine (SVM), (5) random forest (RF), (6) logistic regression (LogR), and (7) multiple linear regression (MLR), in generating LS susceptibility maps and predicting LS in two case studies (Semnan Plain and Kashmar Plain in Iran) with varying intrinsic characteristics and available data points. Multiple input variables (slope, aspect, groundwater drawdown, distance from the river, distance from the fault, lithology, land use, topographic wetness index (TWI), and normalized difference vegetation index (NDVI)), were used as predictors. BRT outperformed the other classification approaches in both case studies, with accuracy rates of 75% and 74% for Semnan and Kashmar plains, respectively. The MLR approach yielded a Mean Square Error (MSE) of 0.25 for Semnan plain and 0.32 for Kashmar plain. According to the BRT approach, the variables playing the most significant role in LS in Semnan Plain were groundwater drawdown (20.31%), distance from the river (17.11%), land use (14.98%), NDVI (12.75%), and lithology (11.93%). Moreover, the three most important factors in LS in Kashmar Plain were groundwater drawdown (35.31%), distance from the river (23.1%), and land use (12.98%). The results suggest that the BRT method is not significantly affected by data set size, but increasing the number of training set data points in MLR results in a decreased error rate.
- Research Article
- 10.1016/j.jsames.2025.105433
- Apr 1, 2025
- Journal of South American Earth Sciences
Monitoring land subsidence using Sentinel-1A, persistent scatterer InSAR, and machine learning techniques
- Research Article
- 10.1038/s41598-025-05376-4
- Jul 7, 2025
- Scientific Reports
Extracting induced polarization (IP) information from transient electromagnetic (TEM) signals is crucial for the exploration of deep mineral, oil, and gas resources.. Linear inversion technology is the preferred method for extracting IP information, but it is associated with three primary drawbacks: dependence on the initial conditions, susceptibility to falling into a local optimum, and a significant lack of uniqueness. To solve the above problems, this study presents an improved shuffle frog leaping algorithm (ISFLA) that incorporates tent chaotic distribution and an adaptive mobile factor, which is employed to extract IP information. First, a tent chaotic operator is adopted to enhance the initial population distribution, thereby improving the global search capability. Then, an adaptive mobile factor is designed to replace the random operator, balancing local and global searches. This adjustment increases solution accuracy and ensures stable convergence in the later stages. Finally, TEM inversion for a 1D layered geoelectric model with IP information is performed using the proposed ISFLA approach. The inversion results show that the ISFLA method can more effectively reconstruct the geoelectric structure, extract IP information, and exhibit greater robustness. Compared to other heuristic algorithms, the proposed method achieves superior global search ability and inversion accuracy, making it well-suited for IP information extraction.
- Research Article
- 10.1007/s43832-024-00073-1
- Apr 8, 2024
- Discover Water
Developing precise groundwater level (GWL) forecast models is essential for the optimal usage of limited groundwater resources and sustainable planning and management of water resources. In this study, an improved forecasting accuracy for up to 3 weeks ahead of GWLs in Bangladesh was achieved by employing a coupled Long Short Term Memory (LSTM) network-based deep learning algorithm and Maximal Overlap Discrete Wavelet Packet Transform (MODWPT) data preprocessing. The coupled LSTM-MODWPT model’s performance was compared with that of the LSTM model. For both standalone LSTM and LSTM-MODWPT models, the Random Forest feature selection approach was employed to select the ideal inputs from the candidate GWL lags. In the LSTM-MODWPT model, input GWL time series were decomposed using MODWPT. The ‘Fejér-Korovkin’ mother wavelet with a filter length of 18 was used to obtain a collection of scaling coefficients and wavelets for every single input time series. Model performance was assessed using five performance indices: Root Mean Squared Error; Scatter Index; Maximum Absolute Error; Median Absolute Deviation; and an a-20 index. The LSTM-MODWPT model outperformed standalone LSTM models for all time horizons in GWL forecasting. The percentage improvements in the forecasting accuracies were 36.28%, 32.97%, and 30.77%, respectively, for 1-, 2-, and 3-weeks ahead forecasts at the observation well GT3330001. Accordingly, the coupled LSTM-MODWPT model could potentially be used to enhance multiscale GWL forecasts. This research demonstrates that the coupled LSTM-MODWPT model could generate more precise GWL forecasts at the Bangladesh study site, with potential applications in other geographic locations globally.
- Research Article
18
- 10.3390/su141811598
- Sep 15, 2022
- Sustainability
Daily groundwater level is an indicator of groundwater resources. Accurate and reliable groundwater level (GWL) prediction is crucial for groundwater resources management and land subsidence risk assessment. In this study, a representative deep learning model, long short-term memory (LSTM), is adopted to predict groundwater level with the selected predictors by partial mutual information (PMI), and bootstrap is employed to generate different samples combination for training many LSTM models, and the predicted values by many LSTM models are used for the uncertainty assessment of groundwater level prediction. Two wells of different climate zones in the USA were used as a case study. Different significant predictors of GWL for two wells were identified by PMI from candidate predictors incorporating teleconnection patterns information. The results show that GWL is significantly affected by antecedent GWL, AO, Niño 3.4, Niño 1 + 2, and precipitation in humid areas, and by antecedent GWL, AO, Niño 3.4, Niño 3, Niño 1 + 2, and PNA in arid areas. Predictor selection can assist in improving the prediction performance of the LSTM model. The relationship between GWL and significant predictors were modeled by the LSTM model, and it achieved higher accuracy in humid areas, while the performance in arid areas was poorer due to limited precipitation information. The performance of LSTM was improved by increasing correlation coefficient (R2) values by 10% and 25% for 2 wells compared to generalized regression neural network (GRNN). Three uncertainty evaluation metrics indicate that LSTM reduced the uncertainty compared to GRNN model. LSTM coupling with PMI and bootstrap can be a promising approach for accurate and reliable groundwater level prediction for different climate zones.
- Research Article
5
- 10.3390/w15061115
- Mar 14, 2023
- Water
Wetland ecosystems with proper functioning provide various ecosystem services. Therefore, their conservation and restoration are of fundamental importance for sustainable development. This study used a deep learning model for groundwater level prediction to evaluate a wetland restoration project implemented in the Kushiro Wetland in Japan. The Kushiro Wetland had been degraded due to river improvement work. However, in 2010, a wetland restoration project was carried out to restore the meandering river channel, and a decade has passed since its completion. In this study, the wetland restoration project was evaluated by comparing the response of the groundwater level using a model that reproduced physical conditions with different characteristics before and after the restoration. At first, a deep learning model was created to predict groundwater levels pre- and post-restoration of a meandering river channel using observation data. Long short-term memory (LSTM) was used as the deep learning model. The most important aspect of this study was that LSTM was trained for each of the pre- and post-restoration periods when the hydrological and geological characteristics changed due to the river channel’s restoration. The trained LSTM model achieved high performance with a prediction error of the groundwater levels within 0.162 m at all observation points. Next, the LSTM models trained with the observation data of the post-restoration period were applied to evaluate the effectiveness of the meandering channel restoration. The results indicated that the meandering channel restoration improved hydrological processes in groundwater levels, i.e., their rainfall response and average groundwater water levels. Furthermore, the variable importance analysis of the explanatory variables in the LSTM model showed that river discharge and precipitation significantly contributed to groundwater level recovery in the Kushiro Wetland. These results indicated that the LSTM model could learn the differences in hydrological and geological characteristics’ changes due to channel restoration to groundwater levels. Furthermore, LSTM is a data-driven deep learning model, and by learning hydrological and geological conditions to identify factors that may affect groundwater levels, LSTM has the potential to become a powerful analysis method that can be used for environmental management and conservation issues.
- Research Article
12
- 10.5194/piahs-382-505-2020
- Apr 22, 2020
- Proceedings of the International Association of Hydrological Sciences
Abstract. With the rapid growth of data volume and the development of artificial intelligence technology, deep-learning methods are a new way to model land subsidence. We utilized a long short-term memory (LSTM) model, a deep-learning-based time-series processing method to model the land subsidence under multiple influencing factors. Land subsidence has non-linear and time dependency characteristics, which the LSTM model takes into account. This paper modelled the time variation in land subsidence for 38 months from 2011 to 2015. The input variables included the change in land subsidence detected by InSAR technology, the change in confined groundwater level, the thickness of the compressible layer and the permeability coefficient. The results show that the LSTM model performed well in areas where the subsidence is slight but poorly in places with severe subsidence.
- Research Article
- 10.1038/s41598-025-16454-y
- Aug 22, 2025
- Scientific reports
Intensive groundwater extraction and a severe 2021 drought have worsened land subsidence in Taiwan's Choshui Delta, highlighting the need for effective predictive modeling to guide mitigation. In this study, we develop a machine learning framework for subsidence analysis using electricity consumption data from pumping wells as a proxy for groundwater extraction. A long short-term memory (LSTM) neural network is trained to reconstruct missing subsidence records and forecast subsidence trends, while an artificial neural network links well electricity usage to groundwater level fluctuations. Using these tools, we identify groundwater-level decline from pumping as a key driver of subsidence. The LSTM model achieves high accuracy in reproducing historical subsidence and provides reliable predictions of subsidence behavior. Scenario simulations indicate that reducing groundwater pumping, simulated by lowering well electricity use, allows groundwater levels to recover and significantly slows the rate of land subsidence. To assess the effectiveness of pumping reduction strategies, two artificial scenarios were simulated. The average subsidence rate at the Xiutan Elementary School multi-layer compression monitoring well (MLCW) decreased from 2.23 cm/year (observed) to 1.94 cm/year in first scenario and 1.34 cm/year in second scenario, demonstrating the potential of groundwater control in mitigating land subsidence. These findings underscore the importance of integrating groundwater-use indicators into subsidence models and demonstrate that curtailing groundwater extraction can effectively mitigate land subsidence in vulnerable deltaic regions.
- Research Article
5
- 10.30630/joiv.7.3-2.2344
- Nov 30, 2023
- JOIV : International Journal on Informatics Visualization
Cryptocurrencies created by Nakamoto in 2009 have gained significant interest due to their potential for high returns. However, the cryptocurrency market's unpredictability makes it challenging to forecast prices accurately. To tackle this issue, a deep learning model has been developed that utilizes Long Short-Term Memory (LSTM) neural networks and Convolutional Neural Networks (CNNs) to predict cryptocurrency prices. LSTMs, a type of recurrent neural network, are well-suited for analyzing time series data and have been successful in various prediction applications. Additionally, CNNs, primarily used for image analysis tasks, can be employed to extract relevant patterns and characteristics from input data in Bitcoin price prediction applications. This study contributes to the existing related works on cryptocurrency price prediction by exploring various predictive models and techniques, which involve a machine learning model, deep learning model, time series analysis, and as well as a hybrid model that combines deep learning methods to predict cryptocurrency prices as well as enhance the accuracy and reliability of the price predictions. To ensure accurate predictions in this study, a trustworthy dataset from investing.com was sought. The dataset, sourced from investing.com, consists of 1826 time series data samples. The dataset covers the time frame from January 1, 2018, to December 31, 2022, providing data for a period of 5 years. Subsequently, pre-processing was conducted on the dataset to guarantee the quality of the input. As a result of absent values and concerns regarding the dataset's obsolescence, an alternative dataset was sourced to avoid these issues. The performance of the LSTM and CNN models was evaluated using root mean squared error (RMSE), mean squared error (MSE), mean absolute error (MAE) and R-squared (R2). It was observed that they outperformed each other to a certain degree in short-term forecasts compared to long-term predictions, where the R2Â values for LSTM range from 0.973 to 0.986, while for CNNs, they range from 0.972 to 0.988 for 1 day, 3 days and 7 days windows length. Nevertheless, the LSTM model demonstrated the most favorable performance with the lowest error rate. The RMSE values for the LSTM model ranged from 1203.97 to 1645.36, whereas the RMSE values for the CNNs model ranged from 1107.77 to 1670.93. As a result, the LSTM model exhibited a lower error rate in RMSE and achieved the highest accuracy in R2Â compared to the CNNs model. Considering these comparative outcomes, the LSTM model can be deemed as the most suitable model for this specific case
- Conference Article
- 10.30632/spwla-2023-0076
- Jun 10, 2023
Precise rock lithology identification from well logs is critical for reservoir characterization and field development. Traditional knowledge-based lithology interpretation is highly dependent on the interpreter’s experience and judgment, which could lead to erroneous decision making or biased prediction. To reduce human involvement and improve interpretation efficiency and consistency, a knowledge-constrained long short-term memory (LSTM) network solution is introduced. In this study, LSTM networks are applied with different constrains to obtain the mapping relations and validate the knowledge-constrained LSTM model accordingly. The entire workflow mainly includes input logging data preprocessing, different constrain validations during the LSTM model training, and validation processes. This study covers and compares the direct LSTM model without constrains, rectangular constrain LSTM (RCLSTM), and Gaussian window weighted constrain LSTM (GWLSTM). In particular, GWLSTM applies a sample cluster as input instead of single sample points. The weight of the sample point is controlled by a distance-correlated Gaussian window, which means the closer to the predicting point, the greater the impact on the prediction. LSTM, RCLSTM, and GWLSTM models are tested on a field data set of five wells in a typical sandstone gas reservoir. Two wells are used to train the network, while the other three wells are used for network assessment. The test results demonstrate that by applying LSTM networks to establish the mapping between the logging curves (e.g., CNL, DT, DEN, GR, and RD) and rock lithology, rock lithologies in target formation can be predicted from well logs. Moreover, the lithology predictions by the GWLSTM model are more accurate than those of conventional LSTM and RCLSTM models, especially for thin layers. In conclusion, GWLSTM networks improve lithology identification accuracy by taking stratigraphic sequences into consideration. And the Gaussian window constrains are more effective than rectangular window constrains for thin layer predictions. Lastly, GWLSTM doesn’t require a large training data set, which makes it advantageous for reservoirs with limited wells.
- Research Article
57
- 10.3389/fbioe.2020.00063
- Feb 12, 2020
- Frontiers in Bioengineering and Biotechnology
Falls in the elderly is a major public health concern due to its high prevalence, serious consequences and heavy burden on the society. Many falls in older people happen within a very short time, which makes it difficult to predict a fall before it occurs and then to provide protection for the person who is falling. The primary objective of this study was to develop deep neural networks for predicting a fall during its initiation and descending but before the body impacts to the ground so that a safety mechanism can be enabled to prevent fall-related injuries. We divided the falling process into three stages (non-fall, pre-impact fall and fall) and developed deep neutral networks to perform three-class classification. Three deep learning models, convolutional neural network (CNN), long short term memory (LSTM), and a novel hybrid model integrating both convolution and long short term memory (ConvLSTM) were proposed and evaluated on a large public dataset of various falls and activities of daily living (ADL) acquired with wearable inertial sensors (accelerometer and gyroscope). Fivefold cross validation results showed that the hybrid ConvLSTM model had mean sensitivities of 93.15, 93.78, and 96.00% for non-fall, pre-impact fall and fall, respectively, which were higher than both LSTM (except the fall class) and CNN models. ConvLSTM model also showed higher specificities for all three classes (96.59, 94.49, and 98.69%) than LSTM and CNN models. In addition, latency test on a microcontroller unit showed that ConvLSTM model had a short latency of 1.06 ms, which was much lower than LSTM model (3.15 ms) and comparable with CNN model (0.77 ms). High prediction accuracy (especially for pre-impact fall) and low latency on the microboard indicated that the proposed hybrid ConvLSTM model outperformed both LSTM and CNN models. These findings suggest that our proposed novel hybrid ConvLSTM model has great potential to be embedded into wearable inertial sensor-based systems to predict pre-impact fall in real-time so that protective devices could be triggered in time to prevent fall-related injuries for older people.
- Research Article
18
- 10.1016/j.scitotenv.2023.167482
- Oct 13, 2023
- Science of The Total Environment
Land subsidence prediction in Zhengzhou's main urban area using the GTWR and LSTM models combined with the Attention Mechanism
- Research Article
15
- 10.1016/j.jhydrol.2023.130076
- Aug 10, 2023
- Journal of Hydrology
Predicting the performance of green stormwater infrastructure using multivariate long short-term memory (LSTM) neural network
- Research Article
1
- 10.1016/j.jclepro.2024.144300
- Nov 28, 2024
- Journal of Cleaner Production
This study breaks new ground by using the Temporal Fusion Transformer (TFT) method for groundwater level prediction, addressing the complex dynamics of the Thames Basin aquifer in England. Our research combines extensive hydrological data collected from the Thames Basin with advanced machine learning, where a complex network of rivers and streams substantially affects groundwater dynamics. Unlike previous studies, this research focuses on long-term forecasting with deep learning, offering, for the first time, a 60-day prediction horizon based on daily data. To rigorously examine the model performance and robustness on new, unseen data, we applied the walk-forward validation method and other matrices such as RMSE and R2 coupled with the Holdout technique. The models used were Long Short-Term Memory (LSTM), Attention-based LSTM, LSTM with Bayesian optimisation, Attention-based LSTM with Bayesian optimisation and TFT. They were used on the basin's Chalk, Jurassic Limestone, and Lower greensand aquifers. Whilst both LSTM models were optimised using the Bayesian technique, TFT was applied for its inherent capability in complex time series. Our methodology processed historical groundwater and rainfall data from 2001 to 2023, accounting for the potential lag in aquifer response to the proximity of the river system. The dataset served as training, validation, and holdout for each model, focusing on capturing the dynamic temporal fluctuation. The results clearly showed the superiority of the TFT model in all aquifer types compared to other models across all horizons. The Limestone had the greatest result in the 7-day projections, with an RMSE of 0.02 and R2 of 0.98; Whilst the Chalk and Lower greensand, had RMSEs of 0.03 with R2 values of 0.75 and 0.95, respectively. The Limestone aquifer performed best for the 30-day horizon again (RMSE = 0.06, R2 = 0.85), with the Chalk and Lower greensand aquifer yielding RMSE of 0.04 and 0.12 and R2 values of 0.64 and 0.74, respectively. In the 60 days predictions, the best results were observed in the limestone aquifer with RMSE of 0.09 and R2 of 0.65 in holdout validation. However, in chalk and lower greensand aquifers, the TFT showed RMSEs of 0.05 and 0.15 and R2s of 0.45 and 0.58, respectively. Traditional LSTM models demonstrated limited predictive power compared to the main model TFT, while the attention mechanism slightly improved the accuracy. This study not only sets a new benchmark in hydrological modelling accuracy but also highlights the potential of advanced machine learning in managing complex aquifers and predicting the water table.
- Preprint Article
- 10.5194/egusphere-egu25-166
- Mar 18, 2025
Groundwater is an important water resource that is widely used worldwide for agricultural, industrial, and domestic purposes. In the case of Jeju Island, located in southern South Korea, groundwater is an indispensable water resource that accounts for 82% of the total water supply. Therefore, scientific prediction and management of groundwater levels are very important for the sustainable use of groundwater by citizens. This study additionally used precipitation data from the Baekrokdam Climate Change Observatory located on the summit of Jeju Island in artificial intelligence (AI) models to accurately predict one-month-ahead future groundwater levels for the mid-mountainous areas of Jeju Island, where groundwater levels are highly variable. In other words, the AI models compared and analyzed the improvement effect of the monthly groundwater level prediction performance for 1) using precipitation data from two rainfall stations, groundwater withdrawal data from two groundwater sources, and groundwater level data from two monitoring wells in the study area, and 2) adding precipitation data from Baekrokdam Climate Change Observatory. The study subjects are two groundwater level monitoring wells located at 435-471m above mean sea level in the southeast of Jeju Island. The AI models used to predict groundwater levels are Artificial Neural Network (ANN) and Long Short-Term Memory (LSTM), a deep learning AI model.As a result, when the Baekrokdam precipitation data were not used, the two AI models showed excellent groundwater level prediction performance with Nash-Sutcliffe efficiency (NSE) values of 0.871 or higher. The LSTM model showed relatively higher prediction performance for high and low groundwater levels than the ANN model. This means that the LSTM model adequately incorporates the seasonal effects of wet and dry periods into groundwater level simulations. The more volatile the observed groundwater level, the more difficult it is for the AI models to interpret the characteristics of groundwater level fluctuations, and the lower the performance of predicting future groundwater levels. When additional Baekrokdam precipitation data were used, the two AI models showed improved groundwater level prediction performance by having NSE values of 0.907 or higher. This means that the additional use of precipitation data located in the uppermost region provides more information to help interpret groundwater levels, allowing AI models to better interpret the characteristics of groundwater level fluctuations. In addition, the use of Baekrokdam precipitation data was more helpful in improving groundwater level prediction for the monitoring well, which has highly variable groundwater levels that are difficult to predict, and the ANN model with relatively low groundwater level prediction performance. When additional Baekrokdam precipitation data was used for a specific monitoring well, the groundwater level prediction performance of the ANN model was improved to a level comparable to that of the LSTM model, which is a deep learning AI, even with a relatively simple ANN model structure. This is an example of how important it is to use additional useful data in research using AI models.
- Book Chapter
- 10.1007/978-3-030-45183-7_16
- Jan 1, 2020
The long short-term memory (LSTM) model is widely used in multiple areas, mainly for speech recognition, natural language processing and activity recognition. In the last few years, we started to see many variants of LSTM for recurrent neural networks since its inception in 1997. However, there weren’t many studies that have addressed the LSTM’s gating mechanism. In this paper, we propose a novel LSTM framework where we modify the architecture of the LSTM unit by adding a new layer that we call the “outlier gate”. The latter controls the flow of information that goes into the LSTM cell. This added signal allows us to avoid both the carry-over effect that the outliers have on the forecasted point and a bias in the estimates of our LSTM model – caused by unusual or non-repetitive events. The proposed architecture led us to an end-to-end trainable model that we applied in this paper to a financial time-series forecasting problem. Our results demonstrate that the new proposed LSTM architecture achieves better performance than the state-of-the-art original LSTM model.
- Research Article
478
- 10.1016/j.jhydrol.2020.125188
- Jun 17, 2020
- Journal of Hydrology
Short-term runoff prediction with GRU and LSTM networks without requiring time step optimization during sample generation
- Research Article
- 10.57197/jdr-2025-0590
- Jan 1, 2025
- Journal of Disability Research
Wearable electroencephalography (EEG) devices for seizure detection accuracy and reliability are deep learning (DL) applications in the field of epilepsy diagnosis. In this study, we sought to increase the accuracy of seizure detection using advanced DL algorithms on the Children’s Hospital Boston - Massachusetts Institute of Technology (CHB-MIT) EEG database. First, a fully convolutional network (FCN) was trained and assessed using accuracy and recall/precision metrics, and the early stopping technique was used to avoid overfitting. To assess the performance, the FCN was evaluated in terms of various metrics, including accuracy, precision, recall, F1-score, and receiver operating characteristic (ROC)-area under the curve (AUC). In addition, two-dimensional (2D) convolutional neural networks (CNNs) and long short-term memory (LSTM) models were used to model the database, and their performance was thoroughly measured using different metrics, graphs, and confusion matrices. Using LSTM variants, such as standard LSTM, bidirectional LSTM, stacked LSTM, and LSTM attention mechanisms, hybrid convolutional LSTM (ConvLSTM) models were trained and compared. The comparison was conducted based on the training and validation accuracy and loss, as well as the graphs resulting from the precision–recall curves. Apart from DL approaches, EEG signal analysis using time–frequency techniques, such as wavelet transform and short-time Fourier transform, has also been investigated. These methods assisted in the analysis of the time–frequency features of EEG signals in combination with DL models. This study demonstrates that the performance of wearable EEG devices can be augmented using a combination of DL and seizure signal processing techniques. The FCN achieved an accuracy of 92%, with a recall for seizures of 33%, an F1-score of 0.03, and strong ROC-AUC results. The 2D CNN achieved 96% accuracy, a seizure recall of 70%, an F1-score of 0.12, and an ROC-AUC score of 78%. The baseline LSTM struggled with effectiveness at 53% accuracy with a seizure recall of 18%. In contrast, the LSTM model, which incorporated synthetic minority oversampling technique (SMOTE) balancing, was able to reach improvements of up to 89% accuracy, with a precision of 91%, a recall of 86%, an F1-score of 0.89, and a strong ROC curve performance. Among the models, the LSTM with SMOTE was the best performer, with 89% accuracy, 91% precision, 86% recall, and an F1-score of 0.89. These results provide evidence that applying techniques for data balancing in combination with certain DL network architectures significantly improves the detection of seizures using wearable EEG devices worn on the body. We believe that real-time monitoring and high-performance systems are feasible using optimized DL frameworks. The analysis of the performance of different models allows for the understanding of the possibilities of optimizing the architectures of DL algorithms for the modern diagnosis of epilepsy in real time. The source code used to carry out the experiments is publicly available at CHB-MIT EEG Dataset Python Scripts (https://www.kaggle.com/code/adnankust/adnaneeg1).
- New
- Research Article
- 10.1016/j.jenvman.2025.127831
- Nov 7, 2025
- Journal of environmental management
- New
- Research Article
- 10.1016/j.jenvman.2025.127801
- Nov 7, 2025
- Journal of environmental management
- New
- Research Article
- 10.1016/j.jenvman.2025.127873
- Nov 7, 2025
- Journal of environmental management
- New
- Research Article
- 10.1016/j.jenvman.2025.127927
- Nov 7, 2025
- Journal of environmental management
- New
- Research Article
- 10.1016/j.jenvman.2025.127910
- Nov 6, 2025
- Journal of environmental management
- New
- Research Article
- 10.1016/j.jenvman.2025.127913
- Nov 6, 2025
- Journal of environmental management
- New
- Research Article
- 10.1016/j.jenvman.2025.127741
- Nov 6, 2025
- Journal of environmental management
- New
- Research Article
- 10.1016/j.jenvman.2025.127875
- Nov 6, 2025
- Journal of environmental management
- New
- Research Article
- 10.1016/j.jenvman.2025.127814
- Nov 6, 2025
- Journal of environmental management
- New
- Research Article
- 10.1016/j.jenvman.2025.127453
- Nov 1, 2025
- Journal of environmental management
- Ask R Discovery
- Chat PDF
AI summaries and top papers from 250M+ research sources.