The performance of regional groundwater level (GWL) prediction model hinges on understanding intricate spatiotemporal correlations among monitoring wells. In this study, a graph convolutional network (GCN) with a long short-term memory (LSTM) (GCN–LSTM) model is introduced for GWL prediction utilizing data from 16 wells located in the northeastern Xiangtan City, China. This model is designed to account for both the hybrid temporal dependencies and spatial autocorrelations among wells. It consists of two parts: the spatial part employs GCNs to extract spatial characteristics from a spatial self-similarity weight matrix and an attribute self-similarity weight matrix among wells; the temporal part utilizes a LSTM module to capture the temporal patterns of GWL sequences, along with monthly precipitation and temperature data. This model dynamically predicts changes in groundwater levels, achieving higher accuracy on average compared to single-well predictions using LSTM. By incorporating both temporal dependencies and spatial autocorrelations, the GCN–LSTM model demonstrated an average improvement in goodness-of-fit of approximately 11.21% over the LSTM-based model for individual wells. Its application holds significant reference value for the sustainable utilization and development of groundwater resources in Xiangtan City.

  • We proposed a novel model for groundwater level (GWL) prediction using a graph convolutional network combined with a long short-term memory (GCN–LSTM).

  • We designed spatial and attribute self-similarity weight matrices to effectively construct the GCN component.

  • The GCN-LSTM model outperforms individual well LSTM models in overall performance across all monitoring wells.

CNN

convolutional neural network

DL

deep learning

DT

decision tree

FC

fully connected

GCN

graph convolutional network

GWL

groundwater level

GNN

graphic neural network

GRU

gated recurrent unit

LSTM

long short-term memory

MAE

mean absolute error

ML

machine learning

RF

random forest

RNN

recurrent neural network

RMSE

root mean square error

Groundwater level (GWL) serves as an essential indicator of the groundwater environment and is influenced by topographical, geomorphological, and meteorologic factors (Scibek & Allen 2006; Zang et al. 2022). Achieving accurate GWL prediction relies on spatiotemporal modeling using rational influential factors and efficient methods (Arabameri et al. 2019). Some traditional and widely used hydrogeological investigation methods, e.g., field surveys, geophysical exploration, well drilling, experimental tests, and numerical simulations, are usually complex, expensive, and time-consuming (Goldman & Neubauer 1994; Qi et al. 2017). Although several groundwater numerical simulation software, e.g., MODFLOW and HYDRUS, include GWL modeling capabilities (Xu et al. 2012; Ghandehari et al. 2024; Zhang et al. 2024c), their practical application is limited by the need for extensive hydrogeological investigation and test data during the time-consuming model development and calibration processes, which is their main limitation in actual application. Additionally, geographical information system (GIS) and remote sensing (RS) techniques have been widely used in GWL modeling (Singh & Katpatal 2017; Qu et al. 2023). However, due to the fluctuating nature of GWL and its nonlinear relationship with multiple influencing factors, these experience-based and linear weighted models that rely on GIS and RS are often subjective and inaccurate in GWL prediction.

Data-driven machine learning (ML) methods aim to approximate the relationship between input influencing factors and output GWL through iterative leaning processes, without the need for explicitly defined physical parameters and models to describe the relationship (Rajaee et al. 2019; Uc-Castillo et al. 2023). Recently, shallow ML methods, e.g., support vector machines (SVMs) (Mackay et al. 2014), artificial neural networks (ANNs) (Gholami et al. 2015), decision trees (DTs) (Wu et al. 2022), random forests (RTs) (Akter et al. 2021; Khan et al. 2023), extreme gradient boosting (XGBoost) (Osman et al. 2021), adaptive neuro-fuzzy inference systems (ANFISs) (Milan et al. 2023; Boo et al. 2024b), and extreme learning machines (ELMs) (Poursaeid et al. 2022) have been increasingly used in GWL prediction to overcome the deficiencies of traditional methods. Although shallow ML models can capture nonlinear patterns in GWL sequences, the long-term prediction results often do not align well with actual observations.

GWL spatiotemporal prediction aims to extract the implicit, unknown, and meaningful relationship between GWL and the spatiotemporal sequence of influencing factors, in order to predict future GWLs based on this relationship. Therefore, it is necessary to consider not only the temporal interaction of different time points but also the interaction of different spatial locations in the spatiotemporal sequence of GWL and influencing factors (Evans et al. 2020; Ali et al. 2021). The spatial autocorrelation and temporal dependency also affect and interact with each other. Spatiotemporal sequence prediction methods can be based on regular grid and irregular graph data structures. Regular grid data, usually consisting of a series of images or transformed into an image-based model (Wang et al. 2022), can be processed using deep learning (DL) models, e.g., convolutional neural networks (CNNs) (Zhang et al. 2024a, 2024b), residual neural networks (ResNet) (Zhang et al. 2018), densely connected convolutional networks (DenseNet) (Xia et al. 2023), fully convolutional networks (FCNs) (Zhang et al. 2020, 2023b), and long short-term memory (LSTM) networks (Wu et al. 2021; Wunsch et al. 2021), to extract spatial characteristics, auxiliary information, and temporal patterns. However, the spatial distribution of realistic observation points is often irregular, making it difficult for regular grid-based CNNs to extract spatial characteristics from scattered points. To address this, Gori et al. (2005) proposed the concept of graphic neural networks (GNNs), using neural networks on non-Euclidean graphs. Kipf & Welling (2017) presented a scalable approach for semi-supervised learning on graph-structured data based on an efficient variant of CNNs, called graph convolutional networks (GCNs), which operate directly on graphs. GCN-based spatiotemporal prediction models have emerged as powerful tools in various fields, e.g., traffic flow prediction and social network analysis. Bai et al. (2023) proposed a spatiotemporal GNN based on gated convolution and topological attention (STGNN-GCTA), which models intricate spatiotemporal correlations in traffic flow with high accuracy and efficiency. Kumar et al. (2023) introduced a dynamic graph convolution LSTM (DyGCN–LSTM), which effectively captures complex spatial and temporal relationships among remote sensors, thereby enhancing traffic forecasting. Skarding et al. (2024) demonstrated that integrating heuristic methods with GCNs significantly improves dynamic link social network predictions. Therefore, to simultaneously capture spatial autocorrelation and temporal dependency, a GCN-based GWL spatiotemporal prediction model should be developed using irregular graphs to effectively model the intricate spatiotemporal dynamics of groundwater systems.

Using easily measurable and widely available influencing factors, GCN-based spatiotemporal methods can help improve the accuracy and efficiency of GWL sequence prediction. In this study, a GCN–LSTM model is proposed for GWL spatiotemporal prediction using GWL temporal sequence data from scattered monitoring wells and auxiliary spatiotemporal characteristics, e.g., temperature, precipitation, elevation, slope, slope aspect, and distance to streams. The model aims to reconstruct the nonlinear relationship between future GWL and historical GWL temporal sequences and influencing factors.

Study area

The study area is located in the northeastern Xiangtan City in the central-eastern Hunan Province, China, along the middle and lower reaches of the Xiangjiang River. It covers an area of about 187 km2. Xiangtan City experiences a typical mild and humid subtropical monsoon climate characterized by abundant rainfall, sufficient sunlight, and an average annual temperature of 18 °C. The region is known for its rich biological and crop resources. The entire area of Xiangtan City features a typical hilly landform, encompassing four mountains, the Xiangjiang River, and its two branches. The northwestern and southwestern parts of Xiangtan City have higher terrain, surrounded by four mountains, i.e., Shaofeng, Changshan, Xiaoxiashan, and Baozhongshan Mountains. In contrast, the northeastern and central parts of Xiangtan City have lower elevations and are traversed by the main stream of Xiangjiang River and its two branches, i.e., the Lianshui and Juanshui Rivers. The elevations in the study area are all below 1,000 m, and 16 monitoring wells are distributed in the northeastern part of Xiangtan City, as shown in Figure 1.
Figure 1

Spatial distribution of 16 monitoring wells: (a) terrain of Xiangtan City and location of the study area and (b) locations of 16 wells.

Figure 1

Spatial distribution of 16 monitoring wells: (a) terrain of Xiangtan City and location of the study area and (b) locations of 16 wells.

Close modal

The geological strata in the study area primarily consist of Quaternary Holocene alluvium and lacustrine deposits (Q4), Quaternary Middle Pleistocene alluvium and lacustrine deposits (Q2), and Mesozoic Upper Cretaceous red sandstone, calcareous mudstone, and sandy conglomerate (K2), as presented in Supplementary material, Table S1 and Figure S1. The Quaternary deposits are widely distributed near rivers and primarily comprise gravel, sand, peat, clay, etc. The Upper Cretaceous stratum in the study area was formed in a terrestrial sedimentary environment, consisting of sand-mudstone of near and shallow lacustric facies with gypsum interlayers deposited in salt lakes. The distribution of GWL is influenced by meteorologic and hydrological factors, while geological factors determine the rate and trend of GWL changes. The specific water yield of the aquifer, which varies according to the geological unit, plays a significant role in GWL changes when precipitation infiltrates into the underground to recharge the groundwater. A larger specific water yield results in less GWL rise, and vice versa.

GWL temporal sequence data

In this study, a total of 1,080 GWL original observations were collected from January of 2003 to December of 2017, with six records per month, from 16 wells in the study area observed by the Hunan Institute of Geological Disaster Investigation and Monitoring (http://hndk.hunan.gov.cn, 30 June, 2022), as presented in Supplementary material, Table S2 and Figure S2. Depths from the ground surface to the GWLs of the 16 wells range from 9.69 to 71.21 m.

The GWL data from 2003 to 2017 were divided into two parts, i.e., a training dataset (864 records) covering the period from 2003 to 2014, and a testing dataset (216 records) covering the period from 2015 to 2017. Subsequently, the GWL temporal sequence data were optimally partitioned into multiple periods using a fixed temporal step length L= 3 for all 16 wells. Each sampling interval occupies 5 days, resulting in an actual temporal span of L being 15 days. Each GWL period, of length L, along with its corresponding auxiliary spatiotemporal characteristics, was trained to predict the GWL for the subsequent observing time, as shown in Supplementary material, Figure S3. Thus, the GWL prediction task at a specific observing time was transformed into a regression problem solved by DL using the data from its previous period.

Auxiliary spatiotemporal characteristics

The spatial distribution of topographical and geomorphological factors, as presented in Figure 2, influences the GWLs of 16 wells in the study area. Surface water infiltration and atmospheric precipitation serve as the primary recharge sources for groundwater in this region. Additionally, during the periods of abundant water resources, the infiltration of rivers or lakes into adjacent groundwater aquifers also contributes to changes in GWLs. The Xiangjiang River and its branches act as the main channels for infiltration sources of groundwater in the area. To represent the impact of streams on GWLs, the distance between the monitoring wells and the streams was utilized as one of the influencing factors. The topographical influence factors of elevation and slope determine the direction and speed of surface water runoff, indirectly affecting the quantity of surface water infiltration and influencing GWLs (Rinderer et al. 2014; Erdbrügger et al. 2023; Warix et al. 2023). Additionally, slope and aspect have nonlinear, interactive, and time-dependent effects on near-surface solar radiation (Tian et al. 2001; Zou et al. 2007). Solar radiation serves as the primary energy source for ecosystems, significantly influencing the hydrological cycle and affecting GWLs by regulating plant evapotranspiration and soil moisture conditions.
Figure 2

Spatial characteristics of the study area: (a) distance from streams, (b) elevation, (c) slope, and (d) aspect.

Figure 2

Spatial characteristics of the study area: (a) distance from streams, (b) elevation, (c) slope, and (d) aspect.

Close modal
Monthly precipitation data from 2003 to 2017 were obtained from the Global Precipitation Measurement (GPM) dataset (https://disc.gsfc.nasa.gov/datasets), with a spatial resolution of approximately 10 km. Precipitation is a crucial influencing factor and a key source of groundwater, as it infiltrates the soil to recharge groundwater deposits and raise their levels(Osman et al. 2022). As shown in Figure 3(a), the precipitation data exhibit periodic fluctuations with seasonal changes, mirroring the corresponding GWL patterns in Well-4. Monthly temperature data from 2003 to 2017 were obtained from Chinese land surface temperature data product, with a spatial resolution of 5.6 km, which integrates advantages of the Moderate-resolution Imaging Spectroradiometer (MODIS) satellite and meteorologic station data, allowing for capturing annual, seasonal, and monthly changes in surface temperature (Zhao et al. 2020a). Figure 3(b) depicts the pronounced seasonal temperature changes that align with the GWL variations in Well-4. Due to the close proximity of the 16 monitoring wells, the monthly precipitation and temperature data exhibit low spatial variability and are identical across all wells, demonstrating periodic fluctuations.
Figure 3

(a) Precipitation and (b) temperature changes along with GWLs in Well-4.

Figure 3

(a) Precipitation and (b) temperature changes along with GWLs in Well-4.

Close modal
To ensure the robustness of the GWL prediction model, the GWL temporal sequence data and all auxiliary spatiotemporal characteristics were normalized using maximum-minimum normalization (Ahmed Osman et al. 2024) during the data preprocessing stage, as follows:
(1)
where x* ∈ [0,1] is the normalized value of a specific feature, x is the original value of the feature, xmax is the maximum value of the feature across all samples, and xmin is the minimum value of the feature across all samples.

In GWL prediction, DL methods only require GWL temporal sequences and auxiliary spatiotemporal characteristics as input data, without the need for detailed physical or hydrogeological characteristics. Consequently, DL methods are more effective in predicting dynamic changes and the high uncertainty of GWL. Moreover, DL methods can learn the intrinsic regularities of GWL from fewer training data, without requiring interaction information among multiple influencing factors (Rajaee et al. 2019).

For the prediction of GWL temporal sequences for an individual well, a GWL prediction model was constructed by combining a LSTM layer, a fully connected (FC) layer, and a dropout method. This model utilized historical GWLs and temporal meteorological information, i.e., precipitation and temperature. For the spatiotemporal prediction of GWLs across all monitoring wells, a GCN–LSTM model was proposed to simultaneously capture the spatial autocorrelations and temporal dependencies among GWL sequences and spatiotemporal influencing factors. The model incorporated the spatial location and attribute characteristics (distance from steams, elevation, slope, and aspect) of wells as additional inputs to a GCN for extracting the spatial autocorrelation between wells. The prediction results of GCN–LSTM model were compared with those of the LSTM model, which did not include a spatial autocorrelation extraction module, using a set of evaluation metrics.

GWL prediction of individual well by LSTM

Long short-term memory

Recurrent neural networks (RNNs) have a natural advantage in processing data with sequence changes. Sequence data are taken as input and are recursively processed in the transmission direction of the sequence, with all cyclic units linked in a chain (Hopfield 1982). However, standard RNNs suffer from the vanishing and exploding gradient problems when dealing with long-term dependencies (Rumelhart et al. 1986; Werbos 1990). LSTM, as a special type of RNN, overcomes these issues by introducing more complex gate units in the recursive module (Ghasemlounia et al. 2021). LSTM also has modules similar to RNN chains, but recursive modules have a more complex structure of gate units, as shown in Supplementary material, Figure S4. LSTM can effectively capture both short-term and long-term dependencies in the sequence data, enabling it to process long-term sequence data more efficiently (Hochreiter & Schmidhuber 1997).

Since GWL is a highly temporally dependent sequence data, ordinary DL models cannot capture the temporal dependency within the GWL sequence. However, LSTM can effectively capture the dependency between GWL sequence data. Therefore, LSTM was applied to process GWL and meteorological temporal sequence data in the GWL prediction of an individual well.

Model design

Multiple studies have demonstrated that stacked LSTM networks have the potential to generate highly accurate GWL predictions (Dey et al. 2021; Ghasemlounia et al. 2021; Boo et al. 2024a). Four models were designed for the GWL temporal sequence prediction of an individual well. These models consisted of two hidden layers, which included a LSTM layer with a FC layer or two LSTM layers, as shown in Supplementary material, Figure S5. Considering the limited data, non-linearity, and complexity of the GWL dynamic change system, and the large number of parameters in GWL DL prediction models, dropout (Srivastava et al. 2014) was used as a regularization method to overcome the overfitting and reduce training time. Dropout temporarily abandons a portion of neurons in the network during each training iteration according to a certain probability. This prevents the model from over-relying on specific neurons and reduces their cooperative adaptation ability.

GWL spatiotemporal prediction by GCN–LSTM

To simultaneously capture spatial autocorrelation and temporal dependence, GCNs integrated with gated CNNs (Yu et al. 2018), gated recurrent units (GRUs) (Zhao et al. 2020b), or LSTMs (Khodayar & Wang 2019; Ali et al. 2022) have been widely utilized in spatiotemporal sequence predictions. To leverage LSTM's advantage in processing temporal sequence data and GCN's advantage in learning spatial autocorrelation characteristics, a spatiotemporal prediction model of GWL based on GCN–LSTM is proposed, as shown in Figure 4. First, the spatial adjacency matrix among the monitoring wells is constructed as the spatial characteristics of GWLs as the input. The edges in the adjacency matrix are determined based on spatial distance, while the node attributes are the GWLs and their influencing factors at each well. The adjacency matrix is then input into the GCN–LSTM model, which adopts an encoder-decoder structure. In the encoder, multiple parallel GCN modules capture spatial autocorrelation among different wells through the graph structure and node characteristics. The extracted temporal sequence data with spatial autocorrelation are then passed to LSTM to capture further temporal dependency between the sequence data. Finally, the encoder generates an encoding vector, which is sent to the decoder. A FC network serves as a decoder to process the encoding vector features, convert the extracted spatiotemporal characteristic back to the original space, and output the predicted GWL for each well.
Figure 4

Structure of GCN–LSTM for GWL spatiotemporal prediction.

Figure 4

Structure of GCN–LSTM for GWL spatiotemporal prediction.

Close modal

Graph convolutional network

Similar to CNNs, GCNs are used for feature extraction. However, GCNs operate on graphs and extract spatial and attribute characteristics from graph to make predictions. For a graph G = (V, E), with N nodes, V represents the set of nodes, which are the objects in the graph, and E represents the set of edges, which are the connections among the objects. If each node Vi has D attribute features, then the attribute features can be represented by an N × D matrix X, and the graph can be represented by an N × N adjacency matrix A. These matrices X and A are the inputs to the GCN, as shown in Supplementary material, Figure S6. Currently, GCNs have been extensively applied in geosciences, e.g., slope deformation prediction (Ma et al. 2021), geochemical anomalies recognition (Guan et al. 2022), bedrock mapping (Zhang et al. 2023a), mineral prospectivity mapping (Zuo & Xu 2023), and three-dimensional geological modeling (Hillier et al. 2021).

In GCN, each node obtains its implicit characteristics from itself and all its neighbors. When there are multiple monitoring wells in the study area, GWL prediction can be regarded as a spatiotemporal sequence prediction. The dynamic GWL changes not only depend on time but also vary based on the locations of the wells. GCN can uncover the influence of spatial autocorrelations among wells to predict more accurate GWLs.

Expressing spatial characteristics by graph

A topological relationship among all wells was established using Delaunay triangulation, resulting in a spatial graph structure. The features are input in the form of a graph, and GCNs generate a new node representation by aggregating its connected nodes through edge connections. It is crucial to quantify the connection relationships among wells in the graph. An adjacency matrix was used to represent the spatial connections in the graph, thereby aggregating the node characteristics through the transformation of the adjacency matrix. If the 16 monitoring wells in the study area are considered as nodes, each well has a spatial autocorrelation and an attribute self-similarity relationship with other wells, which can be expressed as a spatial graph structure. Thus, the GWL of any well can be jointly represented by the GWLs of the other wells as follows:
(2)
where Xi represents the GWL of the ith well; W is a spatial autocorrelation matrix; represents an attribute self-similarity matrix; and N is the number of connected wells with the ith well.
The Euclidean distance was adopted to quantify spatial autocorrelation among wells and constructed the spatial weight matrix of edges using the inverse ratio of squared distance between wells. The spatial weight matrix W of the graph structure is calculated as follows:
(3)
where d is the Euclidean distance between the ith and jth wells.
The attribute self-similarity was measured between two wells using the cosine of the angle between their attribute vectors in the attribute vector space. A smaller angle between the attribute vectors indicates great attribute similarity between the two wells. For the ith well with an attribute vector and the jth well with an attribute vector , the attribute self-similarity between the two wells is expressed as the cosine of the angle between their attribute vectors:
(4)

Model evaluation

The performance of the GWL prediction models was evaluated using three common accuracy metrics (Khan et al. 2023; Boo et al. 2024a), i.e., coefficient of determination (R2), root mean square error (RMSE), and mean absolute error (MAE).

quantifies how well the regression curve fits the actual values. Its theoretical range is [–∞, 1], with higher values indicating better model performance, as follows:
(5)
RMSE represents the standard deviation of the differences between predicted and observed values, with a range of [0, + ∞]. Smaller values indicate a better model fit, as follows:
(6)
MAE measures the average absolute difference between predicted and observed values, also ranging from [0, + ∞]. Smaller values signify a better model fit, as follows:
(7)

Temporal sequence models based on LSTM

The models were initially trained using the training dataset and subsequently evaluated on the testing dataset. The average performances of the constructed four models for GWL prediction across all 16 wells were quantitatively evaluated using determination coefficient R2, RMSE, and MAE. The evaluation results on the testing dataset are presented in Table 1, which shows that the LSTM + FC model performs the best in terms of R2, RMSE, and MAE (R2 = 0.577, RMSE = 0.511 m, and MAE = 0.309 m), followed by the LSTM + LSTM model. The superior performance of the LSTM + FC model is likely due to its effective utilization the temporal sequence features captured by the LSTM network, integrated with the FC layer's efficient feature combination and transformation, thereby enhancing prediction accuracy. In contrast, the LSTM + Dropout + LSTM model performs the worst across all evaluation metrics. This maybe because, in the case of a small dataset, adding a Dropout layer can lead to overfitting or loss of crucial information, thereby reducing prediction accuracy.

Table 1

Average performances of four models for GWL prediction on the testing dataset across all 16 wells

ModelR2RMSE (m)MAE (m)
LSTM + FC 0.577 0.511 0.309 
LSTM + Dropout + FC 0.531 0.526 0.317 
LSTM + LSTM 0.556 0.517 0.311 
LSTM + Dropout + LSTM 0.513 0.578 0.380 
ModelR2RMSE (m)MAE (m)
LSTM + FC 0.577 0.511 0.309 
LSTM + Dropout + FC 0.531 0.526 0.317 
LSTM + LSTM 0.556 0.517 0.311 
LSTM + Dropout + LSTM 0.513 0.578 0.380 

Supplementary material, Figure S7 depicts the differences between the predicted and real GWL values of the four models from January 2015 to December 2017 across all the wells. Among the four models, the network consisting of a LSTM layer and a FC layer predicted GWL values that closely matching the true values, demonstrating good continuity and smooth transition. The LSTM + LSTM model also achieved good prediction results for GWLs. Comparison of these two models reveals that LSTM can capture the time dependence of long time series features of the well, while its fitting ability is insufficient. However, introducing the FC layer solves this problem and better captures the turning points of GWL peak changes, ensuring good continuity. The FC layer has strong nonlinear fitting capabilities and can optimize LSTM networks with poor fitting, thereby improving the model's prediction accuracy.

Spatiotemporal sequence model based on GCN–LSTM

Comparison with LSTM-based model for individual wells

LSTM-based models have proven effective for GWL prediction. Sheikh Khozani et al. (2022) combined LSTMs with auto-regressive integrated moving average (ARIMA) models, e.g., DTs and RTs, to capture both linear and nonlinear components of GWL fluctuations. Zeng et al. (2022) assessed lag times using the maximum information coefficient (MIC) algorithm and optimized LSTM hyperparameters with the grey wolf optimizer (GWO) to enhance GWL prediction accuracy. However, since GWLs represent typical spatiotemporal sequence data, it is challenging to account for their spatial autocorrelations using LSTM-based models (Wunsch et al. 2022). The GCN–LSTM model was trained using GWL data from 16 monitoring wells in the study area simultaneously and predicted the dynamic change trend of GWL for all wells from January 2015 to December 2017. Compared with the GWL prediction results from the best-performing LSTM-based model, which achieved R2 values ranging from 0.044 to 0.981 at individual wells, the proposed GCN–LSTM model, considering both temporal dependency and spatial autocorrelation, achieved R2 values ranging from 0.224 to 0.956. Supplementary material, Figure S8 shows that most of the coefficients R2 of GCN–LSTM models are larger than those of LSTM-based models, and most of the RMSEs, and MAEs of GCN–LSTM models are less than those of LSTM-based models. When predicting GWL at same well, the GCN–LSTM spatiotemporal model achieves an average goodness-of-fit approximately 11.21% higher than the LSTM-based temporal model for that specific well.

Taking four monitoring wells, i.e., Well-7, Well-8, Well-14, and Well-15, as examples, Figure 5 displays the differences between the real GWL values and the predicted values by the LSTM model and the GCN–LSTM model, respectively. Both models capture the change trend of GWL to some extent, however, the LSTM model tends to cause underestimation of the peaks in GWL, while the GCN–LSTM model is more sensitive to the peak and valley of GWL changes. It effectively captures the turning point of the GWL change trend with stronger timeliness and enhanced detail accuracy.
Figure 5

Comparison of GWL prediction results using the LSTM-based model and GCN–LSTM spatiotemporal model at four monitoring wells: (a) Well-7; (b) Well-8; (c) Well-14; and (d) Well-15.

Figure 5

Comparison of GWL prediction results using the LSTM-based model and GCN–LSTM spatiotemporal model at four monitoring wells: (a) Well-7; (b) Well-8; (c) Well-14; and (d) Well-15.

Close modal

The fluctuation of GWL is a spatiotemporal process, and the GCN–LSTM model, which considers both spatial autocorrelation and temporal dependency, provides good modeling results for the nonlinear and non-stationary GWL spatiotemporal sequence data. This model demonstrates higher accuracy compared to the LSTM-based model, which only considers the temporal dependency. Therefore, the proposed GCN–LSTM model represents a more comprehensive GWL prediction method, accurately and efficiently reflecting the dynamic changes of GWL in multiple monitoring wells.

GWL contour map

To explore the spatiotemporal dynamic change trend of GWL in the study area over the years, GWL contour maps were plotted for four different dates, i.e., 30 March 2016, 30 June 2016, 30 September 2016, and 30 December 2016. These contour maps, as shown in Figures 69, respectively, depict the predicted GWL values by the GCN–LSTM model compared to the true values. The overall change trends of the predicted GWLs by the GCN–LSTM model are in good agreement with the actual situations, further confirming the effectiveness of the GCN–LSTM model. The spatiotemporally predicted GWLs can aid in formulating reasonable groundwater management schemes, enabling the rational utilization of groundwater resources and promoting a positive cycle with the groundwater system.
Figure 6

Groundwater level contour maps of the study area on 30 March 2016: (a) true values and (b) predicted values.

Figure 6

Groundwater level contour maps of the study area on 30 March 2016: (a) true values and (b) predicted values.

Close modal
Figure 7

Groundwater level contour maps of the study area on 30 June 2016: (a) true values and (b) predicted values.

Figure 7

Groundwater level contour maps of the study area on 30 June 2016: (a) true values and (b) predicted values.

Close modal
Figure 8

Groundwater level contour maps of the study area on 30 September 2016: (a) true values and (b) predicted values.

Figure 8

Groundwater level contour maps of the study area on 30 September 2016: (a) true values and (b) predicted values.

Close modal
Figure 9

Groundwater level contour maps of the study area on 30 December 2016: (a) true values and (b) predicted values.

Figure 9

Groundwater level contour maps of the study area on 30 December 2016: (a) true values and (b) predicted values.

Close modal

This study proposes a multi-well GWL prediction model based on GCN–LSTM, which comprehensively considers the spatiotemporal influencing factors of GWL. The prediction accuracy and reliability of the GCN–LSTM model are verified using the GWL data from 16 monitoring wells in Xiangtan City. The GCN–LSTM GWL prediction model incorporates the temporal dependency and spatial autocorrelation by integrating the spatial autocorrelation and attribute self-similarity features of wells into a graph structure as input. Compared to the temporal sequence prediction model based solely on LSTM, the GCN–LSTM model effectively reflects the spatiotemporal characteristics of GWL dynamic changes in multiple wells, resulting in higher prediction accuracy and better detail in capturing the change trend of GWL in the study area.

Several limitations remain in this study that warrant further investigation. Firstly, the absence of a sensitivity analysis for the GWL spatiotemporal sequence prediction model restricts our understanding of how various factors may influence model predictions. Therefore, future research should prioritize conducting sensitivity analysis to enhance the robustness and applicability of the GWL spatiotemporal sequence model. Additionally, this study does not account for the impact of extreme weather conditions on GWL predictions, especially given the increasing frequency and intensity of such events due to climate change. Future efforts could improve the model's robustness by incorporating the GCN–LSTM framework with climate scenario simulations under various extreme weather conditions, thus evaluating its predictive capabilities in these contexts.

The authors would like to express their gratitude to the MapGIS Laboratory Co-Constructed by the National Engineering Research Centre for Geographic Information System of China and Central South University, for providing MapGIS® software (Wuhan Zondy Cyber-Tech Co. Ltd, Wuhan, China).

This study was supported by grants from the Hunan Provincial Natural Science Foundation (Grant Nos 2023JJ60188, 2023JJ60190, and 2022JJ30708), the Changsha Municipal Natural Science Foundation (Grant No. kq2208054), the Research Project of Hunan Vocational College of Engineering (GC22YB01), and the National Natural Science Foundation of China (Grant No. 42072326).

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Ahmed Osman
A. I.
,
Latif
S. D.
,
Wee Boo
K. B.
,
Ahmed
A. N.
,
Huang
Y. F.
&
El-Shafie
A.
(
2024
)
Advanced machine learning algorithm to predict the implication of climate change on groundwater level for protecting aquifer from depletion
,
Groundwater for Sustainable Development
,
25
,
101152
.
Akter
F.
,
Bishop
T. F. A.
&
Vervoort
R. W.
(
2021
)
Space-time modelling of groundwater level and salinity
,
Science of The Total Environment
,
776
,
145865
.
Ali
M. Z.
,
Chu
H.-J.
,
Tatas
&
Burbey
T. J.
(
2021
)
Spatio-temporal estimation of monthly groundwater levels from GPS-based land deformation
,
Environmental Modelling & Software
,
143
,
105123
.
Bai
D. W.
,
Xia
D. W.
,
Huang
D.
,
Hu
Y.
,
Li
Y. T.
&
Li
H. Q.
(
2023
)
Spatial-temporal graph neural network based on gated convolution and topological attention for traffic flow prediction
,
Applied Intelligence
,
53
,
30843
30864
.
Boo
K. B. W.
,
El-Shafie
A.
,
Othman
F.
,
Khan
M. M. H.
,
Birima
A. H.
&
Ahmed
A. N.
(
2024a
)
Groundwater level forecasting with machine learning models: A review
,
Water Research
,
252
,
121249
.
Boo
K. B. W.
,
El-Shafie
A.
,
Othman
F.
,
Sherif
M.
&
Ahmed
A. N.
(
2024b
)
Groundwater level forecasting using ensemble coactive neuro-fuzzy inference system
,
Science of The Total Environment
,
912
,
168760
.
Erdbrügger
J.
,
van Meerveld
I.
,
Seibert
J.
&
Bishop
K.
(
2023
)
Temporal and spatial variation in shallow groundwater gradients in a boreal headwater catchment
,
Journal of Hydrology
,
626
,
17
.
Evans
S. W.
,
Jones
N. L.
,
Williams
G. P.
,
Ames
D. P.
&
Nelson
E. J.
(
2020
)
Groundwater level mapping tool: An open source web application for assessing groundwater sustainability
,
Environmental Modelling & Software
,
131
,
104782
.
Ghasemlounia
R.
,
Gharehbaghi
A.
,
Ahmadi
F.
&
Saadatnejadgharahassanlou
H.
(
2021
)
Developing a novel framework for forecasting groundwater level fluctuations using Bi-directional Long Short-Term Memory (BiLSTM) deep neural network
,
Computers and Electronics in Agriculture
,
191
,
106568
.
Gholami
V.
,
Chau
K. W.
,
Fadaee
F.
,
Torkaman
J.
&
Ghaffari
A.
(
2015
)
Modeling of groundwater level fluctuations using dendrochronology in alluvial aquifers
,
Journal of Hydrology
,
529
,
1060
1069
.
Goldman
M.
&
Neubauer
F. M.
(
1994
)
Groundwater exploration using integrated geophysical techniques
,
Surveys in Geophysics
,
15
,
331
361
.
Gori
M.
,
Monfardini
G.
&
Scarselli
F.
(
2005
). '
A new model for learning in graph domains
’,
Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005
.
Guan
Q.
,
Ren
S.
,
Chen
L.
,
Yao
Y.
,
Hu
Y.
,
Wang
R.
,
Feng
B.
,
Gu
L.
&
Chen
W.
(
2022
)
Recognizing multivariate geochemical anomalies related to mineralization by using deep unsupervised graph learning
,
Natural Resources Research
,
31
,
2225
2245
.
Hillier
M.
,
Wellmann
F.
,
Brodaric
B.
,
de Kemp
E.
&
Schetselaar
E.
(
2021
)
Three-dimensional structural geological modeling using graph neural networks
,
Mathematical Geosciences
,
53
,
1725
1749
.
Hochreiter
S.
&
Schmidhuber
J.
(
1997
)
Long short-term memory
,
Neural Computation
,
9
,
1735
1780
.
Hopfield
J. J.
(
1982
)
Neural networks and physical systems with emergent collective computational abilities
,
Proceedings of the National Academy of Sciences
,
79
,
2554
2558
.
Khan
J.
,
Lee
E. K. Y.
,
Balobaid
A. S.
&
Kim
K.
(
2023
)
A comprehensive review of conventional, machine leaning, and deep learning models for groundwater level (GWL) forecasting
,
Applied Sciences-Basel
,
13
,
19
.
Khodayar
M.
&
Wang
J.
(
2019
)
Spatio-temporal graph deep neural network for short-term wind speed forecasting
,
Ieee Transactions on Sustainable Energy
,
10
,
670
681
.
Kipf
T. N.
&
Welling
M.
(
2017
). '
Semi-Supervised Classification with Graph Convolutional Networks
’,
5th International Conference on Learning Representations, ICLR 2017
.
Kumar
R.
,
Moreira
J. M.
&
Chandra
J.
(
2023
)
DyGCN-LSTM: A dynamic GCN-LSTM based encoder-decoder framework for multistep traffic prediction
,
Applied Intelligence
,
53
,
25388
25411
.
Ma
Z. J.
,
Mei
G.
,
Prezioso
E.
,
Zhang
Z. J.
&
Xu
N. X.
(
2021
)
A deep learning approach using graph convolutional networks for slope deformation prediction based on time-series displacement data
,
Neural Computing & Applications
,
33
,
14441
14457
.
Mackay
J. D.
,
Jackson
C. R.
&
Wang
L.
(
2014
)
A lumped conceptual model to simulate groundwater level time-series
,
Environmental Modelling & Software
,
61
,
229
245
.
Milan
S. G.
,
Kayhomayoon
Z.
,
Azar
N. A.
,
Berndtsson
R.
,
Ramezani
M. R.
&
Moghaddam
H. K.
(
2023
)
Using machine learning to determine acceptable levels of groundwater consumption in Iran
,
Sustainable Production and Consumption
,
35
,
388
400
.
Osman
A. I. A.
,
Ahmed
A. N.
,
Chow
M. F.
,
Huang
Y. F.
&
El-Shafie
A.
(
2021
)
Extreme gradient boosting (Xgboost) model to predict the groundwater levels in Selangor Malaysia
,
Ain Shams Engineering Journal
,
12
,
1545
1556
.
Osman
A. I. A.
,
Ahmed
A. N.
,
Huang
Y. F.
,
Kumar
P.
,
Birima
A. H.
,
Sherif
M.
,
Sefelnasr
A.
,
Ebraheemand
A. A.
&
El-Shafie
A.
(
2022
)
Past, present and perspective methodology for groundwater modeling-based machine learning approaches
,
Archives of Computational Methods in Engineering
,
29
,
3843
3859
.
Poursaeid
M.
,
Poursaeid
A. H.
&
Shabanlou
S.
(
2022
)
A comparative study of artificial intelligence models and A statistical method for groundwater level prediction
,
Water Resources Management
,
36
,
1499
1519
.
Rajaee
T.
,
Ebrahimi
H.
&
Nourani
V.
(
2019
)
A review of the artificial intelligence methods in groundwater level modeling
,
Journal Of Hydrology
,
572
,
336
351
.
Rumelhart
D. E.
,
Hinton
G. E.
&
Williams
R. J.
(
1986
)
Learning representations by back-propagating errors
,
Nature
,
323
,
533
536
.
Scibek
J.
&
Allen
D. M.
(
2006
)
Modeled impacts of predicted climate change on recharge and groundwater levels
,
Water Resources Research
,
42
, W11405.
Sheikh Khozani
Z.
,
Barzegari Banadkooki
F.
,
Ehteram
M.
,
Najah Ahmed
A.
&
El-Shafie
A.
(
2022
)
Combining autoregressive integrated moving average with Long Short-Term Memory neural network and optimisation algorithms for predicting ground water level
,
Journal of Cleaner Production
,
348
,
131224
.
Skarding
J.
,
Gabrys
B.
&
Musial
K.
(
2024
)
On the effectiveness of heterogeneous ensembles combining graph neural networks and heuristics for dynamic link prediction
,
Ieee Transactions on Network Science and Engineering
,
11
,
3250
3259
.
Srivastava
N.
,
Hinton
G. E.
,
Krizhevsky
A.
,
Sutskever
I.
&
Salakhutdinov
R. R.
(
2014
)
Dropout: A simple way to prevent neural networks from overfitting
,
Journal of Machine Learning Research
,
15
,
1929
1958
.
Tian
Y. Q.
,
Davies-Colley
R. J.
,
Gong
P.
&
Thorrold
B. W.
(
2001
)
Estimating solar radiation on slopes of arbitrary aspect
,
Agricultural and Forest Meteorology
,
109
,
67
74
.
Uc-Castillo
J. L.
,
Marín-Celestino
A. E.
,
Martínez-Cruz
D. A.
,
Tuxpan-Vargas
J.
&
Ramos-Leal
J. A.
(
2023
)
A systematic review and meta-analysis of groundwater level forecasting with machine learning techniques: Current status and future directions
,
Environmental Modelling & Software
,
168
,
105788
.
Wang
S.
,
Cao
J.
&
Yu
P. S.
(
2022
)
Deep learning for spatio-temporal data mining: A survey
,
Ieee Transactions on Knowledge and Data Engineering
,
34
,
3681
3700
.
Werbos
P. J.
(
1990
)
Backpropagation through time: What it does and how to do it
,
Proceedings of the IEEE
,
78
,
1550
1560
.
Wu
C.
,
Zhang
X.
,
Wang
W.
,
Lu
C.
,
Zhang
Y.
,
Qin
W.
,
Tick
G. R.
,
Liu
B.
&
Shu
L.
(
2021
)
Groundwater level modeling framework by combining the wavelet transform with a long short-term memory data-driven model
,
Science of The Total Environment
,
783
,
146948
.
Wu
C.
,
Wu
X.
,
Lu
C.
,
Sun
Q.
,
He
X.
,
Yan
L.
&
Qin
T.
(
2022
)
Characteristics and driving factors of lake level variations by climatic factors and groundwater level
,
Journal of Hydrology
,
608
,
127654
.
Xu
X.
,
Huang
G.
,
Zhan
H.
,
Qu
Z.
&
Huang
Q.
(
2012
)
Integration of SWAP and MODFLOW-2000 for modeling groundwater dynamics in shallow water table areas
,
Journal of Hydrology
,
412–413
,
170
181
.
Yu
B.
,
Yin
H.
&
Zhu
Z.
(
2018
). '
Spatio-Temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forecasting
',
27th International Joint Conference on Artificial Intelligence, IJCAI 2018
.
International Joint Conferences on Artificial Intelligence Organization
.
Zhang
J.
,
Zheng
Y.
,
Qi
D.
,
Li
R.
,
Yi
X.
&
Li
T.
(
2018
)
Predicting citywide crowd flows using deep spatio-temporal residual networks
,
Artificial Intelligence
,
259
,
147
166
.
Zhang
J.
,
Zheng
Y.
,
Sun
J.
&
Qi
D.
(
2020
)
Flow prediction in spatio-temporal networks based on multitask deep learning
,
Ieee Transactions on Knowledge and Data Engineering
,
32
,
468
478
.
Zhang
B.
,
Li
M.
,
Huan
Y.
,
Khan
U.
,
Wang
L.
&
Wang
F.
(
2023a
)
Bedrock mapping based on terrain weighted directed graph convolutional network using stream sediment geochemical samplings
,
Transactions of Nonferrous Metals Society of China
,
33
,
2799
2814
.
Zhang
B.
,
Tang
J.
,
Huan
Y.
,
Song
L.
,
Shah
S. Y. A.
&
Wang
L.
(
2024a
)
Multi-scale convolutional neural networks (CNNs) for landslide inventory mapping from remote sensing imagery and landslide susceptibility mapping (LSM)
,
Geomatics, Natural Hazards and Risk
,
15
,
2383309
.
Zhang
B.
,
Xu
Z.
,
Wei
X.
,
Song
L.
,
Shah
S. Y. A.
,
Khan
U.
,
Du
L.
&
Li
X.
(
2024b
)
Deep subsurface pseudo-lithostratigraphic modeling based on three-Dimensional convolutional neural network (3D CNN) using inversed geophysical properties and shallow subsurface geological model
,
Lithosphere
,
2024
.
Zhao
B.
,
Mao
K. B.
,
Cai
Y. L.
,
Shi
J. C.
,
Li
Z. L.
,
Qin
Z. H.
,
Meng
X. J.
,
Shen
X. Y.
&
Guo
Z. H.
(
2020a
)
A combined Terra and Aqua MODIS land surface temperature and meteorological station data product for China from 2003 to 2017
,
Earth System Science Data
,
12
,
2555
2577
.
Zhao
L.
,
Song
Y.
,
Zhang
C.
,
Liu
Y.
,
Wang
P.
,
Lin
T.
,
Deng
M.
&
Li
H.
(
2020b
)
T-GCN: A temporal graph convolutional network for traffic prediction
,
Ieee Transactions on Intelligent Transportation Systems
,
21
,
3848
3858
.
Zuo
R. G.
&
Xu
Y.
(
2023
)
Graph deep learning model for mapping mineral prospectivity
,
Mathematical Geosciences
,
55
,
1
21
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).

Supplementary data