In irrigation agriculture, predicting groundwater level (GWL) using deep learning models can help decision-makers coordinate surface water and groundwater usage, thus aiding in the sustainable development and utilization of groundwater. However, when making a long sequence prediction, prediction sequences often have severe delays affecting the availability of prediction results. In this paper, a new loss function is proposed to minimize the lag and oversmoothing on the prediction of GWLs. GWL, meteorology, and pumping data are collected via an irrigation Internet of Things system in Hutubi County, Xinjiang. Through Pearson's correlation analysis, historical potential evapotranspiration (ET0), groundwater extraction, and GWL were chosen to predict GWLs. Datasets were constructed through the proposed spatiotemporal data fusion method; then, the best model from the six deep learning models was selected by comparing the prediction capability of the datasets. Finally, the mean-squared error (MSE) loss function is replaced by the proposed loss function. Compared to the mean absolute error, MSE, and predicted sequence graphs, the new loss function significantly depresses the time delay with similar prediction accuracy.

  • An integrated loss function considering the Euclidean error, time warping, and time delay is proposed to depress the time-delay problem of the predicted sequences.

  • A novel data fusion approach was introduced by incorporating the groundwater extraction data from motive pumping wells into the prediction of groundwater levels.

To ensure regular access to quality food, natural resources such as land, air, and water must be sustainably managed and utilized (FAO et al. 2022). Agriculture is the industry with the largest water consumption and water intake. In China, the largest irrigated country in the world, agriculture accounts for 61.56% of the total water consumption, with irrigated lands accounting for 51.62% of the total area and irrigated grain output accounting for 75% of the total grain production (Ministry of Water Resources, PRC 2022). However, the effective utilization coefficient of farmland irrigation water in China is only 0.568, which is far lower than the level of 0.7–0.8 in developed countries (Ministry of Water Resources, PRC 2022). A large amount of groundwater consumed by the evapotranspiration of crops leads to a continuous decline in groundwater levels (GWLs). Strengthening irrigation groundwater resource management is both urgent and necessary.

With the development of Internet of Things (IoT) technologies, agricultural irrigation is gradually becoming data-driven. Through the use of soil humidity, soil temperature, and weather forecast data to predict the irrigation demand of farmland, accurate automatic irrigation can be achieved (Goap et al. 2018). The use of a deep convolutional neural network (DCNN) to accurately predict soil moisture, temperature, and humidity is able to support intelligent irrigation in crop production with high yields and low amounts of water (Kumar et al. 2022). Calibrating intelligent near-infrared spectral data using a DCNN architecture is expected to provide intelligent technical support to address the water cycle and conservation problems of agricultural farming (Cambra Baseca et al. 2019). Automated irrigation has been achieved with the support of soil moisture sensors and intelligent irrigation systems (Benedict 1988). The agricultural industry of today is data-centric, precise, and more intelligent than ever before (Ayaz et al. 2019). As one of the main sources of irrigation, the GWL needs to be predicted for sustainable use.

Data acquisition and parameter selection are key to predicting the GWL. Monitoring the dynamic GWL data can elucidate the current situation of groundwater utilization, enabling decision-makers to realize the sustainable use of groundwater and plan accordingly (He et al. 2021). Time-series predictions based on historical GWLs and rainfall are reliable methods for predicting future GWLs (Bloomfield et al. 2003). Human activities also have an obvious impact on GWLs. For example, groundwater extraction significantly affects the GWL (Marco & Mathieu 2017). In contrast, the GWLs in western and northwest China have decreased significantly. In addition to human activities, temperature also has a high correlation with GWLs (Zhao et al. 2019). China has built large-scale GWL monitoring and meteorological monitoring stations, which can effectively predict GWLs by considering a variety of data sources.

Besides the data, the model is crucial for the prediction as well. Many machine learning methods have been successfully used by researchers for data-driven GWL prediction based on a variety of data sources. For example, rainfall and aquifer depths can be well predicted by multiple linear regression models (Mogaji et al. 2015). Researchers have also used support vector machines and wavelet transforms to predict GWLs (Zhou et al. 2017a, 2017b). Annual and seasonal GWL trend studies have been performed using the Mann–Kendall (MK) method, and fitting and prediction analyses have been carried out using autoregressive integrated moving average (ARIMA) modeling (Satish Kumar & Venkata Rathnam 2019). In recent years, deep learning models for GWL prediction have also been highlighted. Combining empirical model decomposition (Zhang et al. 2022), principal component analysis (Kim et al. 2021), wavelet, and other pretreatment methods with deep learning models such as gated recurrent unit (GRU) (Gharehbaghi et al. 2022), and long short-term memory (LSTM) (Shin et al. 2020; Kim et al. 2021; Zhang et al. 2022) networks has achieved reliable results in accurately predicting GWLs. Despite the lack of research, a transformer has shown a strong potentiality in predicting GWL because it could capture long-distance dependence (Vaswani et al. 2017) and can adapt to time-series prediction tasks. In New York City, transformers have shown excellent performance in short-term power load forecasting (Ran et al. 2023), network data prediction (Kong et al. 2022), photovoltaic power generation forecasting (Tian et al. 2022), and financial time-series prediction (Preeti & Singh 2022). The high calculation time complexity of self-attention, the large memory footprint caused by high numbers of parameters, and the slow prediction speed of the step-by-step mechanism have led to the difficult large-scale generalization of transformer-based models (Zhou et al. 2021). The Informer, with its prob-attention mechanism, was therefore proposed, effectively solving the shortcomings of the transformer and achieving results beyond those of LSTM, ARIMA, and other models on power load prediction datasets (Zhou et al. 2021).

The accuracy of groundwater depth prediction modeling is of great significance to the rational utilization of groundwater resources, the sustainable development of regional social economies, and the evaluation of ecological environments (Zhang et al. 2019). The integrated response of GWLs to several factors including climate, topography, groundwater extraction, and hydrogeology and their interactions makes GWL simulation a challenging task (Hai et al. 2022). Although there are areas where monitoring data are lacking, distribution estimates of groundwater extraction based on groundwater balance models, land use, statistical information, and qualitative surveys can be used to predict underground water levels (Vu et al. 2020a). However, most studies have focused on prediction accuracy, and the time-delay problem of prediction series makes it difficult to apply these models in the actual project. We need effective methods to reduce the delay in prediction when predicting GWL, in addition to selecting the appropriate parameters and models.

In this study, groundwater extraction and meteorological data were obtained, along with GWL monitoring data from electromechanical wells, through a regional irrigation IoT system. The obtained data were preprocessed, including outlier correction, linear interpolation, and wavelet filtering. The most suitable prediction parameters were selected by performing the correlation analysis. Comparing the multiple time-scale prediction results of LSTM, GRU, multi-layer perceptron (MLP), one-dimensional convolutional neural network (1DCNN), the Transformer and the informer select the most suitable model. A new loss function is presented and its effectiveness in reducing the delay in predicted sequences is verified.

Study area and data

Hutubi County, belonging to the Changji Hui Autonomous Prefecture of Xinjiang Uygur Autonomous Region, is located in the central and northern parts of Xinjiang, with a total population of 220,000, geographical coordinates spanning from east longitude 86°5′ to 87°8′ and north latitude 43°7′ to 45°20′, and a total area of 9,518 km2. Hutubi County has a temperate continental arid and semi-arid climate. The average temperature in the plain area is 6.7 °C, the annual precipitation is 167 mm, the average frost-free period is 180 days, and the annual total sunshine duration is 3,090 h; the annual effective accumulated temperature, which is stable at more than 10 °C, is 3,553 °C. Cotton, corn, and wheat are the main crops in the county. From May to August, crops grow in the plain areas; the average sunshine hours are more than 10 h/d in the season of vigorous growth and more than 11 h in July. The combination of high evaporation rates and low rainfall has promoted the development of irrigated agriculture. However, due to the extensive exploitation of groundwater, the GWL in the county continues to decline. To ensure the safety and sustainable development of groundwater resources, the government has established a dual control monitoring system for well electricity to monitor and control the development of local groundwater resources.

Water-power control equipment (WPCE) (Figure 1), by installing ultrasonic flow meters at the wellhead pipelines, can accurately monitor the pumping of the motive pumping well (MPW) and has become an important component of the agricultural water conservancy IoT in Hutubi. The county has 1,656 WPCEs that access the irrigation IoT, covering almost all MPWs in the county. These WPCEs collect data on groundwater extraction in Hutubi County. The wide distribution of equipment and the short collection interval make the data that effectively reflect the spatial and temporal characteristics of groundwater extraction, which were grouped into seven groups based on latitude and longitude. The data used in this study are water withdrawal data from the Hutubi Plain area spanning from 2017 to 2022, as well as GWL monitoring data and meteorological data of the area during the same period.
Figure 1

Study area and distribution of GWL monitoring stations.

Figure 1

Study area and distribution of GWL monitoring stations.

Close modal
GWL is obtained using pressure-type GWL meters (Figure 1). The GWL data are the daily data from GWL monitoring stations (GWLMS) 1–7 distributed in the county. The climate data, collected using integrated meteorological monitoring stations, include daily rainfall, temperature, humidity, atmospheric pressure, and ET0. Figure 2 shows the IoT terminal equipment used to collect various types of data.
Figure 2

IoT monitoring devices.

Figure 2

IoT monitoring devices.

Close modal

Method

In this study, multi-source data were fused, and the Penman–Monteith (Allan et al. 1998) formula was used to calculate ET0. Figure 3 shows the forecast flow.
Figure 3

Forecast flowchart.

Figure 3

Forecast flowchart.

Close modal

Deep learning models

This study used the six models for the long time-series prediction of GWLs. They can be divided into four categories. Both LSTM and GRU are recurrent neural networks. The LSTM consists of three gate units (namely, a forgetting gate, an input gate, an output gate), and a memory unit (Zheng et al. 2017). The GRU is improved from the LSTM, combining the cell state and the hidden state into one state, and is controlled using the update gate and the reset gate (Cho et al. 2014). MLP is the most classic deep learning model, whose structure involves the full connection of multiple monolayers that are composed of a group of neurons (Benedict 1988). 1DCNN uses multiple convolution kernels to extract features by convolution calculation, and the output serves as the input to the next layer of convolution operations (Zhou et al. 2017a, 2017b).

The Transformer and the Informer connect encoders and decoders through a self-attention mechanism, which is a new deep learning structure.

We have the input, a multivariable time series:
(1)
where N is the variable dimension; is the length of the input time series; and is the input value of dimension i at time t.
The output is to predict the corresponding sequence:
(2)
where N is the variable dimension; is the length of the output time series; and is the output value of dimension i at time t.
Self-attention is defined based on a dot product between tuple inputs: calculated query vector, key vector, and value vector. The degree of correlation of the corresponding sequences was calculated. ProbSparse self-attention, based on the proposed measurement, allows each key to attend only to the dominant u queries, and the equation is as follows:
(3)
where is a sparse matrix containing only the top-u queries; K and V are keys and values, respectively; and d is the input dimension.
The Transformer relies on self-attention, and the Informer relies on ProbSparse self-attention. All six model structures are illustrated in Figure 4.
Figure 4

Structure chart of six deep learning models.

Figure 4

Structure chart of six deep learning models.

Close modal

Error measure and model evaluation

In this study, two error metrics are used to evaluate the forecasting results: mean-squared error (MSE) and mean absolute error (MAE); these are commonly used error evaluation indicators in underground water level prediction tasks (Hai et al. 2022):
(4)
(5)
where is the real measure of GWL; is the estimated value of GWL; and is the mean of .

Loss function

Deep neural networks can become a real-time tool to predict GWL for a certain period, just requiring input of historical data after iterative supervised learning. Supervised learning relies on the back-propagation mechanism, the most critical of which is to calculate the bias of the predicted and supervised values, and the function used to measure the bias is called the loss function.

Customarily, MSE is taken as the loss function when predicting GWL. However, when there are serious delay phenomena and shape errors in time-series prediction, it renders the prediction meaningless. Time delay refers to the delay of the predicted sequence compared to the real sequence. To address the delay phenomenon predicted by the model, a new loss function is designed to penalize Euclidean distance errors and time delays simultaneously, and the corresponding coefficients, α, β, and γ, can be determined according to the specific dataset.

The loss function based on Euclidean distance is defined as follows:
(6)
Differentiable dynamic time warping (Marco & Mathieu 2017) (soft DTW) is used to evaluate the shape difference between the predicted series and the original series. The shape loss function is calculated as follows:
(7)
The time loss index (TDI) (Le Guen & Thorne 2019) is used as a time loss function and is defined as follows:
(8)
The final comprehensive loss function is defined as follows:
(9)
Figure 5 shows the DTW and the TDI algorithm with a 20-unit length sequence as an example. Sequence similarity is usually calculated in a ‘one-to-one’ mode, representing deviations accumulating between corresponding points. DTW scales the time series and calculates the similarity, which can better reflect the overall shape similarity of the sequence. The dtw represents the inner product of the path matrix and the corresponding Euclidean distance vector. In Figure 5, A is the path matrix for calculating the similarity, the red point is 1, and the blank is 0. Δ is the corresponding Euclidean distance vector. The DTW formula is as follows:
(10)
Figure 5

Representation of the DTW and TDI algorithms.

Figure 5

Representation of the DTW and TDI algorithms.

Close modal
The ‘one-to-many’ model of DTW causes the predicted sequence to lag behind the supervised sequence. To alleviate this delay, TDI is used to penalize it. TDI represents the inner product of the path gradient matrix A* = ∇dtw() and the penalty matrix Ω = [ωij], defined as ωij = δ * (ij) 2/n2:
(11)

Data fusion method

Due to the different locations and different numbers of weather stations, electromechanical wells, and GWL monitoring stations, the data collection interval industry is not unified, and the spatiotemporal data fusion of the three types of data is needed. All parameters were resampled with a time interval of daily. MPWs were associated with the nearest GWLMSs based on the minimum distance calculated using geographic longitude and latitude coordinates. The clustering formula is as follows:
(12)
(13)
(14)
where is the longitude; is the latitude; is the parameter value of i MPW; is the parameter value of j GWLMS; and a is the number of MPWs in j aggregation.

All GWLMSs share the same sequence of meteorological data.

Parameter selection and data preprocessing

A correlation analysis was performed on the alternative parameters to determine the input parameters of the model. The results of the correlation analysis are shown in Table 1, where ET0 and daily water volume were associated with GWL, while other parameters showed little correlation with GWL.

Table 1

Result of Pearson's correlation coefficient

GWLET0Daily pumping volumeDaily rainfallTemperatureHumidityAtmospheric pressure
GWL       
ET0 0.243      
Daily pumping volume 0.26 0.661     
Daily rainfall 0.051 0.058 −0.032    
Temperature 0.084 0.761 0.488 −0.225   
Humidity 0.044 −0.132 −0.214 0.662 −0.109  
Atmospheric pressure 0.026 0.105 0.038 0.373 0.083 −0.026 
GWLET0Daily pumping volumeDaily rainfallTemperatureHumidityAtmospheric pressure
GWL       
ET0 0.243      
Daily pumping volume 0.26 0.661     
Daily rainfall 0.051 0.058 −0.032    
Temperature 0.084 0.761 0.488 −0.225   
Humidity 0.044 −0.132 −0.214 0.662 −0.109  
Atmospheric pressure 0.026 0.105 0.038 0.373 0.083 −0.026 

Bolding indicates that the corresponding parameter on the right has a high correlation with the groundwater level.

The three parameters were preprocessed. Figure 6 shows the cluster results of MPWs and pumping volume for each group. There are missing values in the GWL monitoring data from the pressure-type GWL meter. To address this issue, the linear interpolation method is employed to supplement the data, and wavelet hard threshold filtering is applied to remove ultrashort-term fluctuations. Figure 7 illustrates the GWLs and smoothed GWLs at each monitoring point over the last six years. Among the monitoring stations, GWLMS3 is characterized by a short-term, accelerated decline in GWL.
Figure 6

Cluster results of MPWs and pumping volume for each group.

Figure 6

Cluster results of MPWs and pumping volume for each group.

Close modal
Figure 7

GWL from pressure-type GWL meters.

Figure 7

GWL from pressure-type GWL meters.

Close modal
Over the last six years, ET0 has exhibited annual cyclical variations ranging from 0.34 to 19.22 mm/d. In the short term, ET0 random fluctuations are large. Wavelet hard threshold filtering is applied to remove short-term fluctuations, and retaining only the annual cycles helps to improve the accuracy of the predicted GWLs. Figure 8 shows the ET0 and high-frequency random fluctuation over the last six years in the Hutubi region.
Figure 8

Denoised ET0 over the last six years and high-frequency random fluctuation.

Figure 8

Denoised ET0 over the last six years and high-frequency random fluctuation.

Close modal

Selection of the deep learning model

Model building and data analysis were performed using Python3.0. The datasets are divided into large quantum sequences according to the set time step, and these subsequences are divided into a training set, a validation set, and a test set in the order of the time axis for model training and prediction. The ratio between the training set, the validation set, and the test set is 7:1:2. The model parameters are set as follows: heads: 2, padding: 0, dropout: 0.05, train epoch: 50, batch size: 64, initial learning rate: 0.0001, and Informer model Prob factor: 10. Each model is trained 30 times. Figure 9 presents the average MAE and average MSE of the model prediction with output lengths of 12, 24, and 36. The Informer performed the best among the six datasets because it has the smallest error on almost all datasets, and the average MAEs were 0.199, 0.231, and 0.339 m and MSEs were 0.264, 0.325, and 0.417 at three predicted lengths, respectively.
Figure 9

MAE and MSE of six models’ prediction.

Figure 9

MAE and MSE of six models’ prediction.

Close modal

Depress time delay

The Informer performed the best on the GWLMS1 dataset. Figure 10 shows the time delay and time warping in GWLMS1 and GWLMS4. The parameters of the loss function Lintegrated are set as follows: α = 0.75, β = 0.55, and γ = 0.1 after testing.
Figure 10

Time delay and time warping.

Figure 10

Time delay and time warping.

Close modal
Figure 11 shows the prediction sequences when the Lintegrated and MSE loss functions are selected for prediction on the GWLMS1 dataset. In contrast with MSE (blue line), selecting Lintegrated as the loss function yields predicted sequences (yellow line) that closely align with the changing trend of the original sequence and effectively mitigates the prediction delay problem.
Figure 11

Predicted time series using different loss functions.

Figure 11

Predicted time series using different loss functions.

Close modal

Meanwhile, we compared the MAE and MSE of the prediction results when using two loss functions, MSE and Lintegrated. Table 2 shows the results. Of the 18 group results, Lintegrated outperformed MSE on 11. In the remaining seven groups, the mean MAE using Lintegrated was only 0.037 m higher than that using MSE, and the mean MSE was 0.06 higher. This demonstrates that the Lintegrated loss function can achieve comparable or even superior results to MSE, further confirming the above conclusion.

Table 2

Performance of the Informer model on different datasets

Pre_daysLossMetricGWLMS
124567
12 Lintegrated MAE 0.154 0.165 0.190 0.162 0.310 0.206 
MSE 0.108 0.121 0.347 0.117 0.486 0.160 
MSE MAE 0.216 0.105 0.248 0.168 0.385 0.200 
MSE 0.278 0.152 0.489 0.178 0.551 0.148 
24 Lintegrated MAE 0.295 0.119 0.295 0.268 0.502 0.356 
MSE 0.276 0.389 0.425 0.371 0.816 0.185 
MSE MAE 0.371 0.220 0.300 0.290 0.532 0.352 
MSE 0.462 0.374 0.541 0.411 0.845 0.184 
36 Lintegrated MAE 0.327 0.185 0.633 0.420 0.640 0.316 
MSE 0.499 0.669 0.831 0.645 0.959 0.454 
MSE MAE 0.865 0.105 0.542 0.413 0.611 0.307 
MSE 0.960 0.491 0.756 0.562 0.982 0.442 
Pre_daysLossMetricGWLMS
124567
12 Lintegrated MAE 0.154 0.165 0.190 0.162 0.310 0.206 
MSE 0.108 0.121 0.347 0.117 0.486 0.160 
MSE MAE 0.216 0.105 0.248 0.168 0.385 0.200 
MSE 0.278 0.152 0.489 0.178 0.551 0.148 
24 Lintegrated MAE 0.295 0.119 0.295 0.268 0.502 0.356 
MSE 0.276 0.389 0.425 0.371 0.816 0.185 
MSE MAE 0.371 0.220 0.300 0.290 0.532 0.352 
MSE 0.462 0.374 0.541 0.411 0.845 0.184 
36 Lintegrated MAE 0.327 0.185 0.633 0.420 0.640 0.316 
MSE 0.499 0.669 0.831 0.645 0.959 0.454 
MSE MAE 0.865 0.105 0.542 0.413 0.611 0.307 
MSE 0.960 0.491 0.756 0.562 0.982 0.442 

Bolding indicates that the corresponding parameter on the right has a high correlation with the groundwater level.

In this study, groundwater extraction data, meteorological data, and GWL monitoring data were collected through an IoT system for irrigation. ET0, groundwater extraction, and GWL were selected to predict the underground water table based on the Pearson correlation analysis. The correlation of the rainfall with the water table is low due to very little rainfall in the study area. After the data were preprocessed, a novel data fusion approach was introduced by incorporating the groundwater extraction data from MPWs into the prediction of GWLs, constructing the GWLMS1–7 datasets.

Informer, was selected from seven models including LSTM, GRU, MLP, 1 DCNN, Transformer, and Informer, performing the best on six datasets. To address the time-delay problem of the predicted sequences, an integrated loss function considering the Euclidean error, time warping, and time delay is proposed. On six datasets, smaller MSE and MAE were achieved using the integrated loss function compared to the MSE loss function.

In the end, this paper summarizes the GWL prediction models selected in previous studies, as well as prediction time granularities and model performances. It is easy to deduce from Table 3 that our prediction error is acceptable compared to previous studies; results of MSE and MAE are comparable and even better.

Table 3

Groundwater level prediction results in different research studies

Research studiesMethodTime scaleDelay and shape errorEvaluation index
MAEMSE
Pham et al. (2022)  Random tree Quarterly Normal 0.40 0.36 
Zhang et al. (2021)  RBF Monthly Obvious 0.02 
Amin et al. (2022)  GRU Monthly Little 0.01 
Liang et al. (2021)  KNN-LSTM Monthly Obvious ≈0.16 ≈0.05 
Vu et al. (2020b)  LSTM Monthly Obvious 0.84 0.02 
Bum et al. (2021)  PCA-LSTM Daily Normal 0.04 
He et al. (2021)  1DCNN Daily Normal 0.04 
MunJu et al. (2020)  LSTM Daily 0.11 
Ours Improved Informer Daily Little 0.24 0.11 
Research studiesMethodTime scaleDelay and shape errorEvaluation index
MAEMSE
Pham et al. (2022)  Random tree Quarterly Normal 0.40 0.36 
Zhang et al. (2021)  RBF Monthly Obvious 0.02 
Amin et al. (2022)  GRU Monthly Little 0.01 
Liang et al. (2021)  KNN-LSTM Monthly Obvious ≈0.16 ≈0.05 
Vu et al. (2020b)  LSTM Monthly Obvious 0.84 0.02 
Bum et al. (2021)  PCA-LSTM Daily Normal 0.04 
He et al. (2021)  1DCNN Daily Normal 0.04 
MunJu et al. (2020)  LSTM Daily 0.11 
Ours Improved Informer Daily Little 0.24 0.11 

RBF, radial basis function; KNN, K-nearest neighbor; PCA, principal component analysis.

Thanks to Beijing Lianchuang Siyuan Measurement and Control Technology Co., Ltd. for providing the pumping data.

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Allan
R.
,
Pereira
L.
&
Smith
M.
1998
Crop Evapotranspiration-Guidelines for Computing Crop Water Requirements-FAO Irrigation and Drainage Paper 56. Food and Agriculture Organization of the United Nations (FAO), Rome, Italy
.
Amin
G.
,
Redvan
G.
,
Farshad
A.
& Mohammad, A.
2022
Groundwater level prediction with meteorologically sensitive Gated Recurrent Unit (GRU) neural networks
.
Journal of Hydrology
612
(
PC
).
Ayaz
M.
,
Ammad-Uddin
M.
,
Sharif
Z.
,
Mansour
A.
&
Aggoune
E. H. M.
2019
Internet-of-things (IoT)-based smart agriculture: Toward making the fields talk
.
IEEE Access
7
,
129551
129583
.
http://doi.org/10.1109/ACCESS.2019.2932609
.
Benedict
A. K.
1988
Learning in the multilayer perceptron
.
Journal of Physics A: Mathematical and General
21
(
11
),
2643
.
Bloomfield
J. P.
,
Gaus
I.
&
Wade
S. D.
2003
A method for investigating the potential impacts of climate change scenarios on annual minimum groundwater levels
.
Water and Environment Journal
17
,
2
.
https://doi.org/10.1111/j.1747-6593.2003.tb00439.x
.
Bum
G. K.
,
Ik
C. H.
&
Rak
M. C.
2021
PCA-based multivariate LSTM model for predicting natural groundwater level variations in a time-series record affected by anthropogenic factors
.
Environmental Earth Sciences
80
(
18
).
Cambra Baseca
C.
,
Sendra
S.
,
Lloret
J.
&
Tomas
J.
2019
A smart decision system for digital farming
.
Agronomy
9
,
5
.
https://doi.org/10.3390/agronomy9050216
.
Cho
K.
,
Merrienboer
V. B.
,
Gülçehre
Ç.
,
Bougares
F.
,
Schwenk
H.
&
Bengio
Y.
2014
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. CoRR,abs/1406.1078
.
FAO, IFAD, UNICEF, WFP, and WHO
2022
The State of Food Security and Nutrition in the World 2022: Repurposing Food and Agricultural Policies to Make Healthy Diets More Affordable
.
FAO
,
Rome, Italy
.
https://doi.org/10.4060/cc0639en
.
Gharehbaghi
A.
,
Ghasemlounia
R.
,
Ahmadi
F.
&
Albaji
M.
2022
Groundwater level prediction with meteorologically sensitive gated recurrent unit (GRU) neural networks
.
Journal of Hydrology
612
(
Pt C
).
http://doi.org/10.1016/J.JHYDROL.2022.128262
.
Goap
A.
,
Sharma
D.
,
Shukla
A. K.
&
Rama Krishna
C.
2018
An IoT based smart irrigation management system using machine learning and open source technologies
.
Computers and Electronics in Agriculture
155
.
https://doi.org/10.1016/j.compag.2018.09.040
.
Hai
T.
,
Hameed, M. M., Marhoon, A., Zounemat-Kermani, M., Heddam, S., Kim, S., Sulaiman, S. O., Tan, M. L., Sa'adi, Z., Mehr, A. D., Allawi, M. F., Abba, S. I., Zain, J. M., Falah, M. W., Jamei, M., Bokde, N. D., Bayatvarkeshi, M., Al-Mukhtar, M., Bhagat, S. K., Tiyasha, T. & Yaseen, Z. M.
2022
Groundwater level prediction using machine learning models: A comprehensive review
.
Neurocomputing
489
.
https://doi.org/10.1016/j.neucom.2022.03.014
.
He
L.
,
Hou
M.
,
Chen
S.
,
Zhang
J.
,
Chen
J.
&
Qi
H.
2021
Construction of a spatio-temporal coupling model for groundwater level prediction: A case study of Changwu area, Yangtze River Delta region of China
.
Water Supply
21
(
7
).
https://doi.org/10.2166/ws.2021.140
.
Kim
G. B.
,
Hwang
C. I.
&
Choi
M. R.
2021
PCA-based multivariate LSTM model for predicting natural groundwater level variations in a time-series record affected by anthropogenic factors
.
Environmental Earth Sciences
80
(
18
).
http://doi.org/10.1007/S12665-021-09957-0
.
Kong
Q.
,
Zhang
X.
,
Zhang
C.
,
Zhou
L.
,
Yu
M.
,
He
Y.
,
Chen
Y.
,
Miao
Y.
&
Yuan
H.
2022
Network traffic prediction: Apply the transformer to time series forecasting
.
Mathematical Problems in Engineering
2022
,
8424398
.
http://doi.org/10.1155/2022/8424398
.
Kumar
P.
,
Udayakumar
A.
,
Anbarasa Kumar
A.
,
Senthamarai Kannan
K.
&
Krishnan
N.
2022
Multiparameter optimization system with DCNN in precision agriculture for advanced irrigation planning and scheduling based on soil moisture estimation
.
Environmental Monitoring and Assessment
195
(
1
).
https://doi.org/10.1007/s10661-022-10529-3
.
Le Guen
V.
&
Thome
N.
2019
Shape and time distortion loss for training deep time series forecasting models. 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada, pp. 1–13
.
Liang, H., Manqing, H., Suozhong, C., Junru, Z., Junyi, C. & Hui, Q.
2021
Construction of a spatio-temporal coupling model for groundwater level prediction: A case study of Changwu area, Yangtze River Delta region of China
.
Water Supply
21
(
7
).
Marco
C.
&
Mathieu
B.
2017
Soft-DTW: A differentiable loss func-tion for time-series. 34th Int. Conf. Mach. Learn. ICML 2017 2, (2017), 1483–1505
.
Ministry of Water Resources, PRC
2022
China Water Resources Bulletin 2021
.
Ministry of Water Resources, PRC
,
Beijing
.
Mogaji
K. A.
,
Lim
H. S.
&
Abdullah
K.
2015
Modeling of groundwater recharge using a multiple linear regression (MLR) recharge model developed from geophysical parameters: A case of groundwater resources management
.
Environmental Earth Sciences
73
(
3
).
https://doi.org/10.1007/s12665-014-3476-2
.
MunJu, S., SooHyoung, M., Goo, K. K., DukChul, M. & HyukJoon, K.
2020
Analysis of groundwater level variations caused by the changes in groundwater withdrawals using long short-term memory network
.
Hydrology
7
(
3
).
Pham, Q. B., Kumar, M., Di, N. F., Elbeltagi, A., Granata, F., Islam Abu Reza, Md. T., Talukdar, S., Nguyen, X. C., Ahmed, A. N. & Anh, D. T.
2022
Groundwater level prediction using machine learning algorithms in a drought-prone area
.
Neural Computing and Applications
34
(
13
).
Preeti
R. B.
&
Singh
R. P.
2022
A dual-stage advanced deep learning algorithm for long-term and long-sequence prediction for multivariate financial time series
.
Applied Soft Computing Journal
126
,
109317
.
http://doi.org/10.1016/J.ASOC.2022.109317
.
Ran
P.
,
Dong
K.
,
Liu
X.
&
Wang
J.
2023
Short-term load forecasting based on CEEMDAN and transformer
.
Electric Power Systems Research
214
(
Pt A
),
108885
.
http://doi.org/10.1016/J.EPSR.2022.108885
.
Satish Kumar
K.
&
Venkata Rathnam
E.
2019
Analysis and prediction of groundwater level trends using four variations of Mann Kendall tests and ARIMA modelling
.
Journal of the Geological Society of India
94
(
3
).
https://doi.org/10.1007/s12594-019-1308-4
.
Shin
M. J.
,
Moon
S. H.
,
Kang
K. G.
,
Moon
D. C.
&
Koh
H. J.
2020
Analysis of groundwater level variations caused by the changes in groundwater withdrawals using long short-term memory network
.
Hydrology
7
(
3
).
http://doi.org/10.3390/hydrology7030064
.
Tian
F.
,
Fan
X.
,
Wang
R.
,
Qin
H.
&
Fan
Y.
2022
A power forecasting method for ultra-short-term photovoltaic power generation using transformer model
.
Mathematical Problems in Engineering
2022
,
9421400
.
http://doi.org/10.1155/2022/9421400
.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L. & Polosukhin, I.
2017
Attention is all you need
. In:
Proceedings of the 31st Annual Conference on Neural Information Processing Systems (NIPS), Long Beach, CA
,
New York, USA
.
Curran Associates Inc
.
Vu
M. T.
,
Jardani
A.
,
Massei
N.
&
Fournier
M.
2020a
Reconstruction of missing groundwater level data by using Long Short-Term Memory (LSTM) deep neural network
.
Journal of Hydrology
2020
(
prepublish
).
Vu
H. M.
,
Shanafield
M.
,
Nhat
T. T.
,
Partington
D.
&
Batelaan
O.
2020b
Mapping catchment scale unmonitored groundwater abstractions: Approaches based on soft data
.
Journal of Hydrology: Regional Studies
30
.
https://doi.org/10.1016/j.ejrh.2020.100695
.
Zhang
X. M.
,
Wang
Z. L.
,
Du
F. H.
,
Hu
Y. S.
&
Lu
L.
2019
Study on the influence of groundwater burial depth on soil water change in the shallow buried area of Huabei Plain
.
Water Conservation Irrigation
9
,
6
9
.
http://doi.org/10.3969/j.issn.1007-4929.2019.09.002
.
Zhang
H.
,
Zhao
J.
&
Chen
C.
2021
Groundwater level prediction based on neural networks: A case study in Linze, Northwestern China
.
E3S Web of Conferences
2021
,
266
.
Zhang
X.
,
Chen
H.
,
Zhu
G.
,
Zhao
D.
&
Duan
B.
2022
A new groundwater depth prediction model based on EMD-LSTM
.
Water Supply
22
(
6
).
http://doi.org/10.2166/WS.2022.230
.
Zheng
Z.
,
Chen
W.
,
Wu
X.
,
Chen
P.
&
Liu
J.
2017
LSTM network: A deep learning approach for short-term traffic forecast. IET Intelligent Transport Systems 11 (2), 68–75
.
Zhou
F.
,
Jin
L.
&
Dong
J.
2017a
Review of convolutional neural network research
.
Chinese Journal of Computers
40
(
6
).
Zhou
T.
,
Wang
F.
&
Zhi
Y.
2017b
Comparative analysis of ANN and SVM models combined with wavelet preprocess for groundwater depth prediction
.
Water
9
(
10
).
https://doi.org/10.3390/w9100781
.
Zhou
H. Y.
,
Zhang
S. H.
,
Peng
J. Q.
,
Zhang
S.
,
Li
J. X.
,
Xiong
H.
&
Zhang
W. C.
&
Intelligence Association Advancement Artificial
2021
Informer: Beyond efficient transformer for long sequence time-series forecasting
. In:
Proceedings of the 35th AAAI Conference on Artificial Intelligence/33rd Conference on Innovative Applications of Artificial Intelligence/11th Symposium on Educational Advances in Artificial Intelligence, Electr Network, Feb 02–09
,
California, USA
.
AAAI Press
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).