Global climate change and human activities have profoundly impacted the geological and hydrological processes in watersheds, increasing the challenges in streamflow prediction. In this study, we propose a streamflow prediction model based on deep learning and dual-mode decomposition specifically tailored for the Wujiang River basin, a significant tributary on the southern bank of the upper Yangtze River. This model effectively addresses the prediction error issue caused by high-frequency components through dual-mode decomposition of time-series data. The results demonstrate that employing the coupled complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN)–variational mode decomposition (VMD)–temporal convolutional network (TCN)–long short-term memory network (LSTM) model reduces prediction errors by at least 35% compared to single decomposition models. Furthermore, compared to individual TCN or LSTM models, the TCN–LSTM coupled model exhibits greater stability and higher prediction accuracy during training, with reductions in mean absolute error and root mean square error by 43.13 and 24.57%, respectively. This model holds promising prospects for application and can provide crucial insights for water resource management.

  • A new two-stage modal decomposition method for runoff is proposed.

  • Complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) performs the primary modal decomposition and variational mode decomposition (VMD) performs the secondary modal decomposition for high-frequency intrinsic mode function.

  • The temporal convolutional network (TCN) model can effectively extract the implied features between runoff and time.

  • The CEEMDAN–VMD–TCN–long short-term memory network model achieves a prediction accuracy of over 95%.

The rapid changes in global climate and significant human interventions have profoundly impacted the geological and hydrological processes within river basins (Zhang X. Q. et al. 2023), thereby amplifying the challenges in runoff prediction (Shahid et al. 2018, 2021). The Wujiang River basin stands as one of the largest tributaries on the southern bank of the upper Yangtze River, rendering the establishment of accurate and efficient hydrological prediction models crucial for water resources management in the region. Existing runoff prediction models fall into two main categories: process-driven and data-driven (He et al. 2021, 2024; Xu et al. 2024). Compared to process-driven models, data-driven models offer strong operability, as they do not require consideration of the physical mechanisms underlying runoff occurrence (Zheng et al. 2024a, 2024b); instead, they only necessitate mathematical analysis of time series to establish the functional relationship between input and output variables (Zhao et al. 2023). In recent years, machine learning has experienced significant advancements in the hydrology field due to its powerful learning capabilities (Ghazi et al. 2022; Adera et al. 2024; Zeydalinejad et al. 2024). In contrast to traditional algorithms like support vector machine regression and random forest, the gating units within long short-term memory (LSTM) networks are capable of effectively capturing long-term dependencies and temporal correlations within data, thus finding widespread applications in hydrology (Vatanchi et al. 2023; Hu et al. 2024). Zhang et al. (2020) successfully developed a hybrid model for monthly runoff prediction at a control hydrological station in the upper reaches of the Jinjiang River, combining a multi-population genetic algorithm (MPGA) with LSTM neural networks. The MPGA–LSTM model exhibited exceptionally high accuracy. Temporal convolutional networks (TCNs) are capable of effectively capturing short-term local patterns. However, LSTM often encounters issues of vanishing or exploding gradients during training, especially with long sequence data. TCN's convolutional structure effectively alleviates this problem, enabling better gradient propagation and enhancing the model's training efficiency and stability. Based on the TCN–LSTM model, Zhao et al. (2022) successfully addressed the issue of incomplete landslide time information leading to low accuracy in rainfall threshold models, thereby improving the accuracy and efficiency of landslide meteorological warning models. Yao et al. (2023) proposed a method for predicting urban inundation depth using TCN–LSTM models based on data from inundation monitoring stations, achieving high prediction accuracy and enhancing urban inundation perception capabilities.

Long sequence runoff data exhibit complex nonlinear relationships, long-term dependencies, and temporal correlations, containing a wealth of sequential information. Ban et al. (2023) introduced a novel decomposition–integration coupled model, variational mode decomposition (VMD)–DBO–gated recurrent units (GRU), which effectively enhances prediction accuracy by coupling VMD and Dung Beetle Optimization (DBO) with GRU. Zhang, X et al. (2023) proposed a VMD–Singular Spectrum Analysis (SSA)–Bidirectional Long Short-Term Memory (BiLSTM) coupled model for monthly runoff prediction, successfully improving the accuracy and smoothness of downstream Yellow River monthly runoff prediction, and providing significant insights for water resources management. Yang et al. (2023) successfully enhanced the accuracy of runoff trend prediction in the middle reaches of the Huai River by combining the improved complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) algorithm, northern goshawk optimization algorithm, and LSTM algorithm. The aforementioned studies all indicate that the performance of decomposition–integration coupled models surpasses that of single prediction models (Choubin et al. 2019; Liu et al. 2019). The decomposition–integration approach can better capture the features of time series, thereby improving prediction accuracy. However, high-frequency components generated by single decomposition methods exhibit significant volatility and instability, increasing the difficulty of model simulation and affecting the overall accuracy of runoff sequence simulation. To further explore effective information within high-frequency components and enhance their stability, a two-stage decomposition process is employed on top of the single decomposition technique. Wang et al. (2017) employed a two-stage decomposition technique coupled with extreme learning machines optimized by the differential evolution algorithm for predicting the air quality index. The results demonstrated that the predictive accuracy of the two-stage decomposition coupled model significantly surpassed that of other considered models. Currently, the application of two-stage decomposition techniques in runoff prediction remains relatively limited.

Based on this, the present study proposes a two-stage decomposition prediction model, coupled model of adaptive noise-adaptive mode decomposition CEEMDAN–VMD–TCN–LSTM. The integration of the TCN–LSTM model effectively incorporates meteorological factors into runoff prediction, thereby enhancing the accuracy of forecasts. Building upon the traditional decomposition–integration framework, a two-stage decomposition is applied to the high-frequency modal components to reduce their nonstationary characteristics. Monthly runoff and meteorological data from the Wulong station, covering the period from 1957 to 2021, were selected to validate the performance of the CEEMDAN–VMD–TCN–LSTM coupled model in runoff simulation. This model is expected to open new research directions in the field of runoff prediction.

Data sources

The Yangtze River Hydrological Network's real-time water data collection of monthly runoff measurements from 1957 to 2021 at the typical Wulong station in the Wujiang River basin was chosen for this study. This dataset was chosen for its real-time and continuous monitoring capability, which provides a wealth of information about river flow over time. The Wujiang River basin is situated between latitudes 26°07′ and 30°22′N and longitudes 104°18′ and 109°22′E. The Wujiang River is the most extensive tributary on the south bank of the upper parts of the Yangtze River, with a basin area of around 87,900 km2. The basin covers 62 counties (cities and districts) in 10 prefecture-level administrative areas in China's Guizhou, Chongqing, Hubei, and Yunnan provinces (municipalities directly under the Central Government).

The Wujiang basin is located in a subtropical humid monsoon climate zone. The average annual temperature within the basin is maintained at 14.6°C. The region experiences between 120 and 150 rainy days each year, with an average annual sunshine duration of 1,250 h and a relative humidity of 80%. The annual precipitation in the Wu River basin consistently reaches approximately 1,150 mm. Figure 1 portrays the location of the study area.
Figure 1

Location map of the study area.

Figure 1

Location map of the study area.

Close modal
Figure 2 displays the patterns of the monthly runoff at Wulong station from 1957 to 2021. The data from months 1 to 624 were used for the training samples, while the data from months 625 to 780 were used for the test samples, from the monthly runoff data.
Figure 2

Monthly runoff patterns at Wulong hydrological station from 1957 to 2021.

Figure 2

Monthly runoff patterns at Wulong hydrological station from 1957 to 2021.

Close modal

Based on Figure 2, it can be observed that the runoff data from the Wulong hydrological station in the Wujiang River basin between 1957 and 2021 do not exhibit any significant trends. Although the overall trend of runoff shows a slight decrease, the decrease is negligible. Moreover, the majority of monthly runoff values during this period range from 0.5 × 108 to 3 × 108 m3.

Model prediction process

The basic assumptions of the CEEMDAN–VMD–TCN–LSTM model are as follows: The model assumes that (1) the fundamental statistical properties of runoff data remain stable within the considered time period, (2) the relationships between various components (intrinsic mode functions (IMF), Variational mode decomposition–Intrinsic Mode Function (VIMF), TCN, and LSTM) are linear, and (3) the available historical runoff data are sufficient to capture underlying patterns and dependencies.

The CEEMDAN–VMD–TCN–LSTM model for predicting long-term runoff is composed of two stages, as illustrated in Figure 3. First, the CEEMDAN decomposition is performed on the raw data monthly runoff, and the results are shown as several IMF and residual terms. Next, the high-frequency IMF components are decomposed in the second stage using VMD. TCN–LSTM prediction is then applied to the components obtained from the two-stage decomposition, with runoff prediction accomplished by feeding the TCN-extracted features into the LSTM network. To generate the final runoff prediction results, all component forecasts are integrated and rebuilt.
Figure 3

Model of prediction: CEEMDAN–VMD–TCN–LSTM flowchart.

Figure 3

Model of prediction: CEEMDAN–VMD–TCN–LSTM flowchart.

Close modal

Model introducing

Complete ensemble empirical mode decomposition with adaptive noise

The EEMD methodology has a problem with modal mixing and residual auxiliary noise in signal decomposition (Zhang & Zheng 2023). Torres et al. (2011) presented the CEEMDAN method, which is an enhanced version of the EEMD technique. The CEEMDAN algorithm has demonstrated to be effective with minimal reconstruction errors and fast processing times (Kala et al. 2019), making it a comprehensive solution for signal decomposition. Moreover, the CEEMDAN algorithm produces better modal decomposition results compared to other techniques (Zhang et al. 2021):

  • 1. Gaussian white noise is added to the original monthly runoff signal
    (1)
    where is the amplitude of white noise.
  • 2. The first modal component and the residual can be obtained by decomposing the monthly runoff signal by empirical mode decomposition (EMD) after adding Gaussian white noise
    (2)
    (3)
    where n is the amplitude of the added white noise.
  • 3. After conducting EMD, add white noise to the residual to get the second modal component and the residual
    (4)
    (5)
    where E is the symbol for the EMD decomposition and is the resultant Cth order modal component.
  • 4. The initial monthly runoff signal is decomposed into the following equation by repeating the previous stages for each additional white noise decomposition until the residuals produced cannot be further decomposed:
    (6)

Variational mode decomposition

The creation and resolution of variational issues is the central concept of VMD. VMD is an adaptive, fully nonrecursive signal processing method that can efficiently deal with high-frequency signals because it does not rely on a fixed time–frequency resolution, but instead determines the frequency of each modal function through variational optimization (Dragomiretskiy & Zosso 2014; Farshad et al. 2023):
(7)
where is the instantaneous amplitude function and is the nondecreasing instantaneous phase function. Then , defining as the instantaneous frequency of .

TCN–LSTM network

The TCN-–LSTM model is a powerful predictive tool that combines the strengths of both TCN and LSTM networks. The TCN, proposed by Bai et al. (2018), is a convolutional neural network model with a unique structure. This model is built upon traditional one-dimensional convolutional neural networks and incorporates dilated convolution, causal convolution, and residual connections to create a novel network architecture (Liu & Song 2019; Guo et al. 2022). TCN is well suited for addressing time-series problems. Dilated convolution allows for the expansion of input sampling from the previous layer, enabling the extraction of feature information from longer intervals and noncontinuous time-series data (Li et al. 2022). Causal convolution ensures that feature extraction is causal in nature, with the output at any given time step dependent solely on the input prior to that time step. Figure 4 illustrates the dilated causal convolution structure of TCN with a convolution kernel size of 2 and dilation factors of 1, 2, and 4 (Xu et al. 2021).
Figure 4

Dilated causal convolution structure of TCN.

Figure 4

Dilated causal convolution structure of TCN.

Close modal
LSTM networks excel in time-series prediction, and by integrating the TCN network's feature extraction, the LSTM network's processing efficiency is improved, leading to a more effective prediction model that can learn the complex interactions of time series (Cho et al. 2022; Liu et al. 2022). The structure of the LSTM memory cell is shown in Figure 5.
Figure 5

LSTM memory cell structure.

Figure 5

LSTM memory cell structure.

Close modal

Evaluation indicators

Mean absolute error (MAE) and root mean square error (RMSE) are two error measures we utilized to assess the precision of our model's predictions. Also, the accuracy of each component's prediction as well as the final forecast result was evaluated using the Nash–Sutcliffe efficiency (NSE) coefficient. A value of 1 for the NSE indicates high accuracy and good prediction performance, lending high credibility to the model (Zhang, X. Q. et al. 2023):
(8)
(9)
(10)
where represents the projected value, represents the actual value, and is the average of the actual values.

Bimodal decomposition

The monthly runoff data from 1957 to 2021 were subjected to single-mode decomposition with the addition of 500 times the noise. The noise standard deviation ratio was set to 0.3, allowing a maximum of 800 sifting iterations. The results of the single-mode runoff decomposition using CEEMDAN are shown in Figure 6.
Figure 6

CEEMDAN primary modal decomposition diagram.

Figure 6

CEEMDAN primary modal decomposition diagram.

Close modal

CEEMDAN decomposes the runoff data from Wulong station into nine IMF components and a residual term. The sample entropy of each component is calculated, where a higher entropy value indicates a more complex component. The entropy values of IMF1–IMF4 range from 0.98 to 0.63, indicating high-frequency components that reflect the irregular fluctuations in the runoff sequence. The entropy values of IMF5–IMF9 range from 0.50 to 0.35, representing low-frequency components that reveal the trend characteristics of the runoff sequence and exhibit regularity. In Figure 6, the IMF1–IMF4 components are characterized as high-frequency components. VMD is employed to further decompose the high-frequency components obtained after the initial CEEMDAN processing.

First, the IMF1–IMF4 components are aggregated into a single sequence, and then VMD is applied to this aggregated sequence for a secondary decomposition. The results of the bimodal decomposition are shown in Figure 7.
Figure 7

VMD second-stage decomposition.

Figure 7

VMD second-stage decomposition.

Close modal

After bimodal decomposition, five VIMF components and a residual term (VRes) are generated. In Figure 7, compared to the IMF1–IMF4 components obtained from CEEMDAN decomposition, the components after VMD decomposition exhibit reduced fluctuations and greater regularity. This makes them more conducive to capturing information within the components for subsequent use in the TCN–LSTM model and for improving runoff prediction.

Model parameter setting

After bimodal decomposition, TCN employs the rectified linear unit (ReLU) activation function, while LSTM utilizes the hyperbolic tangent (tanh) activation function. The most significant influences on predictive accuracy stem from the size of convolutional kernels, the number of convolutional layers, the number of LSTM layers, and the quantity of neurons. Consequently, determining the optimal values for these four elements becomes the focal point of building an enhanced predictive model for comparative experiments.

  • 1. Determining the size and number of convolutional kernels: Based on the bimodal decomposition and dual-channel convolution, two layers of LSTM (with 100 neurons each) were employed. The convolutional kernel size and the number of convolutional layers were varied in both channels. Comparative results for various errors are presented in Table 1.

  • 2. Determining the number of LSTM layers and neurons: Using bimodal decomposition and dual-channel convolution with convolutional kernels of (2,2,1) and (3,3,1), the number of LSTM layers and neurons were varied. Comparative results for different errors are presented in Table 2.

Table 1

Comparison of different TCN architectures

ModelConvolutional kernelsMAERMSE
Double-layer TCN + LSTM (1,1,1) and (2,2,1) 0.2367 0.2261 
Double-layer TCN + LSTM (2,2,1) and (3,3,1) 0.1022 0.1873 
Double-layer TCN + LSTM (3,3,1) and (4,4,1) 0.2094 0.2225 
Double-layer TCN + LSTM (4,4,1) and (5,5,1) 0.2381 0.2269 
Double-layer TCN + LSTM (5,5,1) and (6,6,1) 0.2534 0.2292 
Single-layer TCN + LSTM (2,2,1) and (3,3,1) 0.2987 0.2360 
ModelConvolutional kernelsMAERMSE
Double-layer TCN + LSTM (1,1,1) and (2,2,1) 0.2367 0.2261 
Double-layer TCN + LSTM (2,2,1) and (3,3,1) 0.1022 0.1873 
Double-layer TCN + LSTM (3,3,1) and (4,4,1) 0.2094 0.2225 
Double-layer TCN + LSTM (4,4,1) and (5,5,1) 0.2381 0.2269 
Double-layer TCN + LSTM (5,5,1) and (6,6,1) 0.2534 0.2292 
Single-layer TCN + LSTM (2,2,1) and (3,3,1) 0.2987 0.2360 
Table 2

Comparison of different LSTM structures

LSTM layersNeuronal numberMAERMSE
100 0.2937 0.2335 
200 0.2977 0.2360 
100 0.2150 0.2082 
200 0.2423 0.2172 
100 0.1022 0.1873 
200 0.2107 0.2141 
LSTM layersNeuronal numberMAERMSE
100 0.2937 0.2335 
200 0.2977 0.2360 
100 0.2150 0.2082 
200 0.2423 0.2172 
100 0.1022 0.1873 
200 0.2107 0.2141 

After comparison, the combination estimation model constructed with convolutional kernels (2,2,1) and (3,3,1), 2 Convolutional Neural Network (CNN) layers, 3 LSTM layers, and 100 neurons fits the experimental data in this paper better, delivering the best fitting performance.

Runoff prediction

After the bimodal decomposition with CEEMDAN–VMD, a total of 10 components and 2 residual terms are generated. These decomposition results are utilized in the TCN–LSTM prediction model. TCN can capture both long-term and short-term dependencies in the data, while LSTM is well suited for handling sequential data. The TCN–LSTM model optimizes the overall network structure, resulting in excellent predictive performance and a better fit to the measured dataset. It leverages the advantages of both TCNs and LSTM networks, enabling more effective processing of long-term data and enhanced high-order feature extraction from multisource data. The model parameters are set as follows: TCN has an expansion factor of [1, 2, 4, 8, 16] for each layer, and a convolutional kernel size of 3 × 3, with 64 convolutional kernels in each layer. LSTM uses a three-layer model with 64 hidden nodes and a tanh activation function.

The prediction results for each component are compared with the observed values, as shown in Figure 8. The final prediction results for Wulong station are compared with the observed values as shown in Figure 9.
Figure 8

Results of observed and predicted values for each component.

Figure 8

Results of observed and predicted values for each component.

Close modal
Figure 9

Plot of final prediction results of CEEMDAN–VMD–TCN–LSTM model.

Figure 9

Plot of final prediction results of CEEMDAN–VMD–TCN–LSTM model.

Close modal

In Figure 8, the prediction accuracy of IMF5 is notably high, with minimal deviations between the predicted and observed values. The variations in the predicted values align closely with the variations in the initial 624 months of IMF5 from the training samples (Figure 6). The fluctuation range of predicted values for IMF6-VRes generally corresponds to the corresponding training samples in Figure 6, and the error between predicted and observed values is small. These outcomes demonstrate the CEEMDAN–VMD–TCN–LSTM model's capability to effectively extract and reliably predict the trends and features of the training samples.

However, due to the substantial fluctuations and limited regularity of VIMF1–VIMF3, some errors are observed in the prediction results. Nevertheless, the fluctuation range of the predicted values for VIMF1–VIMF3 remains consistent with the training samples.

Dual-mode decomposition validation

To validate the effectiveness of bimodal decomposition, both the LSTM and TCN–LSTM prediction models were subjected to both single-mode and bimodal decomposition. Comparative analysis was performed to evaluate the prediction accuracy of these two prediction models under different decomposition modes.

As shown in Table 3, the CEEMDAN–VMD–LSTM model reduced the MAE and RMSE by 29.00% and 30.45%, respectively, compared to the CEEMDAN–LSTM model. Furthermore, the CEEMDAN–VMD–TCN–LSTM model exhibited a reduction of 47.78% in MAE and 35.26% in RMSE when compared to the CEEMDAN–TCN–LSTM model. These results indicate that bimodal decomposition significantly enhances the prediction model's accuracy (Zhang et al. 2024), further confirming the effectiveness of the CEEMDAN–VMD decomposition method.

Table 3

Prediction errors for different decomposition modes (108 m3/month)

Decomposing modalitiesPredictive modelsMAERMSE
One-stage decomposition CEEMDAN–LSTM 0.2286 0.3291 
CEEMDAN–TCN–LSTM 0.1957 0.2893 
Second-stage decomposition CEEMDAN–VMD–LSTM 0.1623 0.2289 
CEEMDAN–VMD–TCN–LSTM 0.1022 0.1873 
Decomposing modalitiesPredictive modelsMAERMSE
One-stage decomposition CEEMDAN–LSTM 0.2286 0.3291 
CEEMDAN–TCN–LSTM 0.1957 0.2893 
Second-stage decomposition CEEMDAN–VMD–LSTM 0.1623 0.2289 
CEEMDAN–VMD–TCN–LSTM 0.1022 0.1873 

Coupled prediction model validation

In order to verify that the TCN-LSTM prediction model can capture more implicit features and improve the prediction accuracy of the sequences, CEEMDAN-VMD-TCN-LSTM is therefore compared with the CEEMDAN-VMD-TCN and CEEMDAN-VMD-LSTM models. The results of inter-model error metrics are shown in Table 4.

Table 4

Simulation results of different models

Predictive modelsMAE (108m3/month)RMSE (108m3/month)NSE
TCN–LSTM 0.2629 0.3854 0.79 
CEEMDAN–VMD–TCN 0.1797 0.2483 0.91 
CEEMDAN–VMD–LSTM 0.1623 0.2289 0.93 
CEEMDAN–VMD–TCN–LSTM 0.1022 0.1873 0.95 
Predictive modelsMAE (108m3/month)RMSE (108m3/month)NSE
TCN–LSTM 0.2629 0.3854 0.79 
CEEMDAN–VMD–TCN 0.1797 0.2483 0.91 
CEEMDAN–VMD–LSTM 0.1623 0.2289 0.93 
CEEMDAN–VMD–TCN–LSTM 0.1022 0.1873 0.95 

The CEEMDAN–VMD–TCN–LSTM model effectively extracts the hidden features of time series, resulting in improved prediction accuracy (Ying et al. 2023). Compared to the CEEMDAN–VMD–LSTM model, the CEEMDAN–VMD–TCN–LSTM model reduced the MAE and RMSE by 37.03 and 18.17%, respectively. When compared to the CEEMDAN–VMD–TCN model, the CEEMDAN–VMD–TCN–LSTM model also demonstrates outstanding performance in terms of prediction accuracy, with a reduction in MAE and RMSE by 43.13 and 24.57%, respectively. The sole use of TCN–LSTM for runoff prediction yields relatively average results (Guo et al. 2024). In contrast, the CEEMDAN–VMD–TCN–LSTM model exhibits remarkable performance in terms of prediction accuracy, achieving a score of 0.95.

Multimodel comparison

The Taylor diagram in Figure 10 provides a more intuitive display of the accuracy of different models, facilitating the comparison of accuracy and reliability among the four models. In the diagram, each model is represented by a scatter point, where the position of the point indicates the correlation between the standard deviation of the model and the observed data. The closer the model's point is to the observed data point, the higher the correlation and consistency between the model's predictions and the observed values. The radial lines in the diagram represent correlation coefficients, while the horizontal and vertical axes represent standard deviations.
Figure 10

Model comparison Taylor diagram.

Figure 10

Model comparison Taylor diagram.

Close modal

While utilizing the CEEMDAN–VMD–TCN–LSTM model for runoff prediction, certain limitations persist. In this study, monthly runoff data from the Wulong station for the period 1957–2021 were utilized. However, due to the data being limited to a single station, the broad applicability of the model to other stations or under varying geographical conditions has not yet been verified. This limitation constrains the model's generalization capability across different basins. The study focuses primarily on runoff prediction at the monthly scale, and verification at finer temporal scales (such as daily or hourly) has not yet been conducted. Future research could explore the model's applicability using data at shorter time intervals.

This study proposes a runoff prediction model based on TCN–LSTM deep learning and CEEMDAN–VMD bimodal decomposition. Through bimodal decomposition, the prediction errors caused by high-frequency IMFs can be effectively addressed. The IMF processed by bimodal decomposition serves as input to the prediction model TCN–LSTM, resulting in MAE = 0.1022 × 108 m3/month and RMSE = 0.1873 × 108 m3/month. Compared to the prediction results of single-mode decomposition, the error is reduced by at least 35%. After dual modal decomposition using CEEMDAN and VMD, the MAE and RMSE of the TCN–LSTM model were reduced by 37.03 and 18.17%, respectively, compared to the standalone LSTM. The CEEMDAN–VMD–TCN–LSTM model demonstrates promising potential for application in runoff prediction.

RZ conceptualized the whole study, developed the methodology, wrote the original draft, and investigated and validated the work. ZZ supervised the process, wrote the review and edited the article, rendered support in data curation, and visualized the work.

This manuscript was supported by the Innovation Fund for Doctoral Students of North China University of Water Resources and Electric Power. The grant number is NCWUBC202303.

Rainfall data are available at https://data.cma.cn. Temperature data are available at https://www.geodata.cn.

The authors declare there is no conflict.

Adera
S.
,
Bellugi
D.
,
Dhakal
A.
&
Larsen
L.
(
2024
)
Streamflow prediction at the intersection of physics and machine learning: A case study of two Mediterranean-climate watersheds
,
Water Resources Research
,
60
(
7
),
e2023WR035790
.
https://doi.org/10.1029/2023WR035790
.
Bai
S.
,
Kolter
J.
&
Koltun
V.
(
2018
)
An empirical evaluation of generic convolutional and recurrent networks for sequence modeling
. ArXiv, 4 (19). Available at: https://arxiv.org/abs/1803.01271.
Ban
W. C.
,
Shen
L. D.
,
Chen
L.
&
Xu
C. T.
(
2023
)
Monthly runoff prediction based on variational modal decomposition combined with the dung beetle optimization algorithm for gated recurrent unit model
,
Environmental Monitoring and Assessment
,
195
,
1538
.
https://doi.org/10.1007/s10661-023-12102-y
.
Cho
M. W.
,
Kim
C. S.
,
Jung
K. Y.
&
Jung
H.
(
2022
)
Water level prediction model applying a long short-term memory (LSTM)–gated recurrent unit (GRU) method for flood prediction
,
Water
,
14
(
14
),
2221
.
https://doi.org/10.3390/W14142221
.
Choubin
B.
,
Solaimani
K.
,
Rezanezhad
F.
,
Roshan
M. H.
,
Malekian
A.
&
Shamshirband
S.
(
2019
)
Streamflow regionalization using a similarity approach in ungauged basins: Application of the geo-environmental signatures in the Karkheh River Basin, Iran
,
Catena
,
182
,
104128
.
https://doi.org/10.1016/j.catena.2019.104128
.
Dragomiretskiy
K.
&
Zosso
D.
(
2014
)
Variational mode decomposition
,
IEEE Transactions on Signal Processing
,
62
(
3
),
531
544
.
https://doi.org/10.1109/TSP.2013.2288675
.
Farshad
A.
,
Mansour
T.
&
Meysam
S.
(
2023
)
Streamflow prediction using a hybrid methodology based on variational mode decomposition (VMD) and machine learning approaches
,
Applied Water Science
,
13
(
6
),
135
.
https://doi.org/10.1007/S13201-023-01943-0
.
Ghazi
B.
,
Jeihouni
E.
,
Kisi
O.
,
Pham
Q.
&
Durin
B.
(
2022
)
Estimation of Tasuj aquifer response to main meteorological parameter variations under shared socioeconomic pathways scenarios
,
Theoretical and Applied Climatology
149
,
25
37
.
https://doi.org/10.1007/s00704-022-04025-4
.
Guo
Z.
,
Ling
K.
,
Zhong
S.
,
Miao
H.
&
Guo
W.
(
2024
)
Approach for short-term power load prediction utilizing the ICEEMDAN–LSTM–TCN–bagging model
,
Journal of Electrical Engineering & Technology
,
9
,
1
13
.
https://doi.org/10.1007/s42835-024-02040-1
.
He
F.
,
Wan
Q.
,
Wang
Y.
,
Wu
J.
,
Zhang
X.
&
Feng
Y.
(
2024
)
Daily runoff prediction with a seasonal decomposition-based deep GRU method
,
Water
,
16
,
618
.
https://doi.org/10.3390/w16040618
.
He
L.
,
Hou
M. Q.
,
Chen
S. Z.
,
Zhang
J. R.
,
Chen
J. Y.
&
Qi
H.
(
2021
)
Construction of a spatio-temporal coupling model for groundwater level prediction: A case study of Changwu area, Yangtze River Delta region of China
,
Water Supply
,
21
(
7
),
3790
3809
.
https://doi.org/10.2166/ws.2021.140
.
Hu
F.
,
Yang
Q.
,
Yang
J.
,
Luo
J.
&
Wang
Q.
(
2024
)
Incorporating multiple grid-based data in CNN-LSTM hybrid model for daily runoff prediction in the source region of the Yellow River Basin
,
Journal of Hydrology: Regional Studies
,
51
,
101652
.
Kala
A.
,
Ganesh Vaidyanathan
S.
&
Sharon Femi
P.
(
2019
)
CEEMDAN hybridized with LSTM model for forecasting monthly rainfall
,
Journal of Intelligent & Fuzzy Systems
,
43
(
3
),
2609
2617
.
https://doi.org/10.3233/JIFS-213064
.
Li
D.
,
Jiang
F. X.
,
Chen
M.
&
Tao
Q.
(
2022
)
Multi-step-ahead wind speed forecasting based on a hybrid decomposition method and temporal convolutional networks
,
Energy
,
238
(
Part C
),
121981
.
https://doi.org/10.1016/J.ENERGY.2021.121981
.
Liu
X.
&
Song
Q. H.
(
2019
)
CEEMDAN adaptive threshold denoising algorithm for seismic directions
,
Journal of Chongqing University
,
42
(
7
),
95
104
.
https://doi.org/10.11835/j.issn.1000-582X.2019.07.011
.
Liu
Y. M.
,
Song
J. Y.
,
Zhao
Z. Z.
,
Ye
G. W.
,
Liu
Z. B.
&
Zhou
Y.
(
2022
)
Adaptive residual life prediction for small samples of mechanical products based on feature matching preprocessor – LSTM
,
Applied Sciences
,
12
(
16
),
8236
.
https://doi.org/10.3390/APP12168236
.
Liu
Z. J.
,
Guo
S. L.
&
Xu
X. F.
(
2019
)
Research progress and prospect of Bayesian probabilistic hydrological forecasting
,
Journal of Water Conservancy
,
50
(
12
),
1467
1478
.
https://doi.org/10.13243/j.cnki.slxb.20190424
.
Shahid
M.
,
Rahman
K. U.
&
Haider
S.
(
2021
)
Quantitative assessment of regional land use and climate change impact on runoff across Gilgit watershed
,
Environmental Earth Sciences
,
80
,
743
.
https://doi.org/10.1007/s12665-021-10032-x
.
Torres
M. E.
,
Colominas
M. A.
,
Schlotthauer
G.
&
Flandrin
P.
(
2011
). '
A complete ensemble empirical mode decomposition with adaptive noise
',
2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp.
4144
4147
.
Vatanchi
S. M.
,
Etemadfard
H.
&
Maghrebi
M. F.
(
2023
)
A comparative study on forecasting of long-term daily streamflow using ANN, ANFIS, BiLSTM and CNN-GRU-LSTM
,
Water Resources Management
,
37
,
4769
4785
.
https://doi.org/10.1007/s11269-023-03579-w
.
Xu
Y. H.
,
Hu
C. H.
,
Wu
Q.
,
Li
Z. C.
,
Jian
S. Q.
&
Chen
Y. Q.
(
2021
)
Application of temporal convolutional network for flood forecasting
,
Hydrology Research
,
52
(
6
),
1455
1468
.
https://doi.org/10.2166/NH.2021.021
.
Yang, C., Jiang, Y., Liu, Y., Liu, S. & Liu, F. (
2023
)
A novel model for runoff prediction based on the ICEEMDAN-NGO-LSTM coupling
,
Environmental Science and Pollution Research
,
30
,
82179
82188
.
https://doi.org/10.1007/s11356-023-28191-8
.
Yao
J.
,
Cai
Z.
,
Qian
Z.
&
Yang
B.
(
2023
)
A noval approach based on TCN-LSTM network for predicting waterlogging depth with waterlogging monitoring station
,
PLoS One
,
18
(
10
),
e0286821
.
https://doi.org/10.1371/journal.pone.0286821
.
Ying
R.
,
Siyuan
W.
&
Bisheng
X.
(
2023
)
Deep learning coupled model based on TCN-LSTM for particulate matter concentration prediction
,
Atmospheric Pollution Research
,
14
(
4
),
101703
.
Zeydalinejad
N.
,
Pour-Beyranvand
A.
,
Nassery
H. R.
&
Ghazi
B.
(
2024
)
Evaluating climate change impacts on snow cover and karst spring discharge in a data-scarce region: A case study of Iran
,
Acta Geophysica
,
7
(
1
), 4649–4669.
https://doi.org/10.1007/s11600-024-01400-9
.
Zhang, X. Q. & Zheng, Z. W. (2023) Prediction of suspended sediment concentration in the lower Yellow River in China based on the coupled CEEMD-NAR model, Environmental Science and Pollution Research, 30 (11), 30960–30971. https://doi.org/10.1007/s11356-022-24406-6.
Zhang, S., Yan, Z. J. & Xu, C. X. (2020) Monthly runoff prediction model based on MPGA-LSTM and its application, Hydropower Energy Science, 38 (5), 38–41.
https://doi.org/CNKI:SUN:SDNY.0.2020-05-010
.
Zhang, X. Q., Zheng, Z. W. & Wang, K. (2021) Prediction of runoff in the upper Yangtze River based on CEEMDAN-NAR model, Water Supply, 21 (7), 3307–3318.
https://doi.org/10.2166/WS.2021.121
.
Zhang, X., Wang, X., Li, H., Sun, S. & Liu, F. (2023) Monthly runoff prediction based on a coupled VMD-SSA-BiLSTM model, Scientific Reports, 13, 13149.
https://doi.org/10.1038/s41598-023-39606-4
.
Zhang, X. Q., Zheng, Z. W., Sun, S., Wen, Y. & Chen, H. (2023) Study on the driving factors of ecosystem service value under the dual influence of natural environment and human activities, Journal of Cleaner Production, 420, 138408.
https://doi.org/10.1016/j.jclepro.2023.138408.
Zhang, X., Ren, H., Liu, J., Zhang, Y. & Cheng, W. (2024) A monthly temperature prediction based on the CEEMDAN–BO–BiLSTM coupled model, Scientific Reports, 14 (1), 808.
https://doi.org/10.1038/s41598-024-51524-7
.
Zhao
S. R.
,
Yang
Z.
,
Zhang
S. T.
,
Wu
J. R.
,
Zhao
Z. X.
,
Jeng
D. S.
&
Wang
Y. G.
(
2023
)
Predictions of runoff and sediment discharge at the lower Yellow River Delta using basin irrigation data
,
Ecological Informatics
,
78
,
102385
.
Zhao
Y.
,
Chen
L. X.
&
Liang
M. J.
(
2022
)
Rainfall-induced landslide temporal probability prediction and meteorological early warning modeling based on LSTM TCN model
,
Bulletin of Geological Science and Technology
,
1
,
16
.
https://doi.org/10.19509/j.cnki.dzkq.tb20220657
.
Zheng
Z.
,
Zhang
X.
,
Yin
Q.
,
Liu
F.
,
Ren
H.
&
Zhao
R.
(
2024a
)
A novel optimization rainfall coupling model based on stepwise decomposition technique
,
Scientific Reports
,
14
,
15617
.
https://doi.org/10.1038/s41598-024-66663-0
.
Zheng
Z.
,
Yao
Y.
,
Zhang
X.
,
Zhao
Y.
&
Qi
Y.
(
2024b
)
Novel optimized coupled rainfall model simulation based on stepwise decomposition technique
,
Water Science and Technology
,
90
(
4
),
1164
1180
.
https://doi.org/10.2166/wst.2024.263
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).