Accurate forecasting of increasingly unpredictable river runoff is essential for effective water resource management in the face of climate change and human activities. This study uses four machine learning models of long short-term memory neural networks (LSTM), support vector machine (SVM), random forest, and artificial neural network models to improve runoff forecasting accuracy and explore combined forecasting models’ effectiveness. This study develops three advanced combined forecasting models (empirical mode decomposition (EMD)–LSTM, VMD–LSTM, wavelet analysis (WA)–LSTM) by combining preprocessing techniques of EMD, variational mode decomposition (VMD), and WA with the LSTM modeling method. These models use signal decomposition techniques to analyze 41 years of runoff data from the Huanren station (1980–2020). The findings reveal that the LSTM model outperforms the other three individual machine learning models when forecasting days with high runoff. Among the three decomposed combined models, the VMD–LSTM model demonstrates the best overall performance during the validation period, achieving root mean square error, Nash–Sutcliffe efficiency coefficient, and bias values of 52.14 m3/s, 0.96, and −0.002, respectively. The combination of LSTM with signal decomposition techniques shows promising potential for enhancing runoff prediction accuracy, with practical implications for water resource management and flood control strategies.

  • LSTM excels in preserving the integrity of rainfall and runoff data, leading to superior performance.

  • The LSTM model demonstrates a significant advantage in forecasting high-flow days compared to other single models.

  • The VMD–LSTM model achieves the best overall performance by effectively filtering out noise from the data.

Runoff forecasting is closely related to watershed hydrology and flood management, as it plays a crucial role in understanding and managing water resources within a watershed. Watershed characteristics, such as its size, shape, river, geology, and river network, determine how precipitation is distributed across the watershed and how it is converted into runoff (Abed-Elmdoust et al. 2016; Sarker et al. 2019; Gao et al. 2022; Singhal et al. 2024). With the ability to predict dynamic changes in river runoff, decision-makers can make informed choices about how to distribute and utilize water resources within a basin. This includes ensuring an equitable allocation of water for various purposes, such as agriculture, industry, and domestic use. It also enables them to plan for potential water shortages or excesses and take proactive measures to address these challenges. Furthermore, river runoff forecasting is essential for improving flood prevention and mitigation capabilities (Hu et al. 2020).

With the continuous advancement of artificial intelligence technology, an increasing number of machine learning models have been employed in the field of runoff forecasting. Some commonly used models include long short-term memory neural networks (LSTM), support vector machines (SVMs), random forest (RF) models, and artificial neural networks (ANNs). For instance, Lin & Cheng (2006) introduced a radial basis kernel function in the SVM modeling process and obtained better results than the ANN. Wang et al. (2009) compared the performance of the auto-regressive moving average model, ANN, adaptive network-based fuzzy inference system, genetic programming, and SVM models in predicting monthly runoff time series, with SVM demonstrating superior performance in long-term runoff prediction. Li et al. (2020) developed RF and SVM models for annual, flood, and dry period average incoming runoff in the Longjiang Reservoir in Yunnan and achieved high overall accuracy. However, the models' prediction accuracy at local extreme flows could be improved.

In further research, Lai et al. (2015) constructed a flood risk evaluation model based on the RF intelligent algorithm in the Dongjiang River basin. The model required minimal parameters and did not necessitate the setting of indicator weights and grading criteria, providing a simple implementation process for flood risk evaluation. Mou et al. (2009) focused on the Urumqi River source No. 1 glacier area and developed a feed-forward ANN model for runoff forecasting in the alpine cold region. The model not only yielded better simulation and prediction results but also provided insight into the network structure characteristics for studying glacier ablation runoff patterns. Zhu et al. (2005) tested an ANN model using water level station data located in the Beijiang River, Zhujiang Delta, demonstrating that good forecasting results could be achieved with an appropriate selection of input layer unit data and forecasting periods. Wang et al. (2022) utilized an ant-lion optimization algorithm-based LSTM network to study an earth-rock dam in southwest China, producing predictions that aligned well with real-world engineering. Yin et al. (2019) compared the LSTM model with the Xin'anjiang model for a gauging station in the Jinjiang River basin and found that the LSTM model outperformed the Xin'anjiang model across different forecasting periods. The aforementioned studies showcase the unique advantages of LSTM models in hydrological forecasting. However, most of these studies focus on single models and limited optimization of model parameters, without preprocessing of input data.

Runoff time series are complex and non-linear, making direct prediction challenging (Peng et al. 2017). Therefore, data preprocessing methods based on signal decomposition, such as empirical mode decomposition (EMD), wavelet analysis (WA), and variational mode decomposition (VMD), have been increasingly used in hydrological research. For instance, Napolitano et al. (2011) built an EMD–ANN using monitoring data from a gauging station in Maryland, USA and found that the combined forecasting model had higher accuracy compared to a single ANN model. Tayyab et al. (2019) developed a WA–ANN for monthly flow prediction in the Jinsha River Basin and demonstrated that WA improved the accuracy of flow prediction. Qi et al. (2022) combined VMD with LSTM models to construct a VMD–LSTM forecasting model for monthly runoff in the upper reaches of the Yellow River basin, achieving satisfactory prediction results. Liu & Wang (2022) proposed an EMD–LSTM model based on deep learning, which outperformed traditional LSTM models and showed greater adaptability in flood prevention and mitigation.

While machine learning models have been widely studied in hydrological prediction, their integration with signal decomposition methods, such as EMD, has not been extensively explored in the realm of hydrological forecasting. Therefore, our study aims to address this gap by examining the collective effectiveness of these models and signal decomposition techniques, providing valuational insights into their collaborative potential for enhancing the precision and reliability of hydrological forecasts. Based on the above background, this study investigates the runoff of the Huanren station between 1980 and 2020 using four single machine learning models (LSTM, SVM, RF, and ANN) and three combined models (EMD–LSTM, VMD–LSTM, and WA–LSTM). The comparison and analysis of the different models will shed light on the accuracy and effectiveness of each model in predicting runoff. It is of great significance to the water resource management and flood control strategies in the Hun River basin.

Study area

The Hun River is the largest right tributary of the Yalu River, originating from the southern foot of the Laogang Mountain Range in the Changbai Mountain System. It flows for a total length of 432 km, passing through the provinces of Jilin and Liaoning in a northeast-to-southwest direction. The section of the river above the Huanren Hydropower Station dam site spans 247 km, covering a controlled basin area of 10,400 km2. Within the watershed, there are 10 rainfall stations: Sanchazi, Badaojiang, Balishao, Tonghua, Sankeshu, Badaogou, Dongcun, Yezhugou, Huadian, and Huanren.

The Hun River basin is situated at the northern edge of the storm center in northeast China. It is characterized by mountainous terrain, steep hills, and abundant vegetation. The region experiences a temperate monsoon climate, with an average annual precipitation of 860 mm and an average runoff coefficient of 0.52. The majority of rainfall, about 70%, is concentrated between June and September, resulting in significant floods typically occurring from late July to mid-August. Due to the basin's undulating topography, the river's steep slope, and the limited storage capacity of the riverbed, heavy rainfall often leads to rapid rises and falls in water levels, with 80% of the total flood occurring within a 3-day period. Winter in the basin usually begins in November and ends in late March or early April, characterized by snowfall. In late March or early April, snow starts to thaw, leading to spring flooding as rivers and streams begin to thaw.

Data collection

This study focuses on the Huanren Hydropower Station and investigates the daily runoff data from 1980 to 2020 using four single forecasting models and three combined forecasting models. The study aims to analyze and predict the runoff at the Huanren Hydropower Station. Figure 1 shows the Hun River basin and the locations of the rainfall stations.
Figure 1

Location of the Huanren Hydropower Station basin and rainfall stations.

Figure 1

Location of the Huanren Hydropower Station basin and rainfall stations.

Close modal
Figure 2 summarizes the new framework developed in this study, which consists of three main components: data collection, decomposition methods, and forecasting models. These are explained in the following sections.
Figure 2

Framework for runoff prediction.

Figure 2

Framework for runoff prediction.

Close modal

Forecasting models

Long short-term memory neural network

LSTM is a type of recurrent neural network (RNN) specifically designed to address the problems of gradient disappearance and gradient explosion that often occur in traditional RNN (Rossi et al. 2024). The LSTM network consists of memory units and gating units, which work together to protect and control information using three main gates: the input gate, forget gate, and output gate. The working principle of this neural network at time t is illustrated in Figure 3. In the figure, ht represents the hidden state at time t, xt corresponds to the input at time t, and tanh refers to the hyperbolic tangent function. ct represents the entire cell state, while ft, it, and ot correspond to the forget gate, input gate, and output gate, respectively. The internal mechanism of the LSTM neural network can be described by the following:
(1)
(2)
(3)
(4)
(5)
(6)
where ft is the forget gate at time t; is the Sigmoid activation function, where a result of 1 indicates the complete retention of information from the previous layer, and a result of 0 indicates complete discarding of that information; xt is the input at time t; ht−1 is the hidden layer state at time t 1; it is the input gate at time t; is the update quantity of the memory cell; tanh is the hyperbolic tangent activation function; Ct is the state of the memory cell at the updating time step; denotes element-wise multiplication; Ct−1 is the state of the memory cell at time t 1; ot is the output gate at time t; ht is the hidden layer state at time t. ,, and correspond to the weight values of the forget gate, input gate, cell and output gate, respectively. , correspond to the bias (BIAS) terms of the forget gate, input gate, cell, and output gate, respectively.
Figure 3

Working principle of the LSTM neural network.

Figure 3

Working principle of the LSTM neural network.

Close modal

Support vector machine

The SVM was formally proposed by Vapnik (1995). It is a binary classification model that uses a linear classifier with the largest interval defined in the feature space to solve convex quadratic programming problems. For the basic principles and related formulas of the SVM model, refer Wang et al. (2003). In recent years, the application of SVM in hydrological work has been increasing, and some scholars have successfully applied SVM in groundwater prediction and flood forecasting (Wang et al. 2003; Swastik et al. 2019).

RF model

RF is an integrated learning algorithm proposed by Breiman (2001). It employs a bootstrap resampling method to draw multiple samples from the original data and constructs classification trees for each sampled dataset. The predictions of all classification trees are then combined through voting to obtain the final result. For the basic principles and related formulas of the RF model, refer Lai et al. (2015). RF is known for its robustness to outliers and noise, and it is less prone to overfitting. As a result, RF has been widely applied in various fields, such as genetic engineering, rock projects, and hydraulic engineering (Chen & Ishwaran 2012; Dong et al. 2013; Arash et al. 2023).

Artificial neural networks

ANNs are an information processing system that simulates the structure and function of brain neural networks using mathematical models (Hyo-Jin et al. 2024). For the basic principles and related formulas of the ANN model, see the paper of Zhu et al. (2005). Unlike traditional methods, ANN does not require a predefined model structure or an optimization algorithm for specific model parameters. As a result, it has been widely utilized in solving various practical problems, including river runoff forecasting, rainfall-runoff simulation, and water quality parameter forecasting (Shang et al. 1995; Li et al. 2002).

Decomposition methods

Empirical mode decomposition

EMD is a signal decomposition technique that breaks down a signal into physically meaningful intrinsic mode functions (IMFs) based on specific conditions (Masoud et al. 2023). The calculation steps of EMD are as follows:

  • ① Identify all the local extreme points of the original signal, x(t), and fit its upper envelope, xmax(t), and lower envelope, xmin(t), using the cubic spline interpolation function.

  • ② Calculate the average envelope, m(t), by taking the average of the upper and lower envelopes. Compute the difference sequence, h(t), between the original signal, x(t), and the average envelope: h(t) = x(t) m(t).

  • ③ Check if h(t) satisfies two conditions:

    • (a) At any point, the mean value of the upper and lower envelopes defined by the extreme points is zero.

    • (b) The mean value of h(t) at each instant is zero.

If these conditions are not satisfied, repeat the above steps with h(t) as the new signal until the conditions are met. Once the conditions are satisfied, h(t) becomes the first IMF, I1(t).

  • ④ Subtract I1(t) from the original signal and calculate the residual term, r1(t): r1(t) = x(t) − I1(t). Consider r(t) as the new original signal and repeat the above steps to decompose the remaining IMFs until a monotonic residual term is obtained.

Following these steps, EMD decomposes the original signal into a set of IMFs, which represent different frequency components of the signal.

Variational mode decomposition

VMD was proposed by Dragomiretskiy & Zosso (2014). It aims to decompose a signal into K components by determining the optimal center frequency and finite bandwidth through a search and solution process. This allows for a complete dissection of the signal's frequency domain and the separation of individual components. VMD effectively addresses the issue of mode mixing in EMD when dealing with non-linear and non-stationary signals. Furthermore, it is advantageous in handling noise-sensitive problems because VMD essentially consists of multiple adaptive Wiener filter groups.

The objective of VMD decomposition is to minimize the sum of estimated bandwidths for each mode while ensuring that the sum of all decomposed modes remains equal to the original signal. This variational constraint is expressed in the following equation:
(7)
where and correspond to the kth modal component and central frequency after decomposition, respectively; is the gradient operation; is the Dicla function; j is an imaginary unit; t is time; * is the convolution sign; is the tth data under the kth decomposed modality; is the predicted central frequency; is the set of central frequencies; s.t. is the constraint; K is the number of modalities to be decomposed; is the set of modal functions; f(t) is the original signal.
By solving Equation (7) and introducing the Lagrange multiplier, the constrained variational problem can be transformed into an unconstrained problem. This leads to the formulation of the extended Lagrange expression, which is shown in the following equation:
(8)
where λ and α are the Lagrange multiplier and quadratic penalty parameters, respectively.
The alternating direction method of multipliers algorithm is used to iteratively search for the optimal solution to the constrained variational model. This iterative process decomposes the original signal sequence into K narrowband modal components. The expressions for uk, ωk, and λ after each iteration are given in the following equations:
(9)
(10)
(11)
where , , , and correspond to the Fourier transforms of , , , and , respectively; ω is the frequency; γ is the noise tolerance.

Wavelet analysis

WA is a powerful method that allows for localized analysis of both time and frequency domains, overcoming the limitations of the Fourier transform (Lana et al. 2023). It has been a significant breakthrough in scientific methods. In practical scenarios, signals typically have low and relatively stable frequencies, while noise frequencies are often high. The wavelet transform involves decomposing the original signal using a set of low-pass and high-pass filter banks, which continuously decompose the signal into low-frequency approximate signals (caN) and high-frequency detailed signals (cd1, … , cdN) at different resolutions. This decomposition greatly enhances the local signal information. By reconstructing these low-frequency and high-frequency signals, a denoised signal can be obtained, which improves the noise immunity of the model.

A wavelet is a special kind of waveform, usually represented by the wavelet mother function . A certain translation in time and scaling at gives the wavelet function basis as follows:
(12)
where b is the translation factor, a is the scale factor.
The wavelet transform is a function of the scale parameter (a) and the translation parameter (b), representing a time-scale representation of the signal. The equation for the continuous wavelet transform is given by the following equation:
(13)
where is the continuous wavelet coefficients; is the original data; and is the conjugate function of .
To reduce the redundant data generated by the continuous wavelet transform, it is necessary to discretize the wavelet. This is achieved through the discrete wavelet transform, where the scale parameter (a) and the translation parameter (b) are discretized. The common approach for discretization is given by the following equation:
(14)
where . The corresponding discrete wavelet basis can be expressed as follows:
(15)
The discrete wavelet transform is given by the following equation:
(16)
where is the discrete wavelet coefficients; is the original data; is the conjugate function of .
The reconstruction process involves summing up the low and high-frequency signals, resulting in the final reconstructed signal. The reduction equation for this process is given by the following equation:
(17)
where is the restored data; is the low-pass filter; and is the high-pass filter.

Model evaluation metrics

The accuracy of the model is evaluated using three metrics: root mean square error (RMSE), Nash–Sutcliffe efficiency (NSE), and BIAS. These metrics are calculated as follows:
(18)
(19)
(20)
where i is the ith moment; N is the total time step; denotes predicted data; denotes real observations; denotes the average of the observations.

RMSE is a metric that quantifies the deviation between predicted and observed values. It ranges from [0, +∞], with 0 indicating the best fit of the model. NSE coefficient is used to assess the goodness of fit of hydrological model simulation results. Its value ranges from [−∞, 1], with 1 indicating perfect agreement between the simulated results and the observed data. BIAS is used to assess the discrepancy between the total water volume in the simulation results and the observed values. It takes values in the range of [−∞, +∞], with 0 being the ideal value. A positive value suggests an overall high water volume, while a negative value indicates a low water volume.

Results of input dimension selection

This study focused on analyzing the daily runoff and rainfall series of the Huanren station spanning from 1980 to 2020. The dataset was divided into two subsets for analysis – the training set, consisting of data from 1980 to 2012, and the validation set, comprising data from 2013 to 2020. To gain a deeper understanding of the temporal patterns and relationships within the runoff and rainfall series, the researchers employed the autocorrelation function (ACF) and partial autocorrelation function (PACF) plots. These plots provide visual representations of the correlation between current values and lagged values within the series. Figure 4 in the study showcases these plots. After careful examination of the ACF and PACF plots, the researchers determined the input variables for the model. It is found that the previous day's rainfall and runoff data exhibited significant correlations and dependencies with the current day's values. Therefore, these variables were chosen as the input for the model. By including the previous day's rainfall and runoff data as input variables, the model could effectively capture the temporal dynamics and improve the accuracy of predictions. This approach takes into account the influence of past values on the present, enabling the model to make informed predictions based on historical patterns observed in the dataset.
Figure 4

ACF and PACF maps of rainfall and runoff.

Figure 4

ACF and PACF maps of rainfall and runoff.

Close modal

Single model prediction results and analysis

The evaluation metrics for the four single models are presented in Table 1. The measured and predicted values during the training and validation period are compared in Figure 5. From the table, it is observed that the NSE values for all four single models are above 0.8, and the absolute values of BIAS are below 0.2. During the training period, the LSTM model performs best in terms of RMSE and NSE, with values of 48.71 m3/s and 0.96, respectively. The RMSE and NSE values for SVMs, RFs, and ANNs are 72.08 and 0.93, 66.60 m3/s and 0.94, and 71.56 and 0.93 m3/s, respectively. The ANN achieves the highest accuracy in terms of BIAS, with a value of 0.002. The BIAS values for the LSTM, SVM, and RF models are 0.13, −0.05, and −0.03, respectively.
Table 1

Evaluation indicators for the four single models

ModelsEvaluation indicatorsTraining periodValidation period
LSTM RMSE (m3/s) 48.71 70.11 
NSE 0.96 0.92 
BIAS 0.13 0.14 
SVM RMSE (m3/s) 72.08 72.45 
NSE 0.93 0.90 
BIAS −0.05 −0.04 
RF RMSE (m3/s) 66.60 78.47 
NSE 0.94 0.90 
BIAS −0.03 −0.06 
ANN RMSE (m3/s) 71.56 75.22 
NSE 0.93 0.90 
BIAS 0.002 0.006 
ModelsEvaluation indicatorsTraining periodValidation period
LSTM RMSE (m3/s) 48.71 70.11 
NSE 0.96 0.92 
BIAS 0.13 0.14 
SVM RMSE (m3/s) 72.08 72.45 
NSE 0.93 0.90 
BIAS −0.05 −0.04 
RF RMSE (m3/s) 66.60 78.47 
NSE 0.94 0.90 
BIAS −0.03 −0.06 
ANN RMSE (m3/s) 71.56 75.22 
NSE 0.93 0.90 
BIAS 0.002 0.006 
Figure 5

Prediction results for each single model.

Figure 5

Prediction results for each single model.

Close modal

During the validation period, the LSTM model also performs best in terms of RMSE and NSE, with values of 70.11 and 0.92 m3/s, respectively. The RMSE and NSE values for SVM, RF, and ANN are 72.45 and 0.90 m3/s, 78.47 and 0.90 m3/s, and 75.22 and 0.90 m3/s, respectively. ANN achieves the highest accuracy in terms of BIAS, with a value of 0.006. The BIAS values for the LSTM, SVM, and RF models are 0.14, −0.04, and −0.06, respectively. Based on the principles of the models, the LSTM model has an advantage in handling long time series data due to its gated structure. It effectively preserves the integrity of rainfall and runoff data, leading to better results in terms of RMSE and NSE during both the training and validation periods. However, the BIAS metric shows slightly lower accuracy for the LSTM model. This could be attributed to factors such as small rainfall and runoff values during the non-flood season, non-stationarity and non-linearity of the runoff sequence, and noise from human activities, which can adversely affect the prediction results and impact the BIAS metric.

In this study, we specifically identify days with daily runoff volumes of 2,000 m³/s and above as high-flow days during the validation period. To evaluate the prediction accuracy of each model on these high-flow days, we introduce the BIAS metric, as shown in Table 2. The results clearly indicate that the LSTM model outperforms the other models in forecasting high-flow days. To further enhance the prediction accuracy of the LSTM model, three signal decomposition methods, namely EMD, VMD, and WA, are introduced in this study. These methods are utilized to preprocess the data for the long runoff series and investigate the prediction accuracy of combined models, namely EMD–LSTM, VMD–LSTM, and WA–LSTM decomposition models.

Table 2

Daily forecast accuracy of high flows for each model

LSTMSVMRFANN
BIAS −0.0266 −0.1427 −0.1852 −0.1562 
LSTMSVMRFANN
BIAS −0.0266 −0.1427 −0.1852 −0.1562 

Decomposition results

Decomposition results of runoff series based on EMD

EMD was employed to decompose the daily runoff time series from 1980 to 2020 at the Huanren Station. The decomposition results are presented in Figure 6. EMD adaptively decomposes the original series into 10 IMFs and 1 trend component ® in descending order of frequencies. The irregularity observed in IMF1 can be attributed to the non-linear and noisy components present in the runoff series, further confirming the non-linearity and non-smoothness of the series. Following the decomposition, IMF1 exhibits the highest frequency and the largest amplitude, containing the primary information of the original sequence. On the other hand, IMF2–IMF10 exhibits decreasing frequencies and amplitudes, highlighting the local characteristics of the original sequence. Furthermore, the trend component indicates that the runoff at the Huanren Station exhibits an upward trend.
Figure 6

EMD-based decomposition of runoff sequences.

Figure 6

EMD-based decomposition of runoff sequences.

Close modal

Decomposition results of runoff series based on VMD

In this study, the researchers applied the variational mode decomposition (VMD) technique to decompose the daily runoff time series at the Huanren Station from 1980 to 2020. The results of the decomposition are analyzed and presented in Table 3. Table 3 displays the center frequencies of each sub-series corresponding to different values of K (the number of IMFs). Upon examination of the table, it was observed that at K = 8, the center frequencies of the two adjacent sub-series were 24 and 56 Hz, which were relatively close. This close proximity suggested the possibility of modal mixing, where the different modes interfere with each other. To avoid this interference, the researchers concluded that choosing K = 7 would be a more appropriate option. Based on this conclusion, the daily runoff sequence was decomposed into seven IMFs using the VMD decomposition technique. The decomposition results are visually presented in Figure 7, showcasing the seven IMFs obtained from the VMD decomposition process. This analysis and decomposition using VMD allowed for a better understanding of the underlying modes and frequencies present in the daily runoff time series. It provided valuable insights into the different temporal components contributing to the overall runoff variation, aiding in further analysis and modeling of the data.
Table 3

Center frequencies corresponding to different K-values

Number of modesCenter frequency (Hz)
       
92       
33 102      
33 101 213     
31 91 140 229    
31 88 135 202 253   
30 86 134 200 249 346  
24 56 99 140 203 251 349 
Number of modesCenter frequency (Hz)
       
92       
33 102      
33 101 213     
31 91 140 229    
31 88 135 202 253   
30 86 134 200 249 346  
24 56 99 140 203 251 349 
Figure 7

VMD-based decomposition of runoff sequences.

Figure 7

VMD-based decomposition of runoff sequences.

Close modal

WA-based decomposition results of runoff sequences

In this study, the selection of the wavelet basis function was a crucial step in the analysis of the data. Among the commonly used orthogonal wavelet bases, options, such as db1, db2, db3, and db4, were considered. After careful evaluation, the db4 wavelet basis function was chosen for several reasons. The db4 wavelet basis function is renowned for its balanced nature, exhibiting good local properties and a relatively smooth frequency response. These characteristics make it suitable for a wide range of signal-processing tasks, including signal denoising and compression. The db4 wavelet basis function has been widely employed and validated in various studies and applications. Considering the comprehensive evaluation of the available options, the db4 wavelet basis function was deemed the most appropriate choice for this study. It offered the potential to effectively capture the desired features and patterns within the runoff and rainfall time series, facilitating accurate analysis and modeling.

To determine the appropriate decomposition scale (J) for WA, a trial-and-error approach was employed. The decomposition scales were set as J = 1, 2, and 3. The LSTM model was utilized to generate prediction results for comparative analysis, aiming to identify the optimal decomposition scale, J. The decomposition plots of the runoff series at different scales are presented in Figure 8(a)–8(c) Additionally, the table of accuracy evaluation metrics for J = 1, 2, and 3 is provided in Table 4.
Table 4

Table of accuracy indicators for different decomposition scales

Scale of decompositionEvaluation indicatorsTraining periodValidation period
J = 1 RMSE (m3/s) 54.05 54.47 
NSE 0.96 0.95 
BIAS −0.17 −0.19 
J = 2 RMSE (m3/s) 84.22 87.38 
NSE 0.91 0.87 
BIAS 0.59 0.65 
J = 3 RMSE (m3/s) 75.46 77.27 
NSE 0.93 0.90 
BIAS 0.48 0.54 
Scale of decompositionEvaluation indicatorsTraining periodValidation period
J = 1 RMSE (m3/s) 54.05 54.47 
NSE 0.96 0.95 
BIAS −0.17 −0.19 
J = 2 RMSE (m3/s) 84.22 87.38 
NSE 0.91 0.87 
BIAS 0.59 0.65 
J = 3 RMSE (m3/s) 75.46 77.27 
NSE 0.93 0.90 
BIAS 0.48 0.54 
Figure 8

WA-based decomposition of runoff sequences: (a) J = 1, (b) J = 2, (c) J = 3.

Figure 8

WA-based decomposition of runoff sequences: (a) J = 1, (b) J = 2, (c) J = 3.

Close modal

Based on the aforementioned plots, it is evident that for this study, when employing the db4 wavelet basis function, the optimal decomposition scale is 1. The combined model prediction results in the validation period demonstrate that the RMSE for this scale is 54.47 m3/s, which is 37.66 and 29.51% lower than that of scales J = 2 and J = 3, respectively. Furthermore, the NSE for this scale is 0.95, representing a 9.20 and 5.56% improvement compared to scales J = 2 and J = 3, respectively. The absolute value of the BIAS is 0.19, indicating a reduction of 70.77 and 64.81% in comparison to scales J = 2 and J = 3, respectively.

Results and analysis of the combined forecast models

Table 5 presents the accuracy evaluation metrics for the three combined models and the single LSTM model. Additionally, Figure 9 illustrates the comparison between the measured and predicted values during the validation period. From Table 5, it is evident that the VMD–LSTM model exhibits the best overall performance among the three combined models. Its RMSE during the validation period is 52.14 m3/s, which is 31.57 and 4.28% lower than that of the EMD–LSTM model and the WA–LSTM model, respectively. Moreover, its NSE during the validation period is 0.96, which is 6.67 and 1.05% higher than that of the EMD–LSTM model and the WA–LSTM model, respectively. The absolute BIAS of the VMD–LSTM model during the validation period is 0.002, which is 96.67 and 98.95% lower than that of the EMD–LSTM model and the WA–LSTM model, respectively. Comparing the accuracy evaluation metrics of the VMD–LSTM model and the single LSTM model, it is evident that the VMD–LSTM model demonstrates improvements in all three evaluation metrics during the validation period. Specifically, the RMSE is reduced by 25.63% compared to the standalone LSTM model, the NSE is improved by 4.35%, and the absolute BIAS value is reduced by 98.57%. This suggests that the VMD method can effectively identify and filter out the noise present in the original runoff series and possesses strong non-linear processing capabilities. As a result, it addresses the modal mixing problem encountered in the EMD method and the noise sensitivity problem associated with the WA method. Consequently, the VMD–LSTM model enhances the smoothness and predictability of the runoff series.
Table 5

Accuracy evaluation indicators for decomposition combination models

ModelsEvaluation indicatorsTraining periodValidation period
LSTM RMSE (m3/s) 48.71 70.11 
NSE 0.96 0.92 
BIAS 0.13 0.14 
EMD–LSTM RMSE (m3/s) 48.63 76.19 
NSE 0.96 0.90 
BIAS 0.09 0.06 
VMD–LSTM RMSE (m3/s) 53.97 52.14 
NSE 0.96 0.96 
BIAS −0.002 −0.002 
WA–LSTM RMSE (m3/s) 54.05 54.47 
NSE 0.96 0.95 
BIAS −0.17 −0.19 
ModelsEvaluation indicatorsTraining periodValidation period
LSTM RMSE (m3/s) 48.71 70.11 
NSE 0.96 0.92 
BIAS 0.13 0.14 
EMD–LSTM RMSE (m3/s) 48.63 76.19 
NSE 0.96 0.90 
BIAS 0.09 0.06 
VMD–LSTM RMSE (m3/s) 53.97 52.14 
NSE 0.96 0.96 
BIAS −0.002 −0.002 
WA–LSTM RMSE (m3/s) 54.05 54.47 
NSE 0.96 0.95 
BIAS −0.17 −0.19 
Figure 9

Comparison of predicted and measured values in the validation period.

Figure 9

Comparison of predicted and measured values in the validation period.

Close modal

The findings of this study have significant implications for environmental protection and climate change adaptation. By improving runoff prediction through advanced machine learning and signal decomposition, we enhance water resource management and flood forecasting. Accurate predictions are vital for addressing climate change impacts, such as flooding and water scarcity (Sarker 2022). These improved models can assist policymakers and engineers in designing resilient infrastructure. Additionally, our approach aids in monitoring watershed health and protecting ecosystems at risk from climate change. Ultimately, our findings support informed decision-making on land use and water conservation, promoting sustainability and environmental stewardship.

In this study, four single machine learning models (LSTM, SVM, RF, and ANN) and three combined models (EMD–LSTM, VMD–LSTM, and WA–LSTM) were utilized to investigate the runoff of the Huanren station spanning from 1980 to 2020. The training set comprised data from 1980 to 2016, while the validation set encompassed data from 2017 to 2020. The LSTM model's applicability and accuracy were examined, and signal decomposition methods were employed to preprocess the runoff series in an attempt to enhance forecasting accuracy. The following conclusions were drawn:

  • (1) Among the four single models, the LSTM model demonstrated superior performance in terms of RMSE and NSE during both the training and validation periods, achieving values of 48.71 and 0.96 m3/s, and 70.11 and 0.91 m3/s, respectively. This can be attributed to the LSTM model's gate structure design, which overcomes the gradient disappearance and explosion issues associated with general RNN neural networks and effectively preserves the integrity of rainfall and runoff information in long time series.

  • (2) To evaluate the prediction accuracy of the four single models specifically on high-flow days, BIAS was introduced as an accuracy evaluation metric. The LSTM model exhibited an absolute BIAS value of 0.0266, which was 81.36, 85.64, and 82.97% lower compared to the SVM, RF, and ANN, respectively. This demonstrates the significant advantage of the LSTM model in forecasting high-flow days.

  • (3) Among the three combined models (EMD–LSTM, VMD–LSTM, and WA–LSTM), the VMD–LSTM model demonstrated the best overall performance, with RMSE, NSE, and BIAS values of 52.14 m3/s, 0.96, and −0.002, respectively, during the validation period. Notably, the VMD–LSTM model exhibited improved RMSE metrics during the validation period compared to the training period, indicating its ability to avoid overfitting issues and minimize generalization errors in the runoff prediction process.

  • (4) Comparing the VMD–LSTM model with the single LSTM model, it was observed that all three evaluation metrics displayed improved accuracy during the validation period. The VMD–LSTM model achieved a 98.57% reduction in the absolute value of BIAS compared to the single LSTM model. This highlights the effectiveness of the VMD method in identifying and filtering out noise from the original runoff sequence, thereby enhancing the smoothness and predictability of runoff sequences.

The results indicate that the LSTM model offers distinct advantages over the other three machine learning models in runoff forecasting, and the VMD–LSTM model significantly improves prediction accuracy. However, it should be known that only the previous day's rainfall and runoff are used as the inputs of these models in this study; further research can be undertaken to include some other variables in the inputs, such as humidity, temperature, and a large number of climate indices, which represent initial catchment conditions and the climate. Additionally, other methods such as EMD and singular spectrum decomposition can also be incorporated with the LSTM model to develop combined prediction models for further enhancing the accuracy of runoff predictions.

This study is supported by the National Key Research and Development Program of China (No. 2023YFC3006602) and the National Natural Science Foundation of China (Grant Nos. 52209028 and 51709108).

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Arash
V.
,
Jafar
J.
,
Sima
O.
,
Naser
S. H.
,
Sanaz
A. A.
&
José
A. F. O. C.
(
2023
)
An efficient random forest-based subset simulation method for reliability analysis of the marine structure piles subject to scour
,
Proceedings of the Institution of Civil Engineers – Maritime Engineering
, 176 (4), 176–192.
Breiman
L.
(
2001
)
Random forests
,
Machine Learning
,
45
(
1
),
5
32
.
Chen
X.
&
Ishwaran
H.
(
2012
)
Random forests for genomic data analysis
,
Genomics
,
99
(
6
),
323
329
.
Dong
L.
,
Li
X.
&
Peng
K.
(
2013
)
Prediction of rockburst classification using random forest
,
Transactions of Nonferrous Metals Society of China
,
23
(
2
),
472
477
.
Dragomiretskiy
K.
&
Zosso
D.
(
2014
)
Variational mode decomposition
,
IEEE Trans. Signal Processing
,
62
(
3
),
531
544
.
Gao
Y.
,
Sarker
S.
,
Sarker
T.
&
Leta
O. T.
(
2022
)
Analyzing the critical locations in response of constructed and planned dams on the Mekong River Basin for environmental integrity
,
Environmental Research Communications
,
4
(
10
),
101001
.
Hu
Q. F.
,
Cao
S. Y.
,
Yang
H. B.
,
Wang
Y. T.
,
Li
L. J.
&
Wang
L. H.
(
2020
)
Daily runoff predication using LSTM at the Ankang Station, Hanjing River
. Progress in Geography
,
39
(
04
),
636
642
.
(in chinese)
.
Hyo-Jin
P.
,
Jeong-Seok
K.
,
Woo
N. B.
&
Joo-Sung
K.
(
2024
)
ANN-based prediction models for green water events around a FPSO in irregular waves
,
Ocean Engineering
,
291
,
116408
.
Lai
C. G.
,
Chen
X. H.
,
Zhao
S. W.
,
Wang
Z. L.
&
Wu
X. S.
(
2015
)
A flood risk assessment model based on random forest and its application
,
Journal of Hydraulic Engineering
,
46
(
01
),
58
66
.
(in chinese)
.
Lana
K.
,
Nejka
P.
&
Helena
L.
(
2024
)
Evaluating transient phenomena by wavelet analysis: Early recovery to exercise
,
American Journal of Physiology. Heart and Circulatory Physiology
. 326 (1), H96–H102.
Li
H. Y.
,
Liu
H. B.
,
Yuan
X. M.
&
Liu
S. K.
(
2002
)
Peak value recognition theory of artificial neural network and its application to flood forecasting
,
Journal of Hydraulic Engineering
,
06
,
15
20
.
(in chinese)
.
Li
L. J.
,
Wang
Y. T.
,
Hu
Q. F.
,
Liu
D. Z.
,
Zhang
A. F.
&
Ba
Y. Q.
(
2020
)
Long-term inflow forecast of reservoir based on random forest and support vector machine
,
Hydro-Science and Engineering
,
04
,
33
40
.
(in chinese)
.
Lin
J. Y.
&
Cheng
C. T.
(
2006
)
Application of support vector machine method to long-term runoff forecast
,
Journal of Hydraulic Engineering
,
06
,
681
686
.
(in chinese)
.
Liu
Y.
&
Wang
L. H.
(
2022
)
Flood forecasting method based on an improved EMD-LSTM
,
Water Resources and Hydropower Engineering(Chinese and English)
,
53
(
01
),
35
44
.
(in Chinese)
.
Masoud
K.
,
Mehdi
J.
,
Mumtaz
A.
,
Anurag
M.
,
Chu
X. F.
,
Aitazaz
A. F.
&
Zaher
M. Y.
(
2023
)
Development of an enhanced bidirectional recurrent neural network combined with time-varying filter-based empirical mode decomposition to forecast weekly reference evapotranspiration
,
Agricultural Water Management
,
290
,
108604
.
Mou
L. Q.
,
Tian
F. Q.
&
Hu
H. P.
(
2009
)
Artificial neural network model of runoff prediction in high and cold mountainous regions: A case study in the source drainage area of Urumqi River
,
Journal of Hydroelectric Engineering
,
28
(
01
),
62
67
.
(in Chinese)
.
Qi
J. X.
,
Su
X. L.
,
Zhang
G. X.
&
Zhang
T.
(
2022
)
Research on monthly runoff prediction of VMD-LSTM model in different forecast periods
,
Agricultural Research in The Arid Areas
,
40
(
06
),
258
267
.
(in chinese)
.
Rossi
M. G.
,
Bezerra
d. S. L. C.
,
Vieira
S. M. E.
,
Roberto
d. O. R. H.
&
Schmidt
C. C. E.
(
2024
)
Simultaneous temporal and spectral evolution of ultrafast optical pulse propagation using a single bidirectional LSTM network
,
Optical Fiber Technology
,
82
,
103636
.
Sarker
S.
,
Veremyev
A.
,
Boginski
V.
&
Singh
A.
(
2019
)
Critical nodes in river networks
,
Scientific Reports
,
9
(
1
),
11178
.
Shang
J. C.
,
Liu
X. Q.
,
Zhang
Y. C.
,
Liu
Y. Q.
&
Du
J.
(
1995
)
Forecast of adaptive streamflow using multilayer neural networks
,
Water Resource And Power
,
01
,
52
56
.
(in Chinese)
.
Singhal
A.
,
Jaseem
M.
,
Divya
,
Sarker
S.
,
Prajapati
P.
,
Singh
A.
&
Jha
S. K.
(
2024
)
Identifying potential locations of hydrologic monitoring stations based on topographical and hydrological information
,
Water Resources Management
,
38
(
1
),
369
384
.
Swastik
B.
,
Balbhadra
T.
,
Ajay
K.
,
William
P. M.
,
Venkat
L.
&
Pratik
P.
(
2019
)
Streamflow forecasting using singular value decomposition and support vector machine for the Upper Rio Grande River Basin
,
JAWRA Journal of the American Water Resources Association
,
55
(
3
),
680
699
.
Tayyab
M.
,
Zhou
J.
,
Dong
X.
,
Ahmad
I.
&
Sun
N.
(
2019
)
Rainfall-runoff modeling at Jinsha River basin by integrated neural network with discrete wavelet transform
,
Meteorology and Atmospheric Physics
,
131
(
1
),
115
125
.
Vapnik
V. N.
(
1995
)
The Nature of Statistical Learning Theory
.
New York, NY
:
Springer
.
Wang
J. L.
,
Wu
J. S.
,
Sun
J. S.
&
Qi
X. B.
(
2003
)
Application of support vector machine method in prediction of groundwater level
,
Journal of Hydraulic Engineering
,
05
,
122
128
.
(in chinese)
.
Wang
W.-C.
,
Chau
K.-W.
,
Cheng
C.-T.
&
Qiu
L.
(
2009
)
A comparison of performance of several artificial intelligence methods for forecasting monthly discharge time series
,
Journal of Hydrology
,
374
(
3
),
294
306
.
Wang
X. L.
,
Li
K.
,
Zhang
Z. L.
,
Kong
L. X.
&
Chen
W. L.
(
2022
)
Coupled ALO-LSTM and feature attention mechanism prediction model for seepage pressure of earth-rock dam
,
Journal of Hydraulic Engineering
,
53
(
04
),
403
412
.
(in chinese)
.
Yin
Z. K.
,
Liao
W. H.
,
Wang
R. J.
&
Lei
X. H.
(
2019
)
Rainfall-runoff modelling and forecasting based on long short-term memory (LSTM)
,
South-to-North Water Transfers and Water Science & Technology
,
17
(
06
),
1
9 + 27
.
(in chinese)
.
Zhu
X. M.
,
Lu
C. N.
,
Wang
R. Y.
&
Bai
J. Y.
(
2005
)
Artificial neural network model for flood water level forecasting
,
Journal of Hydraulic Engineering
,
07
,
806
811
.
(in chinese)
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).