River stage prediction is indispensably a challenging task in flood-prone river basins to disseminate accurate early warning in advance. In this study, multivariate wavelet-based long short-term memory (WLSTM) models have been developed to predict river stage at six gauging stations of the Teesta River basin in India for 1, 3, and 5-day lead time, the comparison of which has been done with long short-term memory (LSTM) models. Various combinations of wavelet decomposed components were utilized to form different sub-series that were fed as input in WLSTM models. In terms of statistical indicators, both the models yielded exceptionally good results, but the root-mean-square error values of the WLSTM model for 1- and 3-day lead time were minimal compared to the LSTM model. However, the accuracy of the LSTM model in longer lead time prediction is noticeable. Specifically, the WLSTM model predicted the peak stage values more precisely compared to the LSTM model, indicating the potential of wavelet analysis to capture the variations and periodicities of the data by removing the noise. Though the WLSTM model marginally outperformed the LSTM model in prediction accuracy, the results highlight both models as feasible alternatives for longer lead time water level prediction.

  • River stage data decomposed with discrete wavelet transform are utilized in the long short-term memory (LSTM) model to develop a hybrid wavelet-based long short-term memory (WLSTM) model.

  • The WLSTM model enhanced the accuracy in peak stage prediction compared to the LSTM model.

  • The utilization of multivariate inputs in LSTM and WLSTM models highlights the influence of upstream stage values in downstream stage predictions.

  • Adjustment of hyperparameters in models produces better results.

Flood forecasting is an indispensable tool devised to predict water level or flow in advance and disseminate this information in the form of a warning. Early warnings are intended to help the people or local bodies make prompt decisions and relevant actions for evacuation and relocation (Chau et al. 2005). Flood forecasting is a part of flood management planning and development devised to withstand the foreseen disaster through competency. Flood forecasting models are utilized in predicting water level or river stage to provide prior intimation about the extreme event (Latt & Wittenberg 2014). Stage is defined as the level of water in the river measured from a specific datum (Subramanya 2021). People can easily infer the stage prediction results as the concept of rising water levels is convenient to understand. Over the years, physical models utilized for prediction involved intensive usage of various data and mathematical equations to describe the underlying hydrological processes (Thirumalaiah & Deo 2000). Sometimes these physical models face some significant issues in producing accurate and reliable flood forecasting. Moreover, the requirement of abundant data by these models makes it difficult to apply when data is inadequate (Tiwari & Chatterjee 2010). In this context, data-driven models have gained immense acceptability in the field of flood forecasting for their capability to capture the input–output relationship within limited data (Kasiviswanathan et al. 2016). Therefore, many researchers pondered the application of different popular machine learning (ML) models (Hadi et al. 2024) such as artificial neural network (ANN) (Liong et al. 2000; Campolo et al. 2003), genetic algorithm-based ANN (ANN-GA), and neuro-fuzzy (ANFIS) (Chau et al. 2005) for stage forecasting.

Despite several advantages, sometimes some standalone ML models fail to achieve accuracy for longer lead time predictions (Tiwari & Chatterjee 2010; Kasiviswanathan et al. 2016). Linh et al. (2021) concluded that the performance of the wavelet neural network hybrid ML model outperformed the traditional ANN and multiple linear regression models for predicting maximum monthly discharge by incorporating two climatic signals as input. Wavelet analysis exhibits a time–frequency representation of the time series data, thus providing extensive statistics about the data pattern (Daubechies 1990). The intrinsic characteristics of wavelet transform to decompose the non-stationary signal into various components of different resolution levels aid in improving the model predictability by capturing the periodicity and trend of the time series signal. Wavelet transform is extensively applied in various fields of hydrology, such as water level prediction, streamflow forecasting, rainfall–runoff, and water quality prediction (Nourani et al. 2008; Guimarães Santos & da Silva 2014; Seo et al. 2015; Barzegar et al. 2016). All these studies reveal that the wavelet-based ANN/ANFIS model has increased the correlation with the actual data, improving accuracy compared to the single ANN/ANFIS model for longer lead time prediction.

In recent times, among the deep learning techniques, the long short-term memory (LSTM) model has become immensely popular in prediction studies. Hochreiter & Schmidhuber (1997) introduced the LSTM model by improvising a recurrent neural network (RNN) to overcome the limitations of vanishing and exploding gradients. The advantage of LSTM lies in its capability to tackle long-term dependencies by creating a linkage between the sequential time series data. Apart from various interdisciplinary fields, the LSTM model has found its application in prediction of hydrological issues, especially rainfall–runoff (Yin et al. 2021), modelling reservoir operation (Zhang et al. 2018), rainfall and monthly streamflow (Ni et al. 2020; Dalkilic et al. 2023), evapotranspiration (Wang et al. 2023), and many more for better accuracy in longer lead time prediction. With the application of proper feature engineering and fine-tuning of the hyperparameters, the multivariate LSTM model yielded satisfactory performance for 1 week ahead discharge prediction of the Central Delaware River in New Jersey using seven stream-water variables (Khosravi et al. 2023). Similarly, the LSTM model has outperformed the nonlinear autoregressive with exogenous inputs for multi-step ahead prediction of river flow in the Kelantan River in Malaysia (Hayder et al. 2022). Furthermore, Faruq et al. (2020) employed the LSTM model as a deep learning technique to predict the water level of the Klang River basin in Malaysia for the next few hours and obtained almost accurate results when compared with the observed values.

As inferred from the aforementioned studies, deep neural networks (DNNs) are utilized to obtain higher accuracy for longer lead time prediction irrespective of hydrological challenges and geographical locations. Presently, the application of hybrid models like variational mode decomposition-based genetic algorithm Elman neural network model (Xing et al. 2022), adaptive step size cuckoo search algorithm-based LSTM and self-attention mechanism (ASCS-LSTM-ATT) model (Li et al. 2020), and particle swarm optimization-LSTM model (Ruma et al. 2023), and many more have gained immense attention, mainly for enhancing model performance. In this process, either the input data are processed with various data pre-processing methods or the parameters are optimized with some algorithms. The prime objective is to achieve better results in a longer lead time prediction, which is especially essential for anticipating a flood situation. Despite solving many hydrological issues, there is an evident gap in the literature addressing the river water level prediction utilizing the upstream water levels as inputs with DNNs. There are many river basins with gauging stations distributed across the mountainous and plain regions such as the Teesta River basin in India. Although the stations within this basin are monitored by the Central Water Commission, prediction studies of river water level or stage in this basin are not properly carried out. It is indispensable to predict the water levels in advance to prevent substantial loss as the Teesta River basin is a flood-prone basin experiencing sudden floods almost every year.

This research attempts to address the issue of the Teesta River basin by explaining the potentiality of wavelet analysis coupled with deep learning techniques and providing insight into the most researched topic of recent times. Influenced by the robustness and higher efficiency of the DNNs, multivariate multi-step LSTM models are developed to predict the daily river stage at six different gauging stations for 1-day, 3-day, and 5-day lead times. Finally, a hybrid deep learning model, a wavelet-based long short-term memory (WLSTM) model with multivariate input for multi-step prediction, is developed individually for all the stations. The uniqueness of this study lies in integrating wavelet transform with a deep learning algorithm, LSTM, for the prediction of the river water level of the Teesta River basin, which explains some new possibilities of leveraging deep learning approaches in flood forecasting.

Study area and data used

The Teesta River Basin lying within India is chosen for the present study. Geographically the area extends between 26°30′N to 28°0′N latitude and 88°0′E to 89°0′E longitude covering an area of 9786 km2 (Figure 1). Originating from a glacier, Teesta Khangse (Pahunri) in North Sikkim, the river flows through the hilly terrain of Sikkim and gradually encounters a mix of low hills and plains towards downstream. Finally, the river merges with the Brahmaputra River in Bangladesh after bifurcating from Jalpaiguri, just touching Mekhliganj in the Cooch Behar district. The entire river length is 309 km, 103 km drains through Sikkim, and 121 km flows through West Bengal. Under the ‘Teesta Barrage Project,’ Gajoldoba barrage has been constructed on the Teesta River in Jalpaiguri district. Glacial lakes and alpine vegetation are predominant at upper elevations, whereas tropical deciduous trees enwrap the low hills. The basin experiences incessant rainfall during the monsoon, often leading to tremulous landslides at higher altitudes and swollen riverbanks at plains. The average annual rainfall varies from 2,000 to 5,000 mm (Goyal & Goswami 2018). Very often flood events occur in the basin. The basin has an incredible flood history; some of the catastrophic flood events occurred in 1950, 1968, 1973, 1975, 1976, 1978, 1993, 1996, 2000, 2003, 2015, and 2017. For instance, in 2015, due to the sudden cloudburst in Sikkim and Darjeeling, an abrupt increase in water level led to the release of water from Gajoldoba barrage, which inundated Jalpaiguri and Maynaguri. More than 20,000 people were stranded, with 300 hectares of agricultural land submerged (Pal et al. 2016). Again in 2017, as reported by Ghosh & Ghosal (2021), heavy rainfall for a week triggered landslides, submergence of railway tracks, harmed numerous lives, and destruction of hectares of cultivable lands. But, the 1968 flood is the most remarkable and disastrous due to sudden cloudbursts and heavy landslides, which demolished numerous bridges and took an unprecedented death toll.
Figure 1

Study area map.

There are six gauging stations present within the study area (Figure 1) maintained by the Central Water Commission (CWC), New Delhi. These are Sankalan (North Sikkim), Khanitar (East Sikkim), Teesta Bazaar (West Bengal), Coronation (WB), Domohani (WB), and Mekhliganj (WB), chronologically from upstream to downstream. The stage data at Teesta Bazaar, Coronation, Domohani, and Mekhliganj gauging stations are characterized by the West Bengal Irrigation Department into three significant levels, i.e., preliminary-danger level (PDL), danger level (DL), and extreme danger level (EDL). Table 1 represents the PDL, DL, and EDL for Teesta Bazaar, Coronation, Domohani, and Mekhliganj. Daily stage data available during the monsoon season (1 May to 31 October) from 2006 to 2017 (2208 sets) were divided into two datasets. Daily stage data for year 2006–2014 (1656 patterns) were randomly divided for training (70%) and k-fold cross-validation (30%) and 2015–2017 (552 patterns) for testing. Some of the statistical properties of daily stage data are illustrated in Table 2.

Table 1

PDL, DL, and EDL for Teesta Bazaar, coronation, Domohani, and Mekhliganj

Gauging stationsPDL (m)DL (m)EDL (m)
Teesta Bazaar 210.40 211.00 213.00 
Coronation 149.40 150.00 153.60 
Domohani 85.60 85.95 86.30 
Mekhliganj 65.35 65.95 66.30 
Gauging stationsPDL (m)DL (m)EDL (m)
Teesta Bazaar 210.40 211.00 213.00 
Coronation 149.40 150.00 153.60 
Domohani 85.60 85.95 86.30 
Mekhliganj 65.35 65.95 66.30 
Table 2

Statistical properties of daily stage data of six gauging stations (1 May to 31 October) for the years 2006 to 2017

Gauging stationsSankalanKhanitarTeesta BazaarCoronationDomohaniMekhliganj
Datasets from 2006 to 2014 
Maximum (m) 760.85 295.32 210.00 150.50 89.13 65.84 
Minimum (m) 752.20 290.32 201.50 142.10 82.34 63.38 
Mean (m) 755.75 292.57 204.70 145.30 85.26 64.73 
Standard deviation (m) 1.57 0.74 1.73 1.32 0.49 0.44 
Skewness −0.11 −0.04 0.16 0.32 −0.26 −0.38 
Kurtosis −0.46 0.35 −0.69 −0.10 2.68 −0.27 
Datasets from 2015 to 2017 
Maximum (m) 758.11 293.28 210.34 146.61 85.85 65.61 
Minimum (m) 753.22 290.39 205.01 141.22 83.46 62.81 
Mean (m) 755.48 291.65 207.59 144.09 84.91 64.33 
Standard deviation (m) 1.27 0.66 1.34 1.17 0.56 0.58 
Skewness 0.22 −0.03 −0.04 −0.27 −0.75 −0.16 
Kurtosis −0.99 −0.86 −0.81 −0.51 −0.05 −0.42 
Gauging stationsSankalanKhanitarTeesta BazaarCoronationDomohaniMekhliganj
Datasets from 2006 to 2014 
Maximum (m) 760.85 295.32 210.00 150.50 89.13 65.84 
Minimum (m) 752.20 290.32 201.50 142.10 82.34 63.38 
Mean (m) 755.75 292.57 204.70 145.30 85.26 64.73 
Standard deviation (m) 1.57 0.74 1.73 1.32 0.49 0.44 
Skewness −0.11 −0.04 0.16 0.32 −0.26 −0.38 
Kurtosis −0.46 0.35 −0.69 −0.10 2.68 −0.27 
Datasets from 2015 to 2017 
Maximum (m) 758.11 293.28 210.34 146.61 85.85 65.61 
Minimum (m) 753.22 290.39 205.01 141.22 83.46 62.81 
Mean (m) 755.48 291.65 207.59 144.09 84.91 64.33 
Standard deviation (m) 1.27 0.66 1.34 1.17 0.56 0.58 
Skewness 0.22 −0.03 −0.04 −0.27 −0.75 −0.16 
Kurtosis −0.99 −0.86 −0.81 −0.51 −0.05 −0.42 

Long short-term memory model

LSTM, a recent breakthrough in the RNN model, introduced by Hochreiter & Schmidhuber (1997) is successfully applied in various fields like language translation, speech recognition, image captioning, and time series problems. The architecture of LSTM comprises memory cells operated through several gates to overcome the long-term dependency issues of conventional RNNs. The basic framework of the LSTM network is shown in Figure 2. It consists of two states, a hidden state known as short-term memory and a cell state known as long-term memory. The cell state facilitates transmitting the information to the next hidden units by filtering some of the information with the aid of structures called ‘gates’. The forget gate efficiently discards the unnecessary information, the input gate stores the information to update the cell state or inhibits further transmission of information, and the output gate controls whether the information of the memory cell is to be transmitted or retained to pass to the output from the memory cell. This functionality of the ‘gates’ structure to preserve or discard some part of the original information and consider some new information enables the LSTM network to learn dependency among the variables over the period while overcoming the issues of diminishing and exploding gradient.
Figure 2

Structure of a long short-term memory cell, where xt, ht, and Ct are the input, hidden, and current states, respectively.

Figure 2

Structure of a long short-term memory cell, where xt, ht, and Ct are the input, hidden, and current states, respectively.

Close modal

Wavelet analysis

Grossman & Morlet (1984) introduced wavelet analysis as an alternative to Fourier transformation for filtering time and frequency information from the original data. Moreover, the major disadvantage of the Fourier transform is that it lacks time information having only frequency resolution and is designed for stationary signals. Of late, wavelet analysis has been popularized in hydrological time series forecasting as it is efficient in treating the non-stationary variance of hydrological methods (Nourani et al. 2014; Sang et al. 2015). The notion behind these joint time–frequency representations is due to specific interest in a particular spectral component occurring at any instant in time. Wavelet functions can decompose non-stationary time series data into various components at distinct resolution levels. Wavelet analysis uses a fully scalable modulated window that is moved along the signal to obtain the spectral components at every position. This procedure is repeated a few times with a slightly shorter or longer window for every new cycle, resulting in a collection of time–frequency representations of the signal at different resolutions. This approach is known as multiresolution analysis. The original signal is decomposed in wavelet transform through two main parameters, scaling and shifting. There are two types of wavelet transform: continuous wavelet transform (CWT) and discrete wavelet transform (DWT). The CWT of time series f(t) is defined by Equation (1) as follows:
(1)
where is the wavelet coefficient, ‘*’ denotes complex conjugate, and is called the mother wavelet function. The wavelet function operates using the two parameters: dilation (scale) and translation (position) parameters denoted by ‘a’ and ‘b’. CWT calculation requires more computational time and generates a huge quantity of data for each parameter ‘a’ and ‘b’. This issue is resolved by selecting scales and positions based on the power of two termed dyadic scales and positions. Therefore, DWT utilizes dyadic sampling to reduce the amount of data producing a substantially faster algorithm. DWT adopted in this study is expressed by Equation (2) (Mallat 1989):
(2)
where m and n are the integers that adjust wavelet dilation and translation, respectively; is the fixed dilation step greater than one; and is the location parameter, which should be greater than zero. The most general and simplest choice for parameters are and (Nourani et al. 2009). The two functions on which DWT operates are scaling (low-pass filters) and translation (high-pass filters) functions. The DWT analyses the original signal by passing through high-pass and low-pass filters to disintegrate the signal into detailed information and coarse approximation. The approximations are the low-frequency components representing the trend of the signal, and the details are high-frequency components unveiling the rapidly changing features in the data series.

Model performance evaluation

Statistical indicators are required to evaluate the goodness of fit between observed and predicted values. In this study, the coefficient of determination (R2), Nash–Sutcliffe efficiency (NSE), and root-mean-square error (RMSE) are the statistical indicators utilized to assess the model accuracy for both the training as well as the testing phase. During training, the efficiency of the model is determined by comparing the model-generated output with a set of target data given to it. While testing the best model, the correlation between the actual values and predicted values for a new dataset is assessed through these statistical indicators. Based on specific values of these indicators, various performance ratings are assigned to the models illustrated in Table 3. The statistical indicators used in this study are expressed by Equations (3)–(5):

  • (i) Coefficient of determination (R2) – It is expressed as follows:
    (3)
  • (ii) NSE (Nash & Sutcliffe 1970) – It is expressed as follows:
    (4)
  • (iii) RMSE – It is expressed as follows:
    (5)
    where and are observed and predicted flows at time i, respectively; and are the average observed and predicted flows, respectively, and n is the number of data points.
Table 3

General performance rating of the statistical indicators (Moriasi et al. 2007)

Sl No.Performance ratingR2NSE
Very good 0.75–1 0.75–1 
Good 0.65–0.75 0.65–0.75 
Satisfactory 0.50–0.65 0.50–0.65 
Unsatisfactory <0.50 <0.50 
Sl No.Performance ratingR2NSE
Very good 0.75–1 0.75–1 
Good 0.65–0.75 0.65–0.75 
Satisfactory 0.50–0.65 0.50–0.65 
Unsatisfactory <0.50 <0.50 

Table 3 illustrates the performance rating for R2 and NSE values. For RMSE, the lesser the RMSE values, the better the performance (Latt & Wittenberg 2014).

Procedure for developing LSTM model

In this study, LSTM models were developed for each gauging station for predicting river stage for 1-, 3-, and 5-day ahead. Specifically, two types of LSTM models were developed: univariate multi-step LSTM model for Sankalan and multivariate multi-step LSTM model for Khanitar, Teesta Bazaar, Coronation, Domohani, and Mekhliganj. In the case of Sankalan, a stacked LSTM model with stage data of Sankalan as its input data was developed, whereas for other stations (Khanitar to Mekhliganj), a stacked LSTM model with multivariate inputs (stage data of the upstream station(s) and target station) was developed for the predicting stage. For multivariate multi-step models, it was found by trial that stage data of two upstream stations along with the stage data of the target station influenced most on the successive lead days prediction yielding better results. The available river stage data from the year 2006 to 2017 for all the gauging stations were divided into two parts, training (2006–2014) and testing (2015–2017). Again, the river stage data from 2006 to 2014 have been randomly divided into two portions, training (70%) and cross-validation (30%). K-fold cross-validation (k = 10) was applied to assess the accuracy prediction and generalization capability for an independent dataset, preventing the overfitting of the model. All the input stage data were normalized to range from 0 to 1 using min–max normalization given by Equation (6) prior to its utilization in the LSTM model. Normalization of input data is an essential task that prevents inconsistent weightage to different input variables by scaling the variables within the same range to treat them as variables alike (Yadav et al. 2018; Han & Morrison 2021).
(6)
where is the normalized value of X, X is the set of observed values, is the minimum value of X, and is the maximum value in X.
As required by the LSTM model, it is essential to rearrange the input data in the form of a three-dimensional (3D) tensor, which includes the number of samples, number of timesteps, and number of features. The number of samples resembles the total number of rows in the training data, and the number of features is the number of variables utilized in the model. The number of timesteps or lags are the past data points in a sequence under consideration. The sequential learning process of the LSTM model is achieved through the lags in the input layer, which aid in realizing the trend of the variable. Therefore, the selection of optimal timesteps is also vital along with the selection of hyperparameters such as learning rate, batch size, and epoch. To reduce the loss function (mean squared error in this case), the LSTM model is trained with Adam optimizer and rectified linear units as the activation function using the Keras library in Python. Initiating with varying the timesteps from 3, 4, and 5, various combinations of batch size (32, 64, and 128) were experimented. Depending on the performance of the LSTM models, a batch size of 32 with 3 timesteps was finalized. Furthermore, it was experimented with by varying the epochs (500, 600, 700, 800, 900, 1,000, 1,500), learning rates (0.001, 0.0001), and hidden neurons (30–100 with increments of 10). Initially, the model was trained with one hidden layer, and subsequently, another layer was added to improve the performance of the model. The number of epochs and hidden neurons varied for different LSTM models of different stations, but a learning rate of 0.0001 produced the best results. After evaluating the performance of the trained model and finalizing the optimal parameters through trial and error, the efficiency of the model was checked with the independent testing dataset of each station. The hyperparameters, especially the epochs and the hidden neurons of each layer, were different in the case of different LSTM models for six gauging stations. Figure 3 explains the methodology of the LSTM model development.
Figure 3

Methodology adopted for developing LSTM model.

Figure 3

Methodology adopted for developing LSTM model.

Close modal

Procedure for developing wavelet-based LSTM model

To generate the WLSTM models for six gauging stations, the original input data series of each station was decomposed into different components (DWCs) at various scales using DWT. The original time series data were decomposed into low frequencies (approximation) and high frequencies (details) by passing through low-pass and high-pass filters. The mother wavelet function was selected from the Daubechies family (db8) for this study. Equation (7) devised by Nourani et al. (2008) to identify the specific number of decomposition levels of the time series with DWT is utilized as follows:
(7)
where L and N are decomposition level and number of time series data, respectively.

In the present study, the original data series of the river stage was decomposed up to three levels (L = 3 approximately), producing the DWCs (D1, D2, D3, and A3). The detail components were extracted in such a way from the original series so that various combinations of approximation components can be used to develop the sub-series. Therefore, the new sub-series was formed by various combinations of approximation components. As a result, at least two or more approximation components were combined to form four different sub-series shown in Table 4. This procedure of forming four different sub-series was conducted for each gauging station, i.e., Sankalan, Khanitar, Teesta Bazaar, Coronation, Domohani, and Mekhliganj. The sub-series was fed as input in the LSTM model to form individual WLSTM models from each sub-series. Thus, four WLSTM models for each gauging station were prepared among which the best model was selected based on its performance.

Table 4

Combination of wavelet components to form sub-series

Sub-seriesInput combinationsIn terms of Q
I = A1+ A2 Q – D1 – D2 
I = A1+ A3 Q – D1 – D2 – D3 
I = A2+ A3 Q – D1 – D2 – D3 
I = A1+ A2 + A3 Q – D1 – D2 – D3 
Sub-seriesInput combinationsIn terms of Q
I = A1+ A2 Q – D1 – D2 
I = A1+ A3 Q – D1 – D2 – D3 
I = A2+ A3 Q – D1 – D2 – D3 
I = A1+ A2 + A3 Q – D1 – D2 – D3 

First, for Sankalan, each sub-series of Sankalan was taken, and they lagged up to three timesteps to form four individual WLSTM models. Then, the accuracy of these four WLSTM models was assessed to select the best-performing series. Subsequently, for stations that utilized multivariate inputs (Khanitar to Mekhliganj), the best-performing sub-series of the upstream stations were included as input along with the sub-series of the target station. The four sub-series generated for each gauging station were utilized as the input of the LSTM model to develop four types of WLSTM models for each station. Figure 4 explains the methodology of the WLSTM model adopted for this study. For the WLSTM model, the process of developing the LSTM model including the hyperparameter tuning is similar to the LSTM models as described in the earlier section. The batch size, timesteps, and learning rate are the same as LSTM models, but the epochs and hidden neurons vary according to the best model performance.
Figure 4

Methodology adopted for developing WLSTM models.

Figure 4

Methodology adopted for developing WLSTM models.

Close modal

Prediction of daily river stage with LSTM models

This section discusses the results of river stage prediction utilizing two different types of LSTM models for various lead times. With the optimal hyperparameters, both the univariate multi-step LSTM model for Sankalan and the multivariate multi-step LSTM model for Khanitar to Mekhliganj gauging stations yielded satisfactory results in terms of the statistical indicators (R2, NSE, and RMSE), which is presented in Table 5. As articulated by Table 5, the performance of the model reduced with the increase in lead time, especially the RMSE values for Sankalan, Teesta Bazaar, and Coronation are higher in the case of 5-day lead time. For 1-day lead, the predicted values hold a good agreement with the observed values for all the stations as reflected from the R2 and NSE values, which range from 0.9516 to 0.9068 and 0.9510 to 0.8978, respectively. In addition, the lower RMSE values ranging from 0.1645 to 0.3411 m support the congruency between the observed and predicted values. From the results of the statistical indicators (Table 5), it can be concluded that the LSTM models for Domohani and Mekhliganj gauging stations performed better than other gauging stations. The potentiality of the LSTM networks to learn from previous timesteps proves advantageous, which is effortlessly reflected in the predictions. With the testing data, the LSTM models perform much better with lesser errors for all the gauging stations. Even for 3-day and 5-day lead time, the LSTM models produced lesser error, especially for Domohani and Mekhliganj with RMSE values ranging from 0.2018 to 0.2545 m. It is evident from Figure 5 that the peak river stage was not exactly predicted for all the lead times, and underestimation is noticeable. However, for low and medium water levels, overestimation is visible. As the LSTM networks can retain important information from the previous timesteps, the predicted result tried to maintain the trend of the original river stage without any pre-processing of data.
Table 5

Results of the performance indicators during the training and testing phase for LSTM model using river stage at various lead time

1 day
Gauging stationsTraining
Testing
R2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.9321 0.9319 0.3221 0.9516 0.951 0.2791 
Khanitar 0.8954 0.8951 0.3011 0.9068 0.8978 0.2099 
Teesta Bazaar 0.9113 0.9101 0.3976 0.9493 0.9357 0.3411 
Coronation 0.9228 0.9221 0.31 0.9421 0.9414 0.2807 
Domohani 0.8862 0.879 0.2876 0.9107 0.9095 0.1673 
Mekhliganj 0.8913 0.8902 0.2421 0.9213 0.9202 0.1645 
 3 days
Training
Testing
Gauging stationsR2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.891 0.8873 0.4676 0.8946 0.8938 0.4109 
Khanitar 0.7956 0.7548 0.3854 0.8104 0.7895 0.3009 
Teesta Bazaar 0.8434 0.8219 0.5618 0.886 0.8423 0.5335 
Coronation 0.8207 0.8043 0.5234 0.8474 0.8385 0.4674 
Domohani 0.8289 0.8242 0.279 0.847 0.8389 0.2236 
Mekhliganj 0.8567 0.8521 0.2355 0.8815 0.8797 0.2018 
5 days
Training
Testing
Gauging stationsR2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.7577 0.7431 0.6221 0.794 0.7927 0.5752 
Khanitar 0.7225 0.7065 0.3839 0.7599 0.7179 0.3474 
Teesta Bazaar 0.8293 0.7536 0.6409 0.8726 0.7937 0.6103 
Coronation 0.8012 0.7852 0.5971 0.833 0.7705 0.5596 
Domohani 0.7798 0.7402 0.3054 0.8069 0.7927 0.2545 
Mekhliganj 0.8364 0.8275 0.2738 0.866 0.8615 0.2165 
1 day
Gauging stationsTraining
Testing
R2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.9321 0.9319 0.3221 0.9516 0.951 0.2791 
Khanitar 0.8954 0.8951 0.3011 0.9068 0.8978 0.2099 
Teesta Bazaar 0.9113 0.9101 0.3976 0.9493 0.9357 0.3411 
Coronation 0.9228 0.9221 0.31 0.9421 0.9414 0.2807 
Domohani 0.8862 0.879 0.2876 0.9107 0.9095 0.1673 
Mekhliganj 0.8913 0.8902 0.2421 0.9213 0.9202 0.1645 
 3 days
Training
Testing
Gauging stationsR2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.891 0.8873 0.4676 0.8946 0.8938 0.4109 
Khanitar 0.7956 0.7548 0.3854 0.8104 0.7895 0.3009 
Teesta Bazaar 0.8434 0.8219 0.5618 0.886 0.8423 0.5335 
Coronation 0.8207 0.8043 0.5234 0.8474 0.8385 0.4674 
Domohani 0.8289 0.8242 0.279 0.847 0.8389 0.2236 
Mekhliganj 0.8567 0.8521 0.2355 0.8815 0.8797 0.2018 
5 days
Training
Testing
Gauging stationsR2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.7577 0.7431 0.6221 0.794 0.7927 0.5752 
Khanitar 0.7225 0.7065 0.3839 0.7599 0.7179 0.3474 
Teesta Bazaar 0.8293 0.7536 0.6409 0.8726 0.7937 0.6103 
Coronation 0.8012 0.7852 0.5971 0.833 0.7705 0.5596 
Domohani 0.7798 0.7402 0.3054 0.8069 0.7927 0.2545 
Mekhliganj 0.8364 0.8275 0.2738 0.866 0.8615 0.2165 
Figure 5

Predicted river stage for 1-day, 3-day, and 5-day lead time using LSTM model at (a) Sankalan, (b) Khanitar, (c) Teesta Bazaar, (d) Coronation, (e) Domohani, and (f) Mekhliganj.

Figure 5

Predicted river stage for 1-day, 3-day, and 5-day lead time using LSTM model at (a) Sankalan, (b) Khanitar, (c) Teesta Bazaar, (d) Coronation, (e) Domohani, and (f) Mekhliganj.

Close modal

Prediction of daily river stage with WLSTM models

To extrude the benefit of wavelet transform in prediction results, decomposed wavelet components are utilized as input while training the LSTM models. The coupling of pre-processing technique with the LSTM model has improved the results over the traditional LSTM models irrespective of gauging stations. The results of the statistical indicators are presented in Table 6. As interpreted by Table 6, the R2 and NSE values are extremely satisfactory, ranging from 0.9957 to 0.9880 and 0.9956 to 0.9866 for 1 day, 0.9608 to 0.9021 and 0.9558 to 0.8987 for 3 days, and 0.9053 to 0.7768 and 0.9015 to 0.7631 for 5 days. A minimal error difference is noticed between the predicted river stage values by WLSTM models and observed values for all the gauging stations, which is depicted in Figure 6. This observation is further substantiated by the RMSE values (Table 6), which range from 0.0369 to 0.1339 m for 1 day, 0.1224 to 0.3386 m for 3 days, and 0.1754 to 0.5702 m for 5 days. For Sankalan, the prediction of peak river stage is better than other stations for all the lead times, but the lower water levels are overestimated with the increase in lead time. For 1-day lead, Khanitar and Teesta Bazaar also predicted the peak river stage proficiently. The predicted river stages captured the trend of the observed stage data in the case of all lead times. Figure 6 elucidates the intrinsic property of the wavelet to understand the trend and periodicity of the original values and replicate it in prediction results.
Table 6

Results of the performance indicators during the training and testing phase for WLSTM model using river stage at various lead time

1 day
Gauging stationsTraining
Testing
R2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.9823 0.9812 0.1142 0.9941 0.9941 0.0971 
Khanitar 0.9771 0.9742 0.121 0.988 0.9866 0.076 
Teesta Bazaar 0.9798 0.9778 0.1591 0.99 0.9893 0.1339 
Coronation 0.9882 0.9721 0.1327 0.995 0.9944 0.0867 
Domohani 0.9642 0.9611 0.121 0.9957 0.9956 0.0369 
Mekhliganj 0.9769 0.9684 0.1224 0.9943 0.9916 0.0535 
3 days
Training
Testing
Gauging stationsR2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.9391 0.9337 0.3321 0.9515 0.9515 0.2776 
Khanitar 0.8723 0.8628 0.2772 0.9021 0.8987 0.2087 
Teesta Bazaar 0.9348 0.9253 0.3907 0.9499 0.9318 0.3386 
Coronation 0.9056 0.8967 0.411 0.9525 0.9415 0.2812 
Domohani 0.9273 0.9045 0.2564 0.9511 0.9491 0.1256 
Mekhliganj 0.9433 0.9386 0.1726 0.9608 0.9558 0.1224 
5 days
Training
Testing
Gauging stationsR2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.8604 0.8575 0.5677 0.8616 0.8609 0.4711 
Khanitar 0.7671 0.7421 0.4753 0.7768 0.7631 0.3184 
Teesta Bazaar 0.8744 0.8439 0.6679 0.8924 0.8083 0.5702 
Coronation 0.8045 0.7787 0.6329 0.8202 0.7866 0.5396 
Domohani 0.8773 0.8691 0.2691 0.9053 0.9015 0.1754 
Mekhliganj 0.8548 0.8472 0.2194 0.902 0.8967 0.1871 
1 day
Gauging stationsTraining
Testing
R2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.9823 0.9812 0.1142 0.9941 0.9941 0.0971 
Khanitar 0.9771 0.9742 0.121 0.988 0.9866 0.076 
Teesta Bazaar 0.9798 0.9778 0.1591 0.99 0.9893 0.1339 
Coronation 0.9882 0.9721 0.1327 0.995 0.9944 0.0867 
Domohani 0.9642 0.9611 0.121 0.9957 0.9956 0.0369 
Mekhliganj 0.9769 0.9684 0.1224 0.9943 0.9916 0.0535 
3 days
Training
Testing
Gauging stationsR2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.9391 0.9337 0.3321 0.9515 0.9515 0.2776 
Khanitar 0.8723 0.8628 0.2772 0.9021 0.8987 0.2087 
Teesta Bazaar 0.9348 0.9253 0.3907 0.9499 0.9318 0.3386 
Coronation 0.9056 0.8967 0.411 0.9525 0.9415 0.2812 
Domohani 0.9273 0.9045 0.2564 0.9511 0.9491 0.1256 
Mekhliganj 0.9433 0.9386 0.1726 0.9608 0.9558 0.1224 
5 days
Training
Testing
Gauging stationsR2NSERMSE (m)R2NSERMSE (m)
Sankalan 0.8604 0.8575 0.5677 0.8616 0.8609 0.4711 
Khanitar 0.7671 0.7421 0.4753 0.7768 0.7631 0.3184 
Teesta Bazaar 0.8744 0.8439 0.6679 0.8924 0.8083 0.5702 
Coronation 0.8045 0.7787 0.6329 0.8202 0.7866 0.5396 
Domohani 0.8773 0.8691 0.2691 0.9053 0.9015 0.1754 
Mekhliganj 0.8548 0.8472 0.2194 0.902 0.8967 0.1871 
Figure 6

Predicted river stage for 1-day, 3-day, and 5-day lead time using WLSTM model at (a) Sankalan, (b) Khanitar, (c) Teesta Bazaar, (d) Coronation, (e) Domohani, and (f) Mekhliganj.

Figure 6

Predicted river stage for 1-day, 3-day, and 5-day lead time using WLSTM model at (a) Sankalan, (b) Khanitar, (c) Teesta Bazaar, (d) Coronation, (e) Domohani, and (f) Mekhliganj.

Close modal

The results demonstrate the application of LSTM and WLSTM models for the prediction of river stage for the six gauging stations for 1-, 3-, and 5-day lead time. In general, the range of R2 and NSE values from 1 day to 5 days is 0.95–0.72 and 0.99–0.76 for LSTM and WLSTM models, respectively. The range of RMSE values is 0.16–0.61 and 0.03–0.57 m. The results prove that the performance of both models slightly deteriorated with the increase in lead days accordingly the congruency between observed and predicted values gradually fell apart with the advancement of days. However, the LSTM models of Sankalan, Domohani, and Mekhliganj perform well compared to the other gauging stations considering the results of all the statistical indicators (R2, NSE, and RMSE). Although the LSTM models of Teesta Bazaar and Coronation perform better in terms of R2 and NSE, the error difference is more compared to other stations as inferred from the RMSE values. Overall, in LSTM models, the agreement between the observed and predicted stage for Mekhliganj is better than that of other stations for all the lead times as the RMSE error is lower. The idea of utilizing multivariate inputs (river stage values of the upstream gauging stations) has proven beneficial in enhancing the accuracy of the model and implementing an idea of natural phenomena in which the downstream water level is influenced by its upstream values. As the LSTM model network learns from the previous timesteps and decides which important data to retain and carry forward for the next iteration, it highlights its ability to capture long-term dependencies in the sequential data. The model discards the redundant information to retain necessary information in the memory for achieving high prediction accuracy. Moreover, the unique architecture of the LSTM model helps to interpret the spatiotemporal correlation of the data while processing the time series data. By incorporating the wavelet components as input in the LSTM model, the results of the statistical indicators of WLSTM models improved with respect to LSTM models. The sequential data handling capacity of the LSTM model and the ability of wavelet transform to discard noise from the data collectively lead to an improvement in prediction accuracy for the WLSTM model. As a result, the R2 and NSE values of WLSTM models have escalated and RMSE values indicated lower errors.

The performance of all the gauging stations was incredibly good, while the WLSTM model for Domohani performed evenly well for all the lead time. In general, the LSTM and WLSTM models are considered very good for modelling river stage prediction for longer lead times as per the performance ratings by Moriasi et al. (2007). Simultaneously, the percentage error in peak stage prediction is equally important to understand the accuracy of the models towards high-water level prediction. Figure 7(a), (b), and (c) represents the percentage error in peak stage prediction of LSTM and WLSTM models for 1-, 3-, and 5-day lead time, respectively. For 1-day lead time, the WLSTM model produces less error for all the stations except for Mekhliganj. Subsequently, for 3-day lead time, the WLSTM model outperforms the LSTM model in peak prediction with exceptionally minimum error for Sankalan, Teesta Bazaar, and Domohani gauging stations. However, for 5-day lead time, the LSTM models for Khanitar and Teesta Bazaar have outperformed the WLSTM model in peak prediction.
Figure 7

Percentage error in peak stage prediction for (a) 1-day, (b) 3-day, and (c) 5-day lead time.

Figure 7

Percentage error in peak stage prediction for (a) 1-day, (b) 3-day, and (c) 5-day lead time.

Close modal
As mentioned earlier, the intimation of flood situation within West Bengal is envisaged through the rise in the river stage above PDL; therefore, proper prediction of river stage above PDL is crucial. Within the testing phase (2015–2017), specifically at Domohani and Mekhliganj, there are a few peak river stages that have exceeded PDL. A comparison of the actual peak with the predicted peak by LSTM and WLSTM models exceeding PDL at Domohani and Mekhliganj is depicted in Figures 8 and 9, respectively. For Domohani, the LSTM model predicted all four peaks exceeding PDL better compared to the WLSTM model. However, for Mekhliganj, the WLSTM model slightly overpredicted both peaks compared to the LSTM model. The water level predictions beyond the PDL of these two downstream stations are immensely beneficial during flood situations to issue pre-emptive warnings as downstream is more likely to be inundated. Commonly government agencies issue early warnings based on the rise of water levels. In this regard, water level prediction with lead days by both models is advantageous to evaluate the situation beforehand and plan accordingly.
Figure 8

Comparison of actual peak and predicted peak exceeding PDL at Domohani by (a) the LSTM model and (b) the WLSTM model.

Figure 8

Comparison of actual peak and predicted peak exceeding PDL at Domohani by (a) the LSTM model and (b) the WLSTM model.

Close modal
Figure 9

Comparison of actual peak and predicted peak exceeding PDL at Mekhliganj by (a) the LSTM model and (b) the WLSTM model.

Figure 9

Comparison of actual peak and predicted peak exceeding PDL at Mekhliganj by (a) the LSTM model and (b) the WLSTM model.

Close modal

Thus, LSTM models are a more reliable choice than the traditional models for longer lead time predictions because they can capture long-term dependencies very well (Adli Zakaria et al. 2023). Similarly, Atashi et al. (2022) recommended the LSTM model over seasonal autoregressive integrated moving average (SARIMA) and random forest models for achieving more accurate results in floodwater level prediction of the Red River basin. Even, LSTM has outperformed other deep learning models like gated recurrent unit and bidirectional LSTM (Bi-LSTM) in hourly water level prediction at Kien Giang river in Vietnam as reported by Hieu et al. (2023). The appropriate usage of disintegrated wavelet components (details and approximation) in the ML models enhances the predictive capability of the model presenting them as a better predictive tool than conventional models (Anh et al. 2018). The study by Xie et al. (2021) suggests that the WLSTM model has stronger generalization capability over the LSTM, wavelet-artificial neural network (WANN), and wavelet-autoregressive integrated moving average (WA-ARIMA) models while predicting the water level at the Yangtze River in China. The methodology of combining wavelet with deep learning algorithm for prediction purposes aids in depicting the nonlinear relationships between the predictor and response variables by capturing the pattern of the data series proficiently. From the findings of various researchers, it can be concluded that hybrid models are superior to conventional models and deep learning models are also an alternative to traditional ML models. Moreover, many researchers (Le et al. 2019; Liu et al. 2020; Mehedi et al. 2022) experimented with various hyperparameters (epochs, hidden neurons) through trial and error similar to this study to achieve the best prediction results. However, the approach is not limited to this methodology, implementation of various hybridization and optimization techniques with the models, and significant parametric modifications are recommended to explore the outcomes in this prediction study.

This study explores the utilization of wavelet analysis with a deep learning algorithm and a comparative analysis of LSTM and WLSTM models to predict the daily river stage at six gauging stations for 1-, 3-, and 5-day lead time. The LSTM and WLSTM models utilized multivariate inputs (river stage values of upstream stations) for all the gauging stations except Sankalan and produced satisfactory results in terms of the statistical indicators (R2, NSE, and RMSE) by adjusting the hyperparameters. Overall, the LSTM model for Mekhliganj performed extremely well with minimum RMSE values even up to 5-day lead time. Advantageously, the LSTM model can be a choice for Mekhliganj even for a longer lead time; warnings can be disseminated earlier since it is the catchment outlet. Regarding the peak stage prediction, for all the stations, the WLSTM model outperforms the LSTM model for 1-day and 3-day lead time. Simultaneously, the LSTM model yields a minimal error in peak stage prediction for 5-day lead time at Sankalan, Khanitar, and Teesta Bazaar gauging stations. Specifically, for Domohani and Mekhliganj, where peak river stages exceeding the PDL were prominent in some instances, these models well predicted those peaks for 1-day lead which was gradually underestimated with the increase in the forecast period. The recursive framework of the LSTM model enables it to solve the long-term dependencies of time series problems. Thus, it can be concluded that both LSTM and WLSTM models are recommended for overall performance with lower errors and proficient peak stage prediction for a certain lead time. In most of the gauging stations, the predicted river stage by the WLSTM model adhered to the observed variation of the time series, but in some cases, overestimation and underestimation were noted with the increase in lead time. This implies that the indigenous property of DWT disintegrates the noise from the original data which enable the model to capture the trend as well as to understand the orientation of the data for better prediction. However, the degradation of the model's performance with the advancement of lead time is considered one of the limitations of the model and further improvement in the LSTM or WLSTM model for enhancing accuracy with increasing forecast period needs to be explored. The variation in the performances of both the LSTM and WLSTM models for different gauging stations also highlights the data sensitivity of the LSTM models. However, the LSTM model handles noisy data but fails to quantify the uncertainties in the prediction. In this regard, integrating the various types of data and optimizing hyperparameters can enhance the robustness of the model for better accuracy. This study of river stage prediction at various gauging stations of the Teesta River basin is indispensably significant in this flood-prone basin to issue early warnings of rising water levels to the inhabitants during emergencies and save them from adversity.

The authors duly thank the Central Water Commission, New Delhi, for furnishing the data support to carry out this study.

No funding was received for conducting this study.

S.C. conceptualized the study, collected data, performed the analysis, and wrote the first draft of the manuscript. S.B. edited the previous version of the manuscript and supervised the entire study. All authors read and approved the final manuscript.

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Adli Zakaria
M. N.
,
Ahmed
A. N.
,
Abdul Malek
M.
,
Birima
A. H.
,
Hayet Khan
M. M.
,
Sherif
M.
&
Elshafie
A.
2023
Exploring machine learning algorithms for accurate water level forecasting in Muda river, Malaysia
.
Heliyon
9
(
7
),
e17689
.
https://doi.org/10.1016/j.heliyon.2023.e17689
.
Anh
N. T. N.
,
Dat
N. Q.
,
Van
N. T.
,
Doanh
N. N.
&
An
N. L.
2018
Wavelet-artificial neural network model for water level forecasting
. In:
Proceedings of the 2018 3rd IEEE International Conference on Research in Intelligent and Computing in Engineering, RICE 2018
.
IEEE
, pp. 1–6.
https://doi.org/10.1109/RICE.2018.8509064
.
Atashi
V.
,
Gorji
H. T.
,
Shahabi
S. M.
,
Kardan
R.
&
Lim
Y. H.
2022
Water level forecasting using deep learning time-series analysis: A case study of Red River of the North
.
Water (Switzerland)
14
(
12
),
1
18
.
https://doi.org/10.3390/w14121971
.
Barzegar
R.
,
Adamowski
J.
&
Moghaddam
A. A.
2016
Application of wavelet-artificial intelligence hybrid models for water quality prediction: A case study in Aji-Chay River, Iran
.
Stochastic Environmental Research and Risk Assessment
30
(
7
),
1797
1819
.
https://doi.org/10.1007/s00477-016-1213-y
.
Campolo
M.
,
Soldati
A.
&
Andreussi
P.
2003
Artificial neural network approach to flood forecasting in the River Arno
.
Hydrological Sciences Journal
48
(
3
),
381
398
.
https://doi.org/10.1623/hysj.48.3.381.45286
.
Chau
K. W.
,
Wu
C. L.
&
Li
Y. S.
2005
Comparison of several flood forecasting models in Yangtze River
.
Journal of Hydrologic Engineering, ASCE
10
(
6
),
485
491
.
Dalkilic
H. Y.
,
Kumar
D.
,
Samui
P.
,
Dixon
B.
,
Yesilyurt
S. N.
&
Katipoğlu
O. M.
2023
Application of deep learning approaches to predict monthly stream flows
.
Environmental Monitoring and Assessment
195
(
6
),
1
14
.
https://doi.org/10.1007/s10661-023-11331-5
.
Daubechies
I.
1990
The wavelet transform, time-frequency localization and signal analysis
.
IEEE Transactions on Information Theory
36
(
5
),
6
7
.
Faruq
A.
,
Abdullah
S. S.
,
Marto
A.
,
Bakar
M. A. A.
,
Samin
&
Mubin
A.
2020
River water level forecasting for flood warning system using deep learning long short-term memory network
.
IOP Conference Series: Materials Science and Engineering
821
(
1
),
1
9
.
https://doi.org/10.1088/1757-899X/821/1/012026
.
Ghosh
M.
&
Ghosal
S.
2021
Climate change vulnerability of rural households in flood-prone areas of Himalayan foothills, West Bengal, India
.
Environment, Development and Sustainability
23
,
2570
2595
.
https://doi.org/10.1007/s10668-020-00687-0
.
Goyal
M. K.
&
Goswami
U. P.
,
2018
Teesta river and its ecosystem
. In:
The Indian Rivers, Springer Hydrogeology
(
Singh
D.
, ed.).
Springer
,
Singapore
, pp.
537
551
.
https://doi.org/10.1007/978-981-10-2984-4_37
.
Grossman
A.
&
Morlet
J.
1984
Decomposition of hardy function into square integrable wavelets of constant shape
.
SIAM Journal on Mathematical Analysis
15
(
4
),
723
736
.
Guimarães Santos
C. A.
&
da Silva
G. B. L.
2014
Daily streamflow forecasting using wavelet transform and artificial neural network hybrid models
.
Hydrological Sciences Journal
59
(
2
),
312
324
.
https://doi.org/10.1080/02626667.2013.800944
.
Hadi
F. A. A.
,
Mohd Sidek
L.
,
Ahmed Salih
G. H.
,
Basri
H.
,
Sammen
S. S.
,
Mohd Dom
N.
,
Muhammad Ali
Z.
&
Ahmed
A. N.
2024
Machine learning techniques for flood forecasting
.
Journal of Hydroinformatics
26
(
4
),
1
21
.
https://doi.org/10.2166/hydro.2024.208
.
Han
H.
&
Morrison
R. R.
2021
Data-driven approaches for runoff prediction using distributed data
.
Stochastic Environmental Research and Risk Assessment
36
,
2153
2171
.
https://doi.org/10.1007/s00477-021-01993-3
.
Hayder
G.
,
Solihin
M. I.
&
Najwa
M. R. N.
2022
Multi-step-ahead prediction of river flow using NARX neural networks and deep learning LSTM
.
H2Open Journal
5
(
1
),
42
59
.
https://doi.org/10.2166/h2oj.2022.134
.
Hieu
T. T.
,
Chieu
T. Q.
,
Quang
D. N.
&
Hieu
N. D.
2023
Water level prediction using deep learning models: A case study of the Kien Giang River, Quang Binh Province
.
River
2
(
4
),
468
479
.
https://doi.org/10.1002/rvr2.63
.
Hochreiter
S.
&
Schmidhuber
J.
1997
Long short-term memory
.
Neural Computing
9
,
1735
1780
.
Kasiviswanathan
K. S.
,
He
J.
,
Sudheer
K. P.
&
Tay
J. H.
2016
Potential application of wavelet neural network ensemble to forecast streamflow for flood management
.
Journal of Hydrology
536
,
161
173
.
https://doi.org/10.1016/j.jhydrol.2016.02.044
.
Khosravi
M.
,
Duti
B. M.
,
Yazdan
M. M. S.
,
Ghoochani
S.
,
Nazemi
N.
&
Shabanian
H.
2023
Multivariate multi-step long short-term memory neural network for simultaneous stream-water variable prediction
.
Eng
4
(
3
),
1933
1950
.
https://doi.org/10.3390/eng4030109
.
Latt
Z. Z.
&
Wittenberg
H.
2014
Improving flood forecasting in a developing country: A comparative study of stepwise multiple linear regression and artificial neural network
.
Water Resources Management
28
(
8
),
2109
2128
.
https://doi.org/10.1007/s11269-014-0600-8
.
Le
X. H.
,
Ho
H. V.
,
Lee
G.
&
Jung
S.
2019
Application of long short-term memory (LSTM) neural network for flood forecasting
.
Water (Switzerland)
11
(
7
),
1387
.
https://doi.org/10.3390/w11071387
.
Li
K.
,
Wan
D.
,
Zhu
Y.
,
Yao
C.
,
Yu
Y.
,
Si
C.
&
Ruan
X.
2020
The applicability of ASCS_LSTM_ATT model for water level prediction in small- and medium-sized basins in China
.
Journal of Hydroinformatics
22
(
6
),
1693
1717
.
https://doi.org/10.2166/HYDRO.2020.043
.
Linh
N. T. T.
,
Ruigar
H.
,
Golian
S.
,
Bawoke
G. T.
,
Gupta
V.
,
Rahman
K. U.
,
Sankaran
A.
&
Pham
Q. B.
2021
Flood prediction based on climatic signals using wavelet neural network
.
Acta Geophysica
69
,
1413
1426
.
https://doi.org/10.1007/s11600-021-00620-7
.
Liong
S.-Y.
,
Lim
W.-H.
&
Paudyal
G. N.
2000
River stage forecasting in Bangladesh: Neural network approach
.
Journal of Computing in Civil Engineering
14
,
1
8
.
Liu
D.
,
Jiang
W.
,
Mu
L.
&
Wang
S.
2020
Streamflow prediction using deep learning neural network: Case study of Yangtze river
.
IEEE Access
8
,
90069
90086
.
https://doi.org/10.1109/ACCESS.2020.2993874
.
Mallat
S. G.
1989
A theory for multiresolution signal decomposition: The wavelet representation
.
IEEE Transactions on Pattern Analysis and Machine Intelligence
11
(
7
),
674
693
.
Mehedi
M. A. A.
,
Khosravi
M.
,
Yazdan
M. M. S.
&
Shabanian
H.
2022
Exploring temporal dynamics of river discharge using univariate long short-term memory (LSTM) recurrent neural network at East Branch of Delaware River
.
Hydrology
9
(
11
),
202
.
https://doi.org/10.3390/hydrology9110202
.
Moriasi
D. N.
,
Arnold
J. G.
,
Van Liew
M. W.
,
Bingner
R. L.
,
Harmel
R. D.
&
Veith
T. L.
2007
Model evaluation guidelines for systematic quantification of accuracy in watershed simulations
.
Transactions of the ASABE
50
(
3
),
885
900
.
Ni
L.
,
Wang
D.
,
Singh
V. P.
,
Wu
J.
,
Wang
Y.
,
Tao
Y.
&
Zhang
J.
2020
Streamflow and rainfall forecasting by two long short-term memory-based models
.
Journal of Hydrology
583
,
124296
.
https://doi.org/10.1016/j.jhydrol.2019.124296
.
Nourani
V.
,
Alami
M. T.
&
Aminfar
M. H.
2008
A combined neural-wavelet model for prediction of Ligvanchai watershed precipitation
.
Engineering Applications of Artificial Intelligence
22
(
3
),
466
472
.
https://doi.org/10.1016/j.engappai.2008.09.003
.
Nourani
V.
,
Komasi
M.
&
Mano
A.
2009
A multivariate ANN-wavelet approach for rainfall-runoff modeling
.
Water Resources Management
23
(
14
),
2877
2894
.
https://doi.org/10.1007/s11269-009-9414-5
.
Nourani
V.
,
Hosseini Baghanam
A.
,
Adamowski
J.
&
Kisi
O.
2014
Applications of hybrid wavelet-artificial intelligence models in hydrology: A review
. Journal of Hydrology 514, 358–377.
https://doi.org/10.1016/j.jhydrol.2014.03.057
.
Pal
R.
,
Biswas
S. S.
,
Mondal
B.
&
Pramanik
M. K.
2016
Landslides and floods in the Tista Basin (Darjeeling and Jalpaiguri Districts): Historical evidence, causes and consequences
.
The Journal of Indian Geophysical Union
20
(
2
),
66
72
.
Ruma
J. F.
,
Adnan
M. S. G.
,
Dewan
A.
&
Rahman
R. M.
2023
Particle swarm optimization based LSTM networks for water level forecasting: A case study on Bangladesh river network
.
Results in Engineering
17
,
100951
.
https://doi.org/10.1016/j.rineng.2023.100951
.
Sang
Y. F.
,
Wang
Z.
&
Liu
C.
2015
Wavelet neural modeling for hydrologic time series forecasting with uncertainty evaluation
.
Water Resources Management
29
(
6
),
1789
1801
.
https://doi.org/10.1007/s11269-014-0911-9
.
Seo
Y.
,
Kim
S.
,
Kisi
O.
&
Singh
V. P.
2015
Daily water level forecasting using wavelet decomposition and artificial intelligence techniques
.
Journal of Hydrology
520
,
224
243
.
https://doi.org/10.1016/j.jhydrol.2014.11.050
.
Subramanya
K.
2021
Engineering Hydrology
(5th edn).
Tata McGraw-Hill
,
New Delhi
.
Thirumalaiah
K.
&
Deo
M. C.
2000
Hydrological forecasting using neural networks
.
Journal of Hydrologic Engineering ASCE
5
(
2
),
180
189
.
Tiwari
M. K.
&
Chatterjee
C.
2010
Uncertainty assessment and ensemble flood forecasting using bootstrap based artificial neural networks (BANNs)
.
Journal of Hydrology
382
,
20
33
.
https://doi.org/10.1016/j.jhydrol.2009.12.013
.
Wang
S.
,
Yu
X.
,
Li
Y.
,
Wang
S.
&
Meng
C.
2023
Application of a hybrid deep learning approach with attention mechanism for evapotranspiration prediction: A case study from the Mount Tai region, China
.
Earth Science Informatics
16
(
4
),
3469
3487
.
https://doi.org/10.1007/s12145-023-01103-7
.
Xie
Z.
,
Liu
Q.
&
Cao
Y.
2021
Hybrid deep learning modeling for water level prediction in Yangtze River
.
Intelligent Automation and Soft Computing
28
(
1
),
153
166
.
https://doi.org/10.32604/iasc.2021.016246
.
Xing
W. Y.
,
Bai
Y. L.
,
Ding
L.
,
Yu
Q. H.
&
Song
W.
2022
Application of a hybrid model based on GA-ELMAN neural networks and VMD double processing in water level prediction
.
Journal of Hydroinformatics
24
(
4
),
818
837
.
https://doi.org/10.2166/hydro.2022.016
.
Yadav
A.
,
Chatterjee
S.
&
Equeenuddin
S. M.
2018
Prediction of suspended sediment yield by artificial neural network and traditional mathematical model in Mahanadi river basin, India
.
Sustainable Water Resources Management
4
(
4
),
745
759
.
https://doi.org/10.1007/s40899-017-0160-1
.
Yin
H.
,
Zhang
X.
,
Wang
F.
,
Zhang
Y.
,
Xia
R.
&
Jin
J.
2021
Rainfall-runoff modeling using LSTM-based multi-state-vector sequence-to-sequence model
.
Journal of Hydrology
598
,
126378
.
https://doi.org/10.1016/j.jhydrol.2021.126378
.
Zhang
D.
,
Lin
J.
,
Peng
Q.
,
Wang
D.
,
Yang
T.
,
Sorooshian
S.
,
Liu
X.
&
Zhuang
J.
2018
Modeling and simulating of reservoir operation using the artificial neural network, support vector regression, deep learning algorithm
.
Journal of Hydrology
565
,
720
736
.
https://doi.org/10.1016/j.jhydrol.2018.08.050
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY-NC-ND 4.0), which permits copying and redistribution for non-commercial purposes with no derivatives, provided the original work is properly cited (http://creativecommons.org/licenses/by-nc-nd/4.0/).