Significant strides have been made in advancing streamflow predictions, notably with the introduction of cutting-edge machine-learning models. Predominantly, long short-term memories (LSTMs) and convolution neural networks (CNNs) have been widely employed in this domain. While LSTMs are applicable in both rainfall-runoff and time-series settings, CNN–LSTMs have primarily been utilized in rainfall-runoff scenarios. In this study, we extend the application of CNN–LSTMs to time-series settings, leveraging lagged streamflow data in conjunction with precipitation and temperature data to predict streamflow. Our results show a substantial improvement in predictive performance in 21 out of 32 HUC8 basins in Nebraska, showcasing noteworthy increases in the Kling–Gupta efficiency values. These results highlight the effectiveness of CNN–LSTMs in time-series settings, particularly for spatiotemporal hydrological modeling, for more accurate and robust streamflow predictions.

  • A convolutional neural network (CNN)– long short-term memory (LSTM) model was developed for time-series forecasting of streamflow in Nebraska by combining CNN for spatial data and LSTM for sequence data.

  • A substantial improvement was observed in 66% of the basins for this model compared to the standalone LSTM.

  • This superior performance was achieved just by using gridded precipitation and 2-m temperature as exogenous inputs.

Streamflow prediction is one of the most challenging tasks in water resources management and is vital for various sectors, e.g. agriculture, disaster mitigation, and planning and management of water resources (Le et al. 2021; Pokharel et al. 2023). It is an integrated practice involving complex nonlinear physical processes that operate at multiple temporal and spatial scales within a watershed (Girihagama et al. 2022; Dehghani et al. 2023). Accurate streamflow prediction allows for advanced warning of potential floods, giving people in flood-prone areas time to prepare and implement flood control measures. Due to the proliferation of high-resolution and remote sensing datasets at both high spatial and temporal resolutions, the use of deep learning (DL) algorithms has gained traction in the field of hydrology, especially in streamflow prediction (Kratzert et al. 2018, 2019; Yan et al. 2019; Ding et al. 2020; Xiang et al. 2020; Hunt et al. 2022; Dehghani et al. 2023; Pokharel et al. 2023; Vatanchi et al. 2023).

Streamflow is typically modeled in two different ways. The first method involves time-series modeling, where we incorporate both lagged predictors and lagged target values over a specific time period (look-back window) to make predictions. In contrast, the second method, i.e., rainfall-runoff modeling, focuses solely on the predictors up to a given time (t) without considering the lagged target values.

The long short-term memory (LSTM) (Hochreiter & Schmidhuber 1997) model has gained traction and is widely used in streamflow prediction due to its ability to store and regulate information over time. It is well-suited for time-series modeling because it can effectively remember long-term dependencies and handle sequential data (Bengio et al. 1994). In streamflow prediction, where the data often exhibit temporal patterns, LSTM can learn from historical information, including lagged predictors and target values, making them suitable for streamflow prediction (Khandelwal et al. 2020; Majeske et al. 2022).

Convolution neural networks (CNNs) are DL networks, which are widely used for image and video recognition tasks. They are well known for their ability to learn spatial patterns within gridded data, such as images/videos, which is why this DL network is suitable for processing gridded spatial data in water resources. CNNs are a feedforward network like artificial neural networks (ANN). However, CNNs are more advanced than ANN with convolution and pooling layers, which help automatically extract the essential features from the input layer (Shu et al. 2021). CNNs can be 1-D, 2-D, and 3-D depending on the shape of the input layer. In 1-D CNN, the inputs are vectors, whereas in 2-D CNN, the inputs are in gridded/matrix form. The majority of the studies in hydrology involving CNN utilized the 1-D version of it (Haidar & Verma 2018; Chong et al. 2020; Hussain et al. 2020), although a few studies applied 2-D CNN (Huang et al. 2020; Chen et al. 2021).

CNN-LSTM is a hybrid neural network that combines the strengths of both CNN and LSTM. It is a DL architecture designed for sequence prediction problems with spatial inputs like images or videos (Donahue et al. 2017). The CNN model is used for feature extraction, while the LSTM model is used for interpreting the features across timesteps. Only a few studies in hydrology have applied CNN–2-D-LSTM in comparison to the widely used LSTM model (Anderson & Radić 2022; Li et al. 2023; Ng et al. 2023). Anderson & Radić (2022) used CNN-LSTM in rainfall-runoff settings using spatial data to predict streamflow at 226 stream gauges across southwestern Canada. They used gridded climate reanalysis data to train and predict streamflow and demonstrated the efficiency and suitability of the CNN-LSTM model for spatiotemporal hydrological modeling. Li et al. (2023) proposed a CNN-LSTM in rainfall-runoff settings that introduces actual evaporation as an additional training target and demonstrates the value of spatial information in hydrological modeling. Mohammed & Corzo (2024) applied CNN-LSTM in regional rainfall-runoff settings to simultaneously predict daily streamflow in 86 catchments across the US, adding the value by using spatial signals in the model. However, to the author's knowledge, no studies have investigated using CNN-LSTM in time-series settings. The contribution of this study is as follows:

  • 1. Applying and testing LSTM and CNN-LSTM models for 32 Nebraska HUC8 basins in time-series settings.

  • 2. Comparing the efficiency of those models in predicting streamflow for Nebraska HUC8 basins.

  • 3. Investigating the added value of using CNN-LSTM in time-series settings for streamflow prediction.

In this study, we used 32 HUC8 basins in Nebraska, which have at least 20 years of streamflow data available (Figure 1). Diverse physical and geographical features characterize these basins, including a range of landscapes such as the Great Plains, the Sand Hills, and the Ogallala Aquifer. The area of the basins ranged from 1,366 to 13,352 km2, with each exhibiting unique hydrological characteristics, such as different soil types, land-use patterns, and water availability.
Figure 1

32 HUC8 basins in Nebraska, as considered in this study.

Figure 1

32 HUC8 basins in Nebraska, as considered in this study.

Close modal

In this study, we used data from three sources. Precipitation (m) and 2-m temperature (K) gridded data were obtained from ERA5-Land (9 km × 9 km) (Muñoz-Sabater et al. 2021), which is the fifth-generation atmospheric reanalysis from the European Centre for Medium-Range Weather Forecasts (ECMWF). Daily streamflow (cfs) data were collected from the United States Geological Survey (USGS) and the Nebraska Department of Natural Resources (NeDNR). The datasets from the ERA5-Land were aggregated on daily timescales by taking an average over the basin for LSTM since LSTM models cannot process gridded data. For the CNN-LSTM model, gridded data were used.

We know that hydrology is complex and has many underlying physics that govern the streamflow. However, we are only taking into consideration precipitation and temperature because these are the two principal meteorological inputs for hydrological modeling (Essou et al. 2016; Anderson & Radić 2022). This approach showcases the enhanced improvement in streamflow prediction achieved by using CNN-LSTM in time-series settings. Thus, taking a parsimonious approach, we tried to build a simpler model with few predictor variables to compare the efficacy of models.

Convolution neural networks

CNN (LeCun et al. 1998) is a DL model designed to process spatial data with a grid pattern like images and videos. CNN consists of three building block layers such as convolution, pooling, and fully connected (Figure 2) (Yamashita et al. 2018; Khosravi et al. 2020). The convolution layer applies a filter to the input tensor to extract features by detecting the spatial pattern. The convolution layer slides filters, also known as kernels, over the input tensor to compute dot products between the filter and input at each location, resulting in feature maps. The pooling layer reduces the spatial size of the feature map by down-sampling it, reducing the number of parameters and computations in the network. Max pooling is commonly used as it takes the maximum activation value in each filter region. The fully connected layer connects every neuron in one layer to every neuron in the next layer, takes the previous layer's output, and produces the network's final output.
Figure 2

Architecture of the CNN (drawn with the help of an online tool created by LeNail (2019)).

Figure 2

Architecture of the CNN (drawn with the help of an online tool created by LeNail (2019)).

Close modal

In CNN, strides are the number of pixels the filter moves each time during the convolution operation. Padding refers to adding pixels of zero around the input image borders. Padding is done so that when the filters slide over the edge pixels, there is meaningful information for it to process rather than an empty space. The output size depends on the input dimensions, strides, padding, and filter size.

Long short-term memory

LSTM (Hochreiter & Schmidhuber 1997) is an evolution form of the recurrent neural network (RNN), which was developed to address the problem of vanishing and exploding gradients in RNN. LSTM consists of three main gates such as the forget, the input, and the output (Figure 3). As the name suggests, the forget gate determines which information should be discarded based on the previous hidden state (ht−1) and current input data (Xt). The input gate adds the new information into the network's long-term memory based on the previous hidden state (ht−1) and the current input data (Xt). The output gate determines the next hidden state (ht) and is used to make the final predictions.
Figure 3

An architecture of the LSTM model.

Figure 3

An architecture of the LSTM model.

Close modal

Time-series LSTM

In this experiment, we utilized the LSTM neural network for time-series modeling. The lagged values of predictors (precipitation and 2-m temperature) and the lagged streamflow values were input to calculate streamflow at time t. The gridded datasets underwent basin averaging, as LSTM models cannot directly handle spatial data in their raw form. They were aggregated to form coherent input sequences for the LSTM model. The objective was to forecast streamflow at time t by examining tn lagged predictors along with the target value.

Time-series CNN-LSTM

In this experiment, we developed an individual sequential CNN-LSTM model for streamflow forecasting. The goal was to leverage the complementary strengths of CNN for extracting spatial features and LSTM for modeling temporal dynamics (Figure 4). Analogous to video frames, we treated the daily gridded meteorological input data (precipitation and 2-m temperature) as image-like frames. Each frame had a defined spatial structure spanning geographic coordinates (latitude and longitude), similar to image rows and columns. The frames also comprised precipitation and temperature channels like RGB color channels. Stacked over the model's 6-month or 182-day input sequence, these daily weather ‘images’ formed an input ‘video’ for precipitation and temperature over the catchment area.
Figure 4

Overview of the CNN-LSTM model architecture in a time-series setting.

Figure 4

Overview of the CNN-LSTM model architecture in a time-series setting.

Close modal

The strategy for applying a CNN-LSTM model for time-series analysis involved representing lagged streamflow values as channels. The original 1-D discharge time series underwent a transformation into a 2-D tensor by duplicating values across the spatial grid. This integration allowed the streamflow data to be seamlessly concatenated with meteorology grids along the channel dimension. The model's input was comprised of a 6-month long video encompassing 182 frames. Each frame was equipped with three channels representing precipitation, 2-m temperature, and streamflow. These frames underwent a series of time-distributed convolution and pooling layers, as depicted in Figure 2. The CNN component efficiently extracted crucial features from each frame, generating 182 vectors that encapsulate spatially learned features for a specific day. Subsequently, these feature vectors traversed through the LSTM layer to capture temporal dependencies. The dense layer further mapped the discharge values derived from the LSTM output.

The initial step in our experiment involved transforming the datasets into the requisite format for model training. Then, it was split into training and testing sets in a 70:30 ratio. The validation split was 0.3, meaning 30% of the training data was used for validation. The datasets were normalized using the standard scaler method, subtracting the mean and dividing by the standard deviation. Only the training period datasets were used to calculate the mean and standard deviation. For consistency and to see the effectiveness of models, we used the same parameter across all the experiments carried out. The activation function, batch size, optimizer, and dropout were kept the same for all the experiments. We used Relu as the activation function, Adam as the optimizer, and the epoch and batch size were kept 100 and 50, respectively. Dropout, kernel initializer (he_uniform), and gradient norm were used to avoid overfitting, vanishing, and exploding gradient problems. The models were trained using the mean square error as the loss function (Pokharel & Ghimire 2023). The number of LSTM layers and neurons was kept the same for the LSTM and CNN-LSTM models. Table 1 shows the CNN-LSTM internal architecture. The use of 1 × 1 convolutional filters followed by an increase to 3 × 3 filters provided a way to manage model complexity, control the number of feature maps, and reduce computational requirements in CNNs, leading to more efficient and effective network architectures (Lin et al. 2013; Szegedy et al. 2014, 2015; Anderson & Radić 2022). We also used a single LSTM layer with 80 neurons, as it was found in the previous studies that using a single LSTM layer with more neurons performed better than multiple layers with fewer neurons (Kratzert et al. 2018; Anderson & Radić 2022).

Table 1

Description of CNN-LSTM model layers

LayersType of layersDescription
Convolution 32 filters, 1 × 1 
Convolution 16 filters, 3 × 3 
Max Pooling2D 1 × 1 
Convolution 32 filters, 1 × 1 
Max Pooling2D 1 × 1 
Dropout 0.3 
LSTM 80 
Dense 
LayersType of layersDescription
Convolution 32 filters, 1 × 1 
Convolution 16 filters, 3 × 3 
Max Pooling2D 1 × 1 
Convolution 32 filters, 1 × 1 
Max Pooling2D 1 × 1 
Dropout 0.3 
LSTM 80 
Dense 

All the basins were trained separately using the same configuration as discussed above. Finally, all the model performances were evaluated using Kling–Gupta efficiency (KGE) (Gupta et al. 2009).

This study used the KGE (Gupta et al. 2009) to assess the models' predictive performance. The KGE is a widely used metric in hydrology that integrates three common types of errors in a model, i.e., bias, variability, and timing (correlation), and is written as:
where r is the linear correlation coefficient between the observations and simulations. is the ratio of the coefficients of variation, and is the ratio of the means. The KGE can range from −∞ to 1, with a value of 1 indicating perfect correspondence between simulations and observations.

Comparison of individual CNN-LSTM and LSTM performance

In this section, we compare the performance of LSTM and CNN-LSTM in time-series settings across 32 Nebraska HUC8 basins. The model's predictions were used to evaluate the model's ability to simulate streamflow during the testing phase. Out of 32 river basins analyzed, the CNN-LSTM model performed better in 62% (21 basins) compared to a standard LSTM model for streamflow prediction. The maximum increment in model performance (the KGE metric) was from 0.39 to 0.91 (Figure 6) as we moved from LSTM to CNN-LSTM, with an overall median KGE improvement from 0.76 to 0.78 (Figure 5). Figure 6 illustrates the performance comparison between the CNN-LSTM and LSTM models for streamflow prediction in the Frenchman basin. The CNN-LSTM model, with a KGE of 0.91, significantly outperforms the LSTM model with a KGE of 0.39. In the full timeline, CNN-LSTM (blue dashed line) closely follows the observed streamflow, particularly during extreme flow events (highlighted in red), capturing both the magnitude and timing of these peaks better than LSTM (orange dashed line). The zoomed-in view further demonstrates CNN-LSTM superior ability to model streamflow, especially during high-flow periods where LSTM underestimates the discharge. This highlights CNN-LSTM capacity to capture spatial dependencies and handle complex flow patterns, making it a more reliable model for predicting extreme streamflow fluctuations. Despite the superior performance of CNN-LSTM in most basins, a deeper investigation into basin characteristics such as elevation, land use, soil type, and surface roughness revealed no clear pattern explaining the varying degrees of improvement, opening new avenues for future research.
Figure 5

Distribution of KGE values in LSTM and CNN-LSTM in time-series settings.

Figure 5

Distribution of KGE values in LSTM and CNN-LSTM in time-series settings.

Close modal
Figure 6

Discharge predictions for the different models at the Frenchman basin.

Figure 6

Discharge predictions for the different models at the Frenchman basin.

Close modal
The line plot (Figure 7) comparing the KGE values of the LSTM and CNN-LSTM models across different basins presents several important insights. Overall, CNN-LSTM outperforms LSTM in most basins, showing higher KGE values, particularly in ‘Calamus,’ ‘Frenchman,’ and ‘Upper Big Blue,’ where the CNN-LSTM model exhibits significant improvement. However, there are exceptions, such as in the ‘Middle Platte-Prairie’ and ‘Lower Elkhorn’ basins, where the LSTM outperforms CNN-LSTM. Additionally, in basins like ‘Cedar’ and ‘South Fork Republican,’ both models demonstrate relatively lower performance. In cases like ‘Lower North Platte’ and ‘Lower Platte-Shell,’ the models perform similarly, with only marginal differences in KGE values. These results suggest that while CNN-LSTM generally enhances predictive accuracy, the extent of improvement varies by basin, implying that local basin characteristics play a role in the models' effectiveness. In 11 basins, the CNN-LSTM model underperformed compared to the LSTM model. These basins were characterized by a high density of flood control structures, potentially affecting the CNN-LSTM model's ability to match the spatial signal with streamflow, especially in regions with extensive cropland and irrigation. Flood control structures like dams and reservoirs disrupt natural precipitation–discharge relationships in several ways. They regulate flow rates, redirect water, retain and slowly release floods, and dampen extreme signals (Gunnell et al. 2019; Hovis et al. 2021). Additionally, their operating policies change over time, reducing the relevance of older training data (Sung et al. 2018). While these structures are vital for managing regional water resources, they significantly alter historical signals, making it challenging for data-driven models to predict streamflow in these basins accurately.
Figure 7

Comparison of LSTM and CNN-LSTM KGE values across basins.

Figure 7

Comparison of LSTM and CNN-LSTM KGE values across basins.

Close modal

In considering the applicability of the CNN-LSTM model in regions with established streamflow gauges, its usefulness remains evident. This model proves advantageous due to the persistent challenges associated with achieving accurate streamflow measurements. These include the influence of backwater from ice or debris, such as log jams, and algae and aquatic growth in the stream (Kiang et al. 2018). Additionally, sediment movement and the potential malfunction of measurement equipment further increase the chance of inaccurate streamflow measurement. The precision and accuracy of these measurements are of high importance, as they directly influence the reliability of hydrologic models, and any inaccuracies directly translate into uncertainties within hydrologic models (Dos Santos et al. 2021). Furthermore, accurate streamflow is critical for various sectors, such as water resources planning and management, agriculture water management, disaster mitigation, and infrastructure safety.

This study presented a new approach for implementing a CNN-LSTM model using lagged streamflow data in a time-series setting. The performance of the CNN-LSTM and LSTM models was compared across 32 basins in Nebraska. The key finding is that the CNN-LSTM model, trained on 1-D streamflow data and gridded precipitation and temperature inputs, demonstrates improved performance over the LSTM model in 21 of the 32 test basins. These results suggest the feasibility and potential benefits of applying CNN-LSTM (time-series setting) models for streamflow prediction tasks. The model's ability to capture intricate spatial patterns and signals between predictors and targets surpasses the capabilities of standard LSTM recurrent networks. However, the study also uncovered degradation in CNN-LSTM performance as compared to the standard LSTM in some basins that are characterized by high streamflow magnitudes and extensive irrigated croplands.

The value of this study lies in demonstrating a new approach of using CNN-LSTM for streamflow prediction and its potential benefits. While this study used the same set of hyperparameters to showcase the efficacy of CNN-LSTM in time-series settings throughout all basins, it should be noted that other variables such as solar radiation, wind, vapor pressure, and some basin static attributes such as slope, soil, and land characteristics could result in even better model performances.

The authors acknowledge funding from the University of Nebraska Collaboration Initiative Grant (Award Number 32370).

The authors declare there is no conflict.

Anderson
S.
&
Radić
V.
(
2022
)
Evaluation and interpretation of convolutional long short-term memory networks for regional hydrological modelling
,
Hydrology and Earth System Sciences
,
26
(
3
),
795
825
.
https://doi.org/10.5194/hess-26-795-2022
.
Bengio
Y.
,
Simard
P.
&
Frasconi
P.
(
1994
)
Learning long-term dependencies with gradient descent is difficult
,
IEEE Transactions on Neural Networks
,
5
(
2
),
157
166
.
https://doi.org/10.1109/72.279181
.
Chen
C.
,
Hui
Q.
,
Xie
W.
,
Wan
S.
,
Zhou
Y.
&
Pei
Q.
(
2021
)
Convolutional neural networks for forecasting flood process in internet-of-things enabled smart city
,
Computer Networks
,
186
,
107744
.
https://doi.org/10.1016/J.COMNET.2020.107744
.
Chong
K. L.
,
Lai
S. H.
,
Yao
Y.
,
Ahmed
A. N.
,
Jaafar
W. Z. W.
&
El-Shafie
A.
(
2020
)
Performance enhancement model for rainfall forecasting utilizing integrated wavelet-convolutional neural network
,
Water Resources Management
,
34
(
8
),
2371
2387
.
https://doi.org/10.1007/S11269-020-02554-Z/FIGURES/7
.
Dehghani
A.
,
Moazam
H. M. Z. H.
,
Mortazavizadeh
F.
,
Ranjbar
V.
,
Mirzaei
M.
,
Mortezavi
S.
,
Ng
J. L.
&
Dehghani
A.
(
2023
)
Comparative evaluation of LSTM, CNN, and ConvLSTM for hourly short-term streamflow forecasting using deep learning approaches
,
Ecological Informatics
,
75
(
May
),
102119
.
https://doi.org/10.1016/j.ecoinf.2023.102119
.
Ding
Y.
,
Zhu
Y.
,
Feng
J.
,
Zhang
P.
&
Cheng
Z.
(
2020
)
Interpretable spatio-temporal attention LSTM model for flood forecasting
,
Neurocomputing
,
403
,
348
359
.
https://doi.org/10.1016/J.NEUCOM.2020.04.110
.
Donahue
J.
,
Hendricks
L. A.
,
Rohrbach
M.
,
Venugopalan
S.
,
Guadarrama
S.
,
Saenko
K.
&
Darrell
T.
(
2017
)
Long-term recurrent convolutional networks for visual recognition and description
,
IEEE Transactions on Pattern Analysis and Machine Intelligence
,
39
(
4
),
677
691
.
https://doi.org/10.1109/TPAMI.2016.2599174
.
Dos Santos
C. I.
,
Chaffe
P. L. B.
,
Perez
A. B. A.
,
Arienti
P. F.
&
J. H. M.
(
2021
)
Precision and accuracy of streamflow measurements in headwater streams during baseflow
,
RBRH
,
26
,
e8
.
https://doi.org/10.1590/2318-0331.262120200135
.
Essou
G. R. C.
,
Sabarly
F.
,
Lucas-Picher
P.
,
Brissette
F.
&
Poulin
A.
(
2016
)
Can precipitation and temperature from meteorological reanalyses be used for hydrological modeling?
,
Journal of Hydrometeorology
,
17
(
7
),
1929
1950
.
https://doi.org/10.1175/JHM-D-15-0138.1
.
Girihagama
L.
,
Naveed Khaliq
M.
,
Lamontagne
P.
,
Perdikaris
J.
,
Roy
R.
,
Sushama
L.
&
Elshorbagy
A.
(
2022
)
Streamflow modelling and forecasting for Canadian watersheds using LSTM networks with attention mechanism
,
Neural Computing and Applications
,
34
(
22
),
19995
20015
.
https://doi.org/10.1007/s00521-022-07523-8
.
Gunnell
K.
,
Mulligan
M.
,
Francis
R. A.
&
Hole
D. G.
(
2019
)
Evaluating natural infrastructure for flood management within the watersheds of selected global cities
,
Science of the Total Environment
,
670
,
411
424
.
https://doi.org/10.1016/J.SCITOTENV.2019.03.212
.
Gupta
H. V.
,
Kling
H.
,
Yilmaz
K. K.
&
Martinez
G. F.
(
2009
)
Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling
,
Journal of Hydrology
,
377
(
1–2
),
80
91
.
https://doi.org/10.1016/J.JHYDROL.2009.08.003
.
Haidar
A.
&
Verma
B.
(
2018
)
Monthly rainfall forecasting using one-dimensional deep convolutional neural network
,
IEEE Access
,
6
,
69053
69063
.
https://doi.org/10.1109/ACCESS.2018.2880044
.
Hochreiter
S.
&
Schmidhuber
J.
(
1997
)
Long short-term memory
,
Neural Computation
,
9
(
8
),
1735
1780
.
https://doi.org/10.1162/NECO.1997.9.8.1735
.
Hovis
M.
,
Hollinger
J. C.
,
Cubbage
F.
,
Shear
T.
,
Doll
B.
,
Kurki-Fox
J. J.
,
Line
D.
,
Fox
A.
,
Baldwin
M.
,
Klondike
T.
,
Lovejoy
M.
,
Evans
B.
,
West
J.
&
Potter
T.
(
2021
)
Natural infrastructure practices as potential flood storage and reduction for farms and rural communities in the North Carolina Coastal Plain
,
Sustainability 2021
,
13
(
16
),
9309
.
https://doi.org/10.3390/SU13169309
.
Huang
C.
,
Zhang
J.
,
Cao
L.
,
Wang
L.
,
Luo
X.
,
Wang
J. H.
&
Bensoussan
A.
(
2020
)
Robust forecasting of river-flow based on convolutional neural network
,
IEEE Transactions on Sustainable Computing
,
5
(
4
),
594
600
.
https://doi.org/10.1109/TSUSC.2020.2983097
.
Hunt
K. M. R.
,
Matthews
G. R.
,
Pappenberger
F.
&
Prudhomme
C.
(
2022
)
Using a long short-term memory (LSTM) neural network to boost river streamflow forecasts over the western United States
,
Hydrology and Earth System Sciences Discussions
, 26 (21), 5449–5472.
https://doi.org/10.5194/hess-2022-53
.
Hussain
D.
,
Hussain
T.
,
Khan
A. A.
,
Naqvi
S. A. A.
&
Jamil
A.
(
2020
)
A deep learning approach for hydrological time-series prediction: A case study of Gilgit river basin
,
Earth Science Informatics
,
13
(
3
),
915
927
.
https://doi.org/10.1007/S12145-020-00477-2/FIGURES/8
.
Khandelwal
A.
,
Xu
S.
,
Li
X.
,
Jia
X.
,
Stienbach
M.
,
Duffy
C.
,
Nieber
J.
&
Kumar
V.
(
2020
)
Physics Guided Machine Learning Methods for Hydrology. http://arxiv.org/abs/2012.02854
.
Khosravi
K.
,
Panahi
M.
,
Golkarian
A.
,
Keesstra
S. D.
,
Saco
P. M.
,
Bui
D. T.
&
Lee
S.
(
2020
)
Convolutional neural network approach for spatial prediction of flood hazard at national scale of Iran
,
Journal of Hydrology
,
591
,
125552
.
https://doi.org/10.1016/J.JHYDROL.2020.125552
.
Kiang
J. E.
,
Gazoorian
C.
,
McMillan
H.
,
Coxon
G.
,
Le Coz
J.
,
Westerberg
I. K.
,
Belleville
A.
,
Sevrez
D.
,
Sikorska
A. E.
,
Petersen-Øverleir
A.
,
Reitan
T.
,
Freer
J.
,
Renard
B.
,
Mansanarez
V.
&
Mason
R.
(
2018
)
A comparison of methods for streamflow uncertainty estimation
,
Water Resources Research
,
54
(
10
),
7149
7176
.
https://doi.org/10.1029/2018WR022708
.
Kratzert
F.
,
Klotz
D.
,
Brenner
C.
,
Schulz
K.
&
Herrnegger
M.
(
2018
)
Rainfall-runoff modelling using long short-term memory (LSTM) networks
,
Hydrology and Earth System Sciences
,
22
(
11
),
6005
6022
.
https://doi.org/10.5194/HESS-22-6005-2018
.
Kratzert
F.
,
Klotz
D.
,
Shalev
G.
,
Klambauer
G.
,
Hochreiter
S.
&
Nearing
G.
(
2019
)
Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets
,
Hydrology and Earth System Sciences
,
23
(
12
),
5089
5110
.
https://doi.org/10.5194/HESS-23-5089-2019
.
Le
X. H.
,
Nguyen
D. H.
,
Jung
S.
,
Yeon
M.
&
Lee
G.
(
2021
)
Comparison of deep learning techniques for river streamflow forecasting
,
IEEE Access
,
9
,
71805
71820
.
https://doi.org/10.1109/ACCESS.2021.3077703
.
LeCun
Y.
,
Bottou
L.
,
Bengio
Y.
&
Haffner
P.
(
1998
)
Gradient-based learning applied to document recognition
,
Proceedings of the IEEE
,
86
(
11
),
2278
2323
.
https://doi.org/10.1109/5.726791
.
LeNail
A.
(
2019
)
NN-SVG: Publication-ready neural network architecture schematics
,
Journal of Open Source Software
,
4
(
33
),
747
.
https://doi.org/10.21105/joss.00747
.
Li
B.
,
Li
R.
,
Sun
T.
,
Gong
A.
,
Tian
F.
,
Khan
M. Y. A.
&
Ni
G.
(
2023
)
Improving LSTM hydrological modeling with spatiotemporal deep learning and multi-task learning: A case study of three mountainous areas on the Tibetan Plateau
,
Journal of Hydrology
,
620
, 129401.
https://doi.org/10.1016/j.jhydrol.2023.129401
.
Lin
M.
,
Chen
Q.
&
Yan
S.
(
2013
). '
Network in Network
',
2nd International Conference on Learning Representations, ICLR 2014 – Conference Track Proceedings
.
Majeske
N.
,
Zhang
X.
,
Sabaj
M.
,
Gong
L.
,
Zhu
C.
&
Azad
A.
(
2022
)
Inductive predictions of hydrologic events using a long short-term memory network and the soil and water assessment tool
,
Environmental Modelling & Software
,
152
,
105400
.
https://doi.org/10.1016/J.ENVSOFT.2022.105400
.
Mohammed
A.
&
Corzo
G.
(
2024
)
Spatiotemporal convolutional long short-term memory for regional streamflow predictions
,
Journal of Environmental Management
,
350
,
119585
.
https://doi.org/10.1016/J.JENVMAN.2023.119585
.
Muñoz-Sabater
J.
,
Dutra
E.
,
Agustí-Panareda
A.
,
Albergel
C.
,
Arduini
G.
,
Balsamo
G.
,
Boussetta
S.
,
Choulga
M.
,
Harrigan
S.
,
Hersbach
H.
,
Martens
B.
,
Miralles
D. G.
,
Piles
M.
,
Rodríguez-Fernández
N. J.
,
Zsoter
E.
,
Buontempo
C.
&
Thépaut
J. N.
(
2021
)
ERA5-Land: A state-of-the-art global reanalysis dataset for land applications
,
Earth System Science Data
,
13
(
9
),
4349
4383
.
https://doi.org/10.5194/ESSD-13-4349-2021
.
Ng
K. W.
,
Huang
Y. F.
,
Koo
C. H.
,
Chong
K. L.
,
El-Shafie
A.
&
Ahmed
A. N.
(
2023
)
A review of hybrid deep learning applications for streamflow forecasting
,
Journal of Hydrology
,
625
,
130141
.
https://doi.org/10.1016/j.jhydrol.2023.130141
.
Pokharel
S.
&
Ghimire
P.
(
2023
)
Data-driven ML models for accurate prediction of energy consumption in a low-energy house: A comparative study of XGBoost, Random Forest, Decision Tree, and Support Vector Machine
,
Journal of Innovations in Engineering Education
,
6
(
1
),
12
20
.
https://doi.org/10.3126/jiee.v6i1.54965
.
Pokharel
S.
,
Roy
T.
&
Admiraal
D.
(
2023
)
Effects of mass balance, energy balance, and storage-discharge constraints on LSTM for streamflow prediction
,
Environmental Modelling and Software
,
166
,
105730
.
https://doi.org/10.1016/j.envsoft.2023.105730
.
Shu
X.
,
Ding
W.
,
Peng
Y.
,
Wang
Z.
,
Wu
J.
&
Li
M.
(
2021
)
Monthly streamflow forecasting using convolutional neural network
,
Water Resources Management
,
35
(
15
),
5089
5104
.
https://doi.org/10.1007/S11269-021-02961-W/FIGURES/5
.
Sung
K.
,
Jeong
H.
,
Sangwan
N.
&
Yu
D. J.
(
2018
)
Effects of flood control strategies on flood resilience under sociohydrological disturbances
,
Water Resources Research
,
54
(
4
),
2661
2680
.
https://doi.org/10.1002/2017WR021440
.
Szegedy
C.
,
Liu
W.
,
Jia
Y.
,
Sermanet
P.
,
Reed
S.
,
Anguelov
D.
,
Erhan
D.
,
Vanhoucke
V.
&
Rabinovich
A.
(
2014
). '
Going deeper with convolutions
’,
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
,
07–12 June-2015
, pp.
1
9
.
https://doi.org/10.1109/CVPR.2015.7298594
.
Szegedy
C.
,
Vanhoucke
V.
,
Ioffe
S.
,
Shlens
J.
&
Wojna
Z.
(
2015
). '
Rethinking the inception architecture for computer vision
',
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
.
2016 December
, pp.
2818
2826
.
https://doi.org/10.1109/CVPR.2016.308
.
Vatanchi
S. M.
,
Etemadfard
H.
,
Maghrebi
M. F.
&
Shad
R.
(
2023
)
A comparative study on forecasting of long-term daily streamflow using ANN, ANFIS, BiLSTM and CNN-GRU-LSTM
,
Water Resources Management
,
37
(
12
),
4769
4785
.
https://doi.org/10.1007/S11269-023-03579-W/TABLES/4
.
Xiang
Z.
,
Yan
J.
&
Demir
I.
(
2020
)
A rainfall-runoff model with LSTM-based sequence-to-sequence learning
,
Water Resources Research
,
56
(
1
),
e2019WR025326
.
https://doi.org/10.1029/2019WR025326
.
Yamashita
R.
,
Nishio
M.
,
Do
R. K. G.
&
Togashi
K.
(
2018
)
Convolutional neural networks: An overview and application in radiology
,
Insights into Imaging
,
9
(
4
),
611
629
.
https://doi.org/10.1007/S13244-018-0639-9/FIGURES/15
.
Yan
L.
,
Feng
J.
&
Hang
T.
(
2019
)
Small watershed stream-flow forecasting based on LSTM
,
Advances in Intelligent Systems and Computing
,
935
,
1006
1014
.
https://doi.org/10.1007/978-3-030-19063-7_79
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).