Abstract
The computational prediction of wave propagation in dam-break floods is a long-standing problem in hydrodynamics and hydrology. We show that a reservoir computing echo state network (RC-ESN) that is well-trained on a minimal amount of data can accurately predict the long-term dynamic behavior of a one-dimensional dam-break flood. We solve the de Saint-Venant equations for a one-dimensional dam-break flood scenario using the Lax–Wendroff numerical scheme and train the RC-ESN model. The results demonstrate that the RC-ESN model has good prediction ability, as it predicts wave propagation behavior 286 time-steps ahead with a root mean square error smaller than 0.01, outperforming the conventional long short-term memory (LSTM) model, which only predicts 81 time-steps ahead. We also provide a sensitivity analysis of prediction accuracy for RC-ESN's key parameters such as training set size, reservoir size, and spectral radius. Results indicate that the RC-ESN is less dependent on training set size, with a medium reservoir size of 1,200–2,600 sufficient. We confirm that the spectral radius has a complex influence on the prediction accuracy and currently recommend a smaller spectral radius. Even when the initial flow depth of the dam break is changed, the prediction horizon of RC-ESN remains greater than that of LSTM.
HIGHLIGHTS
A machine learning model for predicting wave propagation in dam-break floods is presented.
The proposed RC-ESN model well predicts wave propagation 286 time-steps ahead.
The prediction ability of the RC-ESN model significantly outperforms the LSTM model.
The model is not sensitive to the training dataset size but is influenced by the spectral radius.
INTRODUCTION
With the incidence of recurring heavy rainfall, snowmelt, and other extreme abnormal weather around the world in recent years, the resulting floods have become increasingly widespread (Wu et al. 2014). Dam-break floods have become a very important topic for engineers due to their sudden occurrence, fast expansion, and need for immediate response. In the dynamics of dam-break floods, wave propagation is responsible for catastrophic consequences such as the loss of life and properties in downstream areas (Schubert & Sanders 2012). Therefore, the computational prediction of wave propagation in dam-break floods has long been an issue in hydrodynamics and hydrological engineering practice.
The propagation of water waves in dam-break floods is a dynamic process in which the spatial–temporal variance is usually more essential than the spatial pattern that eventually formed (Li et al. 2017). Conventionally, real experience and knowledge, reproduction of historical events, and experiment tests, as well as physical and computer models, can be used to tackle the wave propagation problem in dam-break floods (Aureli et al. 2021). A dam-break wave is a flow resulting from a sudden release of a mass of water in a channel. The two-dimensional, depth-averaged Navier–Stokes (N-S) equations based on the shallow water assumption could well characterize wave propagation in a dam-break flood in most instances. For one-dimensional applications, the continuity and momentum conservation in the wave propagation process generates the de Saint-Venant (S-V) equations (Barré de Saint-Venant 1871), which are strongly related to the N-S equations.
The governing laws describing wave propagation in the dam-break flood are expressed in the forms of hyperbolic partial differential equations (PDEs) in either the one-dimensional S-V equations or the two-dimensional simplified N-S equations, requiring an adequate numerical method to solve these PDEs to obtain efficient and accurate results. Up-to-date, the finite differential method (FDM) (Gąsiorowski & Szymkiewicz 2022; Qi et al. 2022; Mo et al. 2023), finite element method (FEM) (Anisha et al. 2023), and finite volume method (FVM) (Hariri et al. 2022) have found a large fan community (Seyedashraf & Akhtari 2017). Remarkable studies refer to, e.g., the Lax–Wendroff scheme (Lax & Wendroff 1960) and the leapfrog scheme (Fauzi & Memi Mayasari 2021). Thus far, the traditional PDE-based numerical models have been the main approaches for describing the wave propagation in dam-break floods, with good agreement between the theoretical solution and flume experimental observations. Nevertheless, in order to achieve high accuracy, these modeling techniques necessitate large datasets and frequently necessitate significant tweaking and re-tuning, even with little changes in parameters. To address this issue, some researchers (Seyedashraf et al. 2018) are starting to try alternative approaches, rather than solving a problem directly, by offering a desirable balance between accuracy and computational cost.
Recently, artificial intelligence-driven models based on various deep neural networks have had a growing impact on assisting research, such as spatial and temporal forecasting of physical processes through PDE solutions. Due to their superior nonlinear approximation ability, deep learning neural networks have demonstrated obvious advantages for enhancing the simulation and prediction of nonlinear dynamics systems such as fluid, climate, and dynamical system control (Kutz 2017; Gentine et al. 2018; Duraisamy et al. 2019; Peters 2019; San et al. 2022; Sharma et al. 2023). One appeal of the machine learning approach is its capacity to accelerate and enhance the prediction of complicated dynamic systems (Chattopadhyay et al. 2020; Drikakis & Sofos 2023). Deep learning neural networks have been proposed for spatio-temporal forecasting and are increasingly employed to assist in modeling chaotic (Pathak et al. 2018) and turbulent systems (Ling et al. 2016), with promising results (McDermott & Wikle 2017, 2019; Vlachas et al. 2018; Raissi et al. 2019). The deep learning method will also be employed in this study to make a preliminary attempt at one-dimensional dam-break wave propagation and predict the flow depth directly. Given that this is a preliminary exploration stage, the comparatively basic one-dimensional dam-break problem is studied here, and its trustworthy numerical solution can provide data for this research while reducing the influence of other factors.
Solutions of PDEs for wave propagation take the form of time-series and spatio-temporal data, which include velocities and flow depths. In this regard, the long short-term memory (LSTM) model, which is most suitable for the prediction of sequence data, has attracted a lot of attention in recent years (Goodfellow et al. 2016). Srivastava et al. (2015) employed a convolution encoder–decoder architecture with an LSTM module to propagate the latent space into the future. Sorteberg et al. (2020) used LSTM to construct a wave propagation prediction neural network that could reasonably predict up to 80 time-steps into the future. The LSTM is the most prominent study in overcoming the difficulties that recurrent neural networks (RNNs) have in learning long-term dependence on data as well as gradient disappearance and explosion concerns (Bengio et al. 1993; Pascanu et al. 2013; Bynagari 2020). A notable study on dam-break floods related to Fotiadis et al. (2020) employed LSTM as a baseline for predicting surface wave propagation under two-dimensional images. In this paper, we also use LSTM to compare with the method. In contrast to images, we will directly employ flow depth data for training.
The echo state network (ESN) (Jaeger 2001) is primarily used in this paper to predict the flow depth of a one-dimensional dam-break wave. The ESN, along with the liquid state machines (LSMs) (Maass et al. 2002), is referred to as reservoir computing (RC) (Verstraeten et al. 2007). The reservoir computing echo state network (RC-ESN) employs a single training procedure rather than a large number of repetitions as in the back-propagation through time (BPTT) algorithm (Werbos 1990). Notably, the reservoir network structure of RC-ESN is connected by a large number of neurons, and the weights of the reservoir connection matrix need to be initialized in advance, enabling the RC-ESN more stable than other neural networks, as substantiated by many prior studies (Skowronski & Harris 2007; Tong et al. 2007; Lin et al. 2009; Li et al. 2012). This can reduce the computational amount of training and avoid the local optimal situation in the optimization algorithm of gradient descent to a certain extent.
Our goal is to use a data-driven model based on the deep learning neural network to predict the spatio-temporal propagation of water waves in a dam-break flood. This research makes three major contributions. Firstly, we propose employing RC-ESN to predict dam-break waves directly based on numerical data, as opposed to using images to predict. Second, we compare RC-ESN to LSTM and discover that RC-ESN enhances long-term forecast accuracy. Finally, we analyze the influence of RC-ESN model parameters and the prediction effect under various initial conditions of dam break.
METHODS
In this paper, the dam-break wave is expressed in the form of the overall flow depth value at a fixed position. The RC-ESN model is used to learn the relationship between the dam-break waves at before and after moments, and this is used to continuously predict the dam-break wave. The LSTM model is used to learn the relationship between the dam-break waves at multiple moments and the next moment and the flow depth of the time series is predicted.
PDEs and the numerical solutions for dam-break flood
The dam-break flood is simulated in a 20 m long flume in this one-dimensional case, as shown in Figure 1(a). The flume's bottom is horizontal with a layer of 0.6 m thick water covering it initially in the downstream direction. A 1.8 m high dam exists upstream of the flume, filling water on the upstream side. The dam is suddenly removed at the start of the simulation, and the water in the reservoir is released onto the wet bed, generating a dam-break flood downstream.
To improve the simulation result, we set a relatively small time increment , which resulted in hundreds of dam-break flood simulations with 100,000 individual time-steps. Simulation results containing of each time step are well recorded (as illustrated in Figure 1(b) and 1(c)). All of the simulated results have been uploaded to the GitHub repository for free access along with this paper.
We sample the flow depth at each time step to generate the training and test datasets from the above numerical solutions. The dimension of flow depth is 200, as are the dimensions of the network's input and output layers. If the flood dam break is a two-dimensional or three-dimensional scene, the flow depth data need to be integrated into a one-dimensional array according to the method in this paper, which will lead to a sharp increase in the amount of input of these models and affect the effect of these models. We construct a training set consisting of sequence samples and a test set of the next 500 sequence samples from to . In our study, we randomly selected 28 sets of such training/test data, each with a length of . Different datasets generate different time periods () for training the machine learning model. Therefore, 28 datasets are trained and tested, namely , , …, and , as shown in Figure 1(d) and Table 1.
TP . | Training set () . | Test set . | RC-ESN . | LSTM . |
---|---|---|---|---|
Sensitivity analysis | ||||
TP . | Training set () . | Test set . | RC-ESN . | LSTM . |
---|---|---|---|---|
Sensitivity analysis | ||||
RC-ESN model
The reservoir has a size of K and is described by a weighted adjacency matrix , whose largest absolute eigenvalue is the network spectral radius (Jaeger 2001). For a given value of , we choose the values of all the elements of randomly from a uniform distribution and rescale all the values so that its largest eigenvalue is , ensuring that the reservoir system meets the necessary condition of stability, namely echo state property (Jaeger 2001). The reservoir size and the spectral radius are used in this study to achieve the best prediction performance. Sensitivity analysis of the prediction accuracy concerning K and refers to the following discussion section and Table 1.
LSTM network
To further support the long-term prediction performance of the proposed RC-ESN model, the LSTM network, commonly used in many up-to-date studies (Alizadeh et al. 2021; Hayder et al. 2022), is selected as a comparison.
Evaluation method for prediction performance
Here, the temporal average of flow depths at each grid is denoted by . The score ranges from −1.0 to 1.0. If the prediction is accurate, the score equals 1.0.
RESULTS
Considering the different frequencies of flow depth change during the dam-break flood development process, the prediction effect will be influenced by the flow depth at different moments. Therefore, as shown in Figure 1(d), we randomly selected 28 training/test sets from different moments and set the training set size to . Sensitivity analysis of the prediction accuracy concerning N refers to the following discussion section.
The RMSE gradually increases as the prediction time step increases. To display the prediction effect more intuitively, we refer to the prediction time step when RMSE reaches 0.01 as the prediction horizon, implying that the prediction error is acceptable within the prediction horizon. Figure 4 shows the RMSE values of the prediction and numerical solution as the prediction time step increases under two special TPs, namely the longest and shortest prediction horizons of RC-ESN and LSTM, respectively. Figure 4(a) shows that when the time period is , the prediction horizons of the RC-ESN and LSTM methods are the shortest, with the prediction horizon of RC-ESN being 49, much better than the LSTM model. When the time period is , the RC-ESN method has the longest prediction horizon of 286, while the LSTM method has the longest prediction horizon of 81 when the time period is . In addition, Figure 4(b) compares RMSE for all TPs at the corresponding time for the above several prediction horizons. It can be seen that different TPs have a certain influence on the prediction effect, and the longer the prediction time step, the greater the influence.
Figure 5 presents the numerical solution and prediction flow depth of the RC-ESN and LSTM methods for the TPs and prediction horizons mentioned in Figure 4. The figure shows the prediction effect of both models and the flow depth of dam-break waves at two instantaneous time steps, and , during the time period , when the prediction horizon of both models is the shortest. In this case, it is demonstrated that, despite having the worst prediction effect during such TPs, the RC-ESN model can predict wave propagation behavior because the predicted surface wave well reproduces the numerical solution. The figure also shows the prediction results under the time period , during which the RC-ESN performs best, as well as the flow depth of numerical solution and both models at . The figures demonstrate that the RC-ESN prediction result is close to the numerical solution at , whereas the LSTM prediction result obviously deviates. Although the local error fluctuation is observed in the RC-ESN's result at , the overall RMSE of the RC-ESN model is lower owing to that the dam-break wave is predicted to be stagnant in the LSTM's results during the time period , with which the LSTM achieves the best performance. It should be noted that the error between the numerical solution and the prediction of both models gradually grows as the number of predicted steps increases.
As a conclusion of the result analysis, the RMSE of the RC-ESN reaches 0.01 later than that of the LSTM regardless of the influence of the TPs, demonstrating that the RC-ESN model has a stronger ability for prediction than the conventional LSTM model.
DISCUSSION
Comparison of the computational complexity of the RC-ESN and LSTM
Here, we compare the computational complexity of the RC-ESN and LSTM using multiply-accumulate operations (MACs). A MAC contains a multiplication operation and an addition operation, totaling roughly twice the number of floating-point operations (FLOPs). The MACs and the total params of the RC-ESN and LSTM are shown in Table 2 for , , , and . It can be seen that RC-ESN is smaller than LSTM in both the params and MACs, especially the MACs of RC-ESN are three orders of magnitude smaller.
Model . | Params . | MACs . |
---|---|---|
RC-ESN | 0.28M | 2.52M |
LSTM | 8.49M | 6.68G |
Model . | Params . | MACs . |
---|---|---|
RC-ESN | 0.28M | 2.52M |
LSTM | 8.49M | 6.68G |
Sensitivity analysis on the training set size
Figure 6(b) demonstrates the variation of the prediction horizon against the training set size N. The prediction horizon of the RC-ESN and the LSTM both arise with the growth of training set size N. The RC-ESN model clearly exhibits a positive effect of training set size N on the prediction horizon. The prediction horizon expands to 200 time-steps ahead when the training set size is increased to . However, in the case of the LSTM model, this effect is not so obvious, with several exceptional uncertainties existing when the training set size is beyond . In general, when compared to the LSTM, the prediction ability and the accuracy of RC-ESN are less dependent on the training set size, which is a significant advantage when the dataset available for training is limited, as in the case of predicting wave propagation in dam-break floods.
Sensitivity analysis on the reservoir size K and spectral radius
The above analysis supports that compared to the conventional LSTM model, the proposed RC-ESN model has witnessed a better performance for predicting wave propagation behavior of the one-dimensional dam-break flood. In this section, we focus on the other major concern of deep learning, i.e., the requirement for large reservoirs, which can be computationally taxing.
Dam-break wave prediction under different initial flow depths
CONCLUSION
In this paper, we show that a data-driven RC-ESN model, well-trained with numerical solutions of de Saint-Venant equations, can predict the long-term dynamic behavior of a one-dimensional dam-break flood with satisfactory accuracy. When compared to the numerical solution, the proposed RC-ESN model achieves the best performance of ahead predicting wave propagation 286 time-steps among all 28 TPs, with RMSE smaller than 0.01. It outperforms the conventional LSTM model, which reaches a comparable RMSE only 81 time-steps ahead.
According to the sensitivity analysis, the proposed RC-ESN model is less dependent on the training set size than the conventional LSTM model, which is a significant advantage when the dataset is available for training. Although increasing reservoir size K is theoretically positive for improving prediction performance, we find that a medium reservoir size K = 1,200∼2,600 is sufficient when considering the overall balance of computational efficiency and prediction performance. However, it is confirmed that the spectral radius has a complex influence on the prediction performance of the RC-ESN model. We recommend using or a smaller spectral radius for better prediction performance in this one-dimensional dam-break flood scenario. In validating the preceding conclusion, we change the initial flow depth of the dam-break problem and find that the prediction horizon of RC-ESN is larger than that of LSTM in different TPs.
Our findings support the favorable role of the data-driven and RC-ESN model in predicting wave propagation in a one-dimensional dam-break flood. When the change of dam-break waves in a certain period of time is known, this work can quickly predict the evolution of dam-break waves in the following time, which may aid the rapid disaster prediction of dam-break floods. The study in this paper is limited to the one-dimensional ideal dam-break scenario. At the same time, the proposed RC-ESN prediction method has a poor prediction effect over 300 time-steps, and a certain amount of data is still needed for training before the prediction. However, for a two-dimensional or even three-dimensional scenario, which is closer to the real engineering problem, more training data, such as the wave propagation velocities along with different directions, should be included, increasing the difficulty of training a high-performance neural network. This aspect obviously needs more research and should continue to be investigated in order to achieve rapid prediction of dam-break waves in actual scenarios. In the follow-up study, we will consider the flow depth and flow velocity of the dam-break flood in two or three dimensions. Methods such as convolutional neural networks are used to encode and decode the data, and the RC-ESN model is combined for prediction.
ACKNOWLEDGEMENTS
This study was financially supported by the National Natural Science Foundation of China (Grant No. 52078493); the Natural Science Foundation of Hunan Province (Grant No. 2022JJ30700); the Natural Science Foundation for Excellent Young Scholars of Hunan (Grant No. 2021JJ20057); the Innovation Provincial Program of Hunan Province (Grant No. 2020RC3002); the Science and Technology Plan Project of Changsha (Grant No. kq2206014); the Innovation Driven Program of Central South University (Grant No. 2023CXQD033); and the Fundamental Research Funds for the Central Universities of Central South University (Grant No. 2022ZZTS0660). These financial supports are gratefully acknowledged. We also extend our gratitude to associate editor Gwo-Fong Lin and two reviewers for their insightful comments.
DATA AVAILABILITY STATEMENT
All relevant data are available from an online repository or repositories. The data that support the findings of this study are available in the GitHub repository [lcl1527/dam-break-ESN-LSTM], at HYPERLINK "https://github.com/lcl1527/dam-break-ESN-LSTM" lcl1527/dam-break-ESN-LSTM (github.com). The numerical simulation is performed using MATLAB2019. RC-ESN and LSTM training and prediction are performed using PyTorch. The source codes used in this work are freely available online in the GitHub repository: HYPERLINK "https://github.com/lcl1527/dam-break-ESN-LSTM" lcl1527/dam-break-ESN-LSTM (github.com).
CONFLICT OF INTEREST
The authors declare there is no conflict.