Discrepancies between estimated sediment and actual yields are large. The sedimentation often causes severe damage due to a lack of on-site measurement data in the current disaster prevention system. This paper presents a robust early-warning system in which the statistical analysis used to predict sedimentation is conducted within the context of the underlying physical mechanisms. This three-phase early-warning system employs data collection, data generation, and AI (Artificial Intelligence) prediction. Data collection involves the use of HEC-HMS (Hydrologic Engineering Center – Hydrologic Modeling System) to transform measured precipitation data into flow discharge from various sub-catchments. Empirical formulas related to landslide volume and soil erosion are then used to establish suitable boundary conditions for data generation. Finally, a 2D model, SRH-2D (Sediment and River Hydraulic-Two Dimension model), is used to simulate data pertaining to temporal variations in sediment flux under various storm event scenarios. The simulated data are then used as input for the training and testing three artificial neural networks. The primary strength of this study is to improve the prediction ability in large-scale sedimentation issues in mountain rivers. These models returned good prediction results with 1–6 h lead times by interdisciplinary integration with the hydraulic and statistical fields.

  • The sediment flux in the mountain basin was successfully predicted with field data and data generated by numerical modeling.

  • The accuracy of sediment flux predictions could be improved by adopting various data sources and greatly improves the results of conventional that rely on a single variable.

  • The sediment flux prediction obtained a reasonable result, and there were no difficulties in predicting underlying trends.

ANN

Artificial neural network

ARIMA

Auto regressive integrated moving average

BPNN

Back-propagation neural networks

CC

Correlation coefficient

CN

Curve number

EnKF

Ensemble Kalman filter

HEC-HMS

Hydrologic Engineering Center – Hydrologic Modeling System

LVMDR

Landslide volume equation modified for the Dahan River basin

MAE

Mean absolute error

ML

Machine learning

RBFNN

Radial basis function neural network

SEEDR

Soil erosion empirical formula modified for the Dahan River basin

SRH-2D

Sediment and river hydraulic-two dimension model

SVM

Support Vector machines

RMSE

Root mean square error

USBR

United States Bureau of Reclamation

USLE

Universal soil loss equation

Watershed sedimentation has been identified as a severe problem worldwide (Fredriksen 1970; Colosimo & Wilcock 2007; Kim et al. 2013; Huang et al. 2018; Siswanto & Francés 2019), resulting in river erosion, excess sediment, reservoir deposition, and a compromised water supply for downstream populations (Nagle et al. 1999; Dang et al. 2010; Huang et al. 2019b). A reasonable early-warning system needs to be developed to face the significant serious sedimentation disaster due to climate change. Sedimentation is generally investigated using empirical analysis, numerical simulations, and/or field measurements (Aksoy & Kavvas 2005; Garcia 2008; Walling 2017), and the collected references are described below.

The empirical methods used to estimate debris volume, sediment discharge, and river deposition are generally tied to historical events (Boomer et al. 2008; Gartner et al. 2014). Numerical simulations are widely used to estimate sediment yield and erosion volume (Morgan et al. 1998; Chou 2010; Alatorre et al. 2012). Field measurements are generally implemented using aerial photos, satellite images, UAVs, and Lidar (Jain & Kothyari 2000; Green & Westbrook 2009; Mackey & Roering 2011; Peter et al. 2014). These methods are well suited to assessing the scale of disasters and formulating mitigation strategies after the fact (Koi et al. 2008; Hsu et al. 2012); however, it is difficult to predict upcoming sedimentation hazards. In this study, we sought to establish a practical early-warning system for watershed sedimentation based on empirical, numerical, and machine learning (ML) methods.

ML methods have been widely applied, for example, Suwarno et al. (2021) used a decision tree to develop a lava flood early-warning system; Chutiman et al. (2022) presented a forecasting model for daily maximum rainfall during tropical cyclones based on grey theory; Ekwueme (2022) built an ARIMA (auto regressive integrated moving average) to forecast the flood intensity under the climate change scenario. Furthermore, fusions of numerical modeling with ML (e.g., artificial neural network (ANN)) have also been mentioned in the past few years. ANN and fuzzy models have also been used to predict long-term flow discharge and water levels. Yu et al. (2004) combined chaos theory and support vector machines (SVM) with big data to improve daily flow predictions. Sannasiraj et al. (2004) combined deterministic and stochastic models to facilitate predictions of daily tidal variation. Karri et al. (2013) used an Ensemble Kalman filter (EnKF)-based method to improve the forecast ability of deterministic models. Loukas & Vasiliades (2014) implemented a hydrological model in conjunction with ANNs to forecast rainfall runoff. Young & Liu (2015) developed a hybrid model based on numerical models and ANNs to train and test flow discharge during typhoon events. Humphrey et al. (2016) integrated a hydrological model output with a Bayesian ANN to predict the monthly flow.

ANNs have also been used to predict daily sediment concentration (Zounemat-Kermani et al. 2016; Malik et al. 2017; Huang et al. 2019a) and sediment yields along rivers (Babovic 2009). Predicting watershed sedimentation should account for precipitation, river discharge, landslide volume, soil erosion, and river sediment load. However, few studies have successfully predicted sediment concentrations using multiple factors.

An accurate forecast of suspended sediment concentration is critical for evacuation strategy and reservoir management. A reasonable prediction outcome of the watershed can strive for more time to formulate the plan. The prediction outcome can be a reference and applied to other field sites. This paper presents a three-stage early-warning system to predict sediment concentrations in mountainous regions. The proposed system comprises a rainfall-runoff model, two sediment estimation models, and three ML methods, SVM, BPNN (back-propagation neural networks), and RBFNN (radial basis function neural network). In the first stage, the HEC-HMS rainfall-runoff model was used to derive inflow discharge boundary conditions based on data pertaining to typhoon events between 2008 and 2016. Empirical formulas were applied to the output of HEC-HMS to estimate inflow sediment boundary conditions. The SRH-2D shallow water model was then used to generate a large volume of simulated data pertaining to sediment concentrations during the aforementioned typhoon events to be used as an input for the three ANNs, SVM, BPNN, and RBFNN in assessing the efficacy of the proposed scheme. Figure 1 presents a flowchart of the proposed three-phase method.
Figure 1

Flowchart of proposed three-phase early-warning system.

Figure 1

Flowchart of proposed three-phase early-warning system.

Close modal

Dahan river basin

In Taiwan, sedimentation disasters are generally initiated by typhoon events, such as the 2004 Typhoon Aere (Chang et al. 2008), the 2008 Typhoon Sinlaku (Lee et al. 2006), and the 2009 Typhoon Morakot (Tsou et al. 2011). The Dahan River basin upstream from the S Reservoir (capacity of 309,120,000 m3) is highly prone to sedimentation and worth studying. The research area shows in Figure 2.
Figure 2

Location of Dahan River basin.

Figure 2

Location of Dahan River basin.

Close modal

This study collected hydrological data on water elevation, flowrate, and sediment discharge within the Dahan River basin from 2008 to 2016, including six typhoon events: Fungwong, Sinlaku, Jangmi, Morakot, Fitow, and Soudelor. Four other typhoon events were selected as test cases. This information is listed in Table 1.

Table 1

Historical data related to the Dahan River basin from 2008 to 2016 typhoon events

YearEventDuration (h)Peak discharge (m3/s)Sediment discharge (ton/s)Landslide volume (m3)Volume of soil eroded (m3)Total (m3)TrainingTesting
2008 Fungwong 103 1,433 18.23 50,362.88 50,190.28 100,553.16  
2008 Sinlaku 136 2,532 71.70 50,816.78 51,523.34 102,340.12  
2008 Jangmi 60 3,292 100.98 57,623.23 63,623.83 121,247.06  
2009 Morakot 143 2,280 61.60 112,189.91 97,504.13 209,694.04  
2013 Soulik 73 5,457 444.40 105,562.94 94,855.69 200,418.63  
2013 Fitow 89 1,736 144.0 108,599.56 55,447.39 164,046.95  
2013 Trami 87 1,137 39.00 116,816.57 71,017.99 187,834.56  
2015 Soudelor 66 5,634 103.03 108,594.59 96,090.21 204,684.80  
2015 Dujuan 60 3,802 60.23 79,551.57 81,986.90 161,538.47  
2016 Megi 41 4,267 99.10 98,433.89 91,779.24 190,213.13  
YearEventDuration (h)Peak discharge (m3/s)Sediment discharge (ton/s)Landslide volume (m3)Volume of soil eroded (m3)Total (m3)TrainingTesting
2008 Fungwong 103 1,433 18.23 50,362.88 50,190.28 100,553.16  
2008 Sinlaku 136 2,532 71.70 50,816.78 51,523.34 102,340.12  
2008 Jangmi 60 3,292 100.98 57,623.23 63,623.83 121,247.06  
2009 Morakot 143 2,280 61.60 112,189.91 97,504.13 209,694.04  
2013 Soulik 73 5,457 444.40 105,562.94 94,855.69 200,418.63  
2013 Fitow 89 1,736 144.0 108,599.56 55,447.39 164,046.95  
2013 Trami 87 1,137 39.00 116,816.57 71,017.99 187,834.56  
2015 Soudelor 66 5,634 103.03 108,594.59 96,090.21 204,684.80  
2015 Dujuan 60 3,802 60.23 79,551.57 81,986.90 161,538.47  
2016 Megi 41 4,267 99.10 98,433.89 91,779.24 190,213.13  

Boundary conditions

Figure 3 shows the proposed three-phase sediment flow prediction scheme. The first stage of the proposed three-phase early-warning system was to collect hydrological data and applied HEC-HMS to produce runoff volume from seven precipitation stations and five gauge stations. Then we collected geomorphological data, including bed elevation, material, and roughness to obtain the sediment yield estimation including the landslide and soil erosion volume surrounding the watershed. The runoff data and sediment yield amount could be the boundary condition of the SRH-2D to simulate the sediment concentration to be the training and testing input of the ML prediction. Note that the proposed scheme combines a runoff model and hydraulic sediment model with empirical data pertaining to landslide volume and soil erosion to enable the use of ML methods to generate reasonable predictions related to sediment concentrations in the watershed. Note that the proposed scheme combines a runoff model and hydraulic sediment model with empirical data pertaining to landslide volume and soil erosion to enable the use of ML methods to generate reasonable predictions related to sediment concentrations in the watershed.
Figure 3

Proposed three-phase sediment flow prediction scheme for Dahan River basin.

Figure 3

Proposed three-phase sediment flow prediction scheme for Dahan River basin.

Close modal

HEC-HMS

HEC-HMS is a numerical hydrologic model developed by the US Army Corps of Engineers. HEC-HMS has been applied to research on flooding and sedimentation in several regions includes Taiwan watershed (Hussain et al. 2021; Chiang et al. 2022; González et al. 2022). The control module for HEC-HMS can be divided into a basin model, a meteorologic model, and a control specification. Essentially, the model is used to transform rainfall data into runoff information. The study used HEC-HMS to obtain inflow boundary discharge during typhoons and calculate inflow sediment volumes under various return-period cases. Figure 4 is the application domain and sub-catchments of HEC-HMS. It illustrates the application domain, nine sub-catchment locations, and the areas of the Dahan River basin to which HEC-HMS was applied. Table 2 lists the calibrated parameter values, including slope, river length, curve number (CN), S, lag time, and in each sub-catchment.
Table 2

Model and parameter settings for HEC-HMS

CatchmentSlope (%)River length (m)CN (−)S (mm)Lag time (h)Ia (mm)
27.85 27,755.70 62 6.13 3.66 31.13 
37.00 23,406.81 45 12.00 4.27 62.00 
35.9 50,733.66 46 11.80 7.76 59.60 
37.50 39,949.53 35 18.60 8.55 94.30 
33.94 10,819.18 50 10.00 2.11 50.80 
37.03 8,136.94 45 12.20 1.83 62.00 
37.64 10,322.36 40 15.00 2.50 76.20 
35.32 13,858.34 48 10.80 2.65 55.00 
36.28 7,391.10 40 15.00 1.95 76.20 
CatchmentSlope (%)River length (m)CN (−)S (mm)Lag time (h)Ia (mm)
27.85 27,755.70 62 6.13 3.66 31.13 
37.00 23,406.81 45 12.00 4.27 62.00 
35.9 50,733.66 46 11.80 7.76 59.60 
37.50 39,949.53 35 18.60 8.55 94.30 
33.94 10,819.18 50 10.00 2.11 50.80 
37.03 8,136.94 45 12.20 1.83 62.00 
37.64 10,322.36 40 15.00 2.50 76.20 
35.32 13,858.34 48 10.80 2.65 55.00 
36.28 7,391.10 40 15.00 1.95 76.20 
Figure 4

(a) Application domain in Dahan River basin; (b) sub-catchments in Dahan River basin.

Figure 4

(a) Application domain in Dahan River basin; (b) sub-catchments in Dahan River basin.

Close modal

Sediment yield estimation

The watershed sediment yield was defined as the sum of the landslide volume and soil erosion in this study, and this information was treated as an inflow boundary condition for SRH-2D. The landslide volume equation modified for the Dahan River basin (LVMDR) and soil erosion empirical formula modified for the Dahan River basin (SEEDR) were used to estimate the watershed sediment yield as shown in Equations (1)–(3). The LVMDR was based on Equation (1) and implemented with Equation (2) to estimate the landslide volume under different precipitation scenarios.

It was also necessary to calculate soil erosion as another sediment source for deriving boundary conditions for the numerical model. The universal soil loss equation (USLE) is widely used to estimate erosion volume; however, it is necessary to determine the parameters for specific field sites, which can have a profound effect on estimation accuracy. The SEEDR was developed based on the above relationship over the last three decades, as shown in Equation (3), and erosion as a function of rainfall in the Dahan River basin shown in Figure 5 based on the measured data.
Figure 5

Erosion as a function of rainfall in the Dahan River basin.

Figure 5

Erosion as a function of rainfall in the Dahan River basin.

Close modal
Finally, HEC-HMS results were substituted into LVMDR and SEEDR to calculate the landslide volume and the extent of soil erosion during typhoons as well as the watershed sediment yield (see Table 1). As an example, consider the use of estimated inflow discharge from Section 3.1 and sediment volume from Section 3.2 as inflow boundary conditions for SRH-2D, as follows (Leung et al. 2012):
(1)
(2)
where Y refers to landslide percentage (%); A refers to the area of each watershed; a indicates the landslide area; C and K are constants and each value of the corresponding sub-watershed is 0.00154, 0.00053; 0.00324, 0.00078; 0.00071, 030034, respectively (Sinotech Engineering Consultants Inc. 2010); refers to cumulative precipitation; refers to critical precipitation; h refers to soil average depth of each sub-catchment; refers to slope of the landslide location.
(3)
where H indicates the erosion depth (mm). The number of collected data was 803 (Lai & Lin 2019).

SRH-2D

Technically speaking, the sedimentation issue in mountain region rivers needs to consider the x- and y-direction flow mechanism and the z-direction often be ignored due to the shallow water condition. Therefore, the 2-D shallow water model is a better choice because it can consider both computational efficiency and simulation accuracy. SRH-2D is a 2-D hydraulic model developed by the USBR (United States Bureau of Reclamation) for use in estimating erosion and deposition patterns along rivers during flood events (Meyer-Peter & Müller 1948; Engelund & Hansen 1967; Parker 1990; Lai & Greimann 2010). SRH-2D was widely used in Taiwan rivers, especially in Dahan River. In addition, the sensitive parameters were calibrated and validated by using field data. Hence, this study chooses SRH-2D to be the numerical tool to investigate the sediment transport topic (Lai & Wu 2013; Wu et al. 2016; Chiu et al. 2019; Ho et al. 2021). The grid and bed elevation of the simulation region are shown in Figure 6. The grid included 7,764 triangular cells to enhance fitting with the irregular riverbank and bed shape (see Figure 6(a)). Inflow and outflow boundary conditions were estimated for Yufeng, Lengjiao, and Xiayun. The four other sub-inflow boundaries are shown in Figure 6(b). We also calibrated several sensitive parameters to provide a relative optimal solution, such as time-step (1 s), Manning's n (0.025–0.035), active thickness (0.2 m), and adaptation length (50–250 m).
Figure 6

Simulation region: (a) grid; (b) bed elevation.

Figure 6

Simulation region: (a) grid; (b) bed elevation.

Close modal

Machine learning

This study employed three machine learning methods (SVM, BPNN, and RBFNN) for the third phase. SVM is a supervised learning approach developed by Vapnik (1999) for statistical analysis. The structure of SVM is transformed into a quadratic planning issue to minimize computing time, and structured risk minimization is used to maintain accuracy. SVM has proven highly effective in dealing with classification and prediction problems. BPNNs are multilayer feed-forward ANNs using one input and output layer with one or more hidden layers. In the current study, we employed the gradient descent method to solve the energy function and thereby minimize error. RBFNN is another ML method with a simple topological structure used to solve nonlinear system identification.

The prediction performance of ANNs depends on a careful training and testing process. ANNs have been used to predict variations in rainfall-runoff, runoff-discharge, and spatial and temporal sediment concentrations (Afan et al. 2015; Knighton et al. 2019; Pakdaman et al. 2020; Jhong et al. 2021). In the proposed early-warning system, these algorithms are applied to historical data related to precipitation and simulation data related to sediment flow.

Root mean square error (RMSE), mean absolute error (MAE), and correlation coefficient (CC) were used to assess prediction accuracy. RMSE provides high sensitivity during peak times and CC is well suited to identifying linear dependence between predictions and observations (Chadalawada & Babovic 2017). Note that RMSE and MAE values are inversely proportional to prediction performance, wherein perfect prediction would generate a CC value of 1.0. The forms of the evaluation indexes (RMSE, MAE, and CC) are outlined below:
(4)
(5)
(6)
where is the observed sediment flux; is the predicted sediment flux; is the average observed sediment flux; and is the average predicted sediment flux.

HEC-HMS and SRH-2D

The relationship between rainfall and discharge is a critical factor in estimating river sedimentation due to the reliance of sediment threshold velocity on the accuracy of the discharge value. In other words, the reliability of sediment simulations depends on the accurate river discharge estimation. Hence, the discharge and sediment concentration simulation outcomes need to be confirmed by comparing it with the measured data.

Figure 7 is the comparison of simulation and measurement based on HEC-HMS and SRH-2D modeling. Figure 7(a)–7(d) shows the discharge associated with typhoons Fitow (discharge calibration) and Trami (discharge validation) at Yufeng and Xiayun monitoring stations based on HEC-HMS modeling. In the calibration cases, the simulation results reflected hydrological phenomena over a wide range of discharge volumes, as shown in Figure 7(a) and 7(b). The differences between the measured peak discharge and simulated values were 5.7% at Yufeng station and 2.1% at Xiayun station. Note that the simulations presented a consistent trend between peak and low discharge. The validation cases illustrate the effectiveness of HEC-HMS (see Figure 7(c) and 7(d)). The difference between the measured peak discharge and simulated values was 6.8% at Yufeng station and 5.0% at Xiayun station. These simulation results demonstrate the effectiveness of this modeling scheme in describing variations in discharge in the Dahan River basin.
Figure 7

Comparison of simulation and measurement based on HEC-HMS and SRH-2D.

Figure 7

Comparison of simulation and measurement based on HEC-HMS and SRH-2D.

Close modal

Sediment transport is affected by inflow discharge and sediment volume. The estimation approach of the inflow boundary condition was confirmed in Sections 3.1 and 3.2. Figure 7(e) and 7(f) presents a comparison of measured and simulated sediment concentration values associated with typhoons Fitow (sediment calibration) and Trami (sediment validation) at Xiayun station based on SRH-2D modeling. We can see that SRH-2D effectively described sedimentation as long as accurate inflow discharge data were available. The trends in peak concentration were very close during the two typhoons with an absolute difference of less than 20%. These results demonstrate the efficacy of these simulations in capturing temporal variations in sediment flow.

Optimizing data input

We used input data pertaining to two scenarios (I and II) to assess the effectiveness of the ML networks in predicting sediment concentrations (see Table 3). In Scenario I, the only input was time-series discharge (Q) data. In Scenario II, the input data included time-series Q and sediment term (S). The two scenarios were addressed using four parameter combinations: (1) 1 h before the current time, (2) 1 h before and at the current time, (3) 1 and 2 h before, and (4) 1 and 2 h before and at the current time. Table 3 lists the optimal input for the proposed prediction models; the results of optimizing data input shows in Figure 8.
Table 3

Input selection and the calibration values of three models

Case Scenario1234
Qt−1 Qt−1, Qt Qt−1, Qt−2 Qt−2, Qt−1, Qt      
II Qt−1, St−1 Qt−2, Qt−1, St−1 Qt−2, St−2 Qt−2, Qt−1, St−1, St−2      
BPNN
RBFNN
SVM
IRMSEMAECCRMSEMAECCRMSEMAECC
10.57 5.80 0.97 11.00 5.52 0.97 17.40 11.80 0.94 
9.80 5.12 0.98 9.80 5.57 0.98 17.57 11.57 0.94 
11.54 5.45 0.96 12.80 6.80 0.94 20.25 12.29 0.92 
9.80 5.10 0.98 9.50 5.40 0.98 16.00 11.46 0.94 
Average 10.43 5.37 0.97 10.78 5.82 0.97 17.81 11.78 0.94 
BPNN
RBFNN
SVM
IIRMSEMAECCRMSEMAECCRMSEMAECC
9.00 4.00 0.98 7.96 2.90 0.99 8.47 2.88 0.99 
9.40 4.80 0.98 9.86 3.80 0.98 12.83 4.49 0.96 
10.80 5.30 0.97 11.00 6.00 0.95 15.00 7.38 0.93 
9.50 5.00 0.98 9.00 3.90 0.98 12.73 4.45 0.97 
Average 9.68 5.03 0.98 9.46 4.15 0.98 12.26 4.80 0.96 
Case Scenario1234
Qt−1 Qt−1, Qt Qt−1, Qt−2 Qt−2, Qt−1, Qt      
II Qt−1, St−1 Qt−2, Qt−1, St−1 Qt−2, St−2 Qt−2, Qt−1, St−1, St−2      
BPNN
RBFNN
SVM
IRMSEMAECCRMSEMAECCRMSEMAECC
10.57 5.80 0.97 11.00 5.52 0.97 17.40 11.80 0.94 
9.80 5.12 0.98 9.80 5.57 0.98 17.57 11.57 0.94 
11.54 5.45 0.96 12.80 6.80 0.94 20.25 12.29 0.92 
9.80 5.10 0.98 9.50 5.40 0.98 16.00 11.46 0.94 
Average 10.43 5.37 0.97 10.78 5.82 0.97 17.81 11.78 0.94 
BPNN
RBFNN
SVM
IIRMSEMAECCRMSEMAECCRMSEMAECC
9.00 4.00 0.98 7.96 2.90 0.99 8.47 2.88 0.99 
9.40 4.80 0.98 9.86 3.80 0.98 12.83 4.49 0.96 
10.80 5.30 0.97 11.00 6.00 0.95 15.00 7.38 0.93 
9.50 5.00 0.98 9.00 3.90 0.98 12.73 4.45 0.97 
Average 9.68 5.03 0.98 9.46 4.15 0.98 12.26 4.80 0.96 
Figure 8

Results of optimizing data input: (a) scatter plots; (b) sediment discharge hydrograph.

Figure 8

Results of optimizing data input: (a) scatter plots; (b) sediment discharge hydrograph.

Close modal

In Scenario I, combination 4 resulted in the best performance, indicating that sediment prediction accuracy depended on the availability of input discharge data. In Scenario II, combination 1 resulted in the best performance. Note that the overall performance in Scenario II was much better than in Scenario I. The prediction results obtained using BPNN, RBFNN, and SVM were much better in Case II-1 than in Case I-4. For example, the comparison of SVM between two cases shows RMSE reduces from 17.81 to 8.47, MAE reduces from 11.78 to 2.88, and CC increases from 0.94 to 0.99. The sediment term was indicated as a representative factor because the evolution indexes, including RMSE, MAE, and CC of BPNN, RBFNN, and SVM, were showed comprehensiveness better results in scenario II.

Figure 8(a) compares the prediction results with the measured data. Overall, BPNN, RBFNN, and SVM provided satisfactory results under both scenarios regardless of the input data combination, as evidenced by the strong correlation between predictions and observations. These results demonstrate that the discharge term must be considered to capture sediment transport under flood conditions, and the sediment term must also be used to improve predictions pertaining to the sedimentation process. All three prediction models (BPNN, RBFNN, and SVM) presented a similar trend, making them suitable for predicting sediment movement patterns and generating sediment discharge hydrographs (see Figure 8(b)). In conjunction with sediment yield estimation, our numerical models (HEC-HMS and SRH-2D) provided high-quality input data for the prediction models reflecting the mechanism underlying flow and sedimentation along the river.

ML results

In this section, we discuss the performance of the three prediction models (SVM, BPNN, and RBFNN) (see Table 4). During the training stage, the RMSE and MAE values decreased with an increase in forecast lead time, whereas CC presented the opposite trend. The three models achieved the best performance when given a 1-h lead time. The RMSE, MAE, and CC values gradually degraded as the lead time was extended from 1 to 6 h, as follows: 1 h (0.99, 0.99, 0.99), 3 h (0.94, 0.92, 0.91), 4 h (0.89, 0.87, 0.86), and 6 h (0.82, 0.78, 0.75).

Table 4

Estimated values of the three models in the training and testing stage

TypeBPNN
RBFNN
SVM
RMSEMAECCRMSEMAECCRMSEMAECC
Training 
T + 1 7.00 3.55 0.99 6.62 2.43 0.99 7.19 2.51 0.99 
T + 2 10.86 4.21 0.97 11.84 4.54 0.97 12.88 4.54 0.96 
T + 3 16.66 6.46 0.94 19.37 7.42 0.92 20.78 7.13 0.91 
T + 4 22.35 8.95 0.89 24.21 9.79 0.87 25.93 9.20 0.86 
T + 5 25.43 10.44 0.86 28.04 11.88 0.83 30.00 11.03 0.80 
T + 6 29.08 12.45 0.82 31.17 13.83 0.78 33.40 12.71 0.75 
Testing 
T + 1 8.34 6.45 0.99 8.52 5.59 0.99 8.47 4.94 0.99 
T + 2 14.53 10.16 0.96 15.7 10.72 0.94 15.45 9.44 0.95 
T + 3 18.34 10.98 0.93 20.24 11.00 0.91 21.50 9.56 0.90 
T + 4 23.59 12.02 0.88 25.21 11.84 0.86 26.75 10.98 0.84 
T + 5 27.29 14.90 0.83 29.13 15.06 0.81 31.07 13.75 0.78 
T + 6 30.95 19.13 0.79 32.47 18.73 0.76 34.70 16.72 0.72 
TypeBPNN
RBFNN
SVM
RMSEMAECCRMSEMAECCRMSEMAECC
Training 
T + 1 7.00 3.55 0.99 6.62 2.43 0.99 7.19 2.51 0.99 
T + 2 10.86 4.21 0.97 11.84 4.54 0.97 12.88 4.54 0.96 
T + 3 16.66 6.46 0.94 19.37 7.42 0.92 20.78 7.13 0.91 
T + 4 22.35 8.95 0.89 24.21 9.79 0.87 25.93 9.20 0.86 
T + 5 25.43 10.44 0.86 28.04 11.88 0.83 30.00 11.03 0.80 
T + 6 29.08 12.45 0.82 31.17 13.83 0.78 33.40 12.71 0.75 
Testing 
T + 1 8.34 6.45 0.99 8.52 5.59 0.99 8.47 4.94 0.99 
T + 2 14.53 10.16 0.96 15.7 10.72 0.94 15.45 9.44 0.95 
T + 3 18.34 10.98 0.93 20.24 11.00 0.91 21.50 9.56 0.90 
T + 4 23.59 12.02 0.88 25.21 11.84 0.86 26.75 10.98 0.84 
T + 5 27.29 14.90 0.83 29.13 15.06 0.81 31.07 13.75 0.78 
T + 6 30.95 19.13 0.79 32.47 18.73 0.76 34.70 16.72 0.72 

Unit: RMSE, MAE is ton/s.

In the testing phase, the RMSE, MAE, and CC of the models were similar to that observed in the training stage. The RMSE, MAE, and CC values gradually degraded as the lead time was extended from 1 to 6 h, as follows: 1 h (0.99, 0.99, 0.99), 3 h (0.93, 0.91, and 0.90), 4 h (+0.7 for all three), and 6 h (+0.7 for all three). These results demonstrate that the proposed prediction model would be able to sound an alarm for disaster prevention/evacuation from roughly 3 h in advance, which is quite reasonable for mountain basins under the effects of extreme weather conditions.

Figure 9 shows the spatial variations of the sediment flow prediction in different lead times (1–6 h). With a short lead time (1 and 2 h), all three prediction models generated accurate estimates of sediment flow (all flow levels). With a longer lead time (4 and 5 h), the degree of error gradually increased, particularly in areas with higher sediment flow. With a relatively long lead time (5 or 6 h), the discrepancies became quite large with predictions tending toward underestimates of sediment flow. Again, the effects were more pronounced in areas with higher sediment flow.
Figure 9

Spatial variations of the sediment flow prediction in different lead times (scatter points).

Figure 9

Spatial variations of the sediment flow prediction in different lead times (scatter points).

Close modal
Figure 10 presents temporal variations in the prediction of sediment flow as a function of lead time (1–6 h). All three models effectively captured the overall trend; however, predictions veered from observations as discharge peaked. This can be attributed to the difficulties in measuring discharge during typhoon events. Once again, prediction accuracy was inversely proportional to lead time.
Figure 10

Time-series variations of the sediment flow prediction in different lead times.

Figure 10

Time-series variations of the sediment flow prediction in different lead times.

Close modal

Overall evaluation

Insufficient data are the major factor in the difficulty in improving the prediction ability. This interdisciplinary study integrates hydrology, hydraulic, and sediment transportation methods to produce sufficient data to develop the three-stage early-warning system. The critical meaning is that the physical mechanism should be embedded in the prediction model to solve the uncertainty problem of the unstable measured data. This study combines HEC-HMS, LVMDR, SEEDR, and SRH-2D to establish the precipitation–discharge–sedimentation relationship and confirms that reasonable discharge can improve the accuracy of sediment concentration. Furthermore, the input optimization shows that multiple inputs (discharge and sediment concentration) obtain better prediction results than the single input (sediment concentration). Note that the historical references major used a single input to establish the prediction model, and this study proves that effective, sufficient, and believable multiple inputs can improve the prediction accuracy. The primary strength of this study is to obtain better prediction accuracy in mountain rivers during large-scale sedimentation events. The effectiveness of the prediction models in terms of temporal and spatial variation indicates their reliability in forecasting adverse events. The error associated with long lead times can be attributed to the inclusion of irrelevant information.

This paper presents a multi-phase early-warning system based on ML models that account for fluid mechanics and sedimentation dynamics. We obtained a reasonable conversion of precipitation into flow discharge using a numerical model of the hydrology in the study area for use in estimating soil erosion based on empirical formulas. This made it possible to establish a well-calibrated hydraulic model tasked with generating flow data (for training and testing) to compensate for the lack of data in typical field measurements. Our robust results demonstrated that the accuracy of sediment flow predictions could be improved by adopting various data sources, including rainfall, inflow discharge, landslide volume, soil erosion, and sediment. This approach greatly improves the results of conventional models that rely on a single variable, such as inflow discharge. This research demonstrates that sedimentation can be predicted with a high degree of accuracy, even in mountain basins and during intense short-term events, such as typhoons. This proof-of-concept is a feasible approach to dealing with sedimentation issues in other fields. The most important is that the proposed early-warning system not only strives for the government to be the reference of emergency evacuation strategy but also provides reservoir sediment information.

The financial support for this study was partially provided by a grant from the Ministry of Science and Technology (project number: 110-2625-M-002-024-MY3; 111-2625-M-035-007-MY3) and partially provided by the Research Centre of Climate Change and Sustainable Development of National Taiwan University.

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Afan
H. A.
,
El-Shafie
A.
,
Yaseen
Z. M.
,
Hameed
M. M.
,
Wan Mohtar
W. H. M.
&
Hussain
A.
2015
ANN based sediment prediction model utilizing different input scenarios
.
Water Resources Management
29
(
4
),
1231
1245
.
Aksoy
H.
&
Kavvas
M. L.
2005
A review of hillslope and watershed scale erosion and sediment transport models
.
Catena
64
(
2–3
),
247
271
.
Alatorre
L. C.
,
Beguería
S.
,
Lana-Renault
N.
,
Navas
A.
&
García-Ruiz
J. M.
2012
Soil erosion and sediment delivery in a mountain catchment under scenarios of land use change using a spatially distributed numerical model
.
Hydrology and Earth System Sciences
16
(
5
),
1321
1334
.
Babovic
V.
2009
Introducing knowledge into learning based on genetic programming
.
Journal of Hydroinformatics
11
(
3–4
),
181
193
.
Chadalawada
J.
&
Babovic
V.
2017
Review and comparison of performance indices for automatic model induction
.
Journal of Hydroinformatics
21
(
1
),
13
31
.
Chang
K.
,
Chiang
S.-H.
&
Lei
F.
2008
Analysing the relationship between typhoon-triggered landslides and critical rainfall conditions
.
Earth Surface Processes and Landforms
33
(
8
),
1261
1271
.
Chiu, Y. J., Lee, H. Y., Wang, T. L., Yu, J., Lin, Y. T. & Yuan, Y. 2019 Modeling sediment yields and stream stability due to sediment-related disaster in Shihmen Reservoir watershed in Taiwan. Water 11 (2), 332.
Chutiman, N., Chiangpradit, M., Kong-ied, B., Busababodhin, P., Chaiyasaen, C. & Guayjarernpanishk, P. 2022 Daily maximum rainfall forecast affected by tropical cyclones using grey theory. Civil Engineering Journal 8 (8), 1565–1573.
Colosimo
M. F.
&
Wilcock
P. R.
2007
Alluvial sedimentation and erosion in an urbanizing watershed, Gwynns Falls, Maryland 1
.
JAWRA Journal of the American Water Resources Association
43
(
2
),
499
521
.
Dang
T. H.
,
Coynel
A.
,
Orange
D.
,
Blanc
G.
,
Etcheber
H.
&
Le
L. A.
2010
Long-term monitoring (1960–2008) of the river-sediment transport in the Red River Watershed (Vietnam): temporal variability and dam-reservoir impact
.
Science of the Total Environment
408
(
20
),
4654
4664
.
Engelund
F.
&
Hansen
E.
1967
A Monograph on Sediment Transport in Alluvial Streams
.
Technical University of Denmark 0stervoldgade 10
,
Copenhagen K
.
Fredriksen
R. L.
1970
Erosion and Sedimentation Following Road Construction and Timber Harvest on Unstable Soils in Three Small Western Oregon Watersheds
, Vol.
104
.
Pacific Northwest Forest and Range Experiment Station, US Department of Agriculture
. Portland, OR
Garcia
M.
2008
Sedimentation Engineering: Processes, Measurements, Modeling, and Practice
. American Society of Civil Engineers, Reston, VA.
González
C. A. R.
,
Rodríguez-Pérez
Á. M.
,
Mancera
J. J. C.
,
Torres
J. A. H.
,
Carmona
N. G.
&
García
M. I. B.
2022
Applied methodology based on HEC-HMS for reservoir filling estimation due to soil erosion
.
Journal of Hydrology and Hydromechanics
70
(
3
),
341
356
.
Green
K. C.
&
Westbrook
C. J.
2009
Changes in riparian area structure, channel hydraulics, and sediment yield following loss of beaver dams
.
Journal of Ecosystems and Management
. 10 (1), 68–79.
Ho
H.-C.
,
Chiang
Y.-M.
,
Lin
C.-C.
,
Lee
H.-Y.
&
Huang
C.-C.
2021
Development of an interdisciplinary prediction system combining sediment transport simulation and ensemble method
.
Water
13 (18), 2588.
Hsu
S. M.
,
Wen
H. Y.
,
Chen
N. C.
,
Hsu
S. Y.
&
Chi
S. Y.
2012
Using an integrated method to estimate watershed sediment yield during heavy rain period: a case study in Hualien County, Taiwan
.
Natural Hazards and Earth System Sciences
12
(
6
),
1949
1960
.
Huang
C.-C.
,
Lai
J.-S.
,
Lee
F.-Z.
&
Tan
Y.-C.
2018
Physical model-based investigation of reservoir sedimentation processes
.
Water
10
(
4
),
352
.
Huang
C.-C.
,
Lai
Y. G.
,
Lai
J.-S.
&
Tan
Y.-C.
2019b
Field and numerical modeling study of turbidity current in Shimen Reservoir during typhoon events
.
Journal of Hydraulic Engineering
145
(
5
),
05019003
.
Jain
M. K.
&
Kothyari
U. C.
2000
Estimation of soil erosion and sediment yield using GIS
.
Hydrological Sciences Journal
45
(
5
),
771
786
.
Karri
R. R.
,
Badwe
A.
,
Wang
X.
,
El Serafy
G.
,
Sumihar
J.
,
Babovic
V.
&
Gerritsen
H.
2013
Application of data assimilation for improving forecast of water levels and residual currents in Singapore regional waters
.
Ocean Dynamics
63
(
1
),
43
61
.
Koi
T.
,
Hotta
N.
,
Ishigaki
I.
,
Matuzaki
N.
,
Uchiyama
Y.
&
Suzuki
M.
2008
Prolonged impact of earthquake-induced landslides on sediment yield in a mountain watershed: the Tanzawa region, Japan
.
Geomorphology
101
(
4
),
692
702
.
Lai
Y. G.
&
Greimann
B. P.
2010
Predicting contraction scour with a two-dimensional depth-averaged model
.
Journal of Hydraulic Research
48
(
3
),
383
387
.
Lai
C. N.
&
Lin
B. S.
2019
Establishment and application of soil erosion empirical at different spatial scales in Taiwan
.
Sinotech Engineering Consultants, Inc
142
,
31
41
.
Lai, Y. G. & Wu, K. 2013 Modeling of vertical and lateral erosion on the Chosui River, Taiwan. In: World Environmental and Water Resources Congress 2013: Showcasing the Future. American Society of Civil Engineers, Reston, VA, pp. 1747–1756.
Lee
H.-Y.
,
Lin
Y.-T.
&
Chiu
Y.-J.
2006
Quantitative estimation of reservoir sedimentation from three typhoon events
.
Journal of Hydrologic Engineering
11
(
4
),
362
370
.
Leung
W.-Y.
,
Lin
B.-S.
,
Hsu
C.-K.
&
Chi
S.-Y.
2012
A simplified method on the landslide volume estimation of storm events for watershed
.
Journal of Soil and Water Conservation Technology
7
(
2
),
82
95
.
Loukas
A.
&
Vasiliades
L.
2014
Streamflow simulation methods for ungauged and poorly gauged watersheds
.
Natural Hazards and Earth System Sciences
14
(
7
),
1641
1661
.
Mackey
B. H.
&
Roering
J. J.
2011
Sediment yield, spatial characteristics, and the long-term evolution of active earthflows determined from airborne LiDAR and historical aerial photographs, Eel River, California
.
Bulletin
123
(
7–8
),
1560
1576
.
Malik
A.
,
Kumar
A.
&
Piri
J.
2017
Daily suspended sediment concentration simulation using hydrological data of Pranhita River Basin, India
.
Computers and Electronics in Agriculture
138
,
20
28
.
Meyer-Peter
E.
&
Müller
R.
1948
Formulas for Bed-Load transport
. In
IAHSR 2nd Meeting, Stockholm, Appendix 2
.
Morgan
R. P. C.
,
Quinton
J. N.
,
Smith
R. E.
,
Govers
G.
,
Poesen
J. W. A.
,
Auerswald
K.
,
Chisci
G.
,
Torri
D.
&
Styczen
M. E.
1998
The European Soil Erosion Model (EUROSEM): a dynamic approach for predicting sediment transport from fields and small catchments
.
Earth Surface Processes and Landforms: The Journal of the British Geomorphological Group
23
(
6
),
527
544
.
Nagle
G. N.
,
Fahey
T. J.
&
Lassoie
J. R.
1999
Management of sedimentation in tropical watersheds
.
Environmental Management-New York
23
,
441
452
.
Pakdaman
M.
,
Falamarzi
Y.
,
Babaeian
I.
&
Javanshiri
Z.
2020
Post-processing of the North American multi-model ensemble for monthly forecast of precipitation based on neural network models
.
Theoretical and Applied Climatology
141
(
1
),
405
417
.
Parker
G.
1990
Surface-based bedload transport relation for gravel rivers
.
Journal of Hydraulic Research
28
(
4
),
417
436
.
Sannasiraj
S. A.
,
Zhang
H.
,
Babovic
V.
&
Chan
E. S.
2004
Enhancing tidal prediction accuracy in a deterministic model using chaos theory
.
Advances in Water Resources
27
(
7
),
761
772
.
Sinotech Engineering Consultants, Inc.
2010
Study on Historical Migration and Its Mechanism of Heavy Rainfall-Induced Sediment Disaster in Shih-Men Watershed
.
Soil and Water Conservation Bureau, Council of Agriculture, Executive Yuan
.
Suwarno
I.
,
Ma'arif
A.
,
Maharani Raharja
N.
,
Nurjanah
A.
,
Ikhsan
J.
&
Mutiarin
D.
2021
IoT-based lava flood early warning system with rainfall intensity monitoring and disaster communication technology
.
Emerging Science Journal
4
,
154
166
.
Tsou
C.-Y.
,
Feng
Z.-Y.
&
Chigira
M.
2011
Catastrophic landslide induced by Typhoon Morakot, Shiaolin, Taiwan
.
Geomorphology
127
(
3
),
166
178
.
Vapnik
V.
1999
The Nature of Statistical Learning Theory
.
Springer Science & Business Media, New York
.
Walling
D. E.
2017
Measuring sediment yield from river basins
. In:
Soil Erosion Research Methods
.
Routledge
, London, pp.
39
82
.
Wu
K. W.
,
Yeh
K. C.
,
Liao
C. T.
&
Lai
Y. G.
2016
Bedrock channel morphological modeling on the river in Taiwan
. In:
River Sedimentation
.
CRC Press
, Boca Raton, FL, pp.
760
767
.
Yu
X.
,
Liong
S.-Y.
&
Babovic
V.
2004
EC-SVM approach for real-time hydrologic forecasting
.
Journal of Hydroinformatics
6
(
3
),
209
223
.
Zounemat-Kermani
M.
,
Kişi
Ö.
,
Adamowski
J.
&
Ramezani-Charmahineh
A.
2016
Evaluation of data driven models for river suspended sediment concentration modeling
.
Journal of Hydrology
535
,
457
472
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).

Supplementary data