Abstract
Discrepancies between estimated sediment and actual yields are large. The sedimentation often causes severe damage due to a lack of on-site measurement data in the current disaster prevention system. This paper presents a robust early-warning system in which the statistical analysis used to predict sedimentation is conducted within the context of the underlying physical mechanisms. This three-phase early-warning system employs data collection, data generation, and AI (Artificial Intelligence) prediction. Data collection involves the use of HEC-HMS (Hydrologic Engineering Center – Hydrologic Modeling System) to transform measured precipitation data into flow discharge from various sub-catchments. Empirical formulas related to landslide volume and soil erosion are then used to establish suitable boundary conditions for data generation. Finally, a 2D model, SRH-2D (Sediment and River Hydraulic-Two Dimension model), is used to simulate data pertaining to temporal variations in sediment flux under various storm event scenarios. The simulated data are then used as input for the training and testing three artificial neural networks. The primary strength of this study is to improve the prediction ability in large-scale sedimentation issues in mountain rivers. These models returned good prediction results with 1–6 h lead times by interdisciplinary integration with the hydraulic and statistical fields.
HIGHLIGHTS
The sediment flux in the mountain basin was successfully predicted with field data and data generated by numerical modeling.
The accuracy of sediment flux predictions could be improved by adopting various data sources and greatly improves the results of conventional that rely on a single variable.
The sediment flux prediction obtained a reasonable result, and there were no difficulties in predicting underlying trends.
ACRONYMS LIST
- ANN
Artificial neural network
- ARIMA
Auto regressive integrated moving average
- BPNN
Back-propagation neural networks
- CC
Correlation coefficient
- CN
Curve number
- EnKF
Ensemble Kalman filter
- HEC-HMS
Hydrologic Engineering Center – Hydrologic Modeling System
- LVMDR
Landslide volume equation modified for the Dahan River basin
- MAE
Mean absolute error
- ML
Machine learning
- RBFNN
Radial basis function neural network
- SEEDR
Soil erosion empirical formula modified for the Dahan River basin
- SRH-2D
Sediment and river hydraulic-two dimension model
- SVM
Support Vector machines
- RMSE
Root mean square error
- USBR
United States Bureau of Reclamation
- USLE
Universal soil loss equation
INTRODUCTION
Watershed sedimentation has been identified as a severe problem worldwide (Fredriksen 1970; Colosimo & Wilcock 2007; Kim et al. 2013; Huang et al. 2018; Siswanto & Francés 2019), resulting in river erosion, excess sediment, reservoir deposition, and a compromised water supply for downstream populations (Nagle et al. 1999; Dang et al. 2010; Huang et al. 2019b). A reasonable early-warning system needs to be developed to face the significant serious sedimentation disaster due to climate change. Sedimentation is generally investigated using empirical analysis, numerical simulations, and/or field measurements (Aksoy & Kavvas 2005; Garcia 2008; Walling 2017), and the collected references are described below.
The empirical methods used to estimate debris volume, sediment discharge, and river deposition are generally tied to historical events (Boomer et al. 2008; Gartner et al. 2014). Numerical simulations are widely used to estimate sediment yield and erosion volume (Morgan et al. 1998; Chou 2010; Alatorre et al. 2012). Field measurements are generally implemented using aerial photos, satellite images, UAVs, and Lidar (Jain & Kothyari 2000; Green & Westbrook 2009; Mackey & Roering 2011; Peter et al. 2014). These methods are well suited to assessing the scale of disasters and formulating mitigation strategies after the fact (Koi et al. 2008; Hsu et al. 2012); however, it is difficult to predict upcoming sedimentation hazards. In this study, we sought to establish a practical early-warning system for watershed sedimentation based on empirical, numerical, and machine learning (ML) methods.
ML methods have been widely applied, for example, Suwarno et al. (2021) used a decision tree to develop a lava flood early-warning system; Chutiman et al. (2022) presented a forecasting model for daily maximum rainfall during tropical cyclones based on grey theory; Ekwueme (2022) built an ARIMA (auto regressive integrated moving average) to forecast the flood intensity under the climate change scenario. Furthermore, fusions of numerical modeling with ML (e.g., artificial neural network (ANN)) have also been mentioned in the past few years. ANN and fuzzy models have also been used to predict long-term flow discharge and water levels. Yu et al. (2004) combined chaos theory and support vector machines (SVM) with big data to improve daily flow predictions. Sannasiraj et al. (2004) combined deterministic and stochastic models to facilitate predictions of daily tidal variation. Karri et al. (2013) used an Ensemble Kalman filter (EnKF)-based method to improve the forecast ability of deterministic models. Loukas & Vasiliades (2014) implemented a hydrological model in conjunction with ANNs to forecast rainfall runoff. Young & Liu (2015) developed a hybrid model based on numerical models and ANNs to train and test flow discharge during typhoon events. Humphrey et al. (2016) integrated a hydrological model output with a Bayesian ANN to predict the monthly flow.
ANNs have also been used to predict daily sediment concentration (Zounemat-Kermani et al. 2016; Malik et al. 2017; Huang et al. 2019a) and sediment yields along rivers (Babovic 2009). Predicting watershed sedimentation should account for precipitation, river discharge, landslide volume, soil erosion, and river sediment load. However, few studies have successfully predicted sediment concentrations using multiple factors.
STUDY BACKGROUND
Dahan river basin
This study collected hydrological data on water elevation, flowrate, and sediment discharge within the Dahan River basin from 2008 to 2016, including six typhoon events: Fungwong, Sinlaku, Jangmi, Morakot, Fitow, and Soudelor. Four other typhoon events were selected as test cases. This information is listed in Table 1.
Historical data related to the Dahan River basin from 2008 to 2016 typhoon events
Year . | Event . | Duration (h) . | Peak discharge (m3/s) . | Sediment discharge (ton/s) . | Landslide volume (m3) . | Volume of soil eroded (m3) . | Total (m3) . | Training . | Testing . |
---|---|---|---|---|---|---|---|---|---|
2008 | Fungwong | 103 | 1,433 | 18.23 | 50,362.88 | 50,190.28 | 100,553.16 | o | |
2008 | Sinlaku | 136 | 2,532 | 71.70 | 50,816.78 | 51,523.34 | 102,340.12 | o | |
2008 | Jangmi | 60 | 3,292 | 100.98 | 57,623.23 | 63,623.83 | 121,247.06 | o | |
2009 | Morakot | 143 | 2,280 | 61.60 | 112,189.91 | 97,504.13 | 209,694.04 | o | |
2013 | Soulik | 73 | 5,457 | 444.40 | 105,562.94 | 94,855.69 | 200,418.63 | o | |
2013 | Fitow | 89 | 1,736 | 144.0 | 108,599.56 | 55,447.39 | 164,046.95 | o | |
2013 | Trami | 87 | 1,137 | 39.00 | 116,816.57 | 71,017.99 | 187,834.56 | o | |
2015 | Soudelor | 66 | 5,634 | 103.03 | 108,594.59 | 96,090.21 | 204,684.80 | o | |
2015 | Dujuan | 60 | 3,802 | 60.23 | 79,551.57 | 81,986.90 | 161,538.47 | o | |
2016 | Megi | 41 | 4,267 | 99.10 | 98,433.89 | 91,779.24 | 190,213.13 | o |
Year . | Event . | Duration (h) . | Peak discharge (m3/s) . | Sediment discharge (ton/s) . | Landslide volume (m3) . | Volume of soil eroded (m3) . | Total (m3) . | Training . | Testing . |
---|---|---|---|---|---|---|---|---|---|
2008 | Fungwong | 103 | 1,433 | 18.23 | 50,362.88 | 50,190.28 | 100,553.16 | o | |
2008 | Sinlaku | 136 | 2,532 | 71.70 | 50,816.78 | 51,523.34 | 102,340.12 | o | |
2008 | Jangmi | 60 | 3,292 | 100.98 | 57,623.23 | 63,623.83 | 121,247.06 | o | |
2009 | Morakot | 143 | 2,280 | 61.60 | 112,189.91 | 97,504.13 | 209,694.04 | o | |
2013 | Soulik | 73 | 5,457 | 444.40 | 105,562.94 | 94,855.69 | 200,418.63 | o | |
2013 | Fitow | 89 | 1,736 | 144.0 | 108,599.56 | 55,447.39 | 164,046.95 | o | |
2013 | Trami | 87 | 1,137 | 39.00 | 116,816.57 | 71,017.99 | 187,834.56 | o | |
2015 | Soudelor | 66 | 5,634 | 103.03 | 108,594.59 | 96,090.21 | 204,684.80 | o | |
2015 | Dujuan | 60 | 3,802 | 60.23 | 79,551.57 | 81,986.90 | 161,538.47 | o | |
2016 | Megi | 41 | 4,267 | 99.10 | 98,433.89 | 91,779.24 | 190,213.13 | o |
Boundary conditions
Proposed three-phase sediment flow prediction scheme for Dahan River basin.
METHODOLOGY
HEC-HMS

Model and parameter settings for HEC-HMS
Catchment . | Slope (%) . | River length (m) . | CN (−) . | S (mm) . | Lag time (h) . | Ia (mm) . |
---|---|---|---|---|---|---|
1 | 27.85 | 27,755.70 | 62 | 6.13 | 3.66 | 31.13 |
2 | 37.00 | 23,406.81 | 45 | 12.00 | 4.27 | 62.00 |
3 | 35.9 | 50,733.66 | 46 | 11.80 | 7.76 | 59.60 |
4 | 37.50 | 39,949.53 | 35 | 18.60 | 8.55 | 94.30 |
5 | 33.94 | 10,819.18 | 50 | 10.00 | 2.11 | 50.80 |
6 | 37.03 | 8,136.94 | 45 | 12.20 | 1.83 | 62.00 |
7 | 37.64 | 10,322.36 | 40 | 15.00 | 2.50 | 76.20 |
8 | 35.32 | 13,858.34 | 48 | 10.80 | 2.65 | 55.00 |
9 | 36.28 | 7,391.10 | 40 | 15.00 | 1.95 | 76.20 |
Catchment . | Slope (%) . | River length (m) . | CN (−) . | S (mm) . | Lag time (h) . | Ia (mm) . |
---|---|---|---|---|---|---|
1 | 27.85 | 27,755.70 | 62 | 6.13 | 3.66 | 31.13 |
2 | 37.00 | 23,406.81 | 45 | 12.00 | 4.27 | 62.00 |
3 | 35.9 | 50,733.66 | 46 | 11.80 | 7.76 | 59.60 |
4 | 37.50 | 39,949.53 | 35 | 18.60 | 8.55 | 94.30 |
5 | 33.94 | 10,819.18 | 50 | 10.00 | 2.11 | 50.80 |
6 | 37.03 | 8,136.94 | 45 | 12.20 | 1.83 | 62.00 |
7 | 37.64 | 10,322.36 | 40 | 15.00 | 2.50 | 76.20 |
8 | 35.32 | 13,858.34 | 48 | 10.80 | 2.65 | 55.00 |
9 | 36.28 | 7,391.10 | 40 | 15.00 | 1.95 | 76.20 |
(a) Application domain in Dahan River basin; (b) sub-catchments in Dahan River basin.
(a) Application domain in Dahan River basin; (b) sub-catchments in Dahan River basin.
Sediment yield estimation
The watershed sediment yield was defined as the sum of the landslide volume and soil erosion in this study, and this information was treated as an inflow boundary condition for SRH-2D. The landslide volume equation modified for the Dahan River basin (LVMDR) and soil erosion empirical formula modified for the Dahan River basin (SEEDR) were used to estimate the watershed sediment yield as shown in Equations (1)–(3). The LVMDR was based on Equation (1) and implemented with Equation (2) to estimate the landslide volume under different precipitation scenarios.



SRH-2D
Machine learning
This study employed three machine learning methods (SVM, BPNN, and RBFNN) for the third phase. SVM is a supervised learning approach developed by Vapnik (1999) for statistical analysis. The structure of SVM is transformed into a quadratic planning issue to minimize computing time, and structured risk minimization is used to maintain accuracy. SVM has proven highly effective in dealing with classification and prediction problems. BPNNs are multilayer feed-forward ANNs using one input and output layer with one or more hidden layers. In the current study, we employed the gradient descent method to solve the energy function and thereby minimize error. RBFNN is another ML method with a simple topological structure used to solve nonlinear system identification.
The prediction performance of ANNs depends on a careful training and testing process. ANNs have been used to predict variations in rainfall-runoff, runoff-discharge, and spatial and temporal sediment concentrations (Afan et al. 2015; Knighton et al. 2019; Pakdaman et al. 2020; Jhong et al. 2021). In the proposed early-warning system, these algorithms are applied to historical data related to precipitation and simulation data related to sediment flow.




RESULTS AND DISCUSSION
HEC-HMS and SRH-2D
The relationship between rainfall and discharge is a critical factor in estimating river sedimentation due to the reliance of sediment threshold velocity on the accuracy of the discharge value. In other words, the reliability of sediment simulations depends on the accurate river discharge estimation. Hence, the discharge and sediment concentration simulation outcomes need to be confirmed by comparing it with the measured data.
Comparison of simulation and measurement based on HEC-HMS and SRH-2D.
Sediment transport is affected by inflow discharge and sediment volume. The estimation approach of the inflow boundary condition was confirmed in Sections 3.1 and 3.2. Figure 7(e) and 7(f) presents a comparison of measured and simulated sediment concentration values associated with typhoons Fitow (sediment calibration) and Trami (sediment validation) at Xiayun station based on SRH-2D modeling. We can see that SRH-2D effectively described sedimentation as long as accurate inflow discharge data were available. The trends in peak concentration were very close during the two typhoons with an absolute difference of less than 20%. These results demonstrate the efficacy of these simulations in capturing temporal variations in sediment flow.
Optimizing data input
Input selection and the calibration values of three models
Case Scenario . | 1 . | 2 . | 3 . | 4 . | . | . | . | . | . |
---|---|---|---|---|---|---|---|---|---|
I | Qt−1 | Qt−1, Qt | Qt−1, Qt−2 | Qt−2, Qt−1, Qt | |||||
II | Qt−1, St−1 | Qt−2, Qt−1, St−1 | Qt−2, St−2 | Qt−2, Qt−1, St−1, St−2 | |||||
. | BPNN . | RBFNN . | SVM . | ||||||
I . | RMSE . | MAE . | CC . | RMSE . | MAE . | CC . | RMSE . | MAE . | CC . |
1 | 10.57 | 5.80 | 0.97 | 11.00 | 5.52 | 0.97 | 17.40 | 11.80 | 0.94 |
2 | 9.80 | 5.12 | 0.98 | 9.80 | 5.57 | 0.98 | 17.57 | 11.57 | 0.94 |
3 | 11.54 | 5.45 | 0.96 | 12.80 | 6.80 | 0.94 | 20.25 | 12.29 | 0.92 |
4 | 9.80 | 5.10 | 0.98 | 9.50 | 5.40 | 0.98 | 16.00 | 11.46 | 0.94 |
Average | 10.43 | 5.37 | 0.97 | 10.78 | 5.82 | 0.97 | 17.81 | 11.78 | 0.94 |
. | BPNN . | RBFNN . | SVM . | ||||||
II . | RMSE . | MAE . | CC . | RMSE . | MAE . | CC . | RMSE . | MAE . | CC . |
1 | 9.00 | 4.00 | 0.98 | 7.96 | 2.90 | 0.99 | 8.47 | 2.88 | 0.99 |
2 | 9.40 | 4.80 | 0.98 | 9.86 | 3.80 | 0.98 | 12.83 | 4.49 | 0.96 |
3 | 10.80 | 5.30 | 0.97 | 11.00 | 6.00 | 0.95 | 15.00 | 7.38 | 0.93 |
4 | 9.50 | 5.00 | 0.98 | 9.00 | 3.90 | 0.98 | 12.73 | 4.45 | 0.97 |
Average | 9.68 | 5.03 | 0.98 | 9.46 | 4.15 | 0.98 | 12.26 | 4.80 | 0.96 |
Case Scenario . | 1 . | 2 . | 3 . | 4 . | . | . | . | . | . |
---|---|---|---|---|---|---|---|---|---|
I | Qt−1 | Qt−1, Qt | Qt−1, Qt−2 | Qt−2, Qt−1, Qt | |||||
II | Qt−1, St−1 | Qt−2, Qt−1, St−1 | Qt−2, St−2 | Qt−2, Qt−1, St−1, St−2 | |||||
. | BPNN . | RBFNN . | SVM . | ||||||
I . | RMSE . | MAE . | CC . | RMSE . | MAE . | CC . | RMSE . | MAE . | CC . |
1 | 10.57 | 5.80 | 0.97 | 11.00 | 5.52 | 0.97 | 17.40 | 11.80 | 0.94 |
2 | 9.80 | 5.12 | 0.98 | 9.80 | 5.57 | 0.98 | 17.57 | 11.57 | 0.94 |
3 | 11.54 | 5.45 | 0.96 | 12.80 | 6.80 | 0.94 | 20.25 | 12.29 | 0.92 |
4 | 9.80 | 5.10 | 0.98 | 9.50 | 5.40 | 0.98 | 16.00 | 11.46 | 0.94 |
Average | 10.43 | 5.37 | 0.97 | 10.78 | 5.82 | 0.97 | 17.81 | 11.78 | 0.94 |
. | BPNN . | RBFNN . | SVM . | ||||||
II . | RMSE . | MAE . | CC . | RMSE . | MAE . | CC . | RMSE . | MAE . | CC . |
1 | 9.00 | 4.00 | 0.98 | 7.96 | 2.90 | 0.99 | 8.47 | 2.88 | 0.99 |
2 | 9.40 | 4.80 | 0.98 | 9.86 | 3.80 | 0.98 | 12.83 | 4.49 | 0.96 |
3 | 10.80 | 5.30 | 0.97 | 11.00 | 6.00 | 0.95 | 15.00 | 7.38 | 0.93 |
4 | 9.50 | 5.00 | 0.98 | 9.00 | 3.90 | 0.98 | 12.73 | 4.45 | 0.97 |
Average | 9.68 | 5.03 | 0.98 | 9.46 | 4.15 | 0.98 | 12.26 | 4.80 | 0.96 |
Results of optimizing data input: (a) scatter plots; (b) sediment discharge hydrograph.
Results of optimizing data input: (a) scatter plots; (b) sediment discharge hydrograph.
In Scenario I, combination 4 resulted in the best performance, indicating that sediment prediction accuracy depended on the availability of input discharge data. In Scenario II, combination 1 resulted in the best performance. Note that the overall performance in Scenario II was much better than in Scenario I. The prediction results obtained using BPNN, RBFNN, and SVM were much better in Case II-1 than in Case I-4. For example, the comparison of SVM between two cases shows RMSE reduces from 17.81 to 8.47, MAE reduces from 11.78 to 2.88, and CC increases from 0.94 to 0.99. The sediment term was indicated as a representative factor because the evolution indexes, including RMSE, MAE, and CC of BPNN, RBFNN, and SVM, were showed comprehensiveness better results in scenario II.
Figure 8(a) compares the prediction results with the measured data. Overall, BPNN, RBFNN, and SVM provided satisfactory results under both scenarios regardless of the input data combination, as evidenced by the strong correlation between predictions and observations. These results demonstrate that the discharge term must be considered to capture sediment transport under flood conditions, and the sediment term must also be used to improve predictions pertaining to the sedimentation process. All three prediction models (BPNN, RBFNN, and SVM) presented a similar trend, making them suitable for predicting sediment movement patterns and generating sediment discharge hydrographs (see Figure 8(b)). In conjunction with sediment yield estimation, our numerical models (HEC-HMS and SRH-2D) provided high-quality input data for the prediction models reflecting the mechanism underlying flow and sedimentation along the river.
ML results
In this section, we discuss the performance of the three prediction models (SVM, BPNN, and RBFNN) (see Table 4). During the training stage, the RMSE and MAE values decreased with an increase in forecast lead time, whereas CC presented the opposite trend. The three models achieved the best performance when given a 1-h lead time. The RMSE, MAE, and CC values gradually degraded as the lead time was extended from 1 to 6 h, as follows: 1 h (0.99, 0.99, 0.99), 3 h (0.94, 0.92, 0.91), 4 h (0.89, 0.87, 0.86), and 6 h (0.82, 0.78, 0.75).
Estimated values of the three models in the training and testing stage
Type . | BPNN . | RBFNN . | SVM . | ||||||
---|---|---|---|---|---|---|---|---|---|
RMSE . | MAE . | CC . | RMSE . | MAE . | CC . | RMSE . | MAE . | CC . | |
Training | |||||||||
T + 1 | 7.00 | 3.55 | 0.99 | 6.62 | 2.43 | 0.99 | 7.19 | 2.51 | 0.99 |
T + 2 | 10.86 | 4.21 | 0.97 | 11.84 | 4.54 | 0.97 | 12.88 | 4.54 | 0.96 |
T + 3 | 16.66 | 6.46 | 0.94 | 19.37 | 7.42 | 0.92 | 20.78 | 7.13 | 0.91 |
T + 4 | 22.35 | 8.95 | 0.89 | 24.21 | 9.79 | 0.87 | 25.93 | 9.20 | 0.86 |
T + 5 | 25.43 | 10.44 | 0.86 | 28.04 | 11.88 | 0.83 | 30.00 | 11.03 | 0.80 |
T + 6 | 29.08 | 12.45 | 0.82 | 31.17 | 13.83 | 0.78 | 33.40 | 12.71 | 0.75 |
Testing | |||||||||
T + 1 | 8.34 | 6.45 | 0.99 | 8.52 | 5.59 | 0.99 | 8.47 | 4.94 | 0.99 |
T + 2 | 14.53 | 10.16 | 0.96 | 15.7 | 10.72 | 0.94 | 15.45 | 9.44 | 0.95 |
T + 3 | 18.34 | 10.98 | 0.93 | 20.24 | 11.00 | 0.91 | 21.50 | 9.56 | 0.90 |
T + 4 | 23.59 | 12.02 | 0.88 | 25.21 | 11.84 | 0.86 | 26.75 | 10.98 | 0.84 |
T + 5 | 27.29 | 14.90 | 0.83 | 29.13 | 15.06 | 0.81 | 31.07 | 13.75 | 0.78 |
T + 6 | 30.95 | 19.13 | 0.79 | 32.47 | 18.73 | 0.76 | 34.70 | 16.72 | 0.72 |
Type . | BPNN . | RBFNN . | SVM . | ||||||
---|---|---|---|---|---|---|---|---|---|
RMSE . | MAE . | CC . | RMSE . | MAE . | CC . | RMSE . | MAE . | CC . | |
Training | |||||||||
T + 1 | 7.00 | 3.55 | 0.99 | 6.62 | 2.43 | 0.99 | 7.19 | 2.51 | 0.99 |
T + 2 | 10.86 | 4.21 | 0.97 | 11.84 | 4.54 | 0.97 | 12.88 | 4.54 | 0.96 |
T + 3 | 16.66 | 6.46 | 0.94 | 19.37 | 7.42 | 0.92 | 20.78 | 7.13 | 0.91 |
T + 4 | 22.35 | 8.95 | 0.89 | 24.21 | 9.79 | 0.87 | 25.93 | 9.20 | 0.86 |
T + 5 | 25.43 | 10.44 | 0.86 | 28.04 | 11.88 | 0.83 | 30.00 | 11.03 | 0.80 |
T + 6 | 29.08 | 12.45 | 0.82 | 31.17 | 13.83 | 0.78 | 33.40 | 12.71 | 0.75 |
Testing | |||||||||
T + 1 | 8.34 | 6.45 | 0.99 | 8.52 | 5.59 | 0.99 | 8.47 | 4.94 | 0.99 |
T + 2 | 14.53 | 10.16 | 0.96 | 15.7 | 10.72 | 0.94 | 15.45 | 9.44 | 0.95 |
T + 3 | 18.34 | 10.98 | 0.93 | 20.24 | 11.00 | 0.91 | 21.50 | 9.56 | 0.90 |
T + 4 | 23.59 | 12.02 | 0.88 | 25.21 | 11.84 | 0.86 | 26.75 | 10.98 | 0.84 |
T + 5 | 27.29 | 14.90 | 0.83 | 29.13 | 15.06 | 0.81 | 31.07 | 13.75 | 0.78 |
T + 6 | 30.95 | 19.13 | 0.79 | 32.47 | 18.73 | 0.76 | 34.70 | 16.72 | 0.72 |
Unit: RMSE, MAE is ton/s.
In the testing phase, the RMSE, MAE, and CC of the models were similar to that observed in the training stage. The RMSE, MAE, and CC values gradually degraded as the lead time was extended from 1 to 6 h, as follows: 1 h (0.99, 0.99, 0.99), 3 h (0.93, 0.91, and 0.90), 4 h (+0.7 for all three), and 6 h (+0.7 for all three). These results demonstrate that the proposed prediction model would be able to sound an alarm for disaster prevention/evacuation from roughly 3 h in advance, which is quite reasonable for mountain basins under the effects of extreme weather conditions.
Spatial variations of the sediment flow prediction in different lead times (scatter points).
Spatial variations of the sediment flow prediction in different lead times (scatter points).
Time-series variations of the sediment flow prediction in different lead times.
Time-series variations of the sediment flow prediction in different lead times.
Overall evaluation
Insufficient data are the major factor in the difficulty in improving the prediction ability. This interdisciplinary study integrates hydrology, hydraulic, and sediment transportation methods to produce sufficient data to develop the three-stage early-warning system. The critical meaning is that the physical mechanism should be embedded in the prediction model to solve the uncertainty problem of the unstable measured data. This study combines HEC-HMS, LVMDR, SEEDR, and SRH-2D to establish the precipitation–discharge–sedimentation relationship and confirms that reasonable discharge can improve the accuracy of sediment concentration. Furthermore, the input optimization shows that multiple inputs (discharge and sediment concentration) obtain better prediction results than the single input (sediment concentration). Note that the historical references major used a single input to establish the prediction model, and this study proves that effective, sufficient, and believable multiple inputs can improve the prediction accuracy. The primary strength of this study is to obtain better prediction accuracy in mountain rivers during large-scale sedimentation events. The effectiveness of the prediction models in terms of temporal and spatial variation indicates their reliability in forecasting adverse events. The error associated with long lead times can be attributed to the inclusion of irrelevant information.
CONCLUSIONS
This paper presents a multi-phase early-warning system based on ML models that account for fluid mechanics and sedimentation dynamics. We obtained a reasonable conversion of precipitation into flow discharge using a numerical model of the hydrology in the study area for use in estimating soil erosion based on empirical formulas. This made it possible to establish a well-calibrated hydraulic model tasked with generating flow data (for training and testing) to compensate for the lack of data in typical field measurements. Our robust results demonstrated that the accuracy of sediment flow predictions could be improved by adopting various data sources, including rainfall, inflow discharge, landslide volume, soil erosion, and sediment. This approach greatly improves the results of conventional models that rely on a single variable, such as inflow discharge. This research demonstrates that sedimentation can be predicted with a high degree of accuracy, even in mountain basins and during intense short-term events, such as typhoons. This proof-of-concept is a feasible approach to dealing with sedimentation issues in other fields. The most important is that the proposed early-warning system not only strives for the government to be the reference of emergency evacuation strategy but also provides reservoir sediment information.
ACKNOWLEDGEMENT
The financial support for this study was partially provided by a grant from the Ministry of Science and Technology (project number: 110-2625-M-002-024-MY3; 111-2625-M-035-007-MY3) and partially provided by the Research Centre of Climate Change and Sustainable Development of National Taiwan University.
DATA AVAILABILITY STATEMENT
All relevant data are included in the paper or its Supplementary Information.
CONFLICT OF INTEREST
The authors declare there is no conflict.