Abstract
In this research, water's surface elevation in compound channels with converging and diverging floodplains using soft computing models including the Multi-Layer Perceptron Neural Network (MLPNN), Group Method of Data Handling (GMDH), Neuro-Fuzzy Group Method of Data Handling (NF-GMDH) and Support Vector Machine (SVM) was modeled and predicted. For this purpose, laboratory data published in this field were used. Parameters including convergence angle (with a positive sign) and divergence angle (with a negative sign), relative depth, and relative distance were used as input variables. The results showed that all the used models have appropriate performance. However, the best performance was related to the SVM model with statistical indicators R2 = 0.998 and RMSE = 0.008 in the testing stage. The use of the adaptive fuzzy approach in developing the GMDH model led to a remarkable increase in accuracy so that its statistical indicators in the testing stage reached R2 = 0.985 and RMSE = 0.203. It was found that the best performance of the activation and kernel functions in the development of the MLPNN model and the SVM is related to the sigmoid and radial tangent functions.
HIGHLIGHT
Water’s surface elevation in compound channels with converging and diverging floodplains using Multi-Layer Perceptron Neural Network (MLPNN).
Water’s surface elevation in compound channels with converging and diverging floodplains using Group Method of Data Handling (GMDH).
Water’s surface elevation in compound channels with converging and diverging floodplains using Neuro-Fuzzy Group Method of Data Handling (NF-GMDH).
Water’s surface elevation in compound channels with converging and diverging floodplains using Support Vector Machine (SVM).
NOTATIONS
- WSE
water's surface elevation
- GMDH
Group Method of Data Handling
- NF-GMDH
Neuro-Fuzzy Group Method of Data Handling
- SVM
Support Vector Machine
- MLPNN
Multi-Layer Perceptron Neural Network
- SVR
Support Vector Regression
- H
water depth in the main channel
- h
water depth in the floodplain
- Dr
relative depth of the flow
- θ
angle of divergence and convergence of the section
- x/L
relative distance from the start of divergence and convergence of the section
- R2
coefficient of determination
- RMSE
root mean square error
- SI
scattering index
- DDR
developed discrepancy ratio
INTRODUCTION
Floods are natural phenomena that cause countless damages to industrial facilities, agricultural lands, and urban communities. When floods occur, the water level in the main channel of rivers increases and enters the surrounding floodplains. In this condition, the cross-section of the passing stream is out of the normal state and becomes a compound section. The compound section refers to the main channel and floodplains. Due to the relatively high roughness of floodplains compared with the main channel, the flow velocity in the main channel is higher than in floodplains. This difference in velocity generates longitudinal and vertical eddies at the border of the main channel and the floodplain(s), resulting in the loss of flow energy (Parsaie et al. 2017a). These eddies are responsible for the momentum transfer between the main channel and the floodplain (Devi & Khatua 2016). Besides these turbulent structures, there are also secondary currents with a longitudinal axis (Shiono & Knight 1991). Secondary currents are created as a result of turbulence inhomogeneities in the channel bottom and side walls (Kang & Choi 2005) and increase due to the development of the mixing layer at the interface between the main channel and the floodplain. Thus, the hydraulics of compound open channels are much more complicated than for single open channels. The complexity of the flow in a non-prismatic compound open channel (non-uniform flow) is many times higher than in a prismatic compound open channel. Tang & Knight (2009) studied the effect of the roughness of a prismatic floodplain on discharge capacity, bedform geometry, flow resistance, and sediment transport in the main channel.
Non-prismatic compound channels according to the geometrical changes of the floodplains are categorized into three: converging, diverging, and diagonal. A channel is called a converging compound channel when both floodplains move toward the main channel and a diverging compound channel is when the floodplains move away from the main channel. When one of the floodplains moves away from the main channel and the other approaches the main channel, it is known as a diagonal compound channel (Das et al. 2017). Bousmar et al. (2006) investigated the flow hydraulics in non-prismatic compound channels. They declared that the velocity gradient at the border of the main channel and floodplains of a non-prismatic compound channel is more than that of a prismatic compound channel. Rezaei & Knight (2009) stated the conventional Shiono and Knight's method (SKM) cannot successfully model the hydraulic behavior of flow in compound channels with non-prismatic floodplains. Parsaie et al. (2017a) conducted a study to investigate the effects of the roughness of floodplains and the divergence angle of floodplains.
Nowadays, with the advancing of soft computing methods in hydraulic and river engineering, researchers have been motivated to investigate the performance of these methods in solving various river engineering problems. In this regard, various soft computing models were used to estimate the hydraulic properties of compound channels, for example, flow discharge, shear stress, and Manning's coefficient in prismatic and non-prismatic compound channels (meander and prismatic and non-prismatic floodplains). Najafzadeh & Zahiri (2015) predicted the discharge in prismatic compound channels using the Group Method of Data Handling (GMDH) model trained by two optimization algorithms. Azamathulla & Zahiri (2012), Zahiri & Azamathulla (2014), Parsaie & Haghiabi (2017), and Parsaie et al. (2017b) predicted discharge in compound open channels using Support Vector Machine (SVM), GMDH, Adaptive Neuro-Fuzzy Inference System (ANFIS), Multi-Layer Perceptron Neural Network (MLPNN), and Multivariate Adaptive Regression Splines (MARS) techniques. Flow discharge in non-prismatic compound channels was successfully estimated by SVM, GMDH, ANFIS, MLPNN, and MARS methods using soft computing models (Das et al. 2020, 2021; Yonesi et al. 2022).
Estimating the water's surface elevation at the time of a flood is one of the most important challenges in flood management and control. Das & Khatua (2019) obtained the water's surface elevation in compound channels with a converging floodplain using computational methods. They performed several statistical error analyses for the obtained results and concluded that their computational model was in good agreement with the laboratory results of other researchers. Naik et al. (2022) investigated the water's surface elevation in non-prismatic compound channels with converging floodplains using the Gene Expression Programming (GEP) model and their findings showed that the water's surface elevation obtained from GEP has a good correlation with experimental data and data from other studies data (R2 = 0.99 and RMSE = 0.028 for training data and R2 = 0.99 and RMSE = 0.027 for test data). Nowadays, modeling flow in rivers based on the concept of compound channels is one of the modern methods of flood simulation. Estimation of water's surface elevation in floods is one of the most important outputs of modeling to determine the extent of floodplains and flooded areas. Considering that the compound channel has much hydraulic complexity, the necessity of research and efforts to estimate the discharge and the water's surface elevation in these sections is still valid. In this research, the estimation of the water's surface elevation in compound channels has been conducted using soft computing techniques including SVM, GMDH, ANFIS, MLPNN, and MARS methods. In this research, due to the double complexity of hydraulic flow in non-prismatic compound channels and also the inefficiency of numerical models such as the SKM model, the use of soft computing models can be a way to estimate hydraulic properties in such channels.
MATERIALS AND METHODS
The aim of this research is to model and predict the water's surface elevation in compound open channels with diverging and converging floodplains using the soft computing models of MLPNN, GMDH, Neuro-Fuzzy Group Method of Data Handling (NF-GMDH), and SVM. Since soft computing techniques are data-driven models, it is necessary to collect relevant data from the results of the experimental studies. Therefore, firstly the experimental studies are introduced from which the results have been used. In this regard, the data related to the water's surface elevation in the converging channels were collected from Bousmar (2002) and the data related to the water's surface elevation in diverging channels were collected from Bousmar et al. (2006). In the following, the aforementioned soft computing models are introduced.
Experimental setup
Converging floodplains
Bousmar (2002) conducted experimental studies in a 10-m-long and 1.2-m-wide flume at the Hydraulics Laboratory of the Catholic University of Leuven, Belgium. The bed slope of the main channel was 0.99 × 10−3. In this flume, a symmetrical compound channel was built, consisting of the main channel with a width of 400 mm and a depth of 50 mm and two floodplains with a width of 400 mm and a height of 50 mm. The Manning roughness coefficient was 0.0107. He investigated two convergence angles of 3.8° and 11.3° considering two convergence lengths of 6 and 2 m. The discharge in this channel was measured by an electromagnetic flow meter capable of measuring the discharge between 2 and 28 L/s. The level of the water's surface elevation along the compound channel was also measured in the channel axis of the main channel using an automatic needle depth gauge.
Diverging floodplains
3D view and top view of geometries: diverging geometries (a) Dv6 and (b) Dv4, and converging geometries (c) Cv6 and (d) Cv2 (Proust et al. 2009).
3D view and top view of geometries: diverging geometries (a) Dv6 and (b) Dv4, and converging geometries (c) Cv6 and (d) Cv2 (Proust et al. 2009).





Specification of experiments
Row . | Channel type . | Test name . | ![]() | Q (L/s) . |
---|---|---|---|---|
1 | Converge | CV2/0.2/10 | 11.2 | 10 |
2 | Converge | CV2/0.3/12 | 11.2 | 12 |
3 | Converge | CV2/0.5/16 | 11.2 | 16 |
4 | Converge | CV6/0.2/10 | 3.8 | 10 |
5 | Converge | CV6/0.3/12 | 3.8 | 12 |
6 | Converge | CV6/0.5/16 | 3.8 | 16 |
10 | Diverge | DV4/0.2/12 | 5.7 | 12 |
11 | Diverge | DV4/0.3/16 | 5.7 | 16 |
12 | Diverge | DV4/0.5/20 | 5.7 | 20 |
13 | Diverge | DV6/0.2/12 | 3.8 | 12 |
14 | Diverge | DV6/0.3/16 | 3.8 | 16 |
15 | Diverge | DV6/0.5/20 | 3.8 | 20 |
Row . | Channel type . | Test name . | ![]() | Q (L/s) . |
---|---|---|---|---|
1 | Converge | CV2/0.2/10 | 11.2 | 10 |
2 | Converge | CV2/0.3/12 | 11.2 | 12 |
3 | Converge | CV2/0.5/16 | 11.2 | 16 |
4 | Converge | CV6/0.2/10 | 3.8 | 10 |
5 | Converge | CV6/0.3/12 | 3.8 | 12 |
6 | Converge | CV6/0.5/16 | 3.8 | 16 |
10 | Diverge | DV4/0.2/12 | 5.7 | 12 |
11 | Diverge | DV4/0.3/16 | 5.7 | 16 |
12 | Diverge | DV4/0.5/20 | 5.7 | 20 |
13 | Diverge | DV6/0.2/12 | 3.8 | 12 |
14 | Diverge | DV6/0.3/16 | 3.8 | 16 |
15 | Diverge | DV6/0.5/20 | 3.8 | 20 |
Soft computing methods
Group method of data handling






Development of the NF-GMDH model









In the NF-GMDH network, there are two fuzzy rules in each neuron and a total of six unknown coefficients, which include four Gaussian parameters and two weighting coefficients. Six unknown parameters in each neuron were obtained using optimizer algorithms.
SVM model
The SVM model is an efficient learning system based on the theory of constrained optimization, which uses the inductive principle of structural error minimization and leads to a general optimal solution. This model was first presented by Vapnik (1999). The SVM model has been divided into two main groups: (a) the Support Vector Classification model and (b) the Support Vector Regression (SVR) model. The SVM classification model is used to solve data classification problems that are placed in different classes, and the SVM regression model is used to solve forecasting problems (Azamathulla et al. 2016).







In building an efficient SVM model, model parameters must be accurately calculated using an optimization method. These parameters are kernel function type, kernel function parameter , tuning parameter C, and accuracy parameter ɛ corresponding to the maximum error in the ɛ-insensitive area.
Evaluation criteria

RESULTS AND DISCUSSION
In this part, the results of the development of the soft computing models of MLPNN, GMDH, NF-GMDH, and SVM to estimate the water's surface elevation in compound open channels with converging and diverging floodplains are presented. First, the MLPNN model is developed. The development of the MLPNN model, as mentioned in the Materials and Methods section, is a multi-step process based on trial and error; however, it is helpful to use the experiences of other researchers. The design of the MLPNN model structure includes determining the number of hidden layers, the number of neurons in each hidden layer, and the activation function and training algorithm. In this research, the recommendations of Parsaie et al. (2018) have been used to design the structure of the MLPNN. According to their suggestion, first, a hidden layer that has some neurons equal to the number of input features should be considered. Then, various types of well-known (suitable) transfer functions should be used and evaluated. During the development of the MLPNN model, activation functions including the logarithm of the sigmoid (logsig), the tangent of the sigmoid (tansig), and the radial basis function (RBF) were evaluated, and their results are presented in Table 2. As can be seen from this table, the best performance is for the log-sigmoid and RBF. However, both transfer functions were selected as superior functions. The structure of the MLPNN model is shown in Figure 2. As is clear from this figure, the developed model has a hidden layer with four neurons.
The performance of transfer functions investigated in the development of the MLPNN model
Name . | R2 . | RMSE . | SI . |
---|---|---|---|
tan-sigmoid | 0.990 | 0.158 | 0.035 |
log-sigmoid | 0.996 | 0.091 | 0.035 |
RBF | 0.997 | 0.091 | 0.091 |
Name . | R2 . | RMSE . | SI . |
---|---|---|---|
tan-sigmoid | 0.990 | 0.158 | 0.035 |
log-sigmoid | 0.996 | 0.091 | 0.035 |
RBF | 0.997 | 0.091 | 0.091 |
The results of the developed models in estimating the water level in the compound channels with converging and diverging floodplains.
The results of the developed models in estimating the water level in the compound channels with converging and diverging floodplains.
The structure of the GMDH network developed to estimate the water level in compound channels with converging and diverging floodplains.
The structure of the GMDH network developed to estimate the water level in compound channels with converging and diverging floodplains.
Considering that there are only three variables in the inputs and considering the development criteria of the GMDH model, only three neurons can be produced in the first layer. According to the development rules of the GMDH model, not all these neurons (in the first hidden layer) can participate in forming the next (second) hidden layer, therefore, two neurons were selected. In the forming of the next hidden layer, since there were only two neurons in the second hidden layer, only one neuron can be prepared, which also plays the role of the output neuron. The statistical error indicators of the GMDH model in the training stages are R2 = 0.965 and RMSE = 0.309, and in the testing stage they are R2 = 0.964 and RMSE = 0.310. The results of the developed GMDH model in different stages of development (training and testing) are presented in Figure 3. The SI of this model's performance in the training and testing stages is 0.038. Comparing the statistical indices of the GMDH model with the MLPNN model in different development stages shows that the accuracy of the GMDH model is partially lower, although the SI of its results is significantly higher. As is clear from Figure 3, this model has more scatter than the MLPNN model. However, its statistical indicators are slightly less accurate. In the following, the results obtained from the application of adaptive fuzzy concepts in the development of the GMDH model are presented. The structure of the developed NF-GMDH model is shown in Figure 5.
As is clear from this figure, the NF-GMDH model has two hidden layers in which there are three neurons in both layers. The statistical error indicators of the NF-GMDH model in the training stage are R2 = 0.996 and RMSE = 0.097, and in the testing stage they are R2 = 0.997 and RMSE = 0.097. The SI of the NF-GMDH model in the training and testing stages is 0.012 and 0.011, respectively. The performance of the NF-GMDH model in the development stages is shown in Figure 3. Examining and comparing the performance of the NF-GMDH model with the previous models (GMDH and MLPNN) shows that the accuracy of the NF-GMDH model has increased significantly compared with the GMDH model, and compared with the MLPNN model, they have almost the same statistical indicators.
Performance of transfer functions investigated in SVM model development
Name . | R2 . | RMSE . | SI . |
---|---|---|---|
Polynomial | 0.940 | 0.085 | 0.022 |
RBF | 0.996 | 0.065 | 0.007 |
Sigmoid | 0.997 | 0.053 | 0.009 |
Name . | R2 . | RMSE . | SI . |
---|---|---|---|
Polynomial | 0.940 | 0.085 | 0.022 |
RBF | 0.996 | 0.065 | 0.007 |
Sigmoid | 0.997 | 0.053 | 0.009 |
An NF-GMDH model structure to estimate the water level in the compound channels with converging and diverging floodplains.
An NF-GMDH model structure to estimate the water level in the compound channels with converging and diverging floodplains.
The structure of the SVM network developed to estimate the water level in the compound channels with converging and diverging floodplains.
The structure of the SVM network developed to estimate the water level in the compound channels with converging and diverging floodplains.
The statistical indicators of the developed SVM model in the training phase are R2 = 0.999 and RMSE = 0.053, and in the testing stage, they are R2 = 0.997 and RMSE = 0.051. The SI of the developed SVM model is equal to 0.007 in the training stage, and 0.006 in the testing stage. Comparing the performance of this model with the previous models (MLPNN, GMDH, and NF-GMDH) shows that the accuracy of this model is significantly better than the GMDH model and comparable to the MLPNN and NF-GMDH models.



Taylor diagram of the performance of the developed models in the training stage.
Taylor diagram of the performance of the developed models in the testing stage.

The performance of SVM, MLPNN, and NF-GMDH models in estimating the water level in the compound channel with a converging floodplain.
The performance of SVM, MLPNN, and NF-GMDH models in estimating the water level in the compound channel with a converging floodplain.
The performance of SVM, MLPNN, and NF-GMDH models in estimating the water level in the compound channel with a diverging floodplain.
The performance of SVM, MLPNN, and NF-GMDH models in estimating the water level in the compound channel with a diverging floodplain.
Comparison with previous findings
This section presents the results of the development of data mining models including the SVM model and the NF-GMDH data model compared with other soft computing models proposed by research workers. Kaushik & Kumar (2023) investigated the water's surface elevation in non-prismatic compound channels using machine learning techniques, including GEP, artificial neural networks (ANNs), and SVMs. They also presented a new equation using the GEP model to predict the water's surface elevation. Their results showed that the ANN model predicted the water's surface elevation more accurately among the used models, so that R2 = 0.999 and RMSE = 0.003, while in the current research, the best performance was related to the SVM model with statistical indices of R2 = 0.998 and RMSE = 0.008 in the test phase. The use of the adaptive fuzzy approach in the development of the GMDH model resulted in a remarkable increase in the accuracy of this model, the statistical indicators reaching values of R2 = 0.985 and RMSE = 0.203 in the testing stage. Another advantage of the current research that makes it unique is finding effective parameters in predicting the water's surface elevation in compound channels, which was discussed in detail in the Results and Discussion section.
CONCLUSION
In this research, the water's surface elevation in compound channels with converging and diverging floodplains was modeled and estimated using the soft computing models of MLPNN, GMDH, NF-GMDH, and SVM. For this purpose, data related to the angle of convergence and divergence, relative depth, and longitudinal distance were used for both these types of floodplain. The results showed that all the soft computing models have good accuracy in estimating the water's surface elevation in such compound channels. However, the best performance and accuracy were related to the SVM. It was found that the use of adaptive fuzzy logic in the development of the GMDH model significantly increases the accuracy of the GMDH. The sensitivity analysis of the developed models, along with the structure of the GMDH model, showed that the relative depth parameter and the relative distance, respectively, have the most effect in modeling and estimating the water's surface elevation in compound channels with converging and diverging floodplains.
DATA AVAILABILITY STATEMENT
All relevant data are available from https://dial.uclouvain.be/downloader/downloader.php?pid=boreal:4996&datastream=PDF_01.
CONFLICT OF INTEREST
The authors declare there is no conflict.