Aeration is a cost-effective and efficient method for increasing the available oxygen or dissolved oxygen content in water bodies, which is crucial for the existence of aquatic life. However, conventional techniques for estimating aeration in different hydraulic structures are time-consuming and incorrect ways to approximate aeration. Therefore, new, computationally more efficient, and more accurate methods are required. In this article, three machine learning models are presented: (1) ELM (extreme learning machine) model, (2) online sequential extreme learning machine model, and (3) I-ELM (incremental extreme learning machine) model. These models assess the air conditioning capacity of the three variants of Piano Key Weirs (PKWs), denoted as A, B, and C, about Cd, Cs, and Cu, which are the three most important parameters for aeration efficiency at different temperatures. The model performance is evaluated and compared based on mean squared error, root-mean-square error, correlation coefficient, mean absolute error, and Nash–Sutcliffe efficiency. This research concludes that I-ELM is the best-performing model for complete available data that are time invariant.

  • Aeration is crucial for maintaining water quality, particularly in rivers and streams.

  • The utilization of multiple machine learning models enables us to predict aeration efficiency over various types of piano key weirs.

  • This study offers a valuable tool for optimizing the design and advancement of piano key weirs.

The design of hydraulic structures and sewerage supports aeration to ensure their good performance and environmental sustainability. In the process of water treatment, aeration enriches oxygen levels in the water and purifies water by removing impurities. Importantly, cavitation does not occur through aeration. Aeration supports hydraulic structure design and sewerage to ensure their good performance and environmental sustainability. Dissolved oxygen (DO) required for different species of aquatic organisms depends on species, habitat type, and specific ecological considerations. Crabs and oysters that live at the bottom need a minimum of 4 mg/L DO, while shallow-water fish require up to 4–15 mg/L DO for survival. The necessity of DO in aquatic organisms relies upon factors like temperature of water, salinity, nutrient concentration, as well as the presence of pollutants (Baylar & Bagatur 2000; Emiroglu & Baylar 2003). In general, aeration is the process that enhances the DO level in water bodies and is crucial for the existence of the aquatic life. It prevents the formation of anaerobic conditions (i.e., those that produce harmful gases like methane or hydrogen sulphide). It saves our environment and also plays a crucial role in combating climate change.

In hydraulic engineering, aeration serves to improve the flow characteristics of water by reducing its weight so that it can become more buoyant (Baylar et al. 2011). This prevents vapour pockets as well as reducing pressure fluctuation promoting efficiency, stability, and safety for hydraulic structures (Singh & Kumar 2022b). Hydraulic structures are built to decrease cavitation and increase oxygen content in flowing water (Rathinakumar et al. 2014) during extreme weather events such as heavy rainfall and droughts. Weirs and other water management structures can be used to manage water flow, reduce the risk of flooding, and maintain water levels during droughts. Properly managed weirs can also help mitigate the impacts of temperature increases on water quality by promoting aeration and maintaining oxygen levels. The last 10 years have seen growing interest in the study of aeration performance of various free flow structures such as triangular, rectangular, labyrinth, and PKWs for efficiency improvement and stability reasons. These studies aim to understand the flow dynamics of different types of weirs and how they affect the aeration and mixing of air in the water while the flow occurs over such types of weirs.

Although the water flows through the hydraulic structure for a short duration, the DO level increases significantly due to turbulence flow that facilitates the mixing of air with flowing water (Emiroglu & Baylar 2003; Guenther et al. 2013). However, these effects are transient, and the DO concentration returns to the normal level as the water flows in plane areas that are far away from the hydraulic structure (Wormleaton & Tsang 2000). This phenomenon mitigates greenhouse gas emissions and prevents the Earth's surface temperature from rising.

The efficient use of a weir for aeration in rivers or streams was initially shown by Gameson (1957). Weirs are known to stir the water and cause turbulence, which raises the concentration of DO. The work by Baylar & Bagatur (2000) was a landmark in the fields of hydrology and for improving the water quality. The effectiveness of the weir aeration depends on many factors such as the weir's size and placement, water flow rate, desired level of aeration, and the cost of equipment and maintenance (Baylar et al. 2001).

This study has several advantages as a continuation of earlier works on the impact of hydraulic structures on water quality. Experimental studies (Apted & Novak 1973; Ervine & Elsawy 1975; Gulliver & Rindels 1993; Baylar & Bagatur 2000; Baylar et al. 2001) examined the aeration capabilities of various types of weirs and found that the labyrinth weir exhibits a higher entrainment rate compared with the linear weir (Avery & Novak 1978; Nakasone 1987; Wormleaton & Soufiani 1998; Emiroglu & Baylar 2003). Many attempts have been made to estimate the ratio of air/water flow and specifically aeration performance of various hydraulic structures like weirs in detail (Bagatur & Sekerdag 2003; Baylar & Ozkan 2006; Baylar et al. 2010, 2011). From a range of PKW experiments, it was discovered that the water with low heads flowing down through these weirs is naturally aerated. In fluid dynamics, ‘nappe’ represents a sheet of water passing over in a structure such as a weir or a dam. The characteristics of the nappe while flowing is another important aspect to consider the design of weirs based on vibrations, the flow of nature that could be laminar or turbulent, fluctuations, or surging of the flow (Baylar et al. 2011; Jaiswal & Goel 2019; Jaiswal & Goel 2020).

Crookston & Tullis (2012) performed research on the labyrinth weir by examining four different ways that the nappe was ventilated: clinging, aerated, partially aerated, and drowned. Their examination presented the understanding of the processes involved in the creation of nappes. Moreover, the research emphasizes the substantial impact of fluctuating negative pressures beneath the overspill nappe, underscoring their capacity to trigger seismic tremors. In addition, researchers examined the consequences of intentionally increasing the oxygen content in the flow of the nappe (Leite Ribeiro et al. 2007). Usually, the size of the nappe depends on how high it falls and how much it spills over the top. When dealing with bigger runs or narrow outlet keys, there can be conflicts between water from lateral ridges, but these conflicts would not significantly affect the efficiency of the output if flow restrictions are kept constant (Machiels 2012).

In addition, Jaiswal & Goel (2020) applied various soft computing methods to forecast air movement around triangular weirs. They noticed that the Gaussian process and M5P techniques accurately predict the amount of oxygen added. Moreover, other approaches include adaptive neuro-fuzzy inference systems (ANFIS) and linear regression indicating PKW oxygenation (Komal et al. 2017).

In the specific field of hydraulic engineering, the scope of using soft computing techniques for the prediction of aeration efficiency for different hydraulic structures has been increased recently. The reason for their choice is that they are intelligent, fast, and precognitive. These methods find favour on account of their intelligence, speed computation, and predictive ability. If implemented, these models would reduce scale effects and simplify the processes involved in model building. In water resources management and hydraulic engineering, different tools of soft computing techniques have been used (Luxmi et al. 2022; Srinivas & Tiwari 2022). Machine learning and soft computing methodologies have recently received attention from more researchers regarding estimating aeration potentials for different weirs (Verma et al. 2022).

Baylar et al. (2011) investigated using gene expression programming (GEP) to estimate the oxygen transfer efficiency in stepped cascade aeration. Singh et al. (2021) attempted to analyse and assess labyrinth weir aeration efficiency by adopting diverse methods like the M5P model and support vector machine. Kumar et al. (2020) implemented a plunge jet analysis using artificial neural networks. Some of the other tools used for this purpose include generalized regression neural network, ANFIS, and multivariate adaptive regression splines and many others, especially for estimating oxygen transfer rates into water and lakes (Baylar et al. 2008). It is with proper recognition of these facts that studies by Baylar et al. (2009a, 2009b) have shown that it is feasible to apply a support vector machine for predicting the efficiency of aeration by plunging overfall jets from weirs. Other forms of machine learning techniques such as K-nearest neighbours were also tried in the process of aeration performance estimation used for predicting aeration performance using Parshall flumes, including random forest regression (Sangeeta et al. 2021). To derive these models, Aradhana et al. (2021) sought to identify the machine learning-based methodologies, which were adaptable for subsequent analysis of aeration capability of labyrinth weirs. Bansal et al. (2023) performed energy calculations on PKW with the help of extreme gradient boosting (XGBoost) and GEP. A recent literature review indicates that many machine leaning techniques have been applied to predict the performance of different hydraulic systems in terms of aeration. By promoting aeration, weirs help to improve water quality by encouraging the breakdown of organic materials through aerobic processes. This can reduce the levels of pollutants and prevent the growth of harmful algae, which thrive in low-oxygen environments.

In this continuation, this article used three different machine learning techniques to predict the aeration performance of various kinds of PKWs, i.e., types A, B, and C (refer to Figure 1). In this work, the models are built by utilizing the data collected by Singh & Kumar (2022a). The various parameters , and are used in the training procedure of extreme learning machine (ELM), online sequential extreme learning machine (OS-ELM), and I-ELM. The aeration efficiency is considered a target parameter. Ultimately, the authors suggested comparing ELM-, OS-ELM-, and I-ELM-based strategies to conventional techniques.
Figure 1

PKW (types A, B, and C) (derived from Lempérière et al. 2011).

Figure 1

PKW (types A, B, and C) (derived from Lempérière et al. 2011).

Close modal

Laboratory setup and dataset used

This study makes use of data published by Singh & Kumar (2022a) on the aeration performance of various types of PKWs. Singh & Kumar (2022a) performed a series of laboratory experiments using a horizontal flume of a rectangular cross-section of width 0.516 m, height 0.6 m, and length 10 m. The discharge flow rate was determined using an electromagnetic flowmeter. Also, its accuracy tolerance should be within the range of ±0.2%. The aeration efficiency and flow regime over the PKWs were investigated. To ensure proper aeration throughout the operation, the level of the water pool downstream is kept above the bubble's deepest penetration. The DO & temperature were measured upstream (u/s) and downstream (d/s) of the PKW using a Thermo-Scientific Orion Star A223 DO Portable Meter (Figure 2) that is calibrated. The water that flows over the weir is clean. The storage tank is filled with clean water before starting each experiment. The details of the experiment setup for the different PKWs are presented in Table 1.
Table 1

Range and different specifications of the experimental setup taken to collect data for different PKWs

Model No.Range of Q (m3/s)Wi (m)Wo (m)P (m)B (m)Bi (m)Bo (m)Range of drop height h (m)Range of aeration efficiency (E20)N (no. of cycles)
PKW-A 0.003–0.0155 1.28 0.088 0.069 0.20 0.427 0.142 0.142 0.20–0.40 0.185–0.983 
PKW-B 0.003–0.0155 1.28 0.088 0.069 0.20 0.427 0.284 0.20–0.40 0.157–0.631 
PKW-C 0.003–0.0155 1.28 0.088 0.069 0.20 0.427 0.284 0.20–0.40 0.151–0.771 
Model No.Range of Q (m3/s)Wi (m)Wo (m)P (m)B (m)Bi (m)Bo (m)Range of drop height h (m)Range of aeration efficiency (E20)N (no. of cycles)
PKW-A 0.003–0.0155 1.28 0.088 0.069 0.20 0.427 0.142 0.142 0.20–0.40 0.185–0.983 
PKW-B 0.003–0.0155 1.28 0.088 0.069 0.20 0.427 0.284 0.20–0.40 0.157–0.631 
PKW-C 0.003–0.0155 1.28 0.088 0.069 0.20 0.427 0.284 0.20–0.40 0.151–0.771 
Figure 2

Laboratory schematic PKW aeration apparatus (adopted from Singh & Kumar 2022a).

Figure 2

Laboratory schematic PKW aeration apparatus (adopted from Singh & Kumar 2022a).

Close modal

Methodology

The quantity of air that gets dissolved in the water determines aeration efficiency. As water flows through hydraulic structures, the oxygen concentration rate changes with time in the air–water phase and may be described as follows (Gameson 1957; Gulliver et al. 1990):
(1)
where is the gas molecule mass transfer rate, is the coefficient of liquid-mass transfer, C is the degree of oxygen concentration, A is the area of the surface that is available for aeration, V is the volume of water with which gas molecules transfer, is the concentration at saturation point in equilibrium or ambient conditions, and t is the time. The aeration efficiency is determined by integrating Equation (1) and is given as follows:
(2)
where are the concentrations of DO downstream of the hydraulic framework, DO upstream of the hydraulic framework, and the DO at a saturated level, respectively, and r is the oxygen deficit ratio.
The aeration efficiency E = 0 and 1.0 indicate that zero transfer and the transfer up to the saturation point had appeared at the structure, respectively. Researchers employ a temperature adjustment factor to assess aeration efficiency because temperature significantly impacts aeration efficiency. The aeration efficiency is adjusted at 20 °C by creating the mass exchange comparability relationship and is denoted as E20 (Gulliver et al. 1990).
(3)
where the exponent f is expressed as follows and depends on instantaneous temperature.
(4)
where T denotes the measured temperature in degrees Celsius. The machine learning models are fed with , and at different temperatures for all three PKWs to calculate their aeration efficiencies. The different machine learning models applied are explained in the following subsection.

ELM and its application

ELM is a kind of feedforward neural network initially suggested by Huang et al. (2004, 2006). It is designed for simplicity and fast training. It has been demonstrated that the typical ELM with additive or Radial Basis Function (RBF) activation function (Huang et al. 2006) can approximate any function universally. These models have been applied to many problems, including classification, regression, and feature extractions (Rong et al. 2008; Huang et al. 2010; Wang et al. 2011; Lim et al. 2013). The ELM model's concept is to randomly assign hidden biases and input weights, which are solved analytically for the output weights, instead of using iterative optimization methods like backpropagation, gradient descent, etc. The model is applied to predict the aeration efficiency E for a given value of at different heights of PKWs and at temperature 33°C.

The structure of the ELM is depicted in Figure 3, where signifies the input weights assigned between the input and the hidden layers. The threshold for the hidden layer node is denoted by . Initially, both the input weights and biases have been selected randomly, and symbolizes the output weight that bridges the output layer to the hidden layer, which is determined analytically. and P denote the number of nodes in the input, hidden, and output layers, respectively. Y is the prediction of the model.
Figure 3

ELM model structure with a single hidden layer.

Figure 3

ELM model structure with a single hidden layer.

Close modal
The given dataset in the ELM model with H nodes of the hidden layer and with activation function can be designed as follows:
(5)
where is the weight vector that bridges the input node and the ith hidden node, and signifies the threshold of the ith hidden node. The weight vector connects the ith hidden node to the output nodes. signifies the inner product. The activation functions include ‘Sigmoid’ and ‘Sine’.
Equation (5) can be written as follows:
(6)
where and .
The optimization object of the ELM model is given as follows:
(7)
where is the prediction made by the ELM model for the ith sample with the sample having an accurate label . The variable N stands for the total number of samples in the dataset. The optimization challenge at hand can be effectively addressed using the Moore–Penrose generalized inverse method, which is expressed mathematically as follows:
(8)

OS-ELM and its application

Liang et al. (2006) suggested an online version of ELM. OS-ELM is a modification of ELM algorithms. Single hidden layer feedforward neural networks, such as the online sequential ELM model and ELM model, are well known for their ability to understand complex nonlinear functions. The critical difference between the two is that the OS-ELM is designed for online learning, while the ELM is intended for batch learning. All training data are fed once in ELM, while in OS-ELM, large training data are fed in modules. OS-ELM is used in case of memory constraints. OS-ELM is relatively fast in learning as ELM, while it has an advantage over ELM in learning from the newly arrived data. The hidden layer weights of OS-ELM are created at random and maintained as constant, whereas the adjustment of output weights takes place incrementally with the arrival of new data. This enables the model to acquire knowledge rapidly and adjust to shifting trends in the data. OS-ELM can use existing data to update the model, without having to train the whole dataset again, which can be faster and cheaper. OS-ELM can be used for various tasks such as recognizing images and sounds, analysing biological data, and controlling systems. The specifications of algorithms are outlined below. The OS-ELM algorithm's mathematical equations can be classified into two steps: the initialization step and the online learning step.

Initialization step:

  • • A set of training data is fed to start the training process, involving unique and arbitrary samples such that .

  • • Input parameters are assigned randomly, as in the ELM model, and the initial output matrix is computed:

  • from the hidden layer

  • • Calculate the initial output weight matrix from Equation (8):
  • • Set and , where represents the count of data chunks.

Online learning step:

  • For distinct arbitrary samples, the given segment of new training data is considered.

  • Determine the hidden layer output matrix similarly as described in Equation (6).

  • Determine the output matrix by applying the following mathematical relation:
where .

The algorithm used in OS-ELM is as follows:

Input:

  • : hidden nodes count;

  • : count of data chunks;

  • The training data chunk is denoted by :

Outputs:

  • 1. //Initialization;

  • 2. for ;

  • 3. set input weight randomly;

  • 4. set hidden layer bias randomly;

  • 5. compute L0 (the hidden layer output matrix);

  • 6. determine (the output weight matrix);

  • 7.

  • 8. //Learning process;

  • 9. while k is less than K;

  • 10. determine the output matrix of the hidden layer Lk+1; and
  • 11. calculate the output weight matrix ;

  • 12. ;

I-ELM and its application

The incremental ELM is an extension of the ELM algorithm that is introduced (Huang & Chen 2008). It is designed to handle incremental learning. A limited number of hidden nodes are initially present in the model. The hidden layer's input weights and biases are chosen at random. The model is trained on the first batch of the data samples using the ELM training method. The new hidden nodes are added, and adjustments in the biases and the input weights of the already current hidden nodes have been updated incrementally. It uses simple linear least-square methods to calculate the output weight, considering the new data samples and previously learned data.

In addition to being able to handle the nonlinearity of the data and high dimensionality in problems with large scale, it can also quickly and efficiently adapt to new data samples. However, some drawbacks include its dependence on initial conditions and the need for fine-tuning for optimal performance. The structure of the incremental ELM model is given in Figure 4. It consists of I (inputs), H (hidden nodes), and P (outputs). represents the weight vector of order for the currently hidden layer neuron, whose elements are evenly spaced random numbers between [−1, 1].
Figure 4

The structure of incremental extreme learning machine model with I inputs, H hidden nodes, and outputs.

Figure 4

The structure of incremental extreme learning machine model with I inputs, H hidden nodes, and outputs.

Close modal
is the 's node bias and its value is taken randomly from the evenly divided set [−1, 1] when the hidden layer's activation function is additive and calculated as given in Equation (9):
(9)
where x represents an input vector. denotes the output weight vector of the order . The dataset to train the model is considered, where X represents I inputs in each dataset and Y represents P output of each N dataset. The I-ELM algorithm's training steps are outlined here as follows:

Step 1: Initially, let and H be the maximum hidden nodes. The predicted accuracy is predetermined to be . The output Y is set as the initial values for the residuals E, which represents the variance between the network's actual and desired outputs.

Step 2: Steps taken to train the model: While and .

  • 1. , where H is the count of hidden nodes.

  • 2. The newly expanded hidden layer neuron 's input weight and bias are generated at random.

  • 3. Determine the output of the activation function for .
    (10)
  • 4. Determine the output vector :
    (11)
  • 5. Determine the output weight for as given in Equation (12):
    (12)
  • 6. Once the new hidden node has been increased, the residual error is determined as follows:
    (13)

The output weight, as determined in Equation (12), decreases the error of the network quickly. Repeat the previous steps until the residuals are less than the expected error . The training process should be restarted when and the error is more than the anticipated error . Restarting the process of training is advised since it is typically because of the random calculation of the input weights (i.e.,) and biases (i.e., ). The user can check to see if the trained network satisfies the requirements using the provided testing set .

Model performance metrics

The performances of the proposed models are determined using five different statistical measures: (1) mean squared error (MSE), (2) root-mean-square error (RMSE), (3) mean absolute error (MAE), (4) correlation coefficient (CC), and (5) Nash–Sutcliffe efficiency (NSE). The MSE determines the average square of the error, which is the deviation between the actual values and predicted ones. A better model will have a value of MSE close to 0. MSE is calculated as follows:
(14)
where N is the count of data points in the given dataset, and EActual and EPredicted are actual and predicted aeration efficiency, respectively.
RMSE is utilized to calculate the error between actual and predicted values. The value falls within the range of 0 and 1. A value nearer to 0 results in a better and well-fit model. RMSE is calculated as follows:
(15)
MAE calculates the mean of the absolute error, and its lower value indicates a better prediction of a given machine model.
(16)
The CC indicates the relation between two variables' relative movement. It ranges from −1 to +1, whereas a correlation of 0 means no relationship between two variables. The CC is calculated as follows:
(17)
The NSE is a statistical measure utilized to measure the effectiveness of environmental and hydrological systems, representing the ratio of the model's error to potential error. The value of NSE is from to 1. A value closer to 1 represents a perfect match of the model to the actual data. The NSE is calculated as follows:
(18)

The aforementioned five metrics compare the degree of accuracy of the applied machine learning techniques. The section on simulation and results presents a comparison of the outcomes. The aforementioned three described models are trained and simulated for the given dataset, and results are presented in the following section.

The fundamental concept of fluid discharge over a Piano Key Weir involves three primary elements: flow across the inlet key, lateral flow, and outlet key along the side crest (Machiels 2012; Khassaf et al. 2015). These components work together, leading to a complex three-dimensional flow pattern illustrated in Figure 5. Water tends to flow more readily from the side crest toward the outlet key as the head increases, reducing hydraulic efficacy until two discharging streams merge. Consequently, the PKW exhibits behaviour akin to a linear weir.
Figure 5

Flow pattern over PKW.

Figure 5

Flow pattern over PKW.

Close modal

Observation of water movement along the edge of the crest displays two discernible patterns. The initial pattern, closer to the upstream region, shows no signs of aeration. In contrast, the second pattern, located far from the sidewall crest, exhibits clear aeration. This aeration zone becomes more pronounced as the water discharge progresses downstream. As depicted in Figure 5, the flow over the PKW structure has a high degree of ventilation and a three-dimensional nature. Notably, areas of splashing and spraying are evident both inside the outlet keys and in the vicinity of the base of the structure (Singh & Kumar 2022b).

Concerning the trajectory of (head), there is a gradual expansion in the area where spraying and sprinkling occur. The planar jet starts far from the crest. In contrast, the region of air circulation undergoes a substantial increase as rises. This expansion is partly attributed to the rise in local velocity, which results in elevated and turbulent mixing and advection levels. This particular flow pattern enhances the aeration efficiency, as the intricate flow behaviour leads to an increase in air bubbles over a prolonged duration and distance (Eslinger & Crookston 2020).

As per existing research, hydraulic structures' effectiveness in aeration is impacted by factors that include the temperature of the flowing water, water quality, depth of the tailwater, height of the drop, and the discharge of water. It was seen that the weir's aeration performance is highly affected by the drop height as well as the discharge rate across it (Singh & Kumar 2022a).

The experimental data collected by Singh & Kumar (2022a) for types A, B, and C are used to train ELM, OS-ELM, and I-ELM models. The ELM and its variant are used as powerful tools for regression tasks. Here, regression establishes the relationship between the output and the corresponding input dataset. Aeration efficiency (E) depends on many factors, as explained in Section 1; the objective of applying different machine models is to examine how accurately the machine models establish a relation between the output E and inputs . The dataset contains three features and one output. The features affect the aeration efficiency predominantly as given in Equation (2) and are also compatible with ELM and its variants. There are two groupings within the entire dataset: (1) training dataset and (2) testing dataset. Eighty percent of the overall dataset is the training dataset, which is chosen at random, with the remaining 20% going toward testing.

Graphical visualization

The results demonstrate the predicted outcome of different models and their correlation with varying input parameters with several graphs. The findings based on the various parametric studies over type A PKW and its correlation with input parameters are presented in the following subsection.

Simulation results for type A PKW

The input dataset of 3,207 different sets of three input attributes is taken to simulate different models. Hence, the size of the input dataset is . The total 2,565 number of different sets, which is 80% of the total available dataset, is employed randomly for the training of the model. The remaining 20% of different sets are used to test the model. The aeration efficiency, E, which is the output of machine models, is a vector of size . Figure 6(a) compares the predicted aeration efficiency with the actual aeration efficiency when the ELM model is used. Figure 6(b) and 6(c) presents the results when OS-ELM and I-ELM are employed to forecast the aeration efficiency of type A PKW.
Figure 6

The predicted aeration efficiency versus actual aeration efficiency of type A PKW (a) when ELM model is applied, (b) with OS-ELM model, and (c) with I-ELM model. (d) Results all the three models overlaid in a single plot.

Figure 6

The predicted aeration efficiency versus actual aeration efficiency of type A PKW (a) when ELM model is applied, (b) with OS-ELM model, and (c) with I-ELM model. (d) Results all the three models overlaid in a single plot.

Close modal
Figure 6(d) represents the results of all three applied machine models in a single graph overlaying each other to compare the results at a glance. The value of deviation of the predicted aeration efficiency from the actual aeration efficiency is calculated by MSE. The minimum MSE means the corresponding model is more efficient. Figure 7 represents absolute error, i.e. the variation between the actual and predicted values for all testing dataset.
Figure 7

The absolute error of each of 642 testing datasets predicted by ELM, OS-ELM, and I-ELM in case of type A PKW.

Figure 7

The absolute error of each of 642 testing datasets predicted by ELM, OS-ELM, and I-ELM in case of type A PKW.

Close modal

Simulation results for type B PKW

The simulation results for PKW-B are presented similarly to those for type A PKW mentioned earlier. All three machine models are then used to evaluate the aeration efficiency of PKW-B, and the results are presented in this section. Figure 8(a)–8(c) shows the predicted aeration efficiency versus actual aeration efficiency as predicted by ELM, OS-ELM, and I-ELM, respectively. Figure 8(d) plots the results of all three models overlaying each other but represented with different colour schemes.
Figure 8

The predicted aeration efficiency versus actual aeration efficiency of PKW-B with the application of (a) ELM, (b) OS-ELM, and (c) I-ELM model. (d) All results are presented on the same plot for comparison.

Figure 8

The predicted aeration efficiency versus actual aeration efficiency of PKW-B with the application of (a) ELM, (b) OS-ELM, and (c) I-ELM model. (d) All results are presented on the same plot for comparison.

Close modal
The deflection of the predicted value from the actual value for the same testing dataset corresponding to the PKW-B is plotted similarly and indicated by the absolute error, as given in Figure 9.
Figure 9

The absolute error of each of 642 testing datasets predicted by ELM, OS-ELM, and I-ELM in case of PKW-B.

Figure 9

The absolute error of each of 642 testing datasets predicted by ELM, OS-ELM, and I-ELM in case of PKW-B.

Close modal

Simulation results for PKW-C

This section represents the results when all three machine learning algorithms are applied to predict the aeration efficiency of type C similar to PKW (types A and B), and simulations are presented in Figures 10 and 11.
Figure 10

The predicted aeration efficiency versus actual aeration efficiency when (a) ELM model, (b) OS-ELM model, and (c) I-ELM model are used for PKW-C. (d) All results are presents in a single plot for better visual comparison.

Figure 10

The predicted aeration efficiency versus actual aeration efficiency when (a) ELM model, (b) OS-ELM model, and (c) I-ELM model are used for PKW-C. (d) All results are presents in a single plot for better visual comparison.

Close modal
Figure 11

The absolute error for testing datasets predicted by ELM, OS-ELM, and I-ELM in case of PKW-C.

Figure 11

The absolute error for testing datasets predicted by ELM, OS-ELM, and I-ELM in case of PKW-C.

Close modal

After analysing the overall pattern of data points on the graphs plotted between actual and predicted aeration efficiency, it is found that the data points are clustered diagonally in the case of ELM and I-ELM, suggesting that the predictions nearly reflect the actual values. Hence, ELM and I-ELM predict the results more precisely and accurately. The results deviate from the accurate values, and therefore, the absolute error is maximum in the case of OS-ELM as the input data are not streaming time series data.

Comparison of prediction of ELM, OS-ELM, and I-ELM

In Section 3.1, three machine learning techniques – ELM, OS-ELM, and I-ELM – are provided for comparison in terms of prediction accuracy. Then their performances are evaluated using five metrics described in Section 2.6, which are MSE, RMSE, MAE, CC, and NSE. The accuracy of the prediction is inversely proportional to MSE, RMSE, and MAE, while CC and NSE are directly proportional to the accuracy of the models in prediction aeration efficiency.

MAE, MSE, and RMSE values are the minimum for I-ELM and the maximum for OS-ELM. The value of CC and NSE is the maximum and close to unity for I-ELM compared with the other two models, as shown in Table 2. Hence, the prediction made by I-ELM is nearer to the actual value of aeration efficiency in all three cases of PKWs. The results given by OS-ELM deviate the most from the actual value, hence the least efficient model for the given dataset and conditions. The accuracy of the ELM lies between the remaining two. The complete dataset is available to feed at once in the models and does not change rapidly with time. Hence, ELM is more efficient than OS-ELM, which is more suitable when the data arrive sequentially. It is observed that for the available complete dataset, ELM and I-ELM perform better than OS-ELM. I-ELM can process data in batch and incremental modes that can handle vast data. It provides better results for the conditions in this research work. The model can be placed as OS-ELM, ELM, and I-ELM to increase their performance, as concluded in Table 2.

Table 2

MSE and RMSE of predicted aeration efficiency made by ELM, OS-ELM, and I-ELM for type A PKW, PKW-B, and PKW-C

ModelELM
OS-ELM
I-ELM
PKW typeABCABCABC
MSE 0.0000204 0.000029 0.000024 0.00054 0.00064 0.00049 0.0000201 0.000023 0.000019 
RMSE 0.00451 0.00540 0.00493 0.0232 0.0252 0.0221 0.00448 0.00485 0.00426 
MAE 0.005176 0.00487 0.004084 0.01522 0.013949 0.012374 0.002446 0.002097 0.002404 
CC 0.998982 0.998855 0.999318 0.992188 0.992143 0.992197 0.999668 0.99975 0.999643 
NSE 0.997842 0.997584 0.998556 0.984389 0.984231 0.984106 0.99932 0.999465 0.999169 
ModelELM
OS-ELM
I-ELM
PKW typeABCABCABC
MSE 0.0000204 0.000029 0.000024 0.00054 0.00064 0.00049 0.0000201 0.000023 0.000019 
RMSE 0.00451 0.00540 0.00493 0.0232 0.0252 0.0221 0.00448 0.00485 0.00426 
MAE 0.005176 0.00487 0.004084 0.01522 0.013949 0.012374 0.002446 0.002097 0.002404 
CC 0.998982 0.998855 0.999318 0.992188 0.992143 0.992197 0.999668 0.99975 0.999643 
NSE 0.997842 0.997584 0.998556 0.984389 0.984231 0.984106 0.99932 0.999465 0.999169 

Accurately assessing the aeration performance of the weir structures is critical for hydraulic engineers and designers because aeration is the most effective way to enhance oxygen content in flowing water and prevent the anaerobic conditions in the water bodies, which are critical for the survival of aquatic life. To this end, the dataset obtained experimentally to calculate the aeration efficiency of different types of PKWs is used to train different extreme machine learning models and their variants like OS-ELM and I-ELM. Hyperparameters of all the applied models are correctly tuned. The major portion of the dataset is allocated for training the model, while the residual portion is utilized for testing the learning of different models. The prediction efficiency of three models, ELM, OS-ELM, and I-ELM, is compared while calculating the aeration efficiency of different hydraulic structures of type A, B, and C PKWs. It is found that for a complete available dataset that does not change rapidly with time, I-ELM performs the best, whereas, for these conditions, the prediction made in OS-ELM deviates the most from the actual value. The presented work can be extended to establish a well-defined relation between the input and output variables where the input dataset is unorganized. The performance of the models can be further evaluated under different data types, conditions, and problems from different domains.

The authors express their gratitude to the faculty, staff, and technical team of Delhi Technological University's Department of Civil Engineering for their help and assistance in conducting this research.

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Apted
R. W.
&
Novak
P.
1973
Some studies of oxygen uptake at weirs
. In:
Proceedings of the XV Congress, IAHR Paper B23
, Vol.
23
.
Istanbul, Turkey
:
International Association for Hydraulic Research
, pp.
177
186
.
Aradhana
A.
,
Singh
B.
&
Sihag
P.
2021
Predictive models for estimation of labyrinth weir aeration efficiency
.
Journal of Achievements in Material and Manufacturing Engineering
105
(
1
),
18
32
.
Avery
S.
&
Novak
P.
1978
Oxygen transfer at hydraulic structures
.
Journal of Hydraulic Engineering, ASCE
104
(
HY11
),
1521
1540
.
Bagatur
T.
&
Sekerdag
N.
2003
Air-entrainment characteristics in a plunging water jet system using rectangular nozzles with rounded ends
.
Water SA
29
(
1
),
35
38
.
Baylar
A.
&
Bagatur
T.
2000
Study of aeration efficiency at weirs
.
Turkish Journal of Engineering and Environmental Sciences
24
(
4
),
255
264
.
Baylar
A.
&
Ozkan
F.
2006
Applications of Venturi principle to water aeration systems
.
Environmental Fluid Mechanics
6
(
4
),
341
357
.
Baylar
A.
,
Bagatur
T.
&
Tuna
A.
2001
Aeration performance of triangular-notch weirs
.
Water Environmental Journal
15
(
3
),
203
206
.
Baylar
A.
,
Hanbay
D.
&
Ozpolat
E.
2008
An expert system for predicting aeration performance of weirs by using ANFIS
.
Expert Systems with Applications
35
(
3
),
1214
1222
.
Baylar
A.
,
Unsal
M.
&
Ozkan
F.
2009a
Hydraulic structure in water aeration process
.
Water, Air, & Soil Pollution
210
,
87
100
.
Baylar
A.
,
Unsal
M.
&
Ozkan
F.
2010
Hydraulic structures in water aeration processes
.
Water, Air, & Soil Pollution
210
(
1–4
),
87
100
.
Baylar
A.
,
Unsal
M.
&
Ozkan
F.
2011
GEP modeling of oxygen transfer efficiency prediction in aeration cascades. KSCE
.
Journal of Civil Engineering
15
(
5
),
799
804
.
Crookston
B. M.
&
Tullis
B. P.
2012
Hydraulic design and analysis of Labyrinth weirs: Part-II: Nappe Aeration, Instability and Vibration
.
Journal of Irrigation & Drainage Engineering, ASCE
138
(
8
),
757
765
.
Emiroglu
M. E.
&
Baylar
A.
2003
The effect of broad crested weir shape on air entrainment
.
Journal of Hydraulic Research
41
(
6
),
649
655
.
Ervine
D. A.
&
Elsawy
E. M.
1975
Effect of a falling nappe on river aeration
. In
16th Congress of the International Associations for Hydraulic Research Sao Paulo, Paper No. C45
,
Brazil
, pp.
390
397
.
Eslinger
K. R.
&
Crookston
B. M.
2020
Energy Dissipation of Type A Piano Key Weirs
.
Basel, Switzerland
:
MDPI
.
Gameson
A. L. H.
1957
Weirs and aeration of rivers
.
Journal of the Institution of Water Engineers
11
(
5
),
477
490
.
Guenther
P.
,
Felder
S.
&
Chanson
H.
2013
Flow aeration, cavity processes and energy dissipation on flat and pooled stepped spillways for embankments
.
Environmental Fluid Mechanics
13
(
5
),
503
525
.
Gulliver
J. S.
&
Rindels
A. J.
1993
Measurement of air-water oxygen transfer at hydraulic structures
.
Journal of Hydraulic Engineering ASCE
119
(
3
),
327
349
.
Gulliver
J. S.
,
Thene
J. R.
&
Rindels
A. J.
1990
Indexing gas transfer in self-aerated flows
.
Journal of Environmental Engineering, ASCE
116
(
3
),
503
523
.
Huang
G. B.
&
Chen
L.
2008
Enhanced random search based incremental extreme learning machine
.
Neurocomputing
71
,
3060
3068
.
Huang
G. B.
,
Zhu
Q. Y.
&
Siew
C. K.
2004
Extreme learning machine: A new learning scheme of feedforward neural networks
, Vol.
2
(
25–29
). In:
Proceedings of International Joint Conference on Neural Networks (IJCNN2004)
, pp.
985
990
.
Huang
G. B.
,
Zhu
Q. Y.
&
Siew
C. K.
2006
Extreme learning machine: Theory and applications
.
Neurocomputing
70
(
1–3
),
489
501
.
Huang
G. B.
,
Ding
X.
&
Zhou
H.
2010
Optimization method based extreme learning machine for classification
.
Neurocomputing
74
,
155
163
.
Jaiswal
A.
,
Goel
A.
,
2019
Aeration through weirs: A critical review
. In:
Sustainable Engineering, Lecture Notes in Civil Engineering
, Vol.
30
(
Agnihotri
A.
,
Reddy
K.
&
Bansal
A.
, eds.).
Singapore
:
Springer
.
Jaiswal
A.
, &
Goel
A.
,
2020
Evaluation of aeration efficiency of triangular weirs by using Gaussian process and M5P approaches
. In:
Advances Engineering Optimization Through Intelligent Techniques. Advances in Intelligent Systems and Computing
, Vol.
949
, 749–756. (
Venkata
R.
&
Taler
J.
, eds.).
Singapore
:
Springer
.
https://doi.org/10.1007/978-981-13-8196-6_66
.
Khassaf
S. I.
,
Aziz
L. J.
&
Elkatib
Z. A.
2015
Hydraulic behavior of piano key weir type B under free flow conditions
.
International Journal of Scientific & Technology Research
4
(
8
),
158
163
.
Komal
K. M.
,
Tiwari
N. K.
&
Ranjan
S.
2017
Aeration performance evaluation of piano key weir using linear regression and adaptive neuro-fuzzy inference system
. In:
ICET: EITM-2017, NIT Hamirpur
,
December 16–18, 2017
,
India
.
Kumar
M.
,
Tiwari
N. K.
&
Ranjan
S.
2020
Soft computing based predictive modelling of oxygen transfer performance of plunging hollow jets
.
ISH Journal of Hydraulic Engineering
28
(
sup1
),
223
233
.
Leite Ribeiro
M.
,
Boillat
J. L.
,
Schleiss
A.
,
Laugier
F.
&
Albalat
C.
2007
Rehabilitation of St-Marc Dam – Experimental Optimization of a Piano Key Weir
. In:
Proceedings of the 32nd Congress of IAHR
.
Lempérière
F.
,
Vigny
J. P.
,
Ouamane
A.
,
2011
General comments on labyrinth and piano key weirs: The past and present
. In:
Proceedings of the International Conference on Labyrinth and Piano Key Weirs (PKW 2011)
(
Erpicum
S.
,
Laugier
F.
,
Boillat
J.-L.
,
Pirotton
M.
,
Reverchon
B.
&
Schleiss
A.
, eds.).
Leiden, Netherlands
:
CRC Press, Taylor & Francis Group
.
Liang
N. Y.
,
Huang
G. B.
,
Saratchandran
P.
&
Sundararajan
N.
2006
A fast and accurate on-line sequential learning algorithm for feedforward networks
.
IEEE Trans Neural Netw
17
(
6
),
1411
1423
.
Luxmi
K. M.
,
Tiwari
N. K.
&
Ranjan
S.
2022
Application of soft computing approaches to predict gabion weir oxygen aeration efficiency
.
ISH Journal of Hydraulic Engineering
1
15
.
https://doi.org/10.1080/09715010.2022.2050311
.
Machiels
O.
2012
Experimental Study of the Hydraulic Behaviour of Piano Key Weirs
.
PhD thesis
,
Belgium
:
HECE Research Unit, University of Liège
.
Nakasone
H.
1987
Study of aeration at weirs and cascades
.
Journal of Environmental Engineering, ASCE
113
(
1
),
64
81
.
Rathinakumar
V.
,
Dhinakaran
G.
&
Suribabu
C. R.
2014
Assessment of aeration capacity of stepped cascade for selected geometry
.
International Journal of Chemical Technology Research
6
(
1
),
254
262
.
Rong
H. J.
,
Ong
Y. S.
,
Tan
A. H.
&
Zhu
Z.
2008
A fast pruned-extreme learning machine for classification problem
.
Neurocomputing
72
,
359
366
.
Sangeeta
,
Haji Seyed Asadollah
S. B.
,
Sharafati
A.
,
Sihag
P.
,
Al-Ansari
N.
&
Chau
K. W.
2021
Machine learning model development for predicting aeration efficiency through Parshall flumes
.
Engineering Applications of Computational Fluid Mechanics
15
,
889
901
.
Singh
D.
&
Kumar
M.
2022b
Energy dissipation of flow over the type-B piano key weir
.
Flow Measurement and Instrumentation
83
,
102
109
.
Singh
A.
,
Singh
B.
&
Sihag
P.
2021
Experimental investigation and modeling of aeration efficiency at labyrinth weirs
.
Journal of Soft Computing in Civil Engineering
5
(
3
),
15
31
.
Srinivas
R.
&
Tiwari
N. K.
2022
Oxygen aeration efficiency of gabion spillway by soft computing models
.
Water Quality Research Journal
57
(
3
),
215
232
.
Verma
A.
,
Ranjan
S.
,
Ghanekar
U.
&
Tiwari
N. K.
2022
Soft computing techniques for predicting aeration efficiency of gabion stepped weir
. In:
Proceedings of the International Conference on Industrial and Manufacturing Systems (CIMS2020)
.
Cham
:
Springer
, pp.
117
122
.
Wang
L.
,
Huang
Y. P.
,
Luo
X. Y.
,
Wang
Z.
&
Luo
S. W.
2011
Image deblurring with filters learned by extreme learning machine
.
Neurocomputing
74
,
2464
2474
.
Wormleaton
P. R.
&
Soufiani
E.
1998
Aeration performance of triangular platform labyrinth weirs
.
Journal of Environmental Engineering
124
(
8
),
709
719
.
Wormleaton
P.
&
Tsang
C.
2000
Aeration performance of rectangular planform labyrinth weirs
.
Journal of Environmental Engineering
126
(
5
),
456
465
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).