Accurate prediction of streamflows is crucial for managing water resources. Machine learning approaches have gained popularity for their ability to handle noisy and non-linear data and develop models that are capable of detecting relationships from the data they are provided with. This study was conducted to compare the performance of three machine learning algorithms (including extreme learning machine (ELM), random forest (RF), and gene expression programming (GEP)) and their hybrid versions in predicting the monthly streamflow of the Leaf River catchment. The models were tested with three scenarios and the most accurate scenario has been selected for the implementation of hybrid models. Results of all the models have been examined with a new evaluation index called general index (GI), which is calculated based on the three error statistical indices. Finally, the GEP model outperformed the other models, all the scenarios with GI = 11.268 in the M3 scenario and later, the ELM algorithm presented the best performance with GI = 12.811 in the M2 scenario, while the RF model had the worst overall performance. Regarding the hybrid models, using the EMD and principal components analysis (PCA) methods decreased the precisions of the models with the GI values fluctuating around 35.

  • This study was conducted to compare the performance of three machine learning algorithms (including extreme learning machine, random forest, and gene expression programming) and their hybrid versions in predicting the monthly streamflow.

  • In this study, a new index called general index is used to perform evaluations based on three other indices, root mean square error, mean absolute error, and normalized root mean square error.

RMSE

root mean square error

MAE

mean absolute error

LSSVM

least square support vector machine

ENN

emotional neural network

GP

genetic programming

ACF

autocorrelation function

PACF

partial autocorrelation function

ELM

extreme learning machine

GEP

gene expression programming

GPR

Gaussian process regression

ML

machine learning

SVM

support vector machine

M5T

M5 Tree

LSTM

long short-term memory

LWLR

local weighted linear regression

ANN

artificial neural networks

M5P

M5Prime

RF

random forest

GA

genetic algorithm

Increasing demand for water resources and increasing uncertainty in the supply are the results of climate change and human activities (Chu & Huang 2020). The availability of water resources is essential for human survival, as well as an integral part of socio-economic protection (Chu & Huang 2020). One of the most essential tasks for the planning, development, and optimal use of water resources is accurate flow forecasting, which plays an important role in preserving water resources (Tongal & Booij 2018). Consequently, it is imperative to develop methods of managing water resources, which include predicting streamflow, that is accurate and efficient (Tongal & Booij 2018). Water resource management in the past few decades has encouraged the development of streamflow simulations for river basins of various scales, and many researchers have focused on this topic (Naghibi & Pourghasemi 2015). Hydrologists are focused on researches aimed at enhancing the precision and accuracy of short-term streamflow forecasting which is a difficult undertaking (Naghibi & Pourghasemi 2015). Simulating streamflow and analyzing a hydrological system's behavior are essential functions of hydrological models (Remesan & Mathew 2015). To allocate water resources efficiently and use sustainably, it is crucial to have an accurate and detailed prediction of streamflow (Remesan & Mathew 2015). Simulating the streamflow is very difficult because it has a non-linear and non-constant behavior and depends on various factors such as weather, land type, surface vegetation, etc. (Young et al. 2017). In addition to providing valuable information for reducing natural disasters like floods and droughts, it can also facilitate the safe and economical operation of reservoirs, as well as an effective way of organizing the use of water across different sections to maximize the benefits of the water system (Young et al. 2017).

The accuracy of complicated hydrological models has been evaluated in a variety of ways using many models, and a lot of research is currently being conducted to improve those approaches (Sahoo et al. 2017). There is a growing number of watershed hydrological models based on machine learning (ML) approaches in recent years (Sahoo et al. 2017). Hydrology of catchments faces a significant problem when it comes to developing algorithms that can accurately predict streamflow (Sahoo et al. 2017). With the ability to handle noisy and nonlinearities data, the aim of ML is to produce models that can detect relationships within data and improve their performance based on the number of samples used for training (Hastie et al. 2009). It has been developed and tested a number of models for streamflow analysis by using ML: artificial neural network (ANN) (Zhou et al. 2018), extreme learning machine (ELM) (Huang et al. 2018), random forest (RF) (Li et al. 2019), least square support vector machine (LSSVM), genetic programming (GEP) (Mehr 2018), emotional neural network (ENN) (Yaseen et al. 2020), and M5 model tree (M5T)/M5Prime (M5P) (Khosravi et al. 2021; Tiwari et al. 2021). A study by Essam et al. (2022) employed ML algorithms to predict the streamflow of 11 rivers throughout Peninsular Malaysia using ANNs, long short-term memory (LSTM), and support vector machines (SVMs). Among the other methods used in the paper, they concluded that the ANN model produced the best results. Abda et al. (2022) estimated daily streamflow in the Oued Sebaou Watershed using local weighted linear regression (LWLR), ANNs, and RF. They showed the RF model produced better performances than any of the ML models used in the study, such as the ANN and the LWLR models.

Hybrid methods have been used in many research studies to date: principal components analysis (PCA) (Fan et al. 2017), rotated principal components analysis (RPCA) (Meng et al. 2019; Scherl et al. 2020) conducted a research on the Wei River located in China. In this research, the monthly flow was predicted using ANN, SVM, WA–SVM, EMD–SVM, and M-EMD–SVM methods. The EMD–SVM method has produced better results than other approaches. Noori et al. (2011) used the SVM method along with PCA, GT, and FS in their research on Alaviyan Dam located in Iran for monthly flow forecasting and finally compared the results. This research showed that the PCA_SVM method had the highest accuracy and the lowest risk compared to other methods. During the research conducted on the Arkansas River located in the USA by Chamani & Roushangar (2020) the daily and monthly discharge of this river was modeled and compared with ELM, GPR, and CEEMD methods. As a result of using the CEEMD method, the accuracy and performance of the mentioned models were improved.

In this paper, three ML algorithms including GEP, RF, and ELM were used to model and predict the monthly streamflow of Leaf River, United States of America, and three criteria compared the results. The main focus of this research was to investigate the efficiency and precision of these three algorithms with the three forcing inputs of streamflow rate (Q), precipitation (R), and evaporation (E). Today, given the advanced capabilities and performance of today's models, it is conventional to use hybrid models and functions linked to these models. Combinations may take different forms, such as optimization, influence and change of input data, and influence and change of output data. In this article, EMD and PCA algorithms that influence the input data are used. After analyzing the input data into IMFs using the EMD algorithm, PCA was used to reduce the amount of data due to the increased amount of input data.

In general, this research is a comparison between single methods and hybrid methods. Choosing the best model from a variety of options is one of the most challenging tasks for hydrologists. Different indexes are available for evaluating different topics, including model performance, model accuracy, and model error. In this study, a new index called general index (GI) is used to perform evaluations based on three other indices, RMSE, MAE, and NRMSE. In view of the fact that the other three indicators are aligned and exhibit similar behavior, the lower value of the evaluation indicates a higher level of accuracy in the modeling, the GI is calculated by taking the weighted average of these three criteria. By employing this criterion, the process of selecting the best model is simplified and more accurate and reliable. Various researches in different fields can utilize this index, which can be expanded through subsequent studies. In this research, the flow, precipitation, and evaporation data of the studied area were first collected and the process of data preparation, model selection and modeling was done, and after calculating the evaluation indices, the most efficient model was determined based on these results. The continuation of all cases is explained in the commentary.

Study area and data

The Leaf River basin with an area of approximately 1,950 km2 north of the city of Collins in the state of Mississippi, USA, was chosen for a case study. This area has a humid climate and the height of this watershed is 80 m from the lowest point, the average height is 105 m and the highest point is 145 m. In this watershed, the lowest streamflow is 1.91, the average flow rate is 14.38, and the highest is 177.54 m3/s. The temperature information of this area has also been collected, which shows that the lowest temperature is 26.1 °C, the average temperature is 28.3 °C, and the highest temperature of this watershed is 30.5 °C. Hydrometery (daily discharge) has been taken as reliable data in the related rainfall–runoff model (https://waterdata.usgs.gov). It is necessary to mention that the data of the mentioned basin has been used in many researches so far. Figure 1 illustrates the study area of the Leaf River catchment area (https://waterdata.usgs.gov).
Figure 1

(a) Location of the study area and (b) elevational situation of the study area.

Figure 1

(a) Location of the study area and (b) elevational situation of the study area.

Close modal
Figure 2

The flowchart of the stages done in this research.

Figure 2

The flowchart of the stages done in this research.

Close modal
Figure 3

Correlation plots of input data. (a) ACF plot and (b) PACF plot.

Figure 3

Correlation plots of input data. (a) ACF plot and (b) PACF plot.

Close modal

Methods

During the middle of the 20th century, ML was first proposed (Lange & Sippel 2020). The topic of artificial intelligence was discussed a little later by researchers, which originated from ML (Lange & Sippel 2020). ML is generally based on the principle that the best output is produced by analyzing the relationship between input data (Hastie et al. 2009). To predict the monthly streamflow, three ML algorithms were used. These included the RF, GEP, and ELM.

Random forest

RF is one of the smart learning methods that have had very good and successful results in different fields (Li et al. 2019). It consists of simple decision trees. The trees are generated by selecting random samples and analyzing them based on the predictor variables (Li et al. 2019). In RF algorithms, bootstrap aggregating, as well as decision trees are applied, which is also referred to as bagging (Breiman 2001). A RF model can be predicted using the following formula:
(1)

The predicted value is that (Y), Ti(x) is the prediction for tree I, X is the input value, while N represents the number of trees in the forest (Breiman 2001).

Gene expression programming

In 2001, Ferreira proposed a genetic algorithm-based ML technique known as GEP. In general, GP is characterized by linear chromosomes and the ability to use various gene functions, such as logical operators, mathematical functions, and arithmetic operators (Ferreira 2001). Numerous ML problems can be solved using the GEP algorithm, including regression, classification, and time series prediction (Mehr 2018). To develop a GEP, there are five steps (Rezaie-Balf et al. 2019). For each program (i), the following fitness function is determined in the first step:
(2)
The selection range, the first step of the GEP algorithm, is calculated using M, C(i,j) with individual chromosome i in appropriateness item j and shows the determined value. Moreover, Tj demonstrates the highest value for the appropriateness of item j. The second step entails demonstrating the function and terminals set to generate chromosomes (Rezaie-Balf et al. 2019). The third step involves the design of the chromosomal architecture. The fourth step of the process involves selecting the genetic operators and the linking function, and the last step involves specifying the case variation rates.

Extreme learning machine

A ML algorithm known as ELM developed by Huang et al. (2004) performs similarly to feedforward neural networks, the simplest type of ANNs. As a relatively new ML algorithm, ELM has received attention because of its fast learning speed and good generalization capabilities. In addition to its effectiveness at solving classification and regression problems, ELM can also solve problems that involve high-dimensional input spaces (Huang et al. 2018). Three steps are involved in the ELM algorithm, namely input weight generation, hidden node activation, and output weight computation. A uniform distribution is used to generate the input weights in the first step. In the second step, a non-linear activation function, such as the sigmoid function, is used to activate the hidden nodes, and the activation is used to derive the output matrix of the hidden layer. A linear system solver is used in the final step to calculate the output weights that minimize the error between the actual output and the desired output (Huang et al. 2018).

Generally, this algorithm has three layers of equations. A first layer represents the input vector, a second layer contains the hidden layer, which processes the input data, and the third layer is the output. This algorithm is generally explained by the following equations. As shown in the equation below, hj represents the hidden layer:
(3)
The input data were represented by . Input nodes are connected to hidden nodes by wj, which is the weight vector. As a result, bj represents the threshold of the jth hidden node as well as the inner product of and . The output matrix of the hidden layer is represented by H, which is displayed as follows:
(4)
(5)

The output weights vector β is the vector that is positioned between the output nodes and the hidden layer. As a result of the model, W represents the output of the model and the predicted discharge (Huang et al. 2018).

In this article, the optimized ELM algorithm is used. This process consists of two parts: repetition and optimization. It is mentioned in various articles that the value of the weight matrix W is considered as an average of 10, but in this optimization method, the dimensions of the weight matrix W change according to the number of input data, and modeling for all of them are done and, finally, by analyzing all the outputs, the best answer with the least error is selected.

Hybrid models

Researchers have increasingly focused on hybrid methods in recent years and they are now widely used. They are generally composed of various methods, such as statistical analysis, ML, and rule-based algorithms, which increases their accuracy and strength. The most important advantage of these methods is to use the strengths of the hybrid models and reduce their weaknesses. The use of hybrid models allows us to analyze the nonlinearity and complexity of flow data, and to make more accurate and reliable predictions about dynamics based on these data. It has scenarios. In this article, two hybrid methods, EMD and PCA, are used for flow prediction.

Empirical mode decomposition
In 1988, Huang proposed a new algorithm capable of analyzing non-linear and non-stationary data for the first time, and over time, it was welcomed by many researchers. This method is known as the main component of the Hilbert-Huang transform, which created a new algorithm for data analysis (Huang et al. 2014). There are different methods for analyzing time series, the most used of which are HTT and wavelet analysis (WA). Despite the similar performance of these two methods, they have a major difference that the WA method is based on mathematics, but the HTT is based on experiences and was practically made (Huang et al. 2014). In the EMD method, the orthogonal and band limited functions ci(t), i = 1,…,L are obtained by decomposing the time series x(t), which are called the intrinsic mode functions (IMFs), and another parameter that expresses the general trend of the original sequence is called the residual r(t). Since the time series was formed from the sum of IMFs and residuals, the original time series x(t) is calculated using the following formula:
(6)
The physics of the underlying processes that plays an important role in the analysis of non-linear and non-stationary data arises due to consistency in local changes in the data (Wang et al. 2018). IMFs are produced using an iterative process called ‘sifting’ (Wang et al. 2018). In general, it can be said that the main function of the EMD method is to remove the riding waves and make the wave profiles more symmetrical, which is the main task of this algorithm. The sifting process in this algorithm is described as follows (Wang et al. 2018):
(7)

The set can be the first IMFs if the is in the range of 0.2–0.3. According to many tests that have been done, the results showed that if the is in the range of 0.2–0.3, the obtained IMFs give the correct physical meaning (Wang et al. 2018).

Principle component analysis
In general, PCA is a multivariate statistical algorithm, the purpose of which is to reduce the complexity of the input variables, which have a large amount of information, and finally have a better interpretation of the variables (Fan et al. 2017). In order to have the least loss in providing input data information in principle components (PCs), all input data are converted into independent PCs (Fan et al. 2017). PCs are obtained as follows:
(8)

In the mentioned equation, PCs are expressed by , eigen vector is indicated by , and X represent the input variables (Fan et al. 2017).

Evaluation criteria

In this article, four evaluation criteria, namely RMSE, MAE, NRMSE, and GI are used to measure the accuracy of ELM, RF, GEP, EMD, and PCA methods. RMSE, MAE, NRMSE, and GI are calculated as follows:
(9)
(10)
(11)
(12)

One of the most common evaluation criteria is RMSE, which is used to measure the fitness of high streamflow and has been used in extensive research. The MAE evaluation index is used to measure the fitness of streamflows, with the difference that it has a more balanced performance than RMSE and is mostly used for moderate streamflows. NRMSE is a method for evaluating the accuracy of prediction, especially in regression problems, is the normalized RMSE sample, which has been used in many researches. Choosing the right index or indices to evaluate the performance and error of modeling has always been one of the researchers' concerns. By using the appropriate index, it is very efficient in choosing the best model, which is the most difficult part. Similarly, choosing the best model, which is considered one of the most difficult stages of research, is very efficient and makes this work easier and more accurate. The new criterion that is used in this article has been introduced called GI, which is defined based on three other evaluation indicators. This criterion has been proposed and used only for evaluation in this article, and according to the conditions and possibility of use, it will have the ability to be used in other articles. Since the evaluation criteria considered in this research have similar behaviors and better results are obtained by reducing the number of the index, in the GI, the effect of three other indices is considered and the weighted average is calculated. It was said earlier that this criterion does not have a specific acceptable range and is defined based on the RMSE, MAE, and NRMSE criteria, so the lower the obtained values, the better the results. Using the proposed index shows better results and makes it easier for researchers to determine the best model. Figure 2 describes the whole processes done in this research.

The flow of the Mississippi River has been the focus of many researchers and scientists to investigate the accuracy of new methods and algorithms and numerical modeling. For modeling in this article, daily data of flow, evaporation and rainfall of the Mississippi River were collected. Table 1 shows the statistical parameters of the rainfall, evaporation and flow data.

Table 1

Statistical components of observational data

SymbolUnitMeanStandard deviationVarianceSkewnessMaximumMinimum
Flow  28.28 64.48 4157.51 7.6311 1313.91 1.56 
Rainfall Mm 3.710 9.690 93.899 4.5835 124.106 
Evaporation  2.9810 1.8704 3.4982 0.4563 8.4977 0.0062 
SymbolUnitMeanStandard deviationVarianceSkewnessMaximumMinimum
Flow  28.28 64.48 4157.51 7.6311 1313.91 1.56 
Rainfall Mm 3.710 9.690 93.899 4.5835 124.106 
Evaporation  2.9810 1.8704 3.4982 0.4563 8.4977 0.0062 

To build the models and verify their accuracy and efficiency, the input data were divided into two categories of training and testing data, so that 80% of the data entered into the modeling process as training data and 20% as the test data. Based on ACF and PACF plots of flow data (Figure 3), three scenarios, namely the M1, M2, and M3 with 1, 2, and 3 time-lag, respectively, were examined. The rainfall, evaporation and flow data of the previous lags were considered as input and the flow of the next lags were considered as output data.

M1 model

As stated in result and discussion section, the train and test data in this model were considered with one time-lag. The test data produced by all three models were compared with the real data. The comparison of projected data and true observations is illustrated in Figure 4. The criteria, namely RMSE, MAE and NRMSE, have been implemented to examine the precision of the model. Table 2 presents the criteria calculated for test data.
Table 2

The evaluation criteria for the M1 model

ModelRMSE MAE NRMSEGI
ELM 41.153 12.40 3.14 14.37 
GEP 41.227 13.143 3.145 14.46 
RF 58.778 14.216 4.484 20.21 
ModelRMSE MAE NRMSEGI
ELM 41.153 12.40 3.14 14.37 
GEP 41.227 13.143 3.145 14.46 
RF 58.778 14.216 4.484 20.21 
Figure 4

The comparison of predicted data and actual data for the M1 model.

Figure 4

The comparison of predicted data and actual data for the M1 model.

Close modal

As Table 2 shows, the lowest value of the GI corresponds to the ELM algorithm with a value of 14.37. Therefore, the ELM algorithm has the best performance in scenario M1. After that, there are the GEP and RF models with values of 14.46 and 20.21, respectively. As the values of the GI show, the two algorithms, ELM and GEP, worked almost identically and acceptably, while the RF method produced unrealistic results.

M2 model

In this model, the train and test data were entered into the model with two time-lag. As before, the test data produced by three models and the real observations were compared by evaluation criteria. Figure 5 and Table 3 illustrate the comparison and evaluation criteria, respectively.
Table 3

The evaluation criteria for the M2 model

ModelRMSE MAE NRMSEGI
ELM 36.932 10.354 2.818 12.82 
GEP 36.366 11.811 2.774 12.78 
RF 56.401 12.877 4.303 19.34 
ModelRMSE MAE NRMSEGI
ELM 36.932 10.354 2.818 12.82 
GEP 36.366 11.811 2.774 12.78 
RF 56.401 12.877 4.303 19.34 
Figure 5

The comparison of predicted data and actual data for the M2 model.

Figure 5

The comparison of predicted data and actual data for the M2 model.

Close modal

Unlike scenario M1, in scenario M2, the GEP model has surpassed the ELM algorithm, and the values of the GI, 12.78 for the GEP model and 12.82 for the ELM algorithm prove this. In this scenario, the GEP and ELM models performed satisfactorily and equally; still, the RF method produces predictions far from reality. But in general, based on the comparison of calculated values of the GI, it can be said that the results of scenario M2 have improved compared to the results of scenario M1.

M3 model

The rainfall, evaporation and flow data, in this model, were used with three time-lag and divided into train and test data. Like previously, to verify the accuracy of the models, the RMSE, MAE and NRMSE criteria were implemented which is presented in Table 4. Moreover, Figure 6 illustrates the comparison of true data and the projections.
Table 4

The evaluation criteria for the M3 model

ModelRMSE MAE NRMSEGI
ELM 45.208 11.514 3.449 15.59 
GEP 31.824 11.184 2.428 11.27 
RF 56.269 13.404 4.293 19.33 
ModelRMSE MAE NRMSEGI
ELM 45.208 11.514 3.449 15.59 
GEP 31.824 11.184 2.428 11.27 
RF 56.269 13.404 4.293 19.33 
Figure 6

The comparison of predicted data and actual data for the M3 model.

Figure 6

The comparison of predicted data and actual data for the M3 model.

Close modal

In this scenario, the GEP model performed the best with the GI = 11.27, followed by the ELM and RF models with values of 15.59 and 19.33, respectively.

Overall and in the total of three modeled scenarios, the GEP model has produced the best and most accurate results and predictions in scenario M3 with the value of the GI equal to 11.27. After that, the ELM algorithm surpassed other models and scenarios with a value of 12.82 in scenario M2. In all three scenarios, the RF method had the weakest and most unrealistic performance. Figure 7 shows that the GEP algorithm has the highest convergence (R2 values). The M3 scenario in the GEP algorithm with R2 = 0.92 has the best convergence in forecasting observational data and, after that, the M2 scenario in the ELM model with R2 = 0.91 is in the next position. The lowest R2 values were related to the RF approach with R2 = 0.77. Also, based on the Taylor diagram, which compares the results of all the scenarios in the three applied algorithms, scenario M2 in the GEP algorithm has the smallest distance compared to the reference data, which shows the better performance of this model.
Figure 7

The R2 and Taylor diagram of independent model. (a) R2-independent ELM. (b) R2-independent GEP. (c) R2-independent RF. (d) Taylor diagram of independent models.

Figure 7

The R2 and Taylor diagram of independent model. (a) R2-independent ELM. (b) R2-independent GEP. (c) R2-independent RF. (d) Taylor diagram of independent models.

Close modal

Since the most accurate performance was related to the M3 scenario, the modeling related to the EMD and PCA were only applied to this scenario, and their results are presented in the following.

Hybrid models

Using the EMD method, all flow, precipitation, and evaporation data were decomposed into a number of IMFs. Since the number of input data to the model had greatly increased, based on the PCA approach, the main and influential components were separated and finally all the IMF outputs from the EMD method were reduced to 10 influential components. The modeling process was carried out using the ELM, GEP, and RF methods on these 10 components the results of which are presented as described in Table 5 and Figure 8.
Table 5

The evaluation criteria for the EMD and PCA

ModelRMSE MAE NRMSE
Hybrid ELM 85.20 40.323 6.50 31.49 
Hybrid GEP 78.439 37.807 5.984 29.09 
Hybrid RF 85.302 50.769 6.508 33.16 
ModelRMSE MAE NRMSE
Hybrid ELM 85.20 40.323 6.50 31.49 
Hybrid GEP 78.439 37.807 5.984 29.09 
Hybrid RF 85.302 50.769 6.508 33.16 
Figure 8

The comparison of predicted data and actual data for the EMD and PCA.

Figure 8

The comparison of predicted data and actual data for the EMD and PCA.

Close modal
As the GI in Table 5 shows, in general, the use of EMD and PCA algorithms has dramatically reduced the accuracy of the models' results compared to the results of the M1, M2, and M3 scenarios. Also, the RF algorithm, which presented the weakest results in three modeled scenarios, had the most accurate prediction in this part with the GI = 33.16 compared to the ELM and GEP models. As shown in Figure 9, the hybrid-GEP approach with R2 = 0.52 had the best convergence compared to other hybrid models, followed by the hybrid-RF and hybrid-ELM methods with R2 values of 0.48 and 0.44, respectively. Also, based on the Taylor diagram, the hybrid-GEP algorithm has the smallest distance to the reference data among the three reviewed hybrid algorithms, which shows that this model had a more accurate and acceptable performance.
Figure 9

(a) The R2 plot and (b) Taylor diagram of the hybrid models.

Figure 9

(a) The R2 plot and (b) Taylor diagram of the hybrid models.

Close modal

The large amount of observational data and its complexity, along with extreme fluctuations, make modeling and forecasting difficult and error-prone. The GEP algorithm, with its mathematical nature and the use of genetic algorithm and genetic programming, has a high ability to match, analyze, and optimize data modeling. This allows the algorithm to effectively model and predict complex data with extreme fluctuations. On the other hand, the RF algorithm, derived from a set of decision trees, has a weaker ability to understand complex mathematical relationships compared to the GEP algorithm. It struggles to identify complex non-linear relationships. The ELM approach, known for its simplicity and ease in training neural networks and ML, has a weakness in understanding complex and non-linear relationships. In the current research, the GEP algorithm is found to have the best performance in forecasting and modeling. All in all, it can be said that in this article, the use of the EMD and PCA algorithms had a negative effect on the accuracy of modeling results, and individual models performed better than hybrid models. The EMD and PCA were used as data preprocessing steps to enhance ML model performance, as EMD can analyze non-linear and non-stationary time series, while PCA reduces noise and redundancy through dimensionality reduction. However, the results indicated that the combined models performed worse than the individual ones. This may be due to the complexities introduced by decomposition and dimensionality reduction, which could have obscured important temporal patterns in the raw data. This outcome highlights the need to align data preprocessing with ML algorithms and underscores the limitations of these methods in hydrological applications.

Similar research has been done on this catchment area. In the research conducted by Sadeghi & Pourreza Bilondi (2015), they used four optimization methods to analyze the uncertainty of the conceptual rainfall–runoff model (HYMOD) and acceptable results were obtained. In another research that it was done by Pourreza Bilondi et al. (2015), the data of the Leaf River catchment area was used to model the support vector method to simulate the daily runoff. The results of the mentioned research are in line with this research and show the usefulness and efficiency of these algorithms in the water basin.

The performance of the models in each of the scenarios was evaluated using GI which is calculated based on the evaluation criteria of RMSE, MAE and NRMSE. The results showed that the GEP algorithm performs better than other models in all scenarios M1, M2 and M3 with GI values of 14.46, 12.78, and 11.27, respectively. GI values in ELM algorithm are 14.37, 12.82, and 15.59, respectively, in scenarios M1, M2, and M3, which is the best result related to scenario M2. In this way, RF model with GI values obtained from three scenarios M1, M2, and M3, which are 20.21, 19.34, and 19.33, respectively, has shown the weakest performance. Turning to the hybrid models, using the EMD and PCA to build the EMD-PCA-ELM, EMD-PCA-GEP, and EMD-PCA-RF in the M3 scenario considerably decreased the accuracy of the projections which the values of the GI, the average around 31, proved this claim. These findings suggest that the GEP model, individually, is a valuable tool for predicting streamflow in the Leaf River, given that compared to other ML models, it showed superior performance in streamflow forecasting. This study highlights the potential of ML approaches in hydrological modeling and emphasizes the importance of precise streamflow prediction for effective water resource management. The review of other articles and research in this area showed that this article's results align with the results and claims of other articles.

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent was obtained from all individual participants included in the study.

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Abda
Z.
,
Zerouali
B.
,
Chettih
M.
,
Guimaraes Santos
C. A.
,
de Farias
C. A. S.
&
Elbeltagi
A.
(
2022
)
Assessing machine learning models for streamflow estimation: a case study in Oued Sebaou watershed (Northern Algeria)
,
Hydrological Sciences Journal
,
67
(
9
),
1328
1341
.
Breiman
L.
(
2001
)
Random forests
,
Machine Learning
,
45
,
5
32
.
Chamani
M.
&
Roushangar
K.
(
2020
)
Evaluation of CEEMD-GPR hybrid model in temporal and spatial daily stream flow forecasting
,
Water and Irrigation Management
,
9
(
2
),
277
289
.
Essam
Y.
,
Huang
Y. F.
,
Ng
J. L.
,
Birima
A. H.
,
Ahmed
A. N.
&
El-Shafie
A.
(
2022
)
Predicting streamflow in Peninsular Malaysia using support vector machine and deep learning algorithms
,
Scientific Reports
,
12
(
1
),
3883
.
Ferreira
C.
(
2001
)
Gene expression programming: a new adaptive algorithm for solving problems
,
Complex Systems
,
13
(
2
),
87
129
.
Hastie
T.
,
Tibshirani
R.
,
Friedman
J. H.
&
Friedman
J. H.
(
2009
)
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
.
New York, NY, USA
:
Springer
.
Huang
G. B.
,
Zhu
Q. Y.
&
Siew
C. K.
(
2004
). ‘
Extreme learning machine: a new learning scheme of feedforward neural networks
’,
2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541)
,
Budapest, Hungary
.
Huang
S.
,
Chang
J.
,
Huang
Q.
&
Chen
Y.
(
2014
)
Monthly streamflow prediction using modified EMD-based support vector machine
,
Journal of Hydrology
,
511
,
764
775
.
Huang
G. B.
,
Wu
J.
&
Wunsch
D. C.
(
2018
)
Hierarchical extreme learning machines
,
Neurocomputing
,
277
,
1
3
.
https://doi.org/10.1016/j.neucom.2017.07.067
.
Khosravi
K.
,
Golkarian
A.
,
Booij
M. J.
,
Barzegar
R.
,
Sun
W.
,
Yaseen
Z. M.
&
Mosavi
A.
(
2021
)
Improving daily stochastic streamflow prediction: comparison of novel hybrid data-mining algorithms
,
Hydrological Sciences Journal
,
66
(
9
),
1457
1474
.
Lange
H.
&
Sippel
S.
(
2020
)
Machine learning applications in hydrology
. In: Levia, D.F., Carlyle-Moses, D.E., Iida, S., Michalzik, B., Nanko, K. & Tischer, A. (eds) Forest-Water Interactions. Ecological Studies, 240.
Cham, Switzerland: Springer Nature.
https://doi.org/10.1007/978-3-030-26086-6_10.
Li, X., Wang, Y., Basu, S., Kumbier, K. & Yu, B. (2019) A debiased MDI feature importance measure for random forests. In: Wallach, H., Larochelle, H., Beygelzimer A., d Alché-Buc, F., Fox E., Garnett, R. (eds), Advances in Neural Information Processing Systems, 32, 8049–8059.
Meng
E.
,
Huang
S.
,
Huang
Q.
,
Fang
W.
,
Wu
L.
&
Wang
L.
(
2019
)
A robust method for non-stationary streamflow prediction based on improved EMD-SVM model
,
Journal of Hydrology
,
568
,
462
478
.
Noori
R.
,
Karbassi
A. R.
,
Moghaddamnia
A.
,
Han
D.
,
Zokaei-Ashtiani
M. H.
,
Farokhnia
A.
&
Gousheh
M. G.
(
2011
)
Assessment of input variables determination on the SVM model performance using PCA, gamma test, and forward selection techniques for monthly stream flow prediction
,
Journal of Hydrology
,
401
(
3–4
),
177
189
.
Pourreza Bilondi
M.
,
Khasheei Sivaki
A.
&
Sadeghi Tabas
S.
(
2015
)
Forecasting daily runoff with least squares model of support vector machine
,
Journal of Water and Soil Protection Research
,
21
(
6
),
293
304
.
Remesan
R.
&
Mathew
J.
(
2015
)
Hydrological data driven modelling
. In:
Earth Systems Data and Models
.
Cham, Switzerland
:
Springer
. https://doi.org/10.1007/978-3-319-09235-5.
Rezaie-Balf
M.
,
Fani Nowbandegani
S.
,
Samadi
S. Z.
,
Fallah
H.
&
Alaghmand
S.
(
2019
)
An ensemble decomposition-based artificial intelligence approach for daily streamflow prediction
,
Water
,
11
(
4
),
709
.
Sadeghi Tabas
S.
&
Pourreza Bilondi
M.
(
2015
)
Comparison of optimization and uncertainty analysis methods in hydrological modeling
,
Journal of Range and Watershed Managment
,
68
(
3
),
533
552
.
Sahoo
S.
,
Russo
T. A.
,
Elliott
J.
&
Foster
I.
(
2017
)
Machine learning algorithms for modeling groundwater level changes in agricultural regions of the US
,
Water Resources Research
,
53
(
5
),
3878
3895
.
Scherl
I.
,
Strom
B.
,
Shang
J. K.
,
Williams
O.
,
Polagye
B. L.
&
Brunton
S. L.
(
2020
)
Robust principal component analysis for modal decomposition of corrupt fluid flows
,
Physical Review Fluids
,
5
(
5
),
054401
.
Tiwari
M. K.
,
Deo
R. C.
&
Adamowski
J. F.
(
2021
)
Short-term flood forecasting using artificial neural networks, extreme learning machines, and M5 model tree
,
In: Sharma, P. & Machiwal, D. (eds)
Advances in Streamflow Forecasting
, pp.
263
279
.
Amsterdam, the Netherlands: Elsevier.
Wang, Z. Y., Qiu, J. & Li, F. F. (2018) Hybrid models combining EMD/EEMD and ARIMA for long-term streamflow forecasting, Water, 10 (7), 853.
Yaseen
Z. M.
,
Naganna
S. R.
,
Sa'adi
Z.
,
Samui
P.
,
Ghorbani
M. A.
,
Salih
S. Q.
&
Shahid
S.
(
2020
)
Hourly river flow forecasting: application of emotional neural network versus multiple machine learning paradigms
,
Water Resources Management
,
34
,
1075
1091
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).