Recognizing the differential impacts of climate change across geographical scales, this study emphasizes the importance of statistical downscaling. Using Gene Expression Programming (GEP) and Linear Genetic Programming (LGP), statistical downscaling transforms broad climate trends into region-specific insights. This allowed for detailed analyses of anticipated changes in sediment yield and discharge within a Euphrates River sub-basin in Türkiye using large-scale variables from the CanESM2 model. The dataset is divided into calibration (1970–1995) and validation (1996–2005) periods. To assess the models’ accuracy, statistical measures such as RMSE, MAE, NSE, and R were used. The analysis revealed that LGP outperformed GEP in both discharge and sediment yield during validation, with RMSE = 51.79 m3/s and 4,325.66 tons/day, MAE = 27.14 m3/s and 1,593.34 tons/day, NSE = 0.684 and 0.627, and R = 0.841 and 0.788, respectively. However, when simulating future periods based on the observed period (2006–2020), the GEP model was superior to LGP under RCP2.6, RCP4.5, and RCP 8.5 scenarios from CanESM2. In 2021–2100, models suggest a moderate decrease in discharge and sediment yield, indicating potential shifts in the basin's hydrodynamics. These changes could disrupt hydropower generation, challenge water management practices, and alter riverine ecosystems. The results necessitate a thorough assessment of potential ecological consequences.

  • Climate change effects on streamflow and sediment yield have been explored.

  • Genetic programming such as GEP and LGP have been employed in statistical downscaling technique.

  • Future projection of streamflow and sediment yield have been done under different emission scenarios.

Climate change, an emergent issue of the 21st century, has profound implications for the health and functioning of the biosphere. It is predominantly caused by human activities, specifically the burning of fossil fuels and the manufacturing of chemicals (Mampitiya et al. 2023). The hydrological cycle, a key component of the biosphere, is particularly sensitive to climate change. Changes in precipitation patterns, evaporation rates, and river flows affect both the quantity and quality of water resources (Verma et al. 2023). Water resources, in particular, are significantly affected by climate change, and hydrologic models serve as tools to study these effects. These models, often integrated with General Circulation Models (GCMs), provide insights into future hydrological scenarios under different climate change projections. Such studies are crucial for developing effective mitigation and adaptation strategies.

The projections of climate scenarios derived from GCMs for the 21st century are fundamentally important in determining potential adjustments in ecological, physical, and societal systems in response to climate change (Tabor & Williams 2010). However, the dimensions of GCM cells are not infinitely reducible; instead, they are constrained to a range of 100–300 km (Ramírez Villegas & Jarvis 2010). Techniques of downscaling provide scientists with the capability to generate localized forecasts of climatic variations, which span from the refinement and interpolation of GCM irregularities. Downscaling methodologies empower researchers to derive predictions of regional climate variations, extending from the refinement and interpolation of GCM discrepancies to the application of neural networks and regional climate simulations (Giorgi 1990). In a comprehensive perspective, downscaling can be split into two primary categories: statistical and dynamical downscaling. Statistical downscaling employs algebraic connections between large-scale and local climate factors to precisely predict future local scenarios, providing corrections for different biases and computational efficacy. On the other hand, dynamical downscaling utilizes high-resolution regional simulations based on physical laws to anticipate local conditions, delivering comprehensive and physically consistent projections, though it bears sensitivity to large-scale biases and requires extensive computational power.

Gene Expression Programming (GEP) (Ferreira 2001), is an adaptive algorithm that generates flexible programs. GEP is perceived as a learning algorithm striving to decipher the interconnections among variables within data sets. Distinct from its predecessors, Genetic Algorithm (GA) and Genetic Programming (GP), GEP codes individuals as linear strings of a fixed length, known as chromosomes, which are subsequently symbolized by expression trees, a simplified diagrammatic representation. GEP's strength lies in its distinct, multi-genic character that facilitates the evolution of intricate programs composed of multiple sub-programs. It merges the advantages of both GA and GP while mitigating some of the restrictions inherent to each of them individually (Shoaib et al. 2015).

GEP holds the benefit of creating functional correlations, enabling the exploration of intricate non-linear associations among the input parameters by delivering fast, reasonably precise, and computationally cost-effective approximations of the inherent physical/functional procedures that can be utilized beyond merely predictive applications. Another significant benefit of GEP in contrast to numerous other data-driven models, is its ability to offer an analytical representation of the correlation between input and output variables. This feature enables modelers to understand this relationship better and make adjustments as needed, demonstrating the flexibility of GEP (Billah et al. 2021). GEP is successfully applied to the forecasting of climatic data in several studies (Rahmani-Rezaeieh et al. 2020; Billah et al. 2021; Birbal et al. 2021; Esmaeili-Gisavandani et al. 2021; Muhammad et al. 2021; Guven & Pala 2022; Guven et al. 2022; Chansawang et al. 2023; Pouyanfar et al. 2023; Raheel et al. 2023; Song et al. 2023).

Linear Genetic Programming (LGP) is a variation of the GP method that develops sequences of instructions originating from an imperative programming language (such as C or C + +) or from a machine language, as opposed to expressions from a functional programming language typical of conventional tree-based GP. It is noteworthy that the term ‘linear GP’ pertains to the linear structure of genomes, and as such, the LGP method does not necessarily generate linear models for non-linear systems. In fact, LGP is frequently utilized for evolving highly non-linear models (Brameier 2004). Two types of LGP systems exist based on the executability of instructions by a computer's CPU: (i) machine-coded LGP, which executes instructions directly, reducing runtime and (ii) interpreted LGP, where instructions are executed via a higher-level virtual machine (Mehr et al. 2018).

The principal contrasts from customary tree-based GP encompass the graph-based data stream that emerges from repeated utilization of indexed variable (register) contents and the presence of structurally redundant code. Machine-coded LGP offers several advantages as a modeling tool, including its operational speed, its design against overfitting, and its capacity to generate robust solutions that execute swiftly when called by integrated software. Nevertheless, the determination of the number of registers employed in an LGP model is a critical factor, as an unfit selection could result in substantial complications in the program under evolution (Guven & Kişi 2010). LGP is applied in hydrological engineering (Guven et al. 2009; Azamathulla et al. 2011) and predicting hydro-meteorological variables (Azamathulla & Zahiri 2012; Guven & Kisi 2013; Mehr et al. 2013; Danandeh Mehr et al. 2014; Ravansalar et al. 2017). In this study, due to the aforementioned advantages, the statistical downscaling method is processed using GEP and Machine-Coded LGP models, and the linear regression model is applied for comparison. The hydrological effects of climate change at the local scale were examined, and future predictions of sediment yield and discharge were conducted.

This study introduces the application of GEP and LGP in a climatic analysis of the Euphrates River sub-basin, presenting an innovative approach to understanding the dynamics between climate change, sediment yield, and streamflow. By leveraging these advanced statistical downscaling and GP methods, not only the precision of projections are enhanced but also the intricacies of climate-river interactions in a region crucial for water resources are captured. The predictive insights derived are set to offer a valuable framework for both policymakers and hydrologists, enabling informed decision-making and strategic planning in water resource management amidst the exigencies of climate change. Our methodological advancement thereby stands as a significant stride toward more resilient water infrastructures in response to evolving climatic conditions.

Following this section, a detailed description of the study area and data are provided. This is followed by the study's methodology. The results section presents the findings, including future trends of Q and Qs under various models and scenarios. A discussion of the models' results is provided in the following section, before concluding with a summary of our key insights.

The area of study is a sub-basin of the Euphrates basin located on the Euphrates River in eastern Türkiye. The 10,247 km2 basin is located east of Türkiye, positioned 111 km east of the central city of Erzincan. The exit point of the basin, a sediment yield and discharge station named Fırat N. Kemahboğazı, is located in the westernmost part of the basin, 12 km southwest of the city center of Erzincan. The local station has a latitude (N) of 39° 40′ 57″, longitude (E) of 39° 23′ 35″ and altitude of 1,123 m. Figure 1 is a satellite image showing the location (a), center, and exit point of the basin (b) on the map of Türkiye. Figure 2 shows the flow distance contours and stream networks of the basin. Figure 3 displays the Digital Elevation Model (DEM) contours of the basin. The DEM data used in this study are derived from the Shuttle Radar Topography Mission Plus Version 3 (SRTM Plus V3). The SRTM Plus V3 has a resolution of 1 arc-second, which is approximately 30 m (Earth Resources Observation and Science (EROS) Center 2017).
Figure 1

Location of the centroid (a) and outlet of the basin (b).

Figure 1

Location of the centroid (a) and outlet of the basin (b).

Close modal
Figure 2

Flow distance contours and stream networks of the basin.

Figure 2

Flow distance contours and stream networks of the basin.

Close modal
Figure 3

Digital Elevation Model (DEM) contours of the watershed.

Figure 3

Digital Elevation Model (DEM) contours of the watershed.

Close modal

The selection of this particular sub-basin of the Euphrates River for our study was driven by a confluence of crucial factors. Foremost, the availability of long-term observed data within this basin presented a solid foundation for applying statistical downscaling techniques, thereby enhancing the reliability and precision of our analysis. Additionally, the Euphrates River is one of the most important fluvial systems in the region, with a plethora of water infrastructures arrayed along its course. This not only underscores the basin's significance but also amplifies the relevance of understanding the impacts of climate change on its sediment yield and streamflow dynamics. Through this strategic selection, our study aims to provide actionable insights that could be vital in the sustainable management and planning of water resources within this vital river system.

Two data sets are required for the statistical downscaling method. The first data set consists of monthly sediment yield and discharge data obtained from the local station located in the basin and has been used as the predictand. The second data set is the large-scale predictor variables obtained from the Second Generation Canadian Earth System Model (CanESM2) model. The Canadian Center for Climate Modeling and Analysis (CCCma) of Environment and Climate Change Canada created the fourth generation coupled global climate model known as CanESM2. It is the Canadian modeling community's contribution to the Intergovernmental Panel on Climate Change, Fifth Assessment Report (IPCC AR5). In a 128 × 64 grid based on the T42 Gaussian grid, standardized daily data are extracted for each grid cell and stored in a single-column text file for the entire global area. This grid has virtually consistent horizontal resolution along the latitude of approximately 2.8125° and 2.8125° of longitude. In files with the names BOX_iiiX_jjY, where iii stands for the longitudinal index and jj for the latitudinal index, predictions are kept for each grid cell. This organization of the predictors makes it simple to utilize them as input for statistical downscaling models. The data set is downloaded as a zip file from Canadian Climate Data and Scenarios (https://climate-scenarios.canada.ca/) by choosing the relevant grid cell. This can be done by manually putting the centroid of the watershed or simply selecting the cell in the rectangular map on the website. By clicking the retrieve data button, a zip file named ‘BOX_017X_46Y’ was downloaded regarding the specific grid cell chosen. The data set has 26 variables for three scenarios RCP2.6, RCP4.5, and RCP8.5. The daily predictor data set has been converted into monthly average values. Table 1 shows the description of the CanESM2 variables.

Table 1

Description of CanESM2 variables (predictors)

No.CanESM2 predictorsDescriptionAbbreviation
ceshmslpgl Mean sea level pressure v1 
ceshp1_fgl 1,000 hPa Wind speed v2 
ceshp1_ugl 1,000 hPa Zonal wind component v3 
ceshp1_vgl 1,000 hPa Meridional wind component v4 
ceshp1_zgl 1,000 hPa Relative vorticity of true wind v5 
ceshp1thgl 1,000 hPa Wind direction v6 
ceshp1zhgl 1,000 hPa Divergence of true wind v7 
ceshp5_fgl 500 hPa Geopotential v8 
ceshp5_ugl 500 hPa Wind speed v9 
10 ceshp5_vgl 500 hPa Zonal wind component v10 
11 ceshp5_zgl 500 hPa Meridional wind component v11 
12 ceshp5thgl 500 hPa Relative vorticity of true wind v12 
13 ceshp5zhgl 500 hPa Wind direction v13 
14 ceshp8_fgl 500 hPa Divergence of true wind v14 
15 ceshp8_ugl 850 hPa Geopotential v15 
16 ceshp8_vgl 850 hPa Wind speed v16 
17 ceshp8_zgl 850 hPa Zonal wind component v17 
18 ceshp8thgl 850 hPa Meridional wind component v18 
19 ceshp8zhgl 850 hPa Relative vorticity of true wind v19 
20 ceshp500gl 850 hPa Wind direction v20 
21 ceshp850gl 850 hPa Divergence of true wind v21 
22 ceshprcpgl Total precipitation v22 
23 ceshs500gl 500 hPa Specific humidity v23 
24 ceshs850gl 850 hPa Specific humidity v24 
25 ceshshumgl 1,000 hPa Specific humidity v25 
26 ceshtempgl Air temperature at 2 m v26 
No.CanESM2 predictorsDescriptionAbbreviation
ceshmslpgl Mean sea level pressure v1 
ceshp1_fgl 1,000 hPa Wind speed v2 
ceshp1_ugl 1,000 hPa Zonal wind component v3 
ceshp1_vgl 1,000 hPa Meridional wind component v4 
ceshp1_zgl 1,000 hPa Relative vorticity of true wind v5 
ceshp1thgl 1,000 hPa Wind direction v6 
ceshp1zhgl 1,000 hPa Divergence of true wind v7 
ceshp5_fgl 500 hPa Geopotential v8 
ceshp5_ugl 500 hPa Wind speed v9 
10 ceshp5_vgl 500 hPa Zonal wind component v10 
11 ceshp5_zgl 500 hPa Meridional wind component v11 
12 ceshp5thgl 500 hPa Relative vorticity of true wind v12 
13 ceshp5zhgl 500 hPa Wind direction v13 
14 ceshp8_fgl 500 hPa Divergence of true wind v14 
15 ceshp8_ugl 850 hPa Geopotential v15 
16 ceshp8_vgl 850 hPa Wind speed v16 
17 ceshp8_zgl 850 hPa Zonal wind component v17 
18 ceshp8thgl 850 hPa Meridional wind component v18 
19 ceshp8zhgl 850 hPa Relative vorticity of true wind v19 
20 ceshp500gl 850 hPa Wind direction v20 
21 ceshp850gl 850 hPa Divergence of true wind v21 
22 ceshprcpgl Total precipitation v22 
23 ceshs500gl 500 hPa Specific humidity v23 
24 ceshs850gl 850 hPa Specific humidity v24 
25 ceshshumgl 1,000 hPa Specific humidity v25 
26 ceshtempgl Air temperature at 2 m v26 

The selection of the most effective predictors

In order to enhance the association between the predictor variables and predictands, normalization techniques were applied to the data sets. Specifically, each data set for predictors and predictands was transformed by applying the natural logarithm function (Ln) and standardization using the z-value. The variables were modified by implementing these normalization methods to facilitate more accurate and reliable predictions. To identify the most impactful predictors and assess the linear relationship between input and output variables, a Pearson rank correlation coefficient analysis was utilized. The correlation analysis revealed that the large-scale weather factors were correlated with local data at a confidence level of 99%. Those factors with the highest correlation coefficients were selected to identify the most effective predictors for the downscaling models. The highest correlation was obtained when the natural logarithms of both data sets were taken. Finally, 10 of 26 variables have been identified as the most effective predictors. These were: mean sea level pressure (V1), 1,000 hPa relative vorticity of true wind (V5), 500 hPa geopotential (V8), 500 hPa wind speed (V9), 500 hPa zonal wind component (V10), 500 hPa relative vorticity of true wind (V12), 500 hPa divergence of true wind (V14), 850 hPa divergence of true wind (V21) and total precipitation (V22). Although total precipitation (V22) and 1,000 hPa relative vorticity of true wind (V5) variables have a direct effect on discharge and sediment yield, other eight variables have an indirect effect on discharge and sediment yield. Total precipitation (V22) predictor has the highest relative importance of predictors.

Statistical downscaling using GP

GEP involves four main steps for obtaining solutions to problems. This study emphasizes that the first step is the identification of a set of functions to be used. The second step is determining the chromosome structure, which involves specifying the number of genes and their size. The third step involves selecting the linking function. The final step is to evaluate fitness using a specific measure. These four essential steps constitute the foundation of GEP, and their careful implementation is vital for obtaining effective solutions in problem-solving applications. Main setup parameters of GEP are given in Table 2.

Table 2

Setup parameters of the GEP model for both predictands

PredictandFunctionsNumber of genes per chromosomeLinking functionFitness function
Q (+), (−), (*), (/), (−a), (1/a) 16 MSE 
Qs (+), (−), (*), (/), (√) R2 
PredictandFunctionsNumber of genes per chromosomeLinking functionFitness function
Q (+), (−), (*), (/), (−a), (1/a) 16 MSE 
Qs (+), (−), (*), (/), (√) R2 

The main goal of the GEP model is to use training data to create a mathematical equation (Fuladipanah et al. 2023). GEP model generated an equation for the calibration process, and it uses the same equation to generate the validation process and forecast future projections. The generated equation for the first predictand, discharge (Q), is shown in the following:
(1)

As observed, the model utilized the V2, V5, V12, V14, and V22 predictors to generate the Equation (1). These predictors have been selected by GEP automatically among the previously selected 10 most effective predictors.

The equation generated by the GEP model for the second predictand, sediment yield (Qs), is as follows:
(2)

With sediment yield (Qs) data being the predictand, the GEP model used only four of the 10 selected predictors in Equation (2). These are total precipitation (V22), 500 hPa wind speed (V9), 500 hPa zonal wind component (V10), and 1,000 hPa Relative vorticity of true wind (V5).

LGP is referred to as machine-coded GP because it utilizes C or C ++ directly as programming languages, unlike tree-based GEP (Guven & Kişi 2010). Main setup parameters for LGP are given in Table 3.

Table 3

Setup parameters of LGP for both predictands

PredictandInstruction setProgram sizePopulation size
Fitness calculation
InitialMaxerror measurement
Q (+),(−),(*),(/), (abs), (√), (sin), (cos), Arithmetic, condition, data transfer, comparison 500 80 512 Squared 
Qs (+),(−),(*),(/), (abs), (√) 750 80 512 Squared 
PredictandInstruction setProgram sizePopulation size
Fitness calculation
InitialMaxerror measurement
Q (+),(−),(*),(/), (abs), (√), (sin), (cos), Arithmetic, condition, data transfer, comparison 500 80 512 Squared 
Qs (+),(−),(*),(/), (abs), (√) 750 80 512 Squared 

LGP model generated a C code from the calibration data to predict the validation and future periods. The LGP model used all ten selected most efficient predictors in the analyses. Interested readers are encouraged to contact the authors directly to obtain the detailed code output.

The data sets are based on the monthly time scale, and the data were partitioned into two distinct components, namely calibration, and validation. The calibration period, spanning the years between 1970 and 1995, is utilized to train the model. The validation period includes the period between 1996 and 2005 and is essential for evaluating the model's performance. After selecting the most effective predictors, downscaling techniques, GEP and LGP were used to predict monthly discharge and sediment yield on the basin. In the next stage, the prediction results were obtained and compared with each other to demonstrate the best model performance. Evaluations of the proposed models' performance are carried out utilizing metrics such as root mean squared error (RMSE), mean absolute error (MAE), Nash–Sutcliff Efficiency (NSE), and correlation coefficient (R). These metrics are applied in several similar studies (Yang et al. 2017; Perera & Rathnayake 2019; Gupta et al. 2020; Karunanayake et al. 2020; Kidanemariam et al. 2020; Sireesha Naidu et al. 2020; Haleem et al. 2021; Xu et al. 2022; Fuladipanah et al. 2023; Mampitiya et al. 2023; Mollel et al. 2023; Tilahun et al. 2023; Verma et al. 2023)

The detailed methodology followed in this research is presented in Figure 4.
Figure 4

Flow diagram of a statistical downscaling process for this study.

Figure 4

Flow diagram of a statistical downscaling process for this study.

Close modal

In this study, the statistical downscaling method was used, and the predictor required for this method is the large-scale weather factors (26 input sets) obtained from the CanESM2. Local sediment yield and discharge data of the relevant station is used for predictand. Finally, for comparison with the non-linear models, Linear Regression Analysis is conducted.

The best-performing model for sediment yield and discharge data with both calibration and validation periods was LGP. The statistical outcomes for the validation period, derived through the formulas created by the artificial intelligence models using the data set provided for training and testing purposes, are presented in Tables 4 and 5.

Table 4

Comparison of statistical results of observed and predicted Q (m3/s) for the validation period (1996–2005)

DataMeanMinMaxStd. deviationMAERMSENSER
Observed 84.92 18.1 442.89 82.9 – – – – 
GEP 66.32 24.01 585.88 67.76 36.69 74.21 0.427 0.684 
LGP 69.75 30.14 237.34 56.72 27.14 51.79 0.684 0.841 
Linear regression 73.08 26.16 279.15 47.38 32.48 58.14 0.555 0.745 
DataMeanMinMaxStd. deviationMAERMSENSER
Observed 84.92 18.1 442.89 82.9 – – – – 
GEP 66.32 24.01 585.88 67.76 36.69 74.21 0.427 0.684 
LGP 69.75 30.14 237.34 56.72 27.14 51.79 0.684 0.841 
Linear regression 73.08 26.16 279.15 47.38 32.48 58.14 0.555 0.745 
Table 5

Comparison of statistical results of observed and predicted Qs (tons/day) for the validation period (1996–2005)

DataMeanMinMaxStd. deviationMAERMSENSER
Observed 2,890.33 35.4 102,010 10,348.66 – –  – 
GEP 2,014 52.97 80,179.9 7,760.17 2,880.9 11,871.7 0.567 0.673 
LGP 2,084.14 115.28 87534.2 8,693.8 1,593.34 4,325.66 0.627 0.788 
Linear regression 1,023.18 68.07 9,724.34 1,689.62 2,282.65 10,028.6 0.509 0.705 
DataMeanMinMaxStd. deviationMAERMSENSER
Observed 2,890.33 35.4 102,010 10,348.66 – –  – 
GEP 2,014 52.97 80,179.9 7,760.17 2,880.9 11,871.7 0.567 0.673 
LGP 2,084.14 115.28 87534.2 8,693.8 1,593.34 4,325.66 0.627 0.788 
Linear regression 1,023.18 68.07 9,724.34 1,689.62 2,282.65 10,028.6 0.509 0.705 

Table 4 compares the statistical results of observed and predicted discharge values during the validation period (1996–2005). This comparison shows that the LR model provides the best result in predicting the mean value. Although the GEP model performs poorly in predicting the mean value, it performs best in predicting the minimum value. Furthermore, all models underestimated the mean values and overestimated the minimum values. The GEP model exhibits the best performance for predicting the maximum value and standard deviation.

Table 4 also demonstrates that the LGP model performs best with the highest correlation coefficient of R = 0.841, NSE = 0.684 and the lowest MAE = 27.14 m3/s, and RMSE = 51.79 m3/s. LR and GEP models follow LGP in terms of performance, respectively, in line with the correlation factor (R) values.

Table 5 provides a comparison of the statistical results of observed and predicted sediment yield values during the validation period spanning from 1996 to 2005. Similar to previous scenarios, all models underestimated the mean value. While the minimum observed value was 35.40 tons/day, the LGP model predicted 115.28 tons/day, the GEP model predicted 52.97 tons/day, and the LR model predicted 68.07 tons/day, all above the minimum value. In predicting the maximum and standard deviation values, all models also made predictions below the observed value. The LGP model provided the closest predictions except for the minimum value. GEP model was superior to the LR model in estimating all statistical values.

In contrast to other statistics, the GEP model exhibited the weakest performance with MAE = 2,880.9 and RMSE = 11,871.73 values. The LGP model showed the best performance with MAE = 1,593.34, NSE = 0.627 and RMSE = 4,325.66 values. The LR model performed between these two models, which is consistent with the correlation factor (R) values.

Figure 5(a) illustrates the scatter plots of observed and predicted Q values of the basin for LGP, GEP, and LR for the validation period (1996–2005). LGP model performed the best with the highest R-value of 0.841, followed by the LR with a value of R = 0.745 and GEP with a value of R = 0.684.
Figure 5

Scatter plot of the observed and predicted monthly Ln Q (a) and Ln Qs (b) values for the validation period by all models.

Figure 5

Scatter plot of the observed and predicted monthly Ln Q (a) and Ln Qs (b) values for the validation period by all models.

Close modal

Figure 5(b) represents the scatter plots of observed and predicted Qs values for the models in the validation period. The LGP model exhibited the highest performance with the highest correlation of R = 0.788, followed by the LR with a value of R = 0.705 and GEP with R = 0.673.

The linear equations signifying the relationship between observed and predicted data are as follows:
(3)
(4)
(5)
(6)
(7)
(8)
where y represents the observed value and x represents the predicted value.
As observed in Figure 6, the models generally underestimated the discharge values compared to the observed data. It can be seen that the LGP model underestimated the observed in all months except January and September. All models underestimated Q in April, May, June, July, and December. In September, all models overestimated Q, but LGP has the closest prediction. The LGP model underestimated Q in February, August, and October, whereas the GEP and LR models overestimated Q values. The LGP and GEP models underestimated the discharge (Q) values in March and November, while the LR model overestimated them. LGP model showed the best performance in April, October, and December, with underestimation percentages of 7.61, 7.51, and 6.63%, respectively. The worst performance of the LGP model is in March and May, with underestimations by 35.32 and 38.14%, and in September, with overestimation by 21.92%. The GEP model provided very close estimates just below the observed values in March and November by 6.344, 2.04%, and just over to the observed in August by 2.6% but produced inaccurate estimates in April and July below the observed values by 59.394, 46.158%, and over to the observed value in September by 57.186%. The LR model showed the weakest performance among the three models in general, with the closest estimate in October just above the observed value by 2.84% and the furthest estimate with an overestimation percentage of 61.804% in September. Overall, the LGP model showed the best performance in predicting the observed values, while the LR model performed relatively inadequately in predicting the observed values.
Figure 6

Observed and predicted monthly Q for the validation period by all models in logarithmic scale.

Figure 6

Observed and predicted monthly Q for the validation period by all models in logarithmic scale.

Close modal
Figure 7 compares observed and predicted sediment yield values for all models during the validation period covering 1996–2006. It can be seen that all models have made overestimations during February, August, and September, while underestimations have been made during April, June, and December. LGP has shown the best performance among the three models in February, April, May, August, September, November, and December. The GEP model has made more accurate predictions than LGP and LR models in January, March, June, July, and October.
Figure 7

Observed and predicted monthly Qs for the validation period by all models in logarithmic scale.

Figure 7

Observed and predicted monthly Qs for the validation period by all models in logarithmic scale.

Close modal

In January, which is the closest to the model's predictions, LGP has made an overestimation by 28.29%, while GEP and LR models have made underestimations by 11.47 and 12.08%, respectively. In September, the farthest from the models' predictions, the LGP model made a high overestimation by 135.38%, the LR model made an even higher overestimation by 155.10%, and the GEP model made the highest overestimation by 158.64%.

Future projection of discharge under different downscaling models and emission scenarios

Different output results were obtained from GEP and LGP models for three different scenarios, RCP2.6, RCP4.5, and RCP8.5. The outputs of the models for all three scenarios (horizontal axis) were compared to observed values (vertical axis) for the period 2006–2020, and scatter plots were created. It was determined that the R values were 0.408 for RCP2.6, 0.398 for RCP4.5, and 0.4585 for RCP8.5 in the GEP model, and 0.278 for RCP2.6, 0.347 for RCP4.5, and 0.351 for RCP8.5 in the LGP model. Based on these values, it is seen that the GEP model provides the best correlation for discharge (Q) in the RCP8.5 scenario. The GEP model has been superior in comparison to the LGP model by providing higher correlations in all scenarios. It is also observed that RCP8.5 performs better than other scenarios in both models.

Among the data generated by the model, the z-score method was employed to identify outliers, utilizing the formula , where x is a data point, μ is the mean, and σ is the standard deviation of the dataset. Outliers were identified as values with a z-score above 3 or below −3, as such scores reflect significant deviation from the mean. Due to their disruptive impact on comparisons, they are removed from the dataset. These values constitute 1.4% of the total data.

Figure 8 shows the scatter plot of the observed and projected discharge under the CanESM2 RCP8.5 scenario by GEP and LGP models between 2006 and 2020.
Figure 8

Scatter plot of the observed and projected monthly discharge under the CanESM2 RCP8.5 scenario by GEP (a) and LGP (b) models between 2006 and 2020 period.

Figure 8

Scatter plot of the observed and projected monthly discharge under the CanESM2 RCP8.5 scenario by GEP (a) and LGP (b) models between 2006 and 2020 period.

Close modal

For all scenarios, downscaled results were divided into four time periods: the 2020s (2021–2040), 2040s (2041–2060), 2060s (2061–2080), 2080s (2081–2100), and were compared to two observed data periods which are observed calibration and validation periods combined (1970–2005) and future observed period (2006–2020).

The average values for each month of the results of the RCP8.5 scenario of the GEP model, which showed the best performance for discharge, have been separately calculated for these four different periods and compared with observed data in Figure 9. In general, discharge values tend to be higher during April and May across all periods and observed data. The difference between the observed and other periods is also the highest in these months. The increment and decrement rates between critical months provide further insight into the variations in discharge values across the periods, offering a comprehensive understanding of the water flow dynamics. For all projected periods (2006–2020, 2020s, 2040s, 2060s, and 2080s), the increment rates are critical between the months April–May (79.17, 90.56, 31.78, 11.88, 28.39%) and for the observed periods (1970–2005, 2006–2020) the increment rates are critical between March–April by 263.5 and 72.63%, respectively. Decrement rates between months are critical between May and June for all projected and observed periods by 54.11, 56.47, 45.74, 29.40, 59.84, 56.84, and 48.21%. Figure 9 also shows that in all months, in every period of the GEP model projected, discharge values are lower than the future observed (2006–2020) values. When compared with the observed data between 1970 and 2005, it has been observed that discharge values are lower in the GEP model for April, May, June, July, November, and December. Conversely, the opposite situation is observed in January, February, August, September, and October, with the 1970–2005 observed period staying lower than the values of the GEP model for all periods. Comparing the projected 2006–2020 period with the 2006–2020 observed data, we see that the predicted values are always lower than the observed values, particularly in April, May, and June. When comparing the periods of the GEP model under the RCP8.5 scenario, the 2020s tend to have higher discharge values, followed by the 2060s. The 2040 and 2080 s generally show lower discharge values. The most significant average difference between the observed period of 2006–2020 and GEP models is in May, with percentage differences of 63.35, 28.75, 87.55, 98.38, and 44.5%. Comparing with the observed period of 1970–2005, percentage differences are 109.18, 57.34, 107.46, 105.64, and 53.44% for April.
Figure 9

The projection of average monthly discharge (Q) values under the CanESM2 RCP8.5 scenario for different periods by the GEP model in logarithmic scale.

Figure 9

The projection of average monthly discharge (Q) values under the CanESM2 RCP8.5 scenario for different periods by the GEP model in logarithmic scale.

Close modal
Figure 10 displays annual average discharge data between 1970 and 2100. The data in the 2021–2100 period (red line) pertains to the GEP model RCP8.5 scenario, while the observed data (blue line) is shown for the 1970–2020 period. The lowest annual average during the observed period was 49.741 m3/s, which occurred in 1994. The highest annual average during this period was 140.440 m3/s in 1988. The lowest projected annual average for the GEP model is 37.228 m3/s, which is expected to occur in 2097 and the highest is 77.737 m3/s in 2089. It is clear that the highest and lowest observed data values are higher than the projected data.
Figure 10

The projection of annual average discharge (Q) values under the CanESM2 RCP8.5 scenario by GEP model and observed period of 1970–2020 in logarithmic scale.

Figure 10

The projection of annual average discharge (Q) values under the CanESM2 RCP8.5 scenario by GEP model and observed period of 1970–2020 in logarithmic scale.

Close modal
The projection of annual discharge values under CanESM2 RCP8.5 scenario by GEP model in normal scale with trendline is given in Figure 11 to figure out the trend of projected discharge data. It is seen from Figure 11 projected discharge data have a slight decreasing trend with a value of 0.125 m3/s/year. Although the trendline is substantially similar to minimum and maximum projected data, for the period of 2084–2086, 2089 and 2098–2099 the discharge data is out of from trendline. Out of trendline data can be considered the extreme discharge data particularly in 2089.
Figure 11

The projection of annual average discharge (Q) values under the CanESM2 RCP8.5 scenario by GEP model in normal scale with a trendline.

Figure 11

The projection of annual average discharge (Q) values under the CanESM2 RCP8.5 scenario by GEP model in normal scale with a trendline.

Close modal
Figure 12 displays the annual average discharge data values for May, which has been identified as the critical month due to having a natural decreasing trend throughout the years and also have the highest values in all observed and simulated periods. Similar to Figure 10, the red line represents the data of the 2021–2100 period and the blue line shows the observed period (1970–2020). For the observed period, the highest value for May is 483.234 m3/s in 2019. The lowest value during this period is 56.918 m3/s, recorded in 1989. In the predicted period by GEP RCP8.5 scenario, the highest expected value is 328.191 m3/s in 2025, the lowest expected value is 22.281 m3/s in 2047.
Figure 12

The projection of annual average discharge (Q) values under the CanESM2 RCP8.5 scenario by GEP model and observed period of 1970–2020 for May in logarithmic scale.

Figure 12

The projection of annual average discharge (Q) values under the CanESM2 RCP8.5 scenario by GEP model and observed period of 1970–2020 for May in logarithmic scale.

Close modal

Future projection of sediment yield under different downscaling models and emission scenarios

Different output results were obtained from GEP and LGP models for three different scenarios, RCP2.6, RCP4.5, and RCP8.5. The outputs of the models for all three scenarios (horizontal axis) were compared to observed values (vertical axis) for the period 2006–2020, and scatter plots were created. It was determined that the R values were 0.303 for RCP2.6, 0.3795 for RCP4.5, and 0.427 for RCP8.5 in the GEP model, and 0.2915 for RCP2.6, 0.3905 for RCP4.5, and 0.377 for RCP8.5 in the LGP model. Based on these values, it is seen that the GEP model provides the best correlation for sediment yield in the RCP8.5 scenario than all scenarios in both models. For sediment yield, 2.46% of data for the GEP RCP8.5 scenario is identified as outliers with the same method used in discharge and removed from the dataset. Figure 13 shows the scatter plot of the observed and projected sediment yield under the CanESM2 RCP8.5 scenario by GEP and LGP models between 2006 and 2020.
Figure 13

Scatter plot of the observed and projected monthly sediment yield (Qs) under the CanESM2 RCP8.5 scenario by GEP (a) and LGP (b) models between 2006 and 2020 period.

Figure 13

Scatter plot of the observed and projected monthly sediment yield (Qs) under the CanESM2 RCP8.5 scenario by GEP (a) and LGP (b) models between 2006 and 2020 period.

Close modal
The average values of each month for the results of the GEP model under the RCP8.5 scenario, showing the best performance for sediment yield, have been calculated for four different periods and compared with observed data in Figure 14.
Figure 14

The projection of average monthly sediment yield (Qs) values under the CanESM2 RCP8.5 scenario for different periods by the GEP model in logarithmic scale.

Figure 14

The projection of average monthly sediment yield (Qs) values under the CanESM2 RCP8.5 scenario for different periods by the GEP model in logarithmic scale.

Close modal

It is observed that the months with the highest data values for all periods are March, April, May, and June. Sediment yield values, which increase with a mild slope from January to March, exhibit a significant rise during March–April and April–May. For the observed periods of 1970–2005 and 2006–2020, and the projected periods of 2006–2020, the 2020s, 2040s, 2060s, and 2080s, the increase percentages are as follows: for March–April, sequentially, 33.48, 65.9, 43.43, 89.59, 189.73, 51.13, and 1,658.16%; for April–May, sequentially, 208.93, 176.72, 114.99, 57.99, 126.83, 34.77% increase, and 41.9% decrease. A significant decline is also observed from May to June (83.94, 61.24, 70, 40.73, 70.09, 81.36, 80.11%). Values that decrease from June to July experience no major fluctuations in the subsequent months. The results of the sediment yield predictions, divided into 20-year periods and compared with the observed values between 2006 and 2020, are clearly below the observed values for each month, as shown in Figure 14. The difference is particularly dramatic in January and March, with the percentage difference between observed (2006–2020) and predicted values being 163.35, 85.97, 171.66, 164.73, and 79.46% for January and 130.62, 72.58, 141.24, 151.7, 82.64% for March for the projected periods, 2006–2020, 2020s, 2040s, 2060s, 2080s, respectively. When comparing the observed period of 1970–2005 with the simulated values, the differences between the average monthly values of observed values and the average monthly values of simulated data are observed to be very high in April and May. These values have been calculated as percentage differences for April, sequentially, at 180.52, 91.12, 182.73, 181.83, 90.42%, and for May at 114.43, 61.95, 142.76, 154.18, and 67.17%.

Figure 15 displays the annual average sediment yield (Qs) data of GEP RCP8.5 between 2021 and 2100 and observed data between 1970 and 2020. It is understood that the data in the observed period is well above the data projected by the GEP model under the RCP8.5 scenario. The highest annual average of the period that includes the observed data is 17,386.910 tons/day, which belongs to 1974 and differs significantly from other years. The lowest annual average value of 569,717 tons/day in the same period belongs to 2001. For the period estimated by the RCP8.5 scenario of the GEP projected model, the lowest value is expected in 2097 with the value of 293,755 tons/day, while the highest annual average belongs to the year 2089 with 2975.405 tons/day. It is obvious that the highest and lowest observed data values are higher than the projected data (Figure 15).
Figure 15

The projection of annual average sediment yield (Qs) values under the CanESM2 RCP8.5 scenario by GEP model and observed period of 1970–2020 in logarithmic scale.

Figure 15

The projection of annual average sediment yield (Qs) values under the CanESM2 RCP8.5 scenario by GEP model and observed period of 1970–2020 in logarithmic scale.

Close modal
Figure 16

The projection of annual average sediment yield (Qs) values under the CanESM2 RCP8.5 scenario by GEP model in normal scale with a trendline.

Figure 16

The projection of annual average sediment yield (Qs) values under the CanESM2 RCP8.5 scenario by GEP model in normal scale with a trendline.

Close modal

The projection of annual average sediment yield values under values under CanESM2 RCP8.5 scenario by GEP model in normal scale with trendline is given in Figure 16 to figure out the trend of projected sediment yield data. It is seen from Figure 16 projected sediment yield data has a remarkable decreasing trend with a value of 5.625 tons/days/year. Although the trendline is substantially similar to minimum and maximum projected data, for the period of 2059–2060, 2072 and 2085–2090 the sediment yield data is out of from trendline. Out of trendline data can be considered the extreme sediment yield data, especially in 2089.

Figure 17 shows the annual average values of discharge data for May, which has been considered a critical month due to having the highest values in all of the simulated periods of 2006–2020, 2020s, 2040s, 2060 and 2080 s. In May, values are much higher. The observed sediment yield reached its highest value in 1974, with 69,442.1 tons/day, and its lowest in 1989, with 399.1 tons/day. The highest value obtained by the GEP model under the RCP8.5 scenario is expected to be 15,891.163 tons/day in 2060, while the lowest is 20.822 tons/day, which is expected to be recorded in 2049. Again, a decreasing trend is observed from 1970 to 2100.
Figure 17

The projection of annual average sediment yield (Qs) values under the CanESM2 RCP8.5 scenario by GEP model and observed period of 1970–2020 for May, in logarithmic scale.

Figure 17

The projection of annual average sediment yield (Qs) values under the CanESM2 RCP8.5 scenario by GEP model and observed period of 1970–2020 for May, in logarithmic scale.

Close modal

The divergent performances of GEP and LGP across different periods in our study could be attributed to a multitude of factors rooted in their inherent model structures and evolutionary algorithms. Initially, LGP's superior performance in the validation period (1996–2005) might be attributed to its effective learning and generalization from the historical data available until 1995, possibly offering a more precise representation of the observed relationships within the data during this period. On the other hand, GEP's hybrid structure, which combines linear chromosomes with tree-like expressions, might provide a broader exploration of the solution space, thereby potentially capturing evolving patterns in the data more comprehensively. This attribute could have contributed to GEP's superior performance when projecting into the future (2006–2020), a period that might have witnessed new or evolving patterns not apparent in the earlier data. Furthermore, GEP's adaptability, stemming from its evolutionary algorithm nature, might have enabled a more robust adjustment to changing data trends over time, which could be particularly beneficial in the scenario analysis encompassing various RCP scenarios. This adaptability might explain the higher correlation values exhibited by GEP across all RCP scenarios in the future projections. Additionally, the presence of outliers in the future projection data might have posed challenges that GEP was potentially better equipped to handle, thereby contributing to its better performance in this period. The scenario analysis revealed GEP's potentially superior capability in handling uncertainties associated with future climate projections, which is pivotal in the context of climate change impact analysis. Collectively, these factors underscore the nuanced interplay between model structure, evolutionary algorithms, and data characteristics in determining the relative performances of GEP and LGP across different periods of the study.

LGP model provides correlation (R) values of 0.841 for Q and 0.788 for Qs, while the GEP model provides correlation values of 0.684 for Q and 0.673 for Qs during the validation period. This comparison shows that the LGP model exhibits significantly better performance than the GEP model for discharge and sediment yield during validation period (1996–2005). The future predictions of sediment yield and discharge data from the models have also been evaluated under three distinct scenarios: RCP2.6, RCP4.5, and RCP8.5. The models made projections for the period of 2006–2100, and their performance between 2006 and 2020 was evaluated against observed values to measure the efficacy of the models. The GEP model showed the better correlation with the observed period with the R = 0.4585 while LGP has the R-value of 0.351. Moreover, the GEP and LGP models performed best for both sediment yield and discharge under the RCP8.5 scenario. The GEP model, under the RCP8.5 scenario, apart from the extreme values in abovementioned periods, forecasted a slight decrease in discharge and a noticeable decrease in sediment yield from 2021 to 2100. These findings, while unique to the research area, align with broader research that suggests climate change will lead to decreased discharge and sediment yield in river basins. For instance, Bozkurt et al. (2015) predict substantial decreases in mean annual discharge for the Euphrates and Tigris Rivers by the century's end, ranging from 19 to 58%. Similarly, Adamo et al. (2018) project a decrease in discharge within the Euphrates-Tigris basin. Although focused on a different region, Hirschberg et al. (2021) illustrate how climate change-induced shifts in precipitation and temperature can reduce sediment yield (−48%) and debris-flow occurrence (−23%). Finally, Tian et al. (2020) demonstrate that climate change results in decreased streamflow and sediment yield under RCP4.5 and RCP8.5 emission scenarios.

The aim of this research was to assess the performance of two artificial intelligence modeling techniques, GEP and machine-coded LGP, in the statistical downscaling method and to analyze projection of discharge and sediment yield under climate change effects in basin area to acquire trend of projected data. The sediment yield and discharge data of the basin have been estimated using the statistical downscaling method with GEP and LGP models, and these models have been compared with each other, as well as with a devised Linear Regression method.

The obtained projection results of GEP model under RCP 8.5 were divided into 20-year period, namely, 2020s (2021–2040), 2040s (2041–2060), 2060s (2061–2080), 2080s (2081–2100) and monthly average discharge and sediment values were graphically compared with each other and with the observed monthly average values from 2006 to 2020.

In order to view all observed and predicted years together, the values observed between 1970 and 2020 and the values predicted by the GEP model under the RCP8.5 scenario between 2021 and 2100 have been used. These are annual average values covering all results from 1970 to 2100. In the annual average values of the data projected by the GEP model RCP8.5 scenario, the general trend of discharge and sediment yield decreases slightly between 2021 and 2100.

These results suggest significant implications for the future of the basin. Climate change is expected to not only decrease water supply but also fundamentally alter the river's flow patterns. These changes will impact a range of activities dependent on the river, including hydropower generation and agricultural practices. The hydropower potential of the basin is likely to diminish due to reduced discharge, potentially affecting energy production. Additionally, regional water management practices may need revision to accommodate a decreasing water supply and the challenges it poses for agriculture. Furthermore, the projected decrease in sediment yield carries its own implications. Reduced sediment could lead to channel erosion, impacting riverine ecosystems and potentially infrastructure. On the other hand, lower sediment loads might decrease reservoir siltation, extending the lifespan of existing dams. These findings highlight the complex challenges posed by climate change. The development of effective adaptation and mitigation strategies for the basin necessitates careful consideration of these interrelated factors – decreased discharge, reduced sediment yield, and their cascading effects on both human activities and the natural environment.

The applicability of the proposed models is limited with the range of data used in the study area. The contribution of this study to the literature involves the examination of different GP models under three distinct climate change scenarios (RCP2.6, RCP4.5, RCP8.5) for predictions of two separate climatic data sets. The findings of this research are anticipated to serve as an informative resource for policymakers within governmental bodies. Additionally, they offer a significant reference point for hydrologists engaged in the quantification of water resources within river basins. This enhanced understanding can facilitate more effective management and strategic planning around water resource allocation.

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Adamo
N.
,
Al-Ansari
N.
,
Sissakian
V.
,
Laue
J.
&
Knutsson
S.
2018
The future of the Tigris and Euphrates water resources in view of climate change
.
J. Earth Sci. Geotech. Eng.
8
,
59
74
.
Azamathulla
H. M.
&
Zahiri
A.
2012
Flow discharge prediction in compound channels using linear genetic programming
.
J. Hydrol.
454–455
,
203
207
.
https://doi.org/10.1016/j.jhydrol.2012.05.065
.
Azamathulla
H. M.
,
Guven
A.
&
Demir
Y. K.
2011
Linear genetic programming to scour below submerged pipeline
.
Ocean Eng.
38
,
995
1000
.
Billah
K.
,
Le
T. B.
&
Sharif
H. O.
2021
Data- and model-based discharge hindcasting over a subtropical river basin
.
Water
13
,
2560
.
https://doi.org/10.3390/w13182560
.
Birbal
P.
,
Azamathulla
H.
,
Leon
L.
,
Kumar
V.
&
Hosein
J.
2021
Predictive modelling of the stage–discharge relationship using gene-Expression programming
.
Water Supply
21
,
3503
3514
.
https://doi.org/10.2166/ws.2021.111
.
Bozkurt
D.
,
Sen
O. L.
&
Hagemann
S.
2015
Projected river discharge in the Euphrates-Tigris basin from a hydrological discharge model forced with RCM and GCM outputs
.
Clim. Res.
62
,
131
147
.
https://doi.org/10.3354/cr01268
.
Brameier
M.
2004
On Linear Genetic Programming
.
PhD Thesis
.
University of Dortmund. Markus Brameier
.
Chansawang
B.
,
Waqas
M.
,
Wanasing
U. H.
,
Hlaing
P. T.
,
Lin
H. A.
&
Ali
R.
2023
Temperature prediction by gene expression programming
. In:
2023 International Multi-Disciplinary Conference in Emerging Research Trends (IMCERT)
, pp.
1
5
.
https://doi.org/10.1109/IMCERT57083.2023.10075127
.
Danandeh Mehr
A.
,
Kahya
E.
&
Yerdelen
C.
2014
Linear genetic programming application for successive-station monthly streamflow prediction
.
Comput. Geosci.
70
,
63
72
.
https://doi.org/10.1016/j.cageo.2014.04.015
.
Earth Resources Observation and Science (EROS) Center
.
2017
Shuttle Radar Topography Mission (SRTM) 1 Arc-Second Global. https://doi.org/10.5066/F7PR7TFT
.
Esmaeili-Gisavandani
H.
,
Lotfirad
M.
,
Sofla
M. S. D.
&
Ashrafzadeh
A.
2021
Improving the performance of rainfall-runoff models using the gene expression programming approach
.
J. Water Clim. Change
12
,
3308
3329
.
https://doi.org/10.2166/wcc.2021.064
.
Ferreira
C.
2001
Gene expression programming: a new adaptive algorithm for solving problems. ArXiv Prepr. Cs 0102027
.
Fuladipanah
M.
,
Azamathulla
H. M.
,
Tota-Maharaj
K.
,
Mandala
V.
&
Chadee
A.
2023
Precise forecasting of scour depth downstream of flip bucket spillway through data-driven models
.
Results Eng.
20
,
101604
.
https://doi.org/10.1016/j.rineng.2023.101604
.
Gupta
S. K.
,
Singh
P. K.
,
Tyagi
J.
,
Sharma
G.
&
Jethoo
A. S.
2020
Rainstorm-generated sediment yield model based on soil moisture proxies (SMP)
.
Hydrol. Process.
34
,
3448
3463
.
https://doi.org/10.1002/hyp.13789
.
Guven
A.
,
Azamathulla
H. M.
&
Zakaria
N. A.
2009
Linear genetic programming for prediction of circular pile scour
.
Ocean Eng.
36
,
985
991
.
Haleem
K.
,
Khan
A. U.
,
Ahmad
S.
,
Khan
M.
,
Khan
F. A.
,
Khan
W.
&
Khan
J.
2021
Hydrological impacts of climate and land-use change on flow regime variations in upper Indus basin
.
J. Water Clim. Change
13
,
758
770
.
https://doi.org/10.2166/wcc.2021.238
.
Hirschberg
J.
,
Fatichi
S.
,
Bennett
G. L.
,
McArdell
B. W.
,
Peleg
N.
,
Lane
S. N.
,
Schlunegger
F.
&
Molnar
P.
2021
Climate change impacts on sediment yield and debris-flow activity in an alpine catchment
.
J. Geophys. Res. Earth Surf.
126
,
e2020JF005739
.
https://doi.org/10.1029/2020JF005739
.
Karunanayake
C.
,
Gunathilake
M. B.
&
Rathnayake
U.
2020
Inflow forecast of Iranamadu reservoir, Sri Lanka, under projected climate scenarios using artificial neural networks
.
Appl. Comput. Intell. Soft Comput.
2020
,
e8821627
.
https://doi.org/10.1155/2020/8821627
.
Kidanemariam
S.
,
Goitom
H.
&
Desta
Y.
2020
Coupled application of R and WetSpa models for assessment of climate change impact on streamflow of Werie catchment, Tigray, Ethiopia
.
J. Water Clim. Change
12
,
916
936
.
https://doi.org/10.2166/wcc.2020.238
.
Mampitiya
L.
,
Rathnayake
N.
,
Leon
L. P.
,
Mandala
V.
,
Azamathulla
H. M.
,
Shelton
S.
,
Hoshino
Y.
&
Rathnayake
U.
2023
Machine learning techniques to predict the air quality using meteorological data in two urban areas in Sri Lanka
.
Environments
10
,
141
.
https://doi.org/10.3390/environments10080141
.
Mehr
A. D.
,
Nourani
V.
,
Kahya
E.
,
Hrnjica
B.
,
Sattar
A. M.
&
Yaseen
Z. M.
2018
Genetic programming in water resources engineering: A state-of-the-art review
.
J. Hydrol.
566
,
643
667
.
Mollel
G. R.
,
Mulungu
D. M. M.
,
Nobert
J.
&
Alexander
A. C.
2023
Assessment of climate change impacts on hydrological processes in the Usangu catchment of Tanzania under CMIP6 scenarios
.
J. Water Clim. Change
14
,
4162
4182
.
https://doi.org/10.2166/wcc.2023.542
.
Muhammad
M. K. I.
,
Shahid
S.
,
Ismail
T.
,
Harun
S.
,
Kisi
O.
&
Yaseen
Z.
2021
The development of evolutionary computing model for simulating reference evapotranspiration over peninsular Malaysia
.
Theor. Appl. Climatol.
144
,
1419
1434
.
https://doi.org/10.1007/s00704-021-03606-z
.
Pouyanfar
S.
,
Nozari
H.
&
Khodamorad Pour
M.
2023
Comparison of the performances of the gene expression programming model and the RegCM model in predicting monthly runoff
.
J. Water Clim. Change
.
jwc2023439
.
https://doi.org/10.2166/wcc.2023.439
.
Raheel
M.
,
Iqbal
M.
,
Khan
R.
,
Alam
M.
,
Azab
M.
&
Eldin
S. M.
2023
Application of gene expression programming to predict the compressive strength of quaternary-blended concrete
.
Asian J. Civ. Eng.
24
,
1351
1364
.
https://doi.org/10.1007/s42107-023-00573-w
.
Rahmani-Rezaeieh
A.
,
Mohammadi
M.
&
Danandeh Mehr
A.
2020
Ensemble gene expression programming: A new approach for evolution of parsimonious streamflow forecasting model
.
Theor. Appl. Climatol.
139
,
549
564
.
https://doi.org/10.1007/s00704-019-02982-x
.
Ramírez Villegas
J.
&
Jarvis
A.
2010
Downscaling global circulation model outputs: the delta method decision and policy analysis Working Paper No. 1
.
Shoaib
M.
,
Shamseldin
A. Y.
,
Melville
B. W.
&
Khan
M. M.
2015
Runoff forecasting using hybrid wavelet gene expression programming (WGEP) approach
.
J. Hydrol.
527
,
326
344
.
Sireesha Naidu
G.
,
Pratik
M.
&
Rehana
S.
2020
Modelling hydrological responses under climate change using machine learning algorithms – semi-arid river basin of peninsular India
.
H2Open J.
3
,
481
498
.
https://doi.org/10.2166/h2oj.2020.034
.
Song
D.
,
Yuan
X.
,
Li
Q.
,
Zhang
J.
,
Sun
M.
,
Fu
X.
&
Yang
L.
2023
Intrusion detection model using gene expression programming to optimize parameters of convolutional neural network for energy internet
.
Appl. Soft Comput.
134
.
https://doi.org/10.1016/j.asoc.2022.109960
.
Tian
P.
,
Lu
H.
,
Feng
W.
,
Guan
Y.
&
Xue
Y.
2020
Large decrease in streamflow and sediment load of Qinghai–Tibetan Plateau driven by future climate change: A case study in Lhasa River Basin
.
CATENA
187
,
104340
.
https://doi.org/10.1016/j.catena.2019.104340
.
Tilahun
Z. A.
,
Bizuneh
Y. K.
&
Mekonnen
A. G.
2023
The impacts of climate change on hydrological processes of Gilgel Gibe catchment, southwest Ethiopia
.
PLOS ONE
18
,
e0287314
.
https://doi.org/10.1371/journal.pone.0287314
.
Verma
S.
,
Verma
M. K.
,
Prasad
A. D.
,
Mehta
D.
,
Azamathulla
H. M.
,
Muttil
N.
&
Rathnayake
U.
2023
Simulating the hydrological processes under multiple land Use/Land cover and climate change scenarios in the Mahanadi Reservoir Complex, Chhattisgarh, India
.
Water
15
,
3068
.
https://doi.org/10.3390/w15173068
.
Xu
C.
,
Wang
Y.
,
Fu
H.
&
Yang
J.
2022
Comprehensive analysis for long-term hydrological simulation by deep learning techniques and remote sensing
.
Front. Earth Sci.
10
,
875145
.
Yang
L.
,
Feng
Q.
,
Yin
Z.
,
Wen
X.
,
Si
J.
,
Li
C.
&
Deo
R. C.
2017
Identifying separate impacts of climate and land use/cover change on hydrological processes in upper stream of Heihe River, Northwest China
.
Hydrol. Process.
31
,
1100
1112
.
https://doi.org/10.1002/hyp.11098
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY-NC-ND 4.0), which permits copying and redistribution for non-commercial purposes with no derivatives, provided the original work is properly cited (http://creativecommons.org/licenses/by-nc-nd/4.0/).