This study evaluates and enhances machine learning models for predicting pan evaporation under diverse climatic conditions. Five fundamental machine learning models were employed and tested across four different stations. Subsequent comparisons were made with advanced techniques, including long short-term memory (LSTM) networks. An innovative approach was introduced, combining LSTM with Binary Al-Biruni Earth Radius (BER–LSTM). This hybrid method was benchmarked against other optimization techniques. The BER–LSTM model consistently outperformed other models across all stations and time scales, achieving up to a 97.54% improvement in root mean square error (RMSE) compared to standard LSTM on daily time scales. Compared to simpler models like Multilayer Perceptron and Support Vector Regressor, BER–LSTM showed even more substantial improvements, with up to a 99.03% reduction in RMSE. The BER–LSTM model demonstrates superior predictive capabilities for pan evaporation across varied climatic conditions, offering significant improvements over both traditional and advanced machine learning techniques. This approach shows promise for enhancing evaporation forecasting in diverse environmental contexts.

  • Novel hybrid data-driven method for accurate evaporation prediction in Algeria's distinct climatic zones.

  • Importance of model selection based on performance and computational resources highlighted.

  • Proposed BER–LSTM model enhances performance and surpasses existing techniques in accuracy.

  • Results provide insights into developing advanced evaporation forecasting methods applicable to water-scarce regions globally.

Evaporation, the process by which water is converted from liquid to vapor, is a crucial component of the water cycle and plays a significant role in the availability of water resources. Accurate prediction of evaporation rates is essential for managing water resources, particularly in regions with limited water supplies, such as Northern African countries (Algeria, Tunisia, Libya, and Egypt) that suffer from climate change and water scarcity (Benzaghta et al. 2012; Alazard et al. 2015; Latif 2024). Inadequate water resources can lead to water scarcity and negatively impact agriculture, industry, and the overall socioeconomic well-being of the region (Kherbache 2020; Frih et al. 2021; Bouznit et al. 2022; Ogunbode et al. 2024). Accurate prediction of evaporation rates can help optimize irrigation and water allocation and aid in managing droughts (Mouatadid et al. 2018). Furthermore, evaporation prediction plays a key role in climate modeling, as it is a key component of the water and energy balance of the earth (Mouatadid et al. 2018). Predicting evaporation rates can improve our understanding of the earth's climate system, thus providing valuable information for mitigating the effects of climate change (El-Mahdy et al. 2021).

Predicting evaporation is challenging because it is affected by various factors, including temperature, humidity, wind speed, and solar radiation. Numerous methods have been developed over the years to predict evaporation rates, such as empirical, theoretical, and database models (Zheng et al. 2011; Song et al. 2022; Alam et al. 2024). However, these methods often have limitations and may not be suitable for all climates or regions. Therefore, developing a method that accurately predicts evaporation rates across different climates is essential. Evaporation can be estimated using direct or indirect approaches, such as the Moran method (Moran et al. 1996) and the Penman empirical equation. One direct method involves measuring evaporation from a Class pan that is 1.22 m in diameter and 0.25 m deep, placed 0.15 m above the ground. This technique is simple, cost-effective, and provides accurate results over time but is limited to specific climatic regions.

In contrast, indirect methods use meteorological variables like sunshine duration, wind speed, relative humidity, and temperature to estimate evaporation. These indirect approaches are often used in empirical and semi-empirical models but can be problematic due to the dynamic nature of the variables involved (Patel & Majmundar 2016). To overcome this, developing reliable and robust intelligent prediction models for evaporation has become an important focus in water resources management and engineering.

Recently, numerous studies have focused on implementing machine learning models for evaporation estimation (Wang et al. 2017; Mohamadi et al. 2020; Rajput et al. 2024). Several models have emerged, including evolutionary computing (Ehteram et al. 2022), deep learning (Lakmini Prarthana Jayasinghe et al. 2022), Fuzzy Logic (Adnan Ikram et al. 2022), Random Forest (Adnan Ikram et al. 2022), and eXtreme Gradient Boosting (XGBoost) (Malik et al. 2022; Karbasi et al. 2024). These models have demonstrated impressive predictive accuracy (El-kenawy et al. 2022). The application of machine learning extends beyond evaporation prediction to other environmental factors. Kumar et al. (2023) reviewed deep learning applications in flood forecasting, while Mampitiya et al. (2023) demonstrated the effectiveness of models like LightBGM in predicting urban air quality. These studies underscore the potential of advanced machine learning techniques in addressing various environmental challenges.

Machine learning models like Support Vector Machines (SVMs), Gene Expression Programming (GEP), and Cascade Correlation Neural Networks (CCNNs) have significantly advanced hydrologic modeling (Thakur & Manekar 2022; Hussein et al. 2024). Hybrid models, which combine multiple approaches, have gained popularity for their superior accuracy in predicting evaporation rates under various conditions. For instance, at Iranshahr station and Chahnimeh Zabol reservoirs in Iran, Keshtegar et al. (2016) utilized nonlinear mathematical models of daily pan evaporation. They found that a conjugate line search method outperformed ANFIS and decision tree models. Similarly, Arunkumar et al. (2017) compared artificial neural networks (ANNs), genetic programming, and decision tree models with different meteorological inputs, concluding that genetic programming was the most precise.

In another study, Wang et al. (2017) discovered that the least squares SVM and fuzzy genetics were better predictors than decision tree models. At the Talesh Meteorological station in Iran, Ali Ghorbani et al. (2018) developed an MLP–FFA model by combining the Firefly algorithm with ANNs. This model outperformed both the stand-alone MLP and support vector machine models. Further research by Sebbar et al. (2019) employed an Online Sequential Extreme Learning Machine (OSELM) and an Optimal Pruned Extreme Learning Machine (OPELM) to predict evaporation with high accuracy at various dam reservoirs in Algeria's Mediterranean regions. Their findings showed that OSELM outperformed OPELM in estimating daily evaporation.

Additionally, Keshtegar et al. (2019) combined the response surface method with SVM (RSM-SVM) and found it more effective than using SVM, ANN, or RSM models alone. Guan et al. (2020) improved evaporation prediction accuracy by combining the SVM model with the Krill algorithm. The new SVM–Krill model proved suitable not only for evaporation prediction but also for other hydrologic variables.

Overall, hybrid models have several advantages over conventional models, such as integrating data from multiple sources, considering complex physical and statistical processes simultaneously, and processing data more efficiently. They are also more reliable because they can handle outliers and extreme values better. The application of these methods has led to improved evaporation prediction. However, more research is needed to understand the complex processes involved in evaporation fully and to develop even more accurate prediction models. Managing Algeria's water resources is important as the country's water demand increases due to population growth and the expansion of its industrial activities. In addition, Algeria is a semi-arid desert country, so the country's water resources are limited. Accurate prediction of evaporation rates is essential for successfully managing available water resources. However, each of the country's different climatic zones has characteristics that affect evaporation rates. Therefore, developing a technique capable of estimating evaporation rates in each climate zone is paramount.

This research explores a novel hybrid deep learning approach to predicting pan evaporation across Algeria's various climatic zones. The approach integrates long short-term memory (LSTM) networks with the BER algorithm. A binary version of the BER algorithm will be employed for feature selection, refining the dataset by identifying the most relevant variables for prediction. The proposed approach will be tested using historical data, with results compared against those from other machine learning techniques used for evaporation prediction.

Algeria is a country of remarkable geographical diversity, stretching from the Mediterranean coast to the vast expanses of the Sahara Desert, which accounts for over 80% of its landmass (El-Mahdy et al. 2021). This desert environment subjects the region to extreme conditions, including scorching summers, harsh droughts, frigid winters, and strong winds. The cities of Adrar (ADR), Bechar (BCH), and Tamanrasset (TAM) lie within this hot desert climate, where intense heat and minimal rainfall dominate. In contrast, Tizi Ouzou (TIZ), located in the Sebaou River basin in northern Algeria, experiences a subhumid bioclimatic zone. This area typically endures two distinct seasons: a dry period from May to September and a wet season from October to May.

The temperature and humidity data for this study were sourced from four stations (refer to Figure 1 and Table A1, Supplementary Material) provided by the Algerian National Agency of Water Resources (ANRH). Table A1 (Supplementary Material) outlines comprehensive details on these weather monitoring stations, including their location, identification codes, time scales of measurements, latitude, longitude, measurement periods, and Köppen climate classifications (Peel et al. 2007). For example, the station in Adrar records data daily or monthly, depending on the measurement scale. The geographical coordinates are given in degrees north or east, with measurement periods ranging from 1996 to 2015. These stations are spread across different climatic zones, with the predominant classifications being BWh (hot desert climate) and Csa (Mediterranean climate).
Figure 1

Digital elevation model of Algeria and localization of the studied meteorological stations.

Figure 1

Digital elevation model of Algeria and localization of the studied meteorological stations.

Close modal

Water management is a critical concern in a country with such diverse climates. Saggai et al. (2016) observed that the Algerian government's strategy to boost water reserves involves several initiatives, including the construction of dams and reservoirs. However, this approach encounters significant challenges, particularly due to meteorological conditions that lead to substantial evaporation, especially in arid regions. These regions were severely affected by prolonged droughts during the 1980s and 1990s (Khezazna et al. 2017; Elbeltagi et al. 2022; Habibi et al. 2024; Mihi et al. 2024; Oubadi et al. 2024; Zerouali et al. 2024). A study by Remini & Hallouche (2005) on evaporation losses in 39 dam lakes in arid and semi-arid regions of Algeria found that the volume of water lost to evaporation significantly exceeds that lost to siltation. On average, these losses amount to 250 million cubic meters annually across the 350 square kilometers of the dams studied, representing 6.5% of their total capacity. To address this issue, Saggai et al. (2016) and Saggaï & Bachi (2018) proposed a strategy to reduce evaporation in dams by using long-chain substances that form at the air–water interface, thereby enhancing water storage in arid regions.

The statistical data from the TAM, TIZ, ADR, and BCH stations under study are summarized in Table A2 (Supplementary Material). These data are presented on two time scales: daily and monthly. The TAM and TIZ stations provide daily data, while the ADR and BCH stations offer monthly reports. This variation in reporting frequency enables both detailed and broader trend analysis across different climatic regions. For the monthly data, the ADR station records a mean temperature of 25.9 °C, while the BCH station has a mean of 20.7 °C. The maximum temperature (Tmax) at ADR is 33.9 °C, compared to 26.6 °C at BCH. The minimum temperature (Tmin) at ADR is 17.3 °C, whereas at BCH, it is 14.4 °C. Maximum pan evaporation at ADR is 22.1 mm, while at BCH, it is 23.9 mm. These figures highlight the generally warmer and more extreme climate of ADR compared to BCH, which aligns with their respective geographic locations and climate classifications, as shown in Table A1 (Supplementary Material).

The daily data reveal a different pattern. At the TAM station, the mean temperature is 23.6 °C, whereas at the TIZ station, it is 18.8 °C. TAM's maximum temperature (Tmax) reaches 48.4 °C, while TIZ records a slightly higher Tmax of 49 °C. The minimum temperature (Tmin) at TAM is −1.6 °C, compared to 0.0 °C at TIZ. Maximum pan evaporation is 21.4 mm at TAM and 14.5 mm at TIZ. These daily data indicate more extreme temperature ranges compared to the monthly averages, particularly for TAM and TIZ, which is essential for understanding short-term climate variations and their potential impacts on water resources.

Figure 2 illustrates the relative frequency percentages of pan evaporation across bin-center ranges for the four stations: ADR, BCH, TAM, and TIZ. This figure complements the summary statistics in Table A2 (Supplementary Material) by providing a detailed distribution of pan evaporation rates, offering insights into the prevalence of different pan evaporation levels at each station. At the ADR station, the highest relative frequency, 10.2%, occurs at a bin-center of 4.45 mm, indicating that low pan evaporation amounts around this range are the most common. Frequencies fluctuate between 12% and 16.7% across subsequent bin centers, ranging from 6.75 to 20.55 mm. Very low (0% at 2.15 mm) and very high (0.9% at 22.85 mm) bin-center values are rare. This pattern aligns with the monthly maximum pan evaporation of 22.1 mm reported in Table A2 (Supplementary Material) for ADR, suggesting that while moderate pan evaporation amounts are relatively common, extreme values are infrequent at this site.
Figure 2

The relative frequency percentage of pan evaporation is based on bin-center ranges for the ADR, BCH, TAM, and TIZ stations.

Figure 2

The relative frequency percentage of pan evaporation is based on bin-center ranges for the ADR, BCH, TAM, and TIZ stations.

Close modal
Figure 3

Demonstration of the exploration and exploitation activities.

Figure 3

Demonstration of the exploration and exploitation activities.

Close modal

In contrast, the BCH station displays a different distribution pattern. The peak relative frequency is 22.5% at 4.45 mm, representing the most common low pan evaporation range for this location. As bin-center values increase, frequencies decline sharply, indicating fewer instances of higher pan evaporation amounts. Notably, there are two secondary peaks at 6.75 mm (14.1%) and 20.55 mm (12.7%). Similar to ADR, extremely small (0% at 2.15 mm) or large (2.8% at 22.85 mm) pan evaporation amounts are rarely recorded. This distribution corroborates the slightly higher maximum pan evaporation of 23.9 mm for BCH reported in Table A2 (Supplementary Material), revealing a more complex pan evaporation pattern than what the summary statistics alone suggest.

At the TAM station, the distribution peaks at 20.5% in the moderately low 4.45 mm bin, representing the most common pan evaporation range. Substantial frequencies, ranging from 18.4 to 25.1%, persist across bin centers up to 11.35 mm. However, as pan evaporation amounts increase toward higher bins, frequencies drop, with minimal occurrences above 15.95 mm (3.4% or lower). This pattern is consistent with the daily maximum pan evaporation of 21.4 mm reported for TAM in Table A2 (Supplementary Material), highlighting a focus on typically low to moderate pan evaporation levels, with values rarely exceeding 18 mm.

Conversely, the TIZ station exhibits a strong skew toward lower pan evaporation amounts, with a significant peak frequency of 28.5% at the 4.45 mm bin. Frequencies decrease sharply to 12.8 and 5.2% for the next few bins, up to 11.35 mm, and taper off to 0% from 13.65 mm onwards. This distribution aligns with the lower maximum pan evaporation of 14.5 mm reported for TIZ in Table A2 (Supplementary Material), reflecting its location in a more humid climate as indicated by its classification in Table A1 (Supplementary Material).

The LSTM network is an advanced neural architecture designed to handle sequential data while preserving long-term dependencies. Initially developed by Hochreiter & Schmidhuber (1997), the LSTM algorithm is particularly effective in managing non-stationary data and predicting outcomes based on time series analysis (Adnan Ikram et al. 2022; Jean et al. 2023). The network consists of memory cells, input gates, output gates, and forget gates (Figure A1 (Supplementary Material)), all of which work in unison to retain crucial information and make informed decisions (Fan et al. 2020; Ruiz et al. 2023). LSTM technology is extensively utilized in natural language processing tasks, such as text classification and language translation. Additionally, it finds application in various engineering domains, including fault diagnosis, predictive maintenance, and process control (Ersöz et al. 2022; Yahya et al. 2024). Its capability to learn from time series data and process static information makes the LSTM algorithm a valuable tool across fields like energy, engineering, and robotics (Khandan et al. 2024). The primary computational challenges associated with the LSTM algorithm involve matrix-vector products, as represented by Equations (11)–(16).
(1)
(2)
(3)
(4)
(5)
(6)

The LSTM model makes predictions by considering the outputs from its four gates – input gate, forget gate (), output gate (), and update gate () – at each time step t. These gates process the previous time step's model output and the current time step's inputs to make decisions. The weights and biases for the gates (, , , and ) and (, , , and ) are continually adjusted during the learning phase. The memory cell and the activation function σ also plays a crucial role in the LSTM's computations.

The Al-Biruni Earth Radius (BER) technique draws inspiration from swarm intelligence, particularly how insects like ants and bees communicate and collaborate to achieve their goals. The first was proposed by El-Kenawy et al. (2023), this cooperative optimization algorithm divides the population into two sub-groups, each focused on either exploration or exploitation. This approach simulates how swarms naturally split into sub-groups to complete different activities efficiently (El-kenawy et al. 2023). BER addresses the problem of local optima stagnation by maintaining a diverse collection of search agents that constantly explore different areas of the search space. It also dynamically adjusts the number of individuals investigating the space if performance stagnates. This approach leads to a more thorough investigation of the search space and a higher probability of finding the optimal solution to the optimization problem.

The BER algorithm has proven its effectiveness in several domains. It has optimized a deep convolutional neural network for classifying Monkeypox Disease (Khafaga et al. 2022), enhanced predictions for a hybrid solar desalination system (Ibrahim et al. 2023), and improved renewable energy forecasting through a stacking ensemble model (Alghamdi et al. 2023). This innovative approach demonstrates significant potential across various applications.

Balancing exploration and exploitation

To maintain a balance between exploitation and exploration, the BER algorithm adjusts the size of its two sub-groups dynamically (Alazard et al. 2015). Initially, 70% of the population is assigned to exploration, while the remaining 30% focuses on exploitation. As the algorithm advances, this ratio shifts, with 70% of individuals eventually dedicated to exploitation and only 30% to exploration. This transition enables a more concentrated effort to enhance global fitness.

To ensure convergence, the algorithm employs an elitism strategy by retaining the best solution if a better one is not found. If a solution's fitness does not improve significantly for three consecutive iterations, potentially indicating a local optimum, the algorithm generates a new exploration individual through mutation.

Avoiding local optima stagnation

In the BER algorithm, the exploration phase plays a crucial role in avoiding stagnation in local optima. This is achieved by constantly exploring new regions in the search space to discover the best solution. The algorithm ensures that individuals in the exploration group continuously move toward better solutions by searching nearby feasible areas. The equations used in the BER exploration process are as follows:
(7)
(8)
(9)
where is a number randomly chosen from the range [0,2], and are coefficient vectors with values calculated by Equation (3), is the solution vector at iteration t, and is the circle's diameter in which the search agent will look for attractive regions. Figure 3 demonstrates these exploration activities.

Improving solutions with exploitation

The exploitation operation in the BER optimization algorithm improves existing solutions. The BER calculates all individuals' fitness values at each cycle and identifies the best individual. The algorithm employs two approaches to achieve exploitation, including heading toward the best solution. The following equations move the search agent toward the best solution in this approach.
(10)
(11)

The goal of the exploitation group is to enhance existing solutions. BER assesses the fitness values of all individuals at each iteration and identifies the optimal solution. BER implements two techniques to accomplish exploitation, which are explained in the following sections.

Investigating area around the best solution

The area surrounding the best solution, referred to as the leader, is considered the most promising region. Consequently, some individuals explore this area to potentially find an even better solution. This operation is executed using the following equation:
(12)
(13)
where (S*) is the best solution, z is a random number in the range [0, 1], t is the iteration number, and N is the total number of iterations. Figure 3 illustrates the exploration and exploitation activities.

Mutation operation

The mutation operation is used in the exploitation team to generate new solutions. Mutation helps the optimization process escape from local optima, leading to more extensive search space exploration. The BER optimization algorithm uses a mutation operation in the following way:

(14)

Finding the optimal solution

The BER algorithm is designed to consistently select the best solution to guide the search process. While the elitism method is effective, it can sometimes lead to premature convergence when solving multimodal functions. To address this issue, BER combines a mutation operation with a search around individuals in the exploration group, maintaining population diversity. This feature makes BER less susceptible to early convergence and enables it to explore more promising solutions.

The BER pseudo-code (Algorithm 1) requires inputs for iteration number, population size, and mutation rate. The population is divided into exploration and exploitation groups, with the number of individuals in each group dynamically adjusted throughout the search process. To maintain diversity, solutions are randomly reordered after each iteration. The elitism method ensures that the best solution is preserved. Consequently, a solution from the exploration group in one iteration may become part of the exploitation group in the next, demonstrating the dynamic nature of the BER algorithm.

Evaluating a model's performance is crucial for understanding its predictive accuracy. This involves analyzing how well the estimated values match the actual measurements. Statistical techniques such as the coefficient of determination (R2), mean bias error (MBE), and root mean square error (RMSE) are commonly used for this purpose. Each method offers unique insights into the model's effectiveness, helping to pinpoint areas for improvement (Mostafa & Bahgaat 2017; Bailek et al. 2020; Keshtegar et al. 2022; Maraj et al. 2022; Ahmed et al. 2023; Saab et al. 2023; El-kenawy et al. 2024; Yadav et al. 2024; Yuzer & Bozkurt 2024; Zerouali et al. 2024). RMSE, calculated by the formula:
(15)

RMSE provides a measure of the average magnitude of prediction errors. By taking the square root of the average squared differences, RMSE gives a straightforward indication of how close predictions are to actual values. A lower RMSE suggests that the model's predictions are closer to the true measurements, thereby enhancing the reliability of the model.

MBE is another essential metric, given by:
(16)
MBE highlights whether the model tends to overestimate or underestimate the values. This average bias calculation helps in understanding the direction of errors, providing valuable feedback on the model's systematic inaccuracies. Thus, MBE can reveal whether adjustments are needed to correct consistent biases. Mean square error (MSE) is also used to gauge model accuracy:
(17)
The coefficient of determination is calculated using:
(18)

indicates the proportion of variance in the dependent variable that is predictable from the independent variable. A higher value reflects a better fit of the model to the data, showing that the model accounts for a larger portion of the variability.

First, five fundamental machine learning models were employed – Multilayer Perceptron (MLP), Support Vector Regressor (SVR), Decision Tree Regressor (DTR), Random Forest Regressor (RFR), and Nearest Neighbor Estimation (NNE) – at each station. The dataset was divided into two sets, with 80% allocated for training and 20% for testing. Min–max scaling was applied to enhance model accuracy.

Table 1 highlights the performance of various machine learning models across four stations, evaluated using the R2 coefficient of determination on both monthly and daily time scales. The MLP model outperforms others in monthly predictions, achieving R2 values of 0.9167 at the ADR station and 0.9573 at the BCH station. Following closely, the SVR model shows R2 values of 0.8992 at ADR and 0.9520 at BCH. For daily predictions, the R2 values generally decline. The DTR model performs best at the TAM station, with an R2 of 0.2320, while the NNE model achieves the highest R2 value of 0.8224 at the TIZ station. Despite the R2 metric providing a useful measure of model performance, relying on it alone may not yield the most accurate model selection. Other metrics like MBE, RMSE, and bias should also be considered.

Table 1

Performance evaluation of classical machine learning models across four stations using R2 on monthly and daily time scales

ModelsADRBCHTAMTIZ
Monthly
Daily
MLP 0.9167 0.9573 0.0857 0.7618 
SVR 0.8992 0.9520 0.1711 0.7566 
DTR 0.8802 0.9475 0.2320 0.7117 
RFR 0.8801 0.9000 0.0980 0.7760 
NNE 0.8768 0.9431 0.0987 0.8224 
ModelsADRBCHTAMTIZ
Monthly
Daily
MLP 0.9167 0.9573 0.0857 0.7618 
SVR 0.8992 0.9520 0.1711 0.7566 
DTR 0.8802 0.9475 0.2320 0.7117 
RFR 0.8801 0.9000 0.0980 0.7760 
NNE 0.8768 0.9431 0.0987 0.8224 

Table 2 presents a comparative analysis of machine learning models across four stations. The data reveals intriguing performance variations. RMSE values span from 0.0628 to 0.1958, indicating diverse prediction accuracies. MBE ranges from −0.0558 to 0.0603. This suggests varying degrees of model bias. The R2 values exhibit the widest range, spanning from 0.0857 to 0.9573. For the ADR and BCH stations, the models generally exhibit low MBE, low RMSE, and high R2 scores, indicating a good fit to the data. The MLP model performs particularly well at ADR, with MBE, RMSE, and R2 values of 0.0341, 0.0758, and 0.9167, respectively. In contrast, the models for the TAM and TIZ stations show higher MBE and RMSE values coupled with lower R2 scores. For TAM, the DTR model achieves the best performance, with MBE, RMSE, and R2 values of −0.0396, 0.1795, and 0.2320, respectively. For TIZ, the NNE model excels with values of 0.0252, 0.0628, and 0.8224.

Table 2

RMSE, MBE, and R2 (normalized values) range for classical machine learning models at four study sites

ModelsStationsMBERMSER2StationsMBERMSER2
MLP ADR 0.0341 0.0758 0.9167 TAM −0.0443 0.1958 0.0857 
SVR 0.0528 0.0834 0.8992 −0.0319 0.1864 0.1711 
DTR 0.0481 0.0909 0.8802 −0.0396 0.1795 0.2320 
RFR 0.0470 0.0910 0.8801 −0.0558 0.1945 0.0980 
NNE 0.0603 0.0922 0.8768 −0.0524 0.1944 0.0987 
MLP BCH 0.0070 0.0635 0.9573 TIZ 0.0203 0.0727 0.7618 
SVR 0.0006 0.0673 0.9520 0.0254 0.0735 0.7566 
DTR −0.0025 0.0704 0.9475 0.0300 0.0800 0.7117 
RFR −0.0185 0.0972 0.9000 0.0288 0.0705 0.7760 
NNE 0.0030 0.0733 0.9431 0.0252 0.0628 0.8224 
ModelsStationsMBERMSER2StationsMBERMSER2
MLP ADR 0.0341 0.0758 0.9167 TAM −0.0443 0.1958 0.0857 
SVR 0.0528 0.0834 0.8992 −0.0319 0.1864 0.1711 
DTR 0.0481 0.0909 0.8802 −0.0396 0.1795 0.2320 
RFR 0.0470 0.0910 0.8801 −0.0558 0.1945 0.0980 
NNE 0.0603 0.0922 0.8768 −0.0524 0.1944 0.0987 
MLP BCH 0.0070 0.0635 0.9573 TIZ 0.0203 0.0727 0.7618 
SVR 0.0006 0.0673 0.9520 0.0254 0.0735 0.7566 
DTR −0.0025 0.0704 0.9475 0.0300 0.0800 0.7117 
RFR −0.0185 0.0972 0.9000 0.0288 0.0705 0.7760 
NNE 0.0030 0.0733 0.9431 0.0252 0.0628 0.8224 

The variation in model performance across stations can be partly attributed to differences in pan evaporation patterns observed in Figure 2. For example, the complex distribution of pan evaporation at the TAM station, with substantial frequencies across a wider range of bin centers, may contribute to the lower R2 scores and higher RMSE values at this station. Conversely, the strong skew toward lower pan evaporation amounts at the TIZ station, as shown in Figure 2, might explain the better performance of the NNE model there.

MLP, SVR, and NNE are the top-performing models for daily time scales, while MLP and SVR show the best results for monthly time scales. Interestingly, the model with the lowest RMSE and MBE values does not always correspond to the highest R2 value, suggesting that different criteria may be necessary to determine the best-performing models. Minimizing RMSE and MBE generally results in the MLP model performing best across all stations and time scales. However, maximizing R2 occasionally leads to models like SVR and NNE outperforming MLP. All models show relatively low R2 values for the TAM station, indicating that they may not be well-suited for capturing the temporal variability of the data at this location. This observation aligns with the more extreme temperature ranges and higher maximum pan evaporation reported for TAM in Table A2 (Supplementary Material), suggesting that the complex climatic conditions at this station pose a particular challenge for these models. Exploring more complex models, such as deep learning techniques like LSTM, may be worthwhile for this type of data.

Second, we further explore the performance of advanced models, specifically comparing LSTM with MLP and SVR in Figure 4. The data in Figure 4 show that LSTM outperforms MLP and SVR in terms of RMSE values at most stations, demonstrating a significant improvement in accuracy. For instance, at the ADR station, the RMSE for MLP is 0.0758, while LSTM achieves a much lower RMSE of 0.0286, reflecting a 62% improvement. Similarly, at the BCH station, MLP's RMSE is 0.0635, compared to LSTM's 0.0231, indicating a 64% enhancement. These results are consistent across all stations, with LSTM consistently achieving lower RMSE values. At the TAM station, the RMSE for LSTM (0.0246) is substantially lower than those for MLP (0.1958) and SVR (0.1864). Even at the TIZ station, where the difference is less pronounced, LSTM (0.0650) still outperforms both MLP (0.0727) and SVR (0.0735). This significant improvement in RMSE values aligns with the earlier observation about the potential benefits of using more complex models like LSTM, especially for stations like TAM, where simpler models struggled to capture temporal variability. Numerous studies support the efficiency of the LSTM algorithm for evaporation modeling and forecasting. For instance, Majhi et al. (2019) found that deep-LSTM outperformed MLP and empirical methods such as Hargreaves and Blaney–Criddle. Similarly, Abed et al. (2022) reported that LSTM provided the most accurate monthly evaporation estimates across all stations studied in Malaysia. Zhao et al. (2021) compared LSTM's RMSE with other EDH models, including Naval-Postgraduate-School (NPS) and Babin–Young–Carton (BYC), finding a significant reduction in RMSE with LSTM. Although LSTM consistently outperforms MLP and SVR in the cases presented, it is important to recognize that selecting the best model depends on the specific application and the balance between performance and computational resources.
Figure 4

Comparison assessment of MLP, SVR, and LSTM models on evaporation data based on RMSE metric (normalized values).

Figure 4

Comparison assessment of MLP, SVR, and LSTM models on evaporation data based on RMSE metric (normalized values).

Close modal

To further enhance the accuracy of the LSTM model, particularly given its superior performance compared to simpler models, as shown in Figure 4, we plan to implement a new step involving improving BER. This approach could potentially enhance the model's ability to capture the temporal variability of the data, especially for challenging stations like TAM.

Figure 5 displays the results of two models, LSTM and BER–LSTM, applied to evaporation data across various stations and time scales. The results demonstrate that BER–LSTM consistently outperforms LSTM in terms of RMSE across all stations and time scales, with substantial improvements observed. On the monthly time scale at the ADR station, the BER–LSTM RMSE of 0.0058 represents a 79.72% improvement over LSTM's RMSE of 0.0286. Similarly, at the BCH station, BER–LSTM achieves an RMSE of 0.0065, reflecting a 71.86% enhancement over LSTM's RMSE of 0.0231. On daily time scales, BER–LSTM also surpasses LSTM, showing significantly lower RMSE values. For instance, at the TAM station, the BER–LSTM RMSE is 0.0019, compared to LSTM's RMSE of 0.0246, marking a 92.28% improvement. Likewise, at the TIZ station, BER–LSTM achieves an RMSE of 0.0016, while LSTM's RMSE is 0.0650, indicating a 97.54% enhancement.
Figure 5

Comparison assessment of LSTM and BER–LSTM models on pan evaporation data based on RMSE metric (normalized values).

Figure 5

Comparison assessment of LSTM and BER–LSTM models on pan evaporation data based on RMSE metric (normalized values).

Close modal
Comparing the results in Figure 5 with those in Figure 4 reveals that the BER–LSTM model significantly outperforms MLP and SVR models in terms of RMSE. At the TAM station on a daily time scale, the BER–LSTM RMSE of 0.0019 represents improvements of approximately 99.03 and 98.98% compared to MLP's RMSE of 0.1958 and SVR's RMSE of 0.1864, respectively. These findings demonstrate that incorporating the BER enhancement into the LSTM model can significantly boost its performance in capturing the temporal variability of the data. Figure 6 illustrates the comparison between actual and predicted evaporation values for the monthly (ADR and BCH) and daily (TAM and TIZ) stations using BER–LSTM. The results indicate a strong agreement between observed and simulated values.
Figure 6

Comparison between actual and predicted pan evaporation of the monthly (ADR and BCH) and daily (TAM and TIZI) stations using BER–LSTM.

Figure 6

Comparison between actual and predicted pan evaporation of the monthly (ADR and BCH) and daily (TAM and TIZI) stations using BER–LSTM.

Close modal
Figure 7

Comparison of MSE performance (normalized values) for BER–LSTM, PSO–LSTM, GWO–LSTM, GA–LSTM, and WOA–LSTM across multiple runs.

Figure 7

Comparison of MSE performance (normalized values) for BER–LSTM, PSO–LSTM, GWO–LSTM, GA–LSTM, and WOA–LSTM across multiple runs.

Close modal

Next, the model was tested against PSO–LSTM, GWO–LSTM, GA–LSTM, and WOA–LSTM across multiple stations and time scales. Table 3 summarizes the parameters and values of these optimization algorithms, highlighting their roles in enhancing model accuracy. The GWO algorithm employs 20 wolves over 19 iterations, with the parameter ‘a’ decreasing from 2 to 0, facilitating both exploration and exploitation phases. PSO operates with inertia weights ranging from 0.9 to 0.6, acceleration constants fixed at 2, and 20 particles over the same number of iterations, ensuring a well-balanced optimization process. The WOA algorithm similarly uses 20 whales and 19 iterations, with ‘a’ also ranging from 2 to 0, and introduces randomness through a variable ‘r’ within the [0, 1] interval. Finally, the GA algorithm features a mutation ratio of 0.1 and a crossover rate of 0.9, and it employs a roulette wheel selection mechanism with a population of 20 and 19 generations. Finally, the BER algorithm uses a population size of 30 over 500 iterations, with a mutation probability of 0.5, an exploration percentage of 70%, a K value decreasing from 2 to 0, and is run 19 times.

Table 3

Summary of algorithm parameters and values for optimization methods

AlgorithmsParameter(s)Value(s)
GWO  2–0 
 #Wolves 20 
 #Iterations 19 
PSO Inertia ,  [0.9, 0.6] 
 Acceleration constants ,  [2, 2] 
 #Particles 20 
 #Iterations 19 
WOA  2 to 0 
  Random [0, 1] 
 #Whales 20 
 #Iterations 19 
GA Mutation ratio 0.1 
 Crossover 0.9 
 Selection mechanism Roulette wheel 
 #Population 20 
 #Generations 19 
BER Size of population 30 
 Iterations count 500 
 Mutation probability 0.5 
 Exploration percentage 70 
 K (decreases from 2 to 0) 
 Number of runs 19 
AlgorithmsParameter(s)Value(s)
GWO  2–0 
 #Wolves 20 
 #Iterations 19 
PSO Inertia ,  [0.9, 0.6] 
 Acceleration constants ,  [2, 2] 
 #Particles 20 
 #Iterations 19 
WOA  2 to 0 
  Random [0, 1] 
 #Whales 20 
 #Iterations 19 
GA Mutation ratio 0.1 
 Crossover 0.9 
 Selection mechanism Roulette wheel 
 #Population 20 
 #Generations 19 
BER Size of population 30 
 Iterations count 500 
 Mutation probability 0.5 
 Exploration percentage 70 
 K (decreases from 2 to 0) 
 Number of runs 19 

Figure 7 illustrates the MSE performance results across all independent runs for the five models: BER–LSTM, PSO–LSTM, GWO–LSTM, GA–LSTM, and WOA–LSTM. Additionally, Figure 8 shows the MSE values obtained for each model in these runs. The model with the lowest RMSE value in each run is identified as the best-performing model for that specific run. The findings reveal that the BER–LSTM model consistently achieves the lowest MSE values across all independent runs, demonstrating robust performance. In contrast, the other models exhibit more variability, with some runs yielding better results than others. Among these, GA–LSTM and GWO–LSTM emerge as the next best performers after BER–LSTM, showing the lowest MSE values in several instances. However, it is crucial to recognize that model performance can vary depending on the specific problem or dataset, suggesting that further testing may be necessary to confirm their relative effectiveness (Malik et al. 2022).
Figure 8

MSE values (normalized values) across independent runs for BER–LSTM and comparative models.

Figure 8

MSE values (normalized values) across independent runs for BER–LSTM and comparative models.

Close modal

Finally, the analysis of Table A3 (Supplementary Material) underscores the clear superiority of the BER–LSTM model introduced in this study for pan evaporation prediction. Particularly in daily predictions, the model achieves an RMSE of 0.02341, marking a significant improvement over other models. For instance, compared to the study by Ghorbani et al. (2017), who reported an RMSE of 1.007 using MLP–FFA in northern Iran, the BER–LSTM model demonstrates a remarkable 97.67% reduction in error. Even when compared to the SVM model by BENCHAIBA et al. (2022), which achieved an RMSE of 0.224 in New Delhi, the BER–LSTM still shows an impressive 89.55% improvement. For monthly predictions, the BER–LSTM model's RMSE of 3.9303 also stands out. It is 86.74% lower than the 29.64 reported by Elbeltagi et al. (2023) using the AR-M5P model in Mosul, Iraq, and 91.03% lower than the 43.84 RMSE reported for the ARIMA model by (Benchaiba et al. 2022) at Ain Zada Dam, Algeria.

Overall, these comparisons clearly highlight the BER–LSTM model's superiority, particularly in its ability to significantly reduce prediction errors, reinforcing its effectiveness for both daily and monthly pan evaporation predictions.

This study aimed to develop an innovative hybrid approach combining LSTM networks with the BER technique and comparing them with various machine learning models for predicting pan evaporation across different stations and time scales in Algeria.

The findings revealed that MLP achieved the highest R2 values for monthly predictions, particularly at the ADR and BCH stations. On a daily scale, models generally displayed lower R2 values, with DTR and NNE demonstrating superior performance at the TAM and TIZ stations, respectively. This variability is likely due to differing climatic conditions and pan evaporation patterns at each station. While R2 values provided useful insights, incorporating additional metrics such as MBE and RMSE offered a more comprehensive assessment of model performance.

Advanced models like LSTM showed improved accuracy compared to simpler models such as MLP and SVR. The BER–LSTM model, in particular, demonstrated significant enhancements in prediction accuracy across various stations and time scales. This enhanced version consistently outperformed not only the standard LSTM but also other optimization methods like PSO and GWO across multiple independent runs. The BER–LSTM model exhibited remarkable reductions in RMSE, with improvements ranging from 71.86 to 97.54% compared to the standard LSTM across different stations and time scales. These substantial reductions in RMSE underscore its effectiveness in capturing the temporal variability of pan evaporation data.

The results highlight the potential of advanced techniques and optimization strategies in improving predictive performance. The BER–LSTM model, with its superior accuracy, offers a valuable tool for enhancing the understanding and forecasting of pan evaporation in diverse environments. Future research could explore the applicability of these models to other geographical regions and assess their performance in long-term climate projections.

Princess Nourah-bint Abdulrahman University Researchers Supporting Project number (PNURSP2024R120), Princess Nourah-bint Abdulrahman University, Riyadh, Saudi Arabia.

B.Z., N.B., and E.-S.M.E.-k. conceptualized the study; B.Z., N.B., K.B., D.S.K., A.H.A., and E.-S.M.E.-k. performed the methodology and did software validation; B.Z., K.B., G.M., A.H.A., and E.-S.M.E.-k. did formal analysis and wrote the original draft; N.B., G.M., A.K., D.S.K. wrote, reviewed and edited the article. All authors have read and agreed to the published version of the manuscript.

This research was funded by Princess Nourah-bint Abdulrahman University Researchers Supporting – Project number – (PNURSP2024R120), Princess Nourah-bint Abdulrahman University, Riyadh, Saudi Arabia.

We confirm that this manuscript is original, has not been published before, and is not currently being considered for publication elsewhere.

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Abed
M.
,
Imteaz
M. A.
,
Ahmed
A. N.
&
Huang
Y. F.
(
2022
)
Modelling monthly pan evaporation utilising random forest and deep learning algorithms
,
Sci. Rep.
,
12
,
13132
.
https://doi.org/10.1038/s41598-022-17263-3
.
Adnan Ikram
R. M.
,
Jaafari
A.
,
Milan
S. G.
,
Kisi
O.
,
Heddam
S.
&
Zounemat-Kermani
M.
(
2022
)
Hybridized adaptive neuro-Fuzzy inference system with metaheuristic algorithms for modeling monthly pan evaporation
,
Water
,
14
,
3549
.
https://doi.org/10.3390/w14213549
.
Ahmed
H. K.
,
Tuncay
R. N.
&
Alkhafaji
M. H.
(
2023
)
Improving the performance of a realistic distribution network in Kirkuk by integrating a distributed hybrid system
,
Int. J. Renew. Energy Res.
,
13
,
998
1014
.
Alam
M. M.
,
Akter
M. Y.
,
Islam
A. R. M. T.
,
Mallick
J.
,
Kabir
Z.
,
Chu
R.
,
Arabameri
A.
,
Pal
S. C.
,
Al Masud
M. A.
&
Costache
R.
(
2024
)
A review of recent advances and future prospects in calculation of reference evapotranspiration in Bangladesh using soft computing models
,
J. Environ. Manage.
,
351
,
119714
.
Alazard
M.
,
Leduc
C.
,
Travi
Y.
,
Boulet
G.
&
Ben Salem
A.
(
2015
)
Estimating evaporation in semi-arid areas facing data scarcity: Example of the El Haouareb dam (Merguellil catchment, Central Tunisia)
,
J. Hydrol. Reg. Stud.
,
3
,
265
284
.
https://doi.org/10.1016/j.ejrh.2014.11.007
.
Alghamdi
A. A.
,
Ibrahim
A.
,
El-Kenawy
E.-S. M.
&
Abdelhamid
A. A.
(
2023
)
Renewable energy forecasting based on stacking ensemble model and Al-Biruni earth radius optimization algorithm
,
Energies
,
16
,
1370
.
https://doi.org/10.3390/en16031370
.
Ali Ghorbani
M.
,
Kazempour
R.
,
Chau
K.-W.
,
Shamshirband
S.
&
Taherei Ghazvinei
P.
(
2018
)
Forecasting pan evaporation with an integrated artificial neural network quantum-behaved particle swarm optimization model: A case study in Talesh, Northern Iran
,
Eng. Appl. Comput. Fluid Mech.
,
12
,
724
737
.
https://doi.org/10.1080/19942060.2018.1517052
.
Arunkumar
R.
,
Jothiprakash
V.
&
Sharma
K.
(
2017
)
Artificial intelligence techniques for predicting and mapping daily pan evaporation
,
J. Inst. Eng. Ser. A
,
98
,
219
231
.
https://doi.org/10.1007/s40030-017-0215-1
.
Bailek
N.
,
Bouchouicha
K.
,
Hassan
M. A.
,
Slimani
A.
&
Jamil
B.
(
2020
)
Implicit regression-based correlations to predict the back temperature of PV modules in the arid region of south Algeria
,
Renew. Energy
,
156
,
57
67
.
https://doi.org/10.1016/j.renene.2020.04.073
.
Benchaiba
L.
,
Houichi
L.
&
Amarchi
H.
(
2022
)
Stochastic alternatives to models based on the Dalton's approach for estimating monthly pan evaporation from reservoir dams in Algeria
,
LARHYSS J.
,
19 (4), 7–17. P-ISSN 1112-3680/E-ISSN 2521-9782
,
49
59
.
Benzaghta
M. A.
,
Mohammed
T. A.
,
Ghazali
A. H.
&
Soom
M. A. M.
(
2012
)
Validation of selected models for evaporation estimation from reservoirs located in arid and semi-Arid regions
,
Arab. J. Sci. Eng.
,
37
,
521
534
.
https://doi.org/10.1007/s13369-012-0194-5
.
Bouznit
M.
,
Elaguab
M.
,
Selt
M. M.
,
Himrane
M.
&
Aïssaoui
R.
(
2022
)
Climate change and agricultural production in Algeria
, In: Filho, W. L. & Manolas, E. (eds.):
Climate Change in the Mediterranean and Middle Eastern Region
.
Cham: Springer. https://doi.org/10.1007/978-3-030-78566-6_12
.
Ehteram
M.
,
Panahi
F.
,
Ahmed
A. N.
,
Mosavi
A. H.
&
El-Shafie
A.
(
2022
)
Inclusive multiple model using hybrid artificial neural networks for predicting evaporation
,
Front. Environ. Sci.
,
9
,
1
21
.
https://doi.org/10.3389/fenvs.2021.789995
.
Elbeltagi
A.
,
Zerouali
B.
,
Bailek
N.
,
Bouchouicha
K.
,
Pande
C.
,
Santos
C. A. G.
,
Towfiqul Islam
A. R. M.
,
Al-Ansari
N.
&
El-kenawy
E.-S. M.
(
2022
)
Optimizing hyperparameters of deep hybrid learning for rainfall prediction: A case study of a Mediterranean basin
,
Arab. J. Geosci.
,
15
,
933
.
https://doi.org/10.1007/s12517-022-10098-2
.
Elbeltagi
A.
,
Al-Mukhtar
M.
,
Kushwaha
N. L.
,
Al-Ansari
N.
&
Vishwakarma
D. K.
(
2023
)
Forecasting monthly pan evaporation using hybrid additive regression and data-driven models in a semi-arid environment
,
Appl. Water Sci.
,
13
,
42
.
El-kenawy
E.-S. M.
,
Zerouali
B.
,
Bailek
N.
,
Bouchouich
K.
,
Hassan
M. A.
,
Almorox
J.
,
Kuriqi
A.
,
Eid
M.
&
Ibrahim
A.
(
2022
)
Improved weighted ensemble learning for predicting the daily reference evapotranspiration under the semi-arid climate conditions
,
Environ. Sci. Pollut. Res.
,
29
,
81279
81299
.
https://doi.org/10.1007/s11356-022-21410-8
.
El-kenawy
E.-S. M.
,
Abdelhamid
A. A.
,
Ibrahim
A.
,
Mirjalili
S.
,
Khodadad
N.
,
Al duailij
M. A.
,
Ali Alhussan
A.
&
Sami Khafaga
D.
(
2023
)
Al-Biruni Earth Radius (BER) metaheuristic search optimization algorithm
,
Comput. Syst. Sci. Eng.
,
45
,
1917
1934
.
https://doi.org/10.32604/csse.2023.032497
.
El-kenawy
E.-S. M.
,
Bailek
N.
,
Bouchouicha
K.
,
Zerouali
B.
,
Hassan
M. A.
,
Kuriqi
A.
,
Jamil
B.
,
Colak
I.
,
Khalil
A.
&
Ibrahim
A.
(
2024
)
Global scale solar energy harnessing: An advanced intra-hourly diffuse solar irradiance predicting framework for solar energy projects
,
Neural Comput. Appl.
,
36
,
10585
10598
.
https://doi.org/10.1007/s00521-024-09608-y
.
El-Mahdy
M. E.-S.
,
El-Abd
W. A.
&
Morsi
F. I.
(
2021
)
Forecasting lake evaporation under a changing climate with an integrated artificial neural network model: A case study Lake Nasser, Egypt
,
J. African Earth Sci.
,
179
,
104191
.
https://doi.org/10.1016/j.jafrearsci.2021.104191
.
Ersöz
B.
,
Sağıroğlu
Ş.
&
Bülbül
H. İ
. (
2022
)
Methods of explainable artificial intelligence, trustworthy artificial intelligence and interpretable machine learning in renewable energy
,
Int. J. Smart Grid-IjSmartGrid
,
12
.
Frih
B.
,
Oulmi
A.
&
Guendouz
A.
(
2021
)
Study of drought tolerance of some durum wheat (Triticum durum Desf.) genotypes growing under semi-arid conditions in Algeria
,
Int. J. Bio-Resource Stress Manag.
,
12
,
137
141
.
https://doi.org/10.23910/1.2021.2171a
.
Ghorbani
M. A.
,
Deo
R. C.
,
Yaseen
Z. M.
,
Kashani
M. H.
&
Mohammadi
B.
(
2017
)
Pan evaporation prediction using a hybrid multilayer perceptron-firefly algorithm (MLP-FFA) model: Case study in North Iran
,
Theor. Appl. Climatol.
,
133
,
1119
1131
.
https://doi.org/10.1007/s00704-017-2244-0
.
Guan
Y.
,
Mohammadi
B.
,
Pham
Q. B.
,
Adarsh
S.
,
Balkhair
K. S.
,
Rahman
K. U.
,
Linh
N. T. T.
&
Tri
D. Q.
(
2020
)
A novel approach for predicting daily pan evaporation in the coastal regions of Iran using support vector regression coupled with krill herd algorithm model
,
Theor. Appl. Climatol.
,
142
,
349
367
.
https://doi.org/10.1007/s00704-020-03283-4
.
Habibi
B.
,
Meddi
M.
,
Emre
T.
,
Boucefiane
A.
&
Rahmouni
A.
(
2024
)
Drought assessment and characterization using SPI, EDI and DEPI indices in northern Algeria
,
Nat. Hazards
,
120
,
5201
5231
.
Hochreiter
S.
&
Schmidhuber
J.
(
1997
)
Long Short-Term Memory
,
Neural Comput.
,
9
,
1735
1780
.
https://doi.org/10.1162/neco.1997.9.8.1735
.
Hussein
E. E.
,
Derdour
A.
,
Zerouali
B.
,
Almaliki
A.
,
Wong
Y. J.
,
Ballesta-de los Santos
M.
,
Minh Ngoc
P.
,
Hashim
M. A.
&
Elbeltagi
A.
(
2024
)
Groundwater quality assessment and irrigation water quality index prediction using machine learning algorithms
,
Water
,
16
,
264
.
Ibrahim
A.
,
El-kenawy
E.-S. M.
,
Kabeel
A. E.
,
Karim
F. K.
,
Eid
M. M.
,
Abdelhamid
A. A.
,
Ward
S. A.
,
El-Said
E. M. S.
,
El-Said
M.
&
Khafaga
D. S.
(
2023
)
Al-Biruni earth radius optimization based algorithm for improving prediction of hybrid solar desalination system
,
Energies
,
16
,
1185
.
https://doi.org/10.3390/en16031185
.
Jean
R.
,
Molu
J.
,
Raoul
S.
,
Dzonde
N.
,
Patrice
W.
&
Tsobze
S. K.
(
2023
)
Solar irradiance forecasting based on deep learning for sustainable electrical energy in Cameroon
,
Int. J. Smart Grid
,
7
,
1
8
.
https://doi.org/10.20508/ijsmartgrid.v7i2.279.g261
.
Keshtegar
B.
,
Piri
J.
&
Kisi
O.
(
2016
)
A nonlinear mathematical modeling of daily pan evaporation based on conjugate gradient method
,
Comput. Electron. Agric.
,
127
,
120
130
.
https://doi.org/10.1016/j.compag.2016.05.018
.
Keshtegar
B.
,
Heddam
S.
,
Sebbar
A.
,
Zhu
S.-P.
&
Trung
N.-T.
(
2019
)
SVR-RSM: A hybrid heuristic method for modeling monthly pan evaporation
,
Environ. Sci. Pollut. Res.
,
26
,
35807
35826
.
https://doi.org/10.1007/s11356-019-06596-8
.
Keshtegar
B.
,
Bouchouicha
K.
,
Bailek
N.
,
Hassan
M. A.
,
Kolahchi
R.
&
Despotovic
M.
(
2022
)
Solar irradiance short-term prediction under meteorological uncertainties: Survey hybrid artificial intelligent basis music-inspired optimization models
,
Eur. Phys. J. Plus
,
137
,
362
.
https://doi.org/10.1140/epjp/s13360-022-02371-w
.
Khafaga
D. S.
,
Ibrahim
A.
,
El-Kenawy
E.-S. M.
,
Abdelhamid
A. A.
,
Karim
F. K.
,
Mirjalili
S.
,
Khodadadi
N.
,
Lim
W. H.
,
Eid
M. M.
&
Ghoneim
M. E.
(
2022
)
An Al-Biruni earth radius optimization-based deep convolutional neural network for classifying monkeypox disease
,
Diagnostics (Basel, Switzerland)
,
12
,
2892
.
https://doi.org/10.3390/diagnostics12112892
.
Khandan
K. L.
,
Kumar
N. M. G.
,
Krishnasami
U.
,
Kirubakaran
K.
,
Vinoth
K.
,
Kaliappan
S.
,
Maguluri
L. P.
&
Rajaram
A.
(
2024
)
Enhanced predictive model for grid stability using hybrid GBM-LSTM approach
,
Int. J. Renew. Energy Res.
,
14
,
111
126
.
https://doi.org/10.20508/ijrer.v14i1.14591.g8880
.
Kherbache
N.
(
2020
)
Water policy in Algeria: Limits of supply model and perspectives of water demand management (WDM)
,
Desalin. WATER Treat.
,
180
,
141
155
.
https://doi.org/10.5004/dwt.2020.25009
.
Khezazna
A.
,
Amarchi
H.
,
Derdous
O.
&
Bousakhria
F.
(
2017
)
Drought monitoring in the seybouse basin (Algeria) over the last decades
,
J. Water L. Dev.
,
33
,
79
.
Lakmini Prarthana Jayasinghe
W. J. M.
,
Deo
R. C.
,
Ghahramani
A.
,
Ghimire
S.
&
Raj
N.
(
2022
)
Development and evaluation of hybrid deep learning long short-term memory network model for pan evaporation estimation trained with satellite and ground-based data
,
J. Hydrol.
,
607
,
127534
.
https://doi.org/10.1016/j.jhydrol.2022.127534
.
Majhi
B.
,
Naidu
D.
,
Mishra
A. P.
&
Satapathy
S. C.
(
2019
)
Improved prediction of daily pan evaporation using Deep-LSTM model
,
Neural Comput. Appl.
,
32
,
7823
7838
.
https://doi.org/10.1007/s00521-019-04127-7
.
Malik
A.
,
Saggi
M. K.
,
Rehman
S.
,
Sajjad
H.
,
Inyurt
S.
,
Bhatia
A. S.
,
Farooque
A. A.
,
Oudah
A. Y.
&
Yaseen
Z. M.
(
2022
)
Deep learning versus gradient boosting machine for pan evaporation prediction
,
Eng. Appl. Comput. Fluid Mech.
,
16
,
570
587
.
https://doi.org/10.1080/19942060.2022.2027273
.
Mampitiya
L.
,
Rathnayake
N.
,
Leon
L. P.
,
Mandala
V.
,
Azamathulla
H. M.
,
Shelton
S.
,
Hoshino
Y.
&
Rathnayake
U.
(
2023
)
Machine learning techniques to predict the air quality using meteorological data in two urban areas in Sri Lanka
,
Environments
,
10
,
141
.
Maraj
A.
,
Firat
C.
&
Gebremedhin
A.
(
2022
)
Modelling and the experimental validation of the long-term averaged solar radiation on a tilted solar collector facing the south-east direction
,
Int. J. Renew. Energy Res.
,
12
,
1404
1413
.
Mihi
A.
,
Zerroug
K.
,
Kouachi
M. E.
&
Benarfa
N.
(
2024
)
Spatiotemporal changes of desertification degree in the Algerian green barrier over the last four decades (1984–2020)
,
Arid L. Res. Manag
,
38
(
3
),
362
381
.
Mohamadi
S.
,
Ehteram
M.
&
El-Shafie
A.
(
2020
)
Accuracy enhancement for monthly evaporation predicting model utilizing evolutionary machine learning methods
,
Int. J. Environ. Sci. Technol.
,
17
,
3373
3396
.
https://doi.org/10.1007/s13762-019-02619-6
.
Moran
M. S.
,
Rahman
A. F.
,
Washburne
J. C.
,
Goodrich
D. C.
,
Weltz
M. A.
&
Kustas
W. P.
(
1996
)
Combining the Penman-Monteith equation with measurements of surface temperature and reflectance to estimate evaporation rates of semiarid grassland
,
Agric. For. Meteorol.
,
80
,
87
109
.
https://doi.org/10.1016/0168-1923(95)02292-9
.
Mostafa
E.
&
Bahgaat
N. K.
(
2017
)
A comparison between using A firefly algorithm and A modified PSO technique for stability analysis of a PV system connected to grid
,
Int. J. Smart Grid-IjSmartGrid
,
1
,
1
8
.
Mouatadid
S.
,
Raj
N.
,
Deo
R. C.
&
Adamowski
J. F.
(
2018
)
Input selection and data-driven model performance optimization to predict the standardized precipitation and evaporation index in a drought-prone region
,
Atmos. Res.
,
212
,
130
149
.
https://doi.org/10.1016/j.atmosres.2018.05.012
.
Ogunbode
T. O.
,
Esan
V. I.
&
Akande
J. A.
(
2024
)
Water resources endowment and the challenge of underutilization in a tropical community in Nigeria
,
Sustain. Water Resour. Manag.
,
10
,
72
.
Oubadi
M.
,
Faci
M.
&
Pham
Q. B.
(
2024
)
Drought and aridity trends on the Algerian steppe
,
Theor. Appl. Climatol.
,
155
,
1541
1551
.
Patel
J. N.
&
Majmundar
B. P.
(
2016
)
Development of evaporation estimation methods for a reservoir in Gujarat, India
,
J. AWWA
,
108
(
9
),
E489
E500
.
https://doi.org/10.5942/jawwa.2016.108.0113
.
Peel
M. C.
,
Finlayson
B. L.
&
McMahon
T. A.
(
2007
)
Updated world map of the Köppen-Geiger climate classification
,
Hydrol. Earth Syst. Sci.
,
11
,
1633
1644
.
Rajput
J.
,
Kushwaha
N. L.
,
Srivastava
A.
,
Pande
C. B.
,
Suna
T.
,
Sena
D. R.
,
Singh
D. K.
,
Mishra
A. K.
,
Sahoo
P. K.
&
Elbeltagi
A.
(
2024
)
Development of machine learning models for estimation of daily evaporation and mean temperature: A case study in New Delhi, India
,
Water Pract. Technol.
,
19
(
7
),
1
18
.
Remini
B.
&
Hallouche
O.
(
2005
)
L'alluvionnement des retenues. Blida, Algérie Ed. Impr. Madani
.
Ruiz
S.
,
Hernandez
J.
&
Borge-Diez
D.
(
2023
)
V2g ancillary services management strategy for EVs with solar powered charging stations based on artificial intelligence algorithms
,
Int. J. Smart Grid-IjSmartGrid
,
7
,
218
226
.
Saab
K.
,
Tarnini
M.
&
El Ghaly
A.
(
2023
)
A power electronic controller based algorithm for output power prediction of a PV panel
,
Int. J. Renew. Energy Res.
,
13
,
1194
1199
.
Sebbar
A.
,
Heddam
S.
&
Djemili
L.
(
2019
)
Predicting daily pan evaporation (Epan) from Dam reservoirs in the Mediterranean regions of Algeria: OPELM vs OSELM
,
Environ. Process.
,
6
,
309
319
.
https://doi.org/10.1007/s40710-019-00353-2
.
Song
Z.
,
Tiraferri
A.
,
Yuan
R.
,
Cao
J.
,
Tang
P.
,
Xie
W.
,
Crittenden
J. C.
&
Liu
B.
(
2022
)
Theoretical evaluation of the evaporation rate of 2D solar-driven interfacial evaporation and of its large-scale application potential
,
Desalination
,
537
,
115891
.
https://doi.org/10.1016/j.desal.2022.115891
.
Thakur
R.
&
Manekar
V. L.
(
2022
)
Artificial intelligence-Based image classification techniques for hydrologic applications
,
Appl. Artif. Intell.
,
36
,
2014185
.
https://doi.org/10.1080/08839514.2021.2014185
.
Wang
L.
,
Kisi
O.
,
Hu
B.
,
Bilal
M.
,
Zounemat-Kermani
M.
&
Li
H.
(
2017
)
Evaporation modelling using different machine learning techniques
,
Int. J. Climatol.
,
37
,
1076
1092
.
https://doi.org/10.1002/joc.5064
.
Yadav
S.
,
Bailek
N.
,
Kumari
P.
,
Nuţă
A. C.
,
Yonar
A.
,
Plocoste
T.
,
Ray
S.
,
Kumari
B.
,
Abotaleb
M.
,
Alharbi
A. H.
,
Khafaga
D. S.
&
El-Kenawy
E.-S. M.
(
2024
)
State of the art in energy consumption using deep learning models
,
AIP Adv.
,
14
,
65306
.
https://doi.org/10.1063/5.0213366
.
Yahya
K.
,
Al-mayyahi
S.
,
Aldababsa
M.
,
Yahya
A. E. M.
&
Sarreb
R. D.
(
2024
)
Machine learning techniques for solar power output predicting
,
Int. J. Smart Grid-IjSmartGrid
,
8
,
98
107
.
Yuzer
E. O.
&
Bozkurt
A.
(
2024
)
Comparison of regional empirical models based on sunshine duration for determining solar irradiance
,
Int. J. Renew. Energy Res.
,
14
,
82
92
.
https://doi.org/10.20508/ijrer.v14i1.14295.g8860
.
Zerouali
B.
,
Bailek
N.
,
Islam
A. R. M. T.
,
Katipoğlu
O. M.
,
Ayek
A. A. E.
,
Santos
C. A. G.
,
Rajput
J.
,
Wong
Y. J.
,
Abda
Z.
,
Chettih
M.
&
Elbeltagi
A.
(
2024
)
Enhancing groundwater potential zone mapping with a hybrid analytical method: The case of semiarid basin
,
Groundw. Sustain. Dev.
,
26
,
101261
.
https://doi.org/10.1016/j.gsd.2024.101261
.
Zhao
W.
,
Zhao
J.
,
Li
J.
,
Zhao
D.
,
Huang
L.
,
Zhu
J.
,
Lu
J.
&
Wang
X.
(
2021
)
An evaporation duct height prediction model based on a long short-term memory neural network
,
IEEE Trans. Antennas Propag.
,
69
,
7795
7804
.
Zheng
D.-J.
,
Cheng
Y.-Q.
,
Liu
H.-J.
&
Li
L.-T.
(
2011
)
Investigation of EHD-Enhanced water evaporation and a novel empirical model
,
Int. J. Food Eng.
,
7
.
https://doi.org/10.2202/1556-3758.1796
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).

Supplementary data