In study, meteorological drought ​​was used to estimate the potential hydrological drought that may occur in the Gediz Basin of Turkey. For this purpose, the most effective stream flow gauging station was determined by looking at the correlation values ​​between the meteorological data obtained from the Uşak meteorological station. SPEI values ​​for meteorological drought and SRI values ​​for hydrological drought are calculated for 3-, 6-, 9-, and 12-month periods. Correlation matrices were created between meteorological drought inputs from SPEI(t) to SPEI(t-12) and SRI(t) for use in hydrological drought models for 3-, 6-, 9- and 12-month periods. ML models were developed considering correlation matrices and it was seen that ML model results were not sufficient. For this reason, W-ML models were developed by applying DWT and Optuna hyperparameter analysis. It has been observed that the performance of W-ML models increases. Random oversampling (ROS), which has never been used in drought modeling, was then applied to W-ML models. W-ML-ROS model obtained an R2 value of 0.999 for testing set in 12-month period. Similarly, R2 values ​​for SRI3, SRI6 and SRI9 were obtained as 0.893, 0.851, and 0.940, respectively. Results showed that W-ML-ROS hybrid models can be used to predict hydrological drought from meteorological drought.

  • The aim is to predict hydrological drought from meteorological drought.

  • Machine learning (ML) models were developed.

  • W-ML models were developed with discrete wavelet transform (DWT).

  • Random oversampling (ROS) was applied to improve the performances of models.

The hydrological cycle of water, which is vital for living things on Earth, is directly related to meteorological, hydrological variables, and anthropogenic effects. Unfortunately, due to human interventions such as excessive carbon dioxide emissions, deforestation, and industrialization, the quantity and quality of water is damaged and climate changes due to global warming occur. This situation leads to global disasters such as increasing storms, increasing floods and droughts, melting glaciers, rising sea levels, desertification, decrease in biodiversity, soil, water, and air pollution (Özcan 2017; Büyükşahin 2018; Kale 2021). Among these disasters, drought is frequently encountered in Turkey (Çoban et al. 2023; Gul et al. 2023; Kara & Orhan 2023; Khorrami et al. 2023). According to the 2023 Climate Assessment Report prepared by the General Directorate of Meteorology in Turkey, the average temperature between 1991 and 2020 increased from 13.9 to 15.1 °C, an increase of 1.2 °C. Between 1991 and 2020, the areal annual mean precipitation across Turkey was 573.4 mm, and in 2023, it was 641.5 mm, an increase of 11.9% compared with mean (CA 2023). Drought, which can be defined as a decrease in precipitation over an extended period, affects surface and groundwater resources, leading to a reduction in water quality, quantity, and agricultural productivity, resulting in economic losses (Riebsame et al. 1991; Mishra & Singh 2010).

Following meteorological drought, which is characterized by a drop in precipitation and an increase in evapotranspiration, hydrological drought happens as surface and groundwater resources are decreased. Subsequently, agricultural drought arises with reduced soil moisture (Choi et al. 2013; Dabanlı et al. 2017). Finally, socio-economic drought in which water supply cannot meet water demand occurs (Rajsekhar et al. 2013; Şen & Almazroui 2021).

The standardized precipitation evapotranspiration index (SPEI), one of the monitoring methods for meteorological drought, was proposed by Vicente-Serrano et al. (2010,). Unlike the standardized precipitation index (SPI), SPEI uses precipitation and air temperature data to examine the impact of air temperature on drought (McKee et al. 1993). SPEI calculates the difference between precipitation and potential evapotranspiration (PET) at different time scales using air temperature data, allowing for a more sensitive assessment of climate change (Beguería et al. 2014). Many researchers have used SPEI in drought studies (Stagge et al. 2015; Um et al. 2017; Tirivarombo et al. 2018; Manzano et al. 2019; Li et al. 2020; Liu et al. 2021; Ojha et al. 2021). Stagge et al. (2014) compared SPI and SPEI using five different PET formulas in their European drought analysis. Ogunrinde et al. (2020) conducted a hydrological drought analysis in Nigeria for other agricultural areas using PET values calculated with Hargreaves, Penman–Monteith, and Thornthwaite methods, stating that Hargreaves and Penman–Monteith outperformed Thornthwaite. Öney & Anlı (2023) used the SPEI in their meteorological drought analysis in the Gediz Basin. Kumanlioglu (2020) preferred SPI and SPEI for meteorological drought and standardized runoff index (SRI) for hydrological drought.

The SRI is one of the preferred methods for assessing hydrological drought (Shukla & Wood 2008; Salimian et al. 2021). Like the SPI proposed by McKee et al. (1993), SRI expresses the standard deviation of accumulated surface runoff for a given period. Mercado et al. (2016) preferred SRI to determine the temporal and spatial distribution of hydrological drought in southeastern Mexico. Wu et al. (2018) used non-parametric SRI to examine the temporal changes in hydrological drought in 17 basins on the Loess Plateau. Gan et al. (2023) used non-parametric SRI to calculate hydrological drought in the Shaying River Basin from 1956 to 2023, stating an increasing trend in drought.

For the purpose of predicting droughts, researchers have recently begun to use machine learning (ML) systems. The abundance of measured data for hydrological parameters has made ML an inevitable tool in this field (Brunton et al. 2020; Shen 2018). ML algorithms, such as support vector machine (SVM), artificial neural network (ANN), and k-nearest neighbor (kNN), have been used to develop drought prediction models for different regions (Mohamadi et al. 2020; Mokhtar et al. 2021). Khan et al. (2020) attempted to develop drought prediction models for Pakistan using SVM, ANN, and kNN. Mohamadi et al. (2020) used adaptive neural-based fuzzy inference system (ANFIS), multi-layer perceptron (MLP), radial basis function neural network (RBFNN), and SVM models to predict meteorological drought in Iran from 1980 to 2014. Mokhtar et al. (2021) used random forest (RF), extreme gradient boost (XGB), convolutional neural network (CNN), and long-term short memory (LSTM) algorithms for drought prediction in Tibet, stating that XGB and RF models outperformed others in different scenarios. However, researchers have used deep learning methods such as LSTM as ensemble and hybrid models in hydrology to improve the prediction accuracy of ML algorithms. Widiasari et al. (2018) used an LSTM-based deep learning model for the streamflow prediction of the East Canal River in Semarang Province, Indonesia, and achieved good success with a mean absolute percentage error of 3.6%. Yuan et al. (2018) used LSTM parameters to model the monthly streamflow of the Astor River by calibrating them with ant lion optimization (ALO). Fu et al. (2020) used an LSTM-based deep learning model to predict the streamflow of the Kelantan River in northeastern Malaysia. They compared the performance of the model with classical backpropagation (BP). They stated that LSTM had better success than BP in both dry and wet periods. Adnan et al. (2021) used a novel hybrid adaptive neuro-fuzzy inference system (ANFIS-WCAMFO) calibrated with moth-flame optimization algorithm (MFO) and water cycle optimization algorithm (WCA) to predict reference evapotranspiration for Dhaka and Mymensing stations in Bangladesh. They stated that this algorithm, a new hybrid neuro-fuzzy method, was very successful. Adnan et al. (2022) used the SVM method improved by a hybrid firefly algorithm–particle swarm optimization (FFAPSO) method to model dissolved oxygen concentration values for station number 07106000 selected for the Fountain Creek basin in Colorado. They stated that the SVM-FFAPSO method achieved good success with an R2 value of 0.8234. Adnan et al. (2023a) similarly proposed extreme learning machine (ELM) methods tuned with jellyfish search optimizer (JFO) to model groundwater level fluctuations. Ikram et al. (2023) used convolutional neural networks (CNNs) and LSTM deep learning methods calibrated with reptile search algorithm (RSA) and vector weighted average optimization (INFO) to predict water temperatures for Bailong River. Adnan et al. (2023b) evaluated the performance of relevance vector machine tuned (RVM) with gray wolf optimizer (GWO), whale optimization algorithm (WOA), manta ray foraging optimization (MRFO), and improved manta ray foraging optimization (IMRFO) methods to predict monthly pan evaporation values using monthly minimum, maximum temperature, Ta, and periodicity component values for Jingzhou station in Hubei province and Yueyang station in Hunan province between 1962 and 2000 in China. They stated that the RVM-IMRFO method has higher prediction accuracy. Mostafa et al. (2023) proposed the RVM model with quantum-based avian navigation (QANA) optimization to predict PET values for Faisalabad and Peshawar stations with two semi-arid climatic conditions in Pakistan.

Wavelet transform (WT), one of the data decomposition techniques, is used to ensure that ML models provide more sensitive and reliable results in the prediction of hydrological variables that have unstable and non-linear relationships (Yaseen et al. 2020; Baker et al. 2022). It is a method that allows the data to be divided into different frequency bands and their frequency content analyzed (Nooruldeen et al. 2022). In the first known study on water resources, precipitation analysis was performed with W used by Kumar & Foufoula-Georgiou (1993). Lafreniere & Sharp (2003) used W in the analysis of river flows to Bowl Lake in Alberta. For drought prediction, Belayneh & Adamowski (2013) used ANN and support vector regression (SVR) models together with W. It has been stated that W-ANN models used in 3-month and 6-month SPI predictions are more successful. Achite et al. (2023) modeled meteorological, agricultural, and hydrological droughts for the Wadi Ouahrane Basin in Algeria using various ML models and W at different time scales and stated that WT hybrid models were successful in drought prediction.

ML techniques are combined with data processing techniques such as random oversampling (ROS) to address the imbalanced distribution in datasets, often supporting diversity during the learning phase (Snieder et al. 2021). Hybrid modeling studies that use ROS together with ML techniques are limited in the literature (Galar et al. 2013).

According to the literature research mentioned above, the use of ML algorithms in predicting hydrological drought from meteorological drought is quite limited. Thus, an attempt was made to study. In this context, meteorological drought values occurring in Uşak were used to predict the hydrological drought in the Gediz Basin of Turkey. The Gediz Basin is one of the biggest basins in Turkey, discharging its waters into the Aegean Sea and covering a vast area with varying climatic conditions. This geographical diversity allows for a comprehensive analysis of drought patterns and their impacts across different regions within the basin. While the study is focused on the Gediz Basin, the methodologies and findings can be adapted to other similar regions, making the results more widely applicable. In order to decide which stream gauging station should be selected in the basin, correlation analysis was created using each streamflow gauging station and Uşak precipitation (P) and air temperature data (Ta). SRI values are calculated for 3-, 6-, 9-, and 12-month periods using the flow data of the streamflow gauging station selected to define hydrological drought. SPEI values of Uşak meteorology station are also calculated for 3-, 6-, 9-, and 12-month periods. Later, correlation matrices were created between meteorological drought inputs from SPEI(t) to SPEI(t − 12) and SRI(t) for use in hydrological drought models for 3-, 6-, 9-, and 12-month periods. ML models were developed for SRI predictions. Then, discrete wavelet transform (DWT) was applied to increase the performance of the models. It was observed that the W-ML models developed with the obtained subseries and applied hyperparameter analysis have higher performance. By applying ROS to the most successful W-ML models, the predictive power was further improved. With this study, W-ML-ROS was used for the first time in hydrological studies to determine hydrological drought from meteorological drought for long-term effective water resources planning in the region.

Data decomposition by DWT has been used in many studies to increase accuracy in modeling studies in hydrology and other fields. However, the effect of the DWT method on the performance of different ML algorithms has not been compared in studies. At the same time, improving the performance in drought studies by eliminating irregularities in the input datasets with ROS is another innovation offered by the study. The main aim of this study is to develop a completely new hybrid model processed with ROS by determining the right combination of ML models and DWT, thus predicting hydrological drought from meteorological drought. W-ML-ROS estimates were evaluated using different statistical measures and Taylor plots. The results obtained from the study are directly relevant to stakeholders and decision makers involved in combating drought. By using DWT and ROS together, the study provides information that can be used by stakeholders through current models in real hydrological applications. The study is also pioneering in its capacity to influence future research studies. The study has the potential to combine theoretical innovations with practical applications with its comprehensive methodology. In brief, the study's novelty lies in its integration of advanced data decomposition, ML, and data balancing techniques to develop highly accurate and robust models for predicting hydrological drought from meteorological data. The application of these models to the Gediz Basin and the comprehensive methodological approach further enhance the study's originality and impact.

Study area

The study area is located within the borders of the Gediz Basin. Uşak meteorological observation station and three streamflow gauging stations are located close to each other in the eastern part of the basin. The Gediz Basin is in the western part of the Aegean Region in Turkey, discharging its waters into the Aegean Sea through the Gediz River and its tributaries, covering the area between the Northern Aegean, Susurluk, and Küçük Menderes Basins. Geographically, the basin is situated between the 38°04′–39°13′ northern latitudes and the 26°42′–29°45′ eastern longitudes. The annual mean precipitation values in the basin appear to be lowest in the southern regions, with a maximum recorded mean precipitation value of 622.5 mm. Throughout the basin, the highest monthly mean precipitation is observed in December–January, while the lowest mean precipitation is recorded in July–August (GBDMP 2019). The geographical location of the study area is given in Figure 1.
Figure 1

Gediz Basin boundaries and locations of streamflow gauging stations and Uşak meteorological station.

Figure 1

Gediz Basin boundaries and locations of streamflow gauging stations and Uşak meteorological station.

Close modal

The precipitation (P) and air temperature (Ta) data for the Uşak meteorological observation station located east of the basin were obtained from the Turkish State Meteorological Service. The data for the selected streamflow gauging stations in the study were obtained from the General Directorate of State Hydraulic Works. Streamflow gauging stations (E05A014 (Dereköy), E05A022 (Borlu), and E05A024 (Sazköy)) and Uşak meteorological observation station are located close to each other in the basin and have streamflow, P and Ta data in the same time period (1985–2015). Table 1 provides descriptive statistics of streamflow, P and Ta parameters used in this study.

Table 1

The descriptive statistics for used parameters

ParametersStationMeanMaxMinTime interval
Streamflow (m3/s) E05A014 1.70 17.40 0.00 1985–2015 
E05A022 2.53 31.00 0.00 1985–2015 
E05A024 1.56 17.20 0.00 1985–2015 
Precipitation (mm) Uşak 45.71 211.70 0.00 1985–2015 
Air temperature (°C) Uşak 12.70 27.00 −2.10 1985–2015 
ParametersStationMeanMaxMinTime interval
Streamflow (m3/s) E05A014 1.70 17.40 0.00 1985–2015 
E05A022 2.53 31.00 0.00 1985–2015 
E05A024 1.56 17.20 0.00 1985–2015 
Precipitation (mm) Uşak 45.71 211.70 0.00 1985–2015 
Air temperature (°C) Uşak 12.70 27.00 −2.10 1985–2015 

Standardized precipitation evapotranspiration index (SPEI)

Vicente-Serrano et al. (2010,) proposed this method based on the calculation principle of the SPI. In contrast to SPI, which utilizes precipitation data, the SPEI uses the difference between precipitation data and PET. This difference corresponds to a simple climatic water balance (Thornthwaite 1948) calculated over different periods to obtain the SPEI. Detailed information about SPEI calculations can be found in various sources in the literature (Vicente-Serrano et al. 2011, 2012; Beguería et al. 2014). The difference between precipitation and PET is calculated by Equation (1):
(1)
where is the difference between monthly precipitation and monthly potential evapotranspiration . On different time scales, Equation (1) can be written as given in Equation (2):
(2)
where k is the time scale in months and n is the length of the time series. Vicente-Serrano et al. (2010 ) found that there is a good relationship between the calculated D series and the log-logistic distribution. According to the log-logistic distribution, which can take negative values into account, the probability distribution function of the D series is expressed in Equation (3):
(3)
where α, β, and γ are the scale, shape, and origin parameters, respectively. These parameters can be calculated according to the L-moment method with the help of Equations (4)–(6):
(4)
(5)
(6)
where is the gamma function of value and is the probability weighted moment (PWM) value of order s (s = 0, 1, 2,…) and is calculated with Equation (7):
(7)
In Equation (7), n is the number of data points and j is the observation interval in increasing order. From here, SPEI is calculated as the standardized values of given in Equation (8):
(8)
where W is defined as
(9)
where the P value indicates the probability of exceeding a certain value D, P = 1 − . If P > 0.5, 1 − P is written instead of the P value and the sign of the SPEI value is reversed. The coefficients used in the SPEI equation are taken as = 2.515517, = 0.802853, = 0.010328, = 1.432788, = 0.189269, and = 0.001308 (Vicente-Serrano et al. 2010).

Values collected at different time scales are transformed into standard deviations based on averages (Vicente-Serrano et al. 2017). SPEI values can be obtained from the empirical Hargreaves equation suggested by Hargreaves & Samani (1985), using precipitation and PET data. In this study, PET values were calculated using the monthly mean air temperature data with the equation suggested by Thornthwaite (1948).

Standardized runoff index (SRI)

For a specific timeframe, SRI denotes the standard deviation of accumulated surface runoff, resembling the SPI introduced by McKee et al. (1993). When calculating SRI values, the first step involves determining the probability distribution function of the time series for runoff data. This process allows the generation of cumulative probabilities for runoff data. Subsequently, the cumulative probability is transformed into standard normal deviation (Shukla & Wood 2008). McKee et al. (1993) noted in the SPI that precipitation data conforms to the gamma probability distribution function, and they indicated that this method can also be applied to different hydrological variables. However, it has been observed in various studies that other distributions, such as log-normal and exponential, can also be used (Madsen et al. 1998; Lloyd-Hughes & Saunders 2002). The SRI values express the deviation from the mean in terms of standard deviation units, and this calculation is performed based on Equation (10):
(10)
where is the transformed sum of streamflow data, is the mean of the normalized index X, and is the standard deviation of index X (Kubiak-Wójcicka & Juśkiewicz 2020).

The obtained SRI values, similar to those provided by SPI, are categorized into drought classifications given in Table 2. The index assumes that the values are negative during drought periods and remain less than or equal to −1.0 for at least 1 month. Drought ends when the index value rises above 0.

Table 2

Drought categories defined for values of the SRI (McKee et al. 1993)

SRI valuesDrought category
−0.99 to 0 Mild drought 
−1.49 to −1.00 Moderate drought 
−1.99 to −1.50 Severe drought 
≤−2.00 Extreme drought 
SRI valuesDrought category
−0.99 to 0 Mild drought 
−1.49 to −1.00 Moderate drought 
−1.99 to −1.50 Severe drought 
≤−2.00 Extreme drought 

ML algorithms

ML, which uses different computational techniques to obtain unknown outputs from existing data, is a popular field of artificial intelligence with various application areas such as robotics, computer games, emotion tracking, natural language processing, and modeling. It is frequently used in flow, drought, flood modeling, classification, and prediction studies in hydrology (Petty & Dhingra 2018; Kim et al. 2019; Gharib & Davies 2021). Supervised learning, unsupervised learning, semi-supervised learning, reinforcement learning, transduction, and learning to discover are the most common algorithm types (Ayodele 2010). The different ML algorithms used in this study are given in Figure 2.
Figure 2

ML structure used in the study (Ağbulut et al. 2021).

Figure 2

ML structure used in the study (Ağbulut et al. 2021).

Close modal

Multi-layer perceptron (MLP)

MLP, a class of artificial neural networks, consists of input, hidden, and output layers. Between consecutive layers, neurons are connected to each other by weighted connections. Minimizing the error between the known output and the output in the training set is essential until the optimal weight values are obtained. Another goal is to get the optimum number of neurons in the hidden layer (Haykin & Network 2004).

k-nearest neighbor (kNN)

kNN, proposed by Shakhnarovich et al. (2008), is a non-parametric ML algorithm based on data training to predict outputs. Using the training data kept in memory, the term closest to the output, i.e., the term with the most similarity, is measured according to a distance. The values of the kNN are averaged to predict the unknown output. The most used Euclidean function to find the best neighborhoods in the kNN model is defined according to the distance between two points in Euclidean space. Considering the diversity of the neighborhoods, the number of neighbors is chosen between 1 and 5 and the Euclidean distance is selected (Rajput et al. 2023). An example of the kNN structure is given in Figure 3.
Figure 3

An example of the kNN structure.

Figure 3

An example of the kNN structure.

Close modal

Support vector regression (SVR)

SVR, proposed by Cortes & Vapnik (1995), is a statistical learning algorithm that allows for reaching a universal solution with few samples. It is a powerful regression approach for estimating linear and non-linear systems using Vapnik's ε-insensitive loss function. In the training and prediction processes of the SVR method, Kernel functions are used. Gaussian kernel function, one of the widely used kernel functions, is given in Equation (11):
(11)
where is training input, σ denotes the Gaussian spread, and gamma (γ) denotes the Gaussian parameter (Tan et al. 2015).

Linear regression (LR)

Linear regression (LR) is a statistical method used to reveal the cause-and-effect relationship between a dependent variable and one or more independent variables. LR relates the dependent variable to the independent variable or variables with the help of a function. When multiple independent variables are used in the LR method, it is called multiple linear regression (MLR) (Luu et al. 2021).

AdaBoost

AdaBoost, one of the ensemble algorithms, is obtained by combining weak learning algorithms such as decision trees (DTs), to create a stronger learning algorithm. While training the model, the algorithm assigns a weight to each example, and this weight is recalculated with iterations. At each iteration, AdaBoost changes the weight to arrive at training samples with higher error in the prediction value so that each subsequent regression tree has a reweighted training dataset (Drucker 1997). To represent n-number of samples, AdaBoost initially assigns equal weights of (1/n) to all samples and AdaBoost includes a learning_rate hyperparameter defined as ‘learning_rate floating point, default = 1’. Thus, a higher learning rate increases the contribution of each classifier. There is a trade-off between the learning_rate and n_estimators parameters.

Random forest (RF)

RF regression is a tree-based technique that involves segmenting the predictor space into a series of simple regions (Wu et al. 2017). In simpler terms, it is a non-parametric ensemble learning method that aggregates results from numerous individual DTs. Each tree is grown using a bootstrap sample to prevent overfitting, and variable subsets are randomly selected at each split. Observations not included in a tree's sample due to the bootstrapping process are referred to as out-of-bag samples, comprising approximately 36% on average. These samples serve as the test set for the tree, aiding in measuring prediction error. The importance of a particular predictor is assessed through permutation, wherein the values of the predictor are randomly shuffled in the out-of-bag samples. The final prediction error is then compared with the error obtained before the shuffle. This permutation-based importance estimate, known as the variable importance measure (VIMP), considers the effects of all interactions by eliminating the predictor's effect on the choice of other variables deeper in the tree (Van der Meer et al. 2017).

Light gradient boosting machine (LGBM)

Light gradient boosting machine (LGBM) is a fast, distributed, open-source, and high-performance gradient boosting ML algorithm. It utilizes histogram-based algorithms to expedite training and reduce memory usage. LGBM operates efficiently and accurately, supporting parallel learning and compatibility with large datasets. Also, LGBM has been found to converge fastest among other gradient boosting algorithms (Sibindi et al. 2023). In the optimization process, the objective function of LGBM, which uses the leaf-wise method to find suitable leaves, is given in Equation (12):
(12)
where and denote the normal and loss functions, respectively, c is the extra parameter, and t is the sampling time. The c parameter prevents overfitting and optimizes the depth of the tree (Meng et al. 2016).

Ridge regression

In regression analysis, multi-linear connections between independent variables indicate a strong relationship between variables. In this case, although estimates made by the method of least squares are generally accurate, significant variances can be obtained. That is, the predicted values tend to deviate significantly from the actual values. The ridge regression method proposed by Hoerl & Kennard (1970) is used to obtain more stable estimates, especially when multicollinearity problems are encountered. This method focuses on reducing uncertainty between highly correlated independent variables, providing more reliable results in regression analysis.

Gradient boosting

Gradient boosting (GBoost), developed by Friedman (2001), uses the boosting algorithm by extending it to solve a proposed regression problem. It is an integrated model with higher performance and stability. It solves for the minimum value using negative gradients of the loss function such as least squares and least absolute deviation. GBoost, which has strong prediction ability for non-linear data, handles noisy and dirty data well and supports different loss functions. It stops the growth of the tree as soon as possible to solve the problem of overfitting in the DT learning process (Li et al. 2018).

Category boosting

Category boosting (CatBoost) is a gradient DT algorithm that can better handle categorical features by using the entire dataset for training. It obtains categorical features with minimum information loss by using the entire dataset and can combine all categorical features into a single new feature (Prokhorenkova et al. 2018). In addition, it provides faster results with its sequential strengthening feature and deviations in the solution are eliminated (Huang et al. 2019).

Decision tree

DT is one of the frequently used prediction models in binary classification because it simplifies the classification processes (Anyanwu & Shiva 2009). A DT consists of an input dataset, a few test rules, and root, branches, and leaf nodes. Since a node has only one master node, one-way data flow is ensured and loops are prevented (Friedl & Brodley 1997).

Wavelet transform

Wavelets (W) are mathematical functions that work on components with a resolution identical to their own scale after separating the data into different frequency components (Terzi & Barak 2015). WT, on the other hand, has a flexible window function and is a revision of the short-time Fourier transform regarding signal compactness and shape (Hernández & Weiss 1996). There are two types of WT: continuous (CWT) and discrete (DWT).

The CWT of a time series is a function of the variables and s for the wave:
(13)

Here, τ indicates the position of the wavelet in the time domain and s indicates its position in the frequency domain. The CWT provides information about time and frequency by mapping the x(t) series to the function τ and s (Aguiar-Conraria & Soares 2014).

In DWT, the scale is based on powers of two so that results can be calculated accurately in less time. DWT is written in Equation (14):
(14)
where N is the length of the series x(t). By applying DWT to a signal, the signal is decomposed into two new auxiliary signals, called approximate (A) and detail (D) components, to represent the low and high frequencies of the original signal, respectively. The sum of components Ad and Dd for decomposition level d is equal to component A for decomposition level (d − 1) (Ad−1 = Ad + Dd) (Araghi et al. 2020).

A wavelet is not just a function, but a whole family of functions that meet certain requirements. The most popular functions include Daubechies, Coiflet, Symlet, Haar, Morlet, Gaussian, Shannon, Meyer, and Mexican Hat (Oteng Mensah et al. 2024).

Random oversampling

ROS can provide high-fit models by generating new examples for classes that are few in a population (Chawla et al. 2004). Chawla et al. (2004) used LR of a minority sample with randomly selected kNN when generating new samples. Most class examples are not taken into account when developing recent examples. Therefore, overlaps between minority and majority class samples are inevitable. However, it is frequently used because it is a simple application (Moreo et al. 2016; Mohammed et al. 2020).

Hyperparameter tuning

Default hyperparameter values given in ML models do not guarantee the best performance by default (Schratz et al. 2019). Therefore, tuning hyperparameter values can significantly impact the performance of a model. The numerous hyperparameters of models make manual adjustment of these values nearly impossible. Hence, many hyperparameter tuning methods are available and used in the literature. There are different frameworks for hyperparameter optimization, such as manual search, random search, grid search, and Optuna. In this study, due to its scalability, versatile design, and usability in different areas, Optuna is used as a framework suggested by Akiba et al. (2019). Optuna, an open-source hyperparameter tuning optimization, aims to find the best values of parameters used to improve the performance of a ML model (Agrawal 2021). Optuna aims to make this process more efficient and effective by automating it. Optuna offers different options to choose the optimization algorithm that will best solve the problem. It can use multiple processes or processes on different machines to speed up the optimization process. This helps perform optimization faster and more efficiently. It also provides a user-friendly application programming interface (API), making operations such as defining the hyperparameter space, selecting the optimization algorithm, and analyzing the results simple and intuitive (Srinivas & Katarya 2022; Can 2024).

This study aims to predict the future hydrological drought events around the Uşak, located in the Gediz Basin, Turkey. It utilized precipitation (P) and air temperature (Ta) parameters obtained from Uşak station to determine meteorological drought values and streamflow parameters using data from various streamflow gauging stations to determine hydrological drought values. Then, the relationship between meteorological and hydrological drought was examined. To determine the most effective streamflow gauging station, correlation matrices (Figure 4) and monthly correlation coefficients (Table 3) were used between P and Ta data separately and streamflow data.
Table 3

Correlation coefficients between streamflow data and meteorological parameters

StationE05A024
E05A022
E05A014
PTaPTaPTa
Oct 0.213 0.070 0.162 0.016 0.126 0.039 
Nov 0.106 0.019 0.217 0.015 0.439 0.051 
Dec 0.507 0.016 0.375 0.001 0.480 0.001 
Jan 0.248 0.000 0.312 0.001 0.535 0.001 
Feb 0.044 0.110 0.789 0.005 0.668 0.005 
Mar 0.001 0.037 0.148 0.059 0.115 0.133 
Apr 0.024 0.023 0.279 0.300 0.297 0.186 
May 0.129 0.024 0.411 0.026 0.341 0.013 
Jun 0.078 0.021 0.146 0.061 0.318 0.055 
July 0.082 0.030 0.070 0.299 0.061 0.023 
Aug 0.052 0.115 0.252 0.195 0.299 0.009 
Sep 0.169 0.026 0.182 0.055 0.231 0.054 
StationE05A024
E05A022
E05A014
PTaPTaPTa
Oct 0.213 0.070 0.162 0.016 0.126 0.039 
Nov 0.106 0.019 0.217 0.015 0.439 0.051 
Dec 0.507 0.016 0.375 0.001 0.480 0.001 
Jan 0.248 0.000 0.312 0.001 0.535 0.001 
Feb 0.044 0.110 0.789 0.005 0.668 0.005 
Mar 0.001 0.037 0.148 0.059 0.115 0.133 
Apr 0.024 0.023 0.279 0.300 0.297 0.186 
May 0.129 0.024 0.411 0.026 0.341 0.013 
Jun 0.078 0.021 0.146 0.061 0.318 0.055 
July 0.082 0.030 0.070 0.299 0.061 0.023 
Aug 0.052 0.115 0.252 0.195 0.299 0.009 
Sep 0.169 0.026 0.182 0.055 0.231 0.054 
Figure 4

Correlation matrices between (a) P and streamflow and (b) Ta and streamflow.

Figure 4

Correlation matrices between (a) P and streamflow and (b) Ta and streamflow.

Close modal

As seen in Figure 4, it has been determined that the streamflow gauging station with the highest correlation to P and Ta meteorological parameters is E05A014 station. In addition to the correlation matrices given in Figure 4, the correlation coefficients between meteorological parameters and streamflow values for each month were calculated and given in Table 3, as done by Adib & Tavancheh (2019). According to Table 3, the relationship between streamflow data of E05A014 station and meteorological parameters is better than other stations. While the relationship between E05A014 station and P was found to be high throughout the year, the relationship with Ta was especially higher in autumn and winter. E05A024 station has the lowest correlation coefficient values in all months. This situation also supports the result obtained from the correlation matrices given in Figure 4.

Meteorological and hydrological drought analysis

Using monthly P and Ta data for Uşak station, meteorological droughts were calculated with SPEI, and hydrological droughts were calculated with SRI using streamflow data for E05A014 station in 3-, 6-, 9-, and 12-month periods. The time series of these drought values is given in Figure 5.
Figure 5

Time series of SPEI and SRI values.

Figure 5

Time series of SPEI and SRI values.

Close modal
It could be observed from the time series that there was a delay between the SPEI and SRI series in both rainy and dry periods in 3-, 6-, 9-, and 12-month periods. Correlation matrices for time shift ranging up to 12 months (t, t − 1, t − 2, …, t − 12) are given for 3-, 6-, 9-, and 12-month periods in Figures 69.
Figure 6

Correlation matrix between SPEI3 and SRI3 for the 3-month period.

Figure 6

Correlation matrix between SPEI3 and SRI3 for the 3-month period.

Close modal
Figure 7

Correlation matrix between SPEI6 and SRI6 for the 6-month period.

Figure 7

Correlation matrix between SPEI6 and SRI6 for the 6-month period.

Close modal
Figure 8

Correlation matrix between SPEI9 and SRI9 for the 9-month period.

Figure 8

Correlation matrix between SPEI9 and SRI9 for the 9-month period.

Close modal
Figure 9

Correlation matrix between SPEI12 and SRI12 for the 12-month period.

Figure 9

Correlation matrix between SPEI12 and SRI12 for the 12-month period.

Close modal

As seen from the correlation matrices, which visually reveal how the correlation values ​​change over time, there was a positive and decreasing relationship between the SPEI values of months t, t − 1, t − 2,…, t − 12 and the SRI values of month t. Dark colors represent higher correlations and light colors represent lower correlations. This color scale emphasizes that correlations decrease over time, but there is a relationship in every time period. Hence, in this study, SPEI values from (t) month to (t − 12) month were used as input parameters for the prediction of hydrological drought from meteorological drought. There are studies in the literature that use the TOPSIS method during the model selection phase (Farajpanah et al. 2020), but since the input parameters were selected instead of the method in this study, correlation matrices were used instead of the TOPSIS method.

Modeling using ML algorithms

ML algorithms were tested to predict hydrological drought from meteorological drought using all one month shift up to t − 12 months (Tables 47). Models were constructed employing various algorithms (MLP, kNN, SVR, LR, AdaBoost, RF, LGBM, Ridge, DT, CatBoost, and GradientBoosting) within ML algorithms. While developing the models, 20% of the monthly data between 1985 and 2015 was randomly selected as the testing set. The remaining data was used to train the models. The performance of these models was assessed using determination coefficients (R2) and root mean squared error (RMSE) as defined in Equations (15) and (16):
(15)
(16)
where N is the number of real data, and are SRI values and ML model results, respectively, and is the mean of .
Table 4

Models developed for SRI3 prediction

Input setModelTesting R2Testing RMSE
SPEI(t − 12)3 to SPEI(t)3 MLP 0.420 0.726 
SPEI(t − 11)3 to SPEI(t)3 kNN 0.410 0.729 
SPEI(t − 10)3 to SPEI(t)3 SVR 0.400 0.734 
SPEI(t − 9)3 to SPEI(t)3 SVR 0.410 0.730 
SPEI(t − 8)3 to SPEI(t)3 SVR 0.430 0.717 
SPEI(t − 7)3 to SPEI(t)3 SVR 0.450 0.705 
SPEI(t − 6)3 to SPEI(t)3 SVR 0.430 0.717 
SPEI(t − 5)3 to SPEI(t)3 SVR 0.420 0.724 
SPEI(t − 4)3 to SPEI(t)3 SVR 0.380 0.749 
SPEI(t − 3)3 to SPEI(t)3 MLP 0.340 0.768 
SPEI(t − 2)3 to SPEI(t)3 MLP 0.270 0.812 
SPEI(t − 1)3 to SPEI(t)3 MLP 0.250 0.821 
SPEI(t)3 LR 0.240 0.827 
Input setModelTesting R2Testing RMSE
SPEI(t − 12)3 to SPEI(t)3 MLP 0.420 0.726 
SPEI(t − 11)3 to SPEI(t)3 kNN 0.410 0.729 
SPEI(t − 10)3 to SPEI(t)3 SVR 0.400 0.734 
SPEI(t − 9)3 to SPEI(t)3 SVR 0.410 0.730 
SPEI(t − 8)3 to SPEI(t)3 SVR 0.430 0.717 
SPEI(t − 7)3 to SPEI(t)3 SVR 0.450 0.705 
SPEI(t − 6)3 to SPEI(t)3 SVR 0.430 0.717 
SPEI(t − 5)3 to SPEI(t)3 SVR 0.420 0.724 
SPEI(t − 4)3 to SPEI(t)3 SVR 0.380 0.749 
SPEI(t − 3)3 to SPEI(t)3 MLP 0.340 0.768 
SPEI(t − 2)3 to SPEI(t)3 MLP 0.270 0.812 
SPEI(t − 1)3 to SPEI(t)3 MLP 0.250 0.821 
SPEI(t)3 LR 0.240 0.827 

The bold values indicate the model that gives the best result among all models.

Table 5

Models developed for SRI6 prediction

Input setModelTesting R2Testing RMSE
SPEI(t − 12)6 to SPEI(t)6 ADABOOST 0.570 0.678 
SPEI(t − 11)6 to SPEI(t)6 ADABOOST 0.550 0.694 
SPEI(t − 10)6 to SPEI(t)6 ADABOOST 0.530 0.706 
SPEI(t − 9)6 to SPEI(t)6 ADABOOST 0.530 0.710 
SPEI(t − 8)6 to SPEI(t)6 ADABOOST 0.520 0.714 
SPEI(t − 7)6 to SPEI(t)6 ADABOOST 0.560 0.685 
SPEI(t − 6)6 to SPEI(t)6 ADABOOST 0.540 0.704 
SPEI(t − 5)6 to SPEI(t)6 ADABOOST 0.550 0.693 
SPEI(t − 4)6 to SPEI(t)6 ADABOOST 0.540 0.700 
SPEI(t − 3)6 to SPEI(t)6 kNN 0.500 0.732 
SPEI(t − 2)6 to SPEI(t)6 RF 0.480 0.745 
SPEI(t − 1)6 to SPEI(t)6 MLP 0.460 0.758 
SPEI(t)6 MLP 0.370 0.822 
Input setModelTesting R2Testing RMSE
SPEI(t − 12)6 to SPEI(t)6 ADABOOST 0.570 0.678 
SPEI(t − 11)6 to SPEI(t)6 ADABOOST 0.550 0.694 
SPEI(t − 10)6 to SPEI(t)6 ADABOOST 0.530 0.706 
SPEI(t − 9)6 to SPEI(t)6 ADABOOST 0.530 0.710 
SPEI(t − 8)6 to SPEI(t)6 ADABOOST 0.520 0.714 
SPEI(t − 7)6 to SPEI(t)6 ADABOOST 0.560 0.685 
SPEI(t − 6)6 to SPEI(t)6 ADABOOST 0.540 0.704 
SPEI(t − 5)6 to SPEI(t)6 ADABOOST 0.550 0.693 
SPEI(t − 4)6 to SPEI(t)6 ADABOOST 0.540 0.700 
SPEI(t − 3)6 to SPEI(t)6 kNN 0.500 0.732 
SPEI(t − 2)6 to SPEI(t)6 RF 0.480 0.745 
SPEI(t − 1)6 to SPEI(t)6 MLP 0.460 0.758 
SPEI(t)6 MLP 0.370 0.822 

The bold values indicate the model that gives the best result among all models.

Table 6

Models developed for SRI9 prediction

Input setModelTesting R2Testing RMSE
SPEI(t − 12)9 to SPEI(t)9 LGBM 0.530 0.725 
SPEI(t − 11)9 to SPEI(t)9 RIDGE 0.480 0.762 
SPEI(t − 10)9 to SPEI(t)9 ADABOOST 0.480 0.759 
SPEI(t − 9)9 to SPEI(t)9 RIDGE 0.470 0.766 
SPEI(t − 8)9 to SPEI(t)9 RIDGE 0.470 0.766 
SPEI(t − 7)9 to SPEI(t)9 RIDGE 0.470 0.767 
SPEI(t − 6)9 to SPEI(t)9 RIDGE 0.470 0.770 
SPEI(t − 5)9 to SPEI(t)9 RIDGE 0.470 0.771 
SPEI(t − 4)9 to SPEI(t)9 LR 0.490 0.757 
SPEI(t − 3)9 to SPEI(t)9 RIDGE 0.480 0.761 
SPEI(t − 2)9 to SPEI(t)9 RIDGE 0.480 0.761 
SPEI(t − 1)9 to SPEI(t)9 LR 0.480 0.758 
SPEI(t)9 LR 0.430 0.799 
Input setModelTesting R2Testing RMSE
SPEI(t − 12)9 to SPEI(t)9 LGBM 0.530 0.725 
SPEI(t − 11)9 to SPEI(t)9 RIDGE 0.480 0.762 
SPEI(t − 10)9 to SPEI(t)9 ADABOOST 0.480 0.759 
SPEI(t − 9)9 to SPEI(t)9 RIDGE 0.470 0.766 
SPEI(t − 8)9 to SPEI(t)9 RIDGE 0.470 0.766 
SPEI(t − 7)9 to SPEI(t)9 RIDGE 0.470 0.767 
SPEI(t − 6)9 to SPEI(t)9 RIDGE 0.470 0.770 
SPEI(t − 5)9 to SPEI(t)9 RIDGE 0.470 0.771 
SPEI(t − 4)9 to SPEI(t)9 LR 0.490 0.757 
SPEI(t − 3)9 to SPEI(t)9 RIDGE 0.480 0.761 
SPEI(t − 2)9 to SPEI(t)9 RIDGE 0.480 0.761 
SPEI(t − 1)9 to SPEI(t)9 LR 0.480 0.758 
SPEI(t)9 LR 0.430 0.799 

The bold values indicate the model that gives the best result among all models.

Table 7

Models developed for SRI12 prediction

Input setModelTesting R2Testing RMSE
SPEI(t − 12)12 to SPEI(t)12 kNN 0.540 0.732 
SPEI(t − 11)12 to SPEI(t)12 kNN 0.590 0.692 
SPEI(t − 10)12 to SPEI(t)12 kNN 0.570 0.708 
SPEI(t − 9)12 to SPEI(t)12 kNN 0.550 0.727 
SPEI(t − 8)12 to SPEI(t)12 SVR 0.500 0.765 
SPEI(t − 7)12 to SPEI(t)12 SVR 0.510 0.750 
SPEI(t − 6)12 to SPEI(t)12 LR 0.480 0.780 
SPEI(t − 5)12 to SPEI(t)12 SVR 0.490 0.773 
SPEI(t − 4)12 to SPEI(t)12 LR 0.480 0.779 
SPEI(t − 3)12 to SPEI(t)12 RIDGE 0.490 0.775 
SPEI(t − 2)12 to SPEI(t)12 RIDGE 0.480 0.783 
SPEI(t − 1)12 to SPEI(t)12 LR 0.450 0.803 
SPEI(t)12 RF 0.430 0.816 
Input setModelTesting R2Testing RMSE
SPEI(t − 12)12 to SPEI(t)12 kNN 0.540 0.732 
SPEI(t − 11)12 to SPEI(t)12 kNN 0.590 0.692 
SPEI(t − 10)12 to SPEI(t)12 kNN 0.570 0.708 
SPEI(t − 9)12 to SPEI(t)12 kNN 0.550 0.727 
SPEI(t − 8)12 to SPEI(t)12 SVR 0.500 0.765 
SPEI(t − 7)12 to SPEI(t)12 SVR 0.510 0.750 
SPEI(t − 6)12 to SPEI(t)12 LR 0.480 0.780 
SPEI(t − 5)12 to SPEI(t)12 SVR 0.490 0.773 
SPEI(t − 4)12 to SPEI(t)12 LR 0.480 0.779 
SPEI(t − 3)12 to SPEI(t)12 RIDGE 0.490 0.775 
SPEI(t − 2)12 to SPEI(t)12 RIDGE 0.480 0.783 
SPEI(t − 1)12 to SPEI(t)12 LR 0.450 0.803 
SPEI(t)12 RF 0.430 0.816 

The bold values indicate the model that gives the best result among all models.

The R2 value indicates model's overall explanatory power and accuracy. A higher R2 value signifies better model performance. RMSE indicates the magnitude of the model's prediction error and quantifies the dispersion between predicted and real data. A lower RMSE value indicates better model performance. Therefore, it is common to use these two metrics together when evaluating the performance of regression models. When used together, these metrics provide a comprehensive evaluation to understand how well the model is performing and the magnitude of prediction errors (Legates & McCabe 1999; Chai & Draxler 2014).

As seen from Table 4, for the 3-month period, the SVR model generally shows the best performance. With the input set, SPEI(t − 7)3 to SPEI(t)3, the SVR model achieved a testing R2 value of 0.450 and an RMSE value of 0.705, the best results overall. The LR model was only tested with the input set SPEI(t)3 and showed the lowest performance with a testing R2 value of 0.240 and an RMSE value of 0.827. Based on this information, it seems reasonable to prefer the SVR model. For the 6-month period, the AdaBoost algorithm, using SPEI3 values up to the (t − 12)th month as inputs, demonstrated the most optimal performance among the constructed SRI3 prediction models at (t)th month (Table 5).

In Table 6, similar to the 6-month period, for the 9-month period, the LGBM algorithm exhibited successful performance when using an input set composed of SPEI9 values up to the (t − 12)th month, outperforming other models for prediction of SRI9 at (t)th month.

For the 12-month period, the kNN model showed the best performance with the input set SPEI(t − 11)12 to SPEI(t)12, achieving a testing R2 value of 0.590 and an RMSE value of 0.692. Overall, the kNN model obtained high R2 and low RMSE values with different input sets, indicating that the model performed consistently well. The RF model, on the other hand, showed the lowest performance (Table 7).

Moreover, stacking regressor approach has been used for kNN-SVR, kNN-RIDGE, and kNN-RF hybrid ML models, which gave the best results for SRI12 and R2 values were obtained as 0.560, 0.560, and 0.540, respectively. Similarly, for SRI9, R2 values of LGBM-ADABOOST, LGBM-RIDGE, and RIDGE-ADABOOST hybrid ML models were obtained as 0.450, 0.470, and 0.480, respectively. For SRI6, R2 values of ADABOOST-kNN and ADABOOST-RF hybrid ML models were 0.480 and 0.540, respectively. For SRI3, R2 values for MLP-kNN, MLP-SVR, and kNN-SVR were found to be 0.390, 0.440, and 0.450, respectively. Since the success rate is higher in ML models developed with DWT, W-ML models were continued in the study.

Modeling using W-ML algorithms

In this part of the study, W-ML hybrid models were developed. The DWT was applied to 3-, 6-, 9-, and 12-month periods of SPEI values. These SPEI series were decomposed into eight detail components (2-4-8-16-32-64-128-256) (D1, D2, D3, D4, D5, D6, D7, and D8). In this study, the Daubechies wavelet was chosen due to its characteristic orthogonality and compact support, which are considered suitable for hydrometeorological time series in the creation of subseries (Nourani et al. 2018). The correlations between the subseries obtained by DWT and 3-, 6-, 9-, and 12-month periods of SRI series were calculated and given in Table 8.

Table 8

Correlation between subseries and 3-, 6-, 9-, and 12-month periods of SRI values

SubseriesCorrelation
SPEI(t)3SPEI(t)6SPEI(t)9SPEI(t)12
D1 0.0400 0.0001 0.0040 0.0400 
D2 0.1800 0.0200 0.0040 0.0600 
D3 0.2100 0.0400 0.0080 0.0002 
D4 0.1100 0.2300 0.2500 0.2700 
D5 0.0400 0.1900 0.2300 0.2700 
D6 0.0100 0.0400 0.0600 0.0900 
D7 0.0005 0.1600 0.0080 0.0080 
D8 0.0700 0.0400 0.0300 0.0300 
SubseriesCorrelation
SPEI(t)3SPEI(t)6SPEI(t)9SPEI(t)12
D1 0.0400 0.0001 0.0040 0.0400 
D2 0.1800 0.0200 0.0040 0.0600 
D3 0.2100 0.0400 0.0080 0.0002 
D4 0.1100 0.2300 0.2500 0.2700 
D5 0.0400 0.1900 0.2300 0.2700 
D6 0.0100 0.0400 0.0600 0.0900 
D7 0.0005 0.1600 0.0080 0.0080 
D8 0.0700 0.0400 0.0300 0.0300 

From Table 8, it can be seen that the highest correlation values for SRI(t)3 were obtained between the D2,D3 wavelet subseries of the SPEI(t)3 input variables. Similarly, the highest correlation values were obtained with D4,D5 wavelet subseries of SPEI(t)6 input variables for SRI(t)6, D4,D5 wavelet subseries of SPEI(t)9 input variables for SRI(t)9, and D4,D5 wavelet subseries of SPEI(t)12 input variables for SRI(t)12. Various combinations of these subseries, individually and by adding them together, have been tried. An increase was observed in the performance of W-ML hybrid models in which subseries were used separately as input parameters for each period. Therefore, the number of inputs in the W-ML hybrid models developed to estimate the SRI(t) value was twice as much as in the ML models. The input sets and R2–RMSE values of the W-ML hybrid models developed to estimate 3-, 6-, 9-, and 12-month SRI(t) values are given in Tables 912.

Table 9

W-ML hybrid models developed for SRI(t)3 prediction

Input setModelTesting R2Testing RMSE
SPEI(t-12 to t)(D2,D3)3 W-GBOOST 0.484 0.652 
SPEI(t-11 to t)(D2,D3)3 W-MLP 0.447 0.674 
SPEI(t-10 to t)(D2,D3)3 W-LINEAR 0.400 0.703 
SPEI(t-9 to t)(D2,D3)3 W-CATBOOST 0.374 0.718 
SPEI(t-8 to t)(D2,D3)3 W-SVR 0.378 0.716 
SPEI(t-7 to t)(D2,D3)3 W-CATBOOST 0.419 0.692 
SPEI(t-6 to t)(D2,D3)3 W-LINEAR 0.398 0.704 
SPEI(t-5 to t)(D2,D3)3 W-LINEAR 0.376 0.717 
SPEI(t-4 to t)(D2,D3)3 W-LINEAR 0.404 0.700 
SPEI(t-3 to t)(D2,D3)3 W-SVR 0.392 0.708 
SPEI(t-2 to t)(D2,D3)3 W-LINEAR 0.378 0.716 
SPEI(t-1 to t)(D2,D3)3 W-LINEAR 0.374 0.718 
SPEI(t)(D2,D3)3 W-SVR 0.407 0.699 
Input setModelTesting R2Testing RMSE
SPEI(t-12 to t)(D2,D3)3 W-GBOOST 0.484 0.652 
SPEI(t-11 to t)(D2,D3)3 W-MLP 0.447 0.674 
SPEI(t-10 to t)(D2,D3)3 W-LINEAR 0.400 0.703 
SPEI(t-9 to t)(D2,D3)3 W-CATBOOST 0.374 0.718 
SPEI(t-8 to t)(D2,D3)3 W-SVR 0.378 0.716 
SPEI(t-7 to t)(D2,D3)3 W-CATBOOST 0.419 0.692 
SPEI(t-6 to t)(D2,D3)3 W-LINEAR 0.398 0.704 
SPEI(t-5 to t)(D2,D3)3 W-LINEAR 0.376 0.717 
SPEI(t-4 to t)(D2,D3)3 W-LINEAR 0.404 0.700 
SPEI(t-3 to t)(D2,D3)3 W-SVR 0.392 0.708 
SPEI(t-2 to t)(D2,D3)3 W-LINEAR 0.378 0.716 
SPEI(t-1 to t)(D2,D3)3 W-LINEAR 0.374 0.718 
SPEI(t)(D2,D3)3 W-SVR 0.407 0.699 

The bold values indicate the model that gives the best result among all models.

Table 10

W-ML hybrid models developed for SRI(t)6 prediction

Input setModelTesting R2Testing RMSE
SPEI(t-12 to t)(D4,D5)6 W-SVR 0.753 0.463 
SPEI(t-11 to t)(D4,D5)6 W-MLP 0.724 0.489 
SPEI(t-10 to t)(D4,D5)6 W-SVR 0.728 0.486 
SPEI(t-9 to t)(D4,D5)6 W-CATBOOST 0.692 0.585 
SPEI(t-8 to t)(D4,D5)6 W-CATBOOST 0.665 0.539 
SPEI(t-7 to t)(D4,D5)6 W-SVR 0.763 0.453 
SPEI(t-6 to t)(D4,D5)6 W-MLP 0.674 0.532 
SPEI(t-5 to t)(D4,D5)6 W-CATBOOST 0.621 0.574 
SPEI(t-4 to t)(D4,D5)6 W-CATBOOST 0.595 0.593 
SPEI(t-3 to t)(D4,D5)6 W-GBOOST 0.588 0.598 
SPEI(t-2 to t)(D4,D5)6 W-RF 0.577 0.606 
SPEI(t-1 to t)(D4,D5)6 W-CATBOOST 0.547 0.627 
SPEI(t)(D4,D5)6 W-CATBOOST 0.580 0.604 
Input setModelTesting R2Testing RMSE
SPEI(t-12 to t)(D4,D5)6 W-SVR 0.753 0.463 
SPEI(t-11 to t)(D4,D5)6 W-MLP 0.724 0.489 
SPEI(t-10 to t)(D4,D5)6 W-SVR 0.728 0.486 
SPEI(t-9 to t)(D4,D5)6 W-CATBOOST 0.692 0.585 
SPEI(t-8 to t)(D4,D5)6 W-CATBOOST 0.665 0.539 
SPEI(t-7 to t)(D4,D5)6 W-SVR 0.763 0.453 
SPEI(t-6 to t)(D4,D5)6 W-MLP 0.674 0.532 
SPEI(t-5 to t)(D4,D5)6 W-CATBOOST 0.621 0.574 
SPEI(t-4 to t)(D4,D5)6 W-CATBOOST 0.595 0.593 
SPEI(t-3 to t)(D4,D5)6 W-GBOOST 0.588 0.598 
SPEI(t-2 to t)(D4,D5)6 W-RF 0.577 0.606 
SPEI(t-1 to t)(D4,D5)6 W-CATBOOST 0.547 0.627 
SPEI(t)(D4,D5)6 W-CATBOOST 0.580 0.604 

The bold values indicate the model that gives the best result among all models.

Table 11

W-ML hybrid models developed for SRI(t)9 prediction

Input setModelTesting R2Testing RMSE
SPEI(t-12 to t)(D4,D5)9 W-SVR 0.874 0.349 
SPEI(t-11 to t)(D4,D5)9 W-SVR 0.889 0.329 
SPEI(t-10 to t)(D4,D5)9 W-SVR 0.854 0.375 
SPEI(t-9 to t)(D4,D5)9 W-MLP 0.782 0.460 
SPEI(t-8 to t)(D4,D5)9 W-MLP 0.841 0.393 
SPEI(t-7 to t)(D4,D5)9 W-SVR 0.766 0.476 
SPEI(t-6 to t)(D4,D5)9 W-SVR 0.739 0.503 
SPEI(t-5 to t)(D4,D5)9 W-MLP 0.823 0.414 
SPEI(t-4 to t)(D4,D5)9 W-CATBOOST 0.673 0.563 
SPEI(t-3 to t)(D4,D5)9 W-MLP 0.702 0.537 
SPEI(t-2 to t)(D4,D5)9 W-RF 0.622 0.605 
SPEI(t-1 to t)(D4,D5)9 W-RF 0.572 0.644 
SPEI(t)(D4,D5)9 W-RF 0.636 0.594 
Input setModelTesting R2Testing RMSE
SPEI(t-12 to t)(D4,D5)9 W-SVR 0.874 0.349 
SPEI(t-11 to t)(D4,D5)9 W-SVR 0.889 0.329 
SPEI(t-10 to t)(D4,D5)9 W-SVR 0.854 0.375 
SPEI(t-9 to t)(D4,D5)9 W-MLP 0.782 0.460 
SPEI(t-8 to t)(D4,D5)9 W-MLP 0.841 0.393 
SPEI(t-7 to t)(D4,D5)9 W-SVR 0.766 0.476 
SPEI(t-6 to t)(D4,D5)9 W-SVR 0.739 0.503 
SPEI(t-5 to t)(D4,D5)9 W-MLP 0.823 0.414 
SPEI(t-4 to t)(D4,D5)9 W-CATBOOST 0.673 0.563 
SPEI(t-3 to t)(D4,D5)9 W-MLP 0.702 0.537 
SPEI(t-2 to t)(D4,D5)9 W-RF 0.622 0.605 
SPEI(t-1 to t)(D4,D5)9 W-RF 0.572 0.644 
SPEI(t)(D4,D5)9 W-RF 0.636 0.594 

The bold values indicate the model that gives the best result among all models.

Table 12

W-ML hybrid models developed for SRI(t)12 prediction

Input setModelTesting R2Testing RMSE
SPEI(t − 12 to t)(D4,D5)12 W-MLP 0.907 0.304 
SPEI(t − 11 to t)(D4,D5)12 W-MLP 0.921 0.279 
SPEI(t − 10 to t)(D4,D5)12 W-MLP 0.935 0.272 
SPEI(t − 9 to t)(D4,D5)12 W-MLP 0.890 0.316 
SPEI(t − 8 to t)(D4,D5)12 W-MLP 0.896 0.321 
SPEI(t − 7 to t)(D4,D5)12 W-SVR 0.857 0.375 
SPEI(t − 6 to t)(D4,D5)12 W-GBOOST 0.795 0.450 
SPEI(t − 5 to t)(D4,D5)12 W-CATBOOST 0.739 0.507 
SPEI(t − 4 to t)(D4,D5)12 W-CATBOOST 0.709 0.536 
SPEI(t − 3 to t)(D4,D5)12 W-CATBOOST 0.662 0.577 
SPEI(t − 2 to t)(D4,D5)12 W-CATBOOST 0.635 0.599 
SPEI(t − 1 to t)(D4,D5)12 W-MLP 0.619 0.613 
SPEI(t)(D4,D5)12 W-LGBM 0.600 0.628 
Input setModelTesting R2Testing RMSE
SPEI(t − 12 to t)(D4,D5)12 W-MLP 0.907 0.304 
SPEI(t − 11 to t)(D4,D5)12 W-MLP 0.921 0.279 
SPEI(t − 10 to t)(D4,D5)12 W-MLP 0.935 0.272 
SPEI(t − 9 to t)(D4,D5)12 W-MLP 0.890 0.316 
SPEI(t − 8 to t)(D4,D5)12 W-MLP 0.896 0.321 
SPEI(t − 7 to t)(D4,D5)12 W-SVR 0.857 0.375 
SPEI(t − 6 to t)(D4,D5)12 W-GBOOST 0.795 0.450 
SPEI(t − 5 to t)(D4,D5)12 W-CATBOOST 0.739 0.507 
SPEI(t − 4 to t)(D4,D5)12 W-CATBOOST 0.709 0.536 
SPEI(t − 3 to t)(D4,D5)12 W-CATBOOST 0.662 0.577 
SPEI(t − 2 to t)(D4,D5)12 W-CATBOOST 0.635 0.599 
SPEI(t − 1 to t)(D4,D5)12 W-MLP 0.619 0.613 
SPEI(t)(D4,D5)12 W-LGBM 0.600 0.628 

The bold values indicate the model that gives the best result among all models.

According to Table 9, for the 3-month period, the W-GBOOST hybrid model, using the SPEI(t − 12 to t)(D2,D3)3 wavelet subseries as inputs for predicting SRI3 values in month t, provided the highest R2 and the lowest RMSE values. For the 6-month period, the W-SVR hybrid model, using SPEI(t − 7 to t)(D4,D5)6 wavelet subseries as inputs, demonstrated the best performance among SRI6 prediction models (Table 10).

In Table 11, for the 9-month period, the W-SVR hybrid model showed the best performance when using set of SPEI(t − 11 to t)(D4,D5)9 wavelet subseries as input. For the 12-month period, the W-MLP hybrid model gave the best results with an input set consisting of SPEI(t − 10 to t)(D4,D5)12 subseries in Table 12.

Hyperparameter tuning is also performed using the Optuna framework for W-ML hybrid models. The hyperparameters selected for the best models are given in Table 13.

Table 13

Hyperparameters tuning

ModelsHyperparametersSelection
W-GBOOST SRI3 learning_rate 0.009 
n_estimators 1,423 
subsample 0.943 
min_samples_split 
min_samples_leaf 
max_depth 33 
max_features 0.631 
W-SVR SRI6 value −0.483 
Params: C: 23.267 
epsilon 0.002 
kernel rbf 
W-SVR SRI9 value −0.198 
Params: C: 42.233 
epsilon 0.008 
kernel rbf 
W-MLP SRI12 n_layers 
n_units_l0 92 
n_units_l1 97 
activation tanh 
solver lbfgs 
alpha 0.036 
learning_rate adaptive 
ModelsHyperparametersSelection
W-GBOOST SRI3 learning_rate 0.009 
n_estimators 1,423 
subsample 0.943 
min_samples_split 
min_samples_leaf 
max_depth 33 
max_features 0.631 
W-SVR SRI6 value −0.483 
Params: C: 23.267 
epsilon 0.002 
kernel rbf 
W-SVR SRI9 value −0.198 
Params: C: 42.233 
epsilon 0.008 
kernel rbf 
W-MLP SRI12 n_layers 
n_units_l0 92 
n_units_l1 97 
activation tanh 
solver lbfgs 
alpha 0.036 
learning_rate adaptive 

The Taylor plots, which visualize the accuracy and precision of the W-ML models by displaying multiple statistical metrics (such as correlation coefficient, standard deviation, and mean squared error) on a single graph, are presented in Figure 10.
Figure 10

Taylor plots of test sets for SRI(t) models. (a) W-GBOOST SRI3, (b) W-SVR SRI6, (c) W-SVR SRI9, and (d) W-MLP SRI12.

Figure 10

Taylor plots of test sets for SRI(t) models. (a) W-GBOOST SRI3, (b) W-SVR SRI6, (c) W-SVR SRI9, and (d) W-MLP SRI12.

Close modal

The performance of the 3-, 6-, 9-, and 12-month period W-ML hybrid models is detailed in Figure 10 through Taylor plots. Figure 10(d) and 10(c) shows that the RMSE value for W-MLP and W-SVR models for SRI12 and SRI9, respectively, is close to 0.300, and the correlation between the predictions and the SRI12 and SRI9 values is around 0.900. This indicates that the standard deviations of the predictions are close to the actual values. Therefore, it is understood that the W-MLP and W-SVR models for SRI12 and SRI9 have very good performance, respectively. Similarly, the Taylor plots for SRI6 show that the W-SVR model also has reasonable performance (Figure 10(b)). However, the W-GBOOST model for SRI3 (Figure 10(a)) has poor performance compared with 6-, 9-, and 12-month periods.

Then, the improvement stage is performed to enhance the success of the obtained W-ML hybrid models. Thus, the W-ML-ROS hybrid models were developed by resampling with ROS for 3-, 6-, 9-, and 12-month periods. This approach resulted in a more homogeneously distributed dataset. The R2 and RMSE values for testing sets obtained W-ML-ROS hybrid models are summarized in Table 14.

Table 14

RMSE and R2 values of the W-ML-ROS models

ModelInputsTesting set
R2RMSE
W-GBOOST-ROS (SRI)3 SPEI(t − 12 to t)(D2,D3)3 0.893 0.350 
W-SVR-ROS (SRI)6 SPEI(t − 7 to t)(D4,D5)6 0.851 0.365 
W-SVR-ROS (SRI)9 SPEI(t − 11 to t)(D4,D5)9 0.940 0.261 
W-MLP-ROS (SRI)12 SPEI(t − 10 to t)(D4,D5)12 0.999 0.031 
ModelInputsTesting set
R2RMSE
W-GBOOST-ROS (SRI)3 SPEI(t − 12 to t)(D2,D3)3 0.893 0.350 
W-SVR-ROS (SRI)6 SPEI(t − 7 to t)(D4,D5)6 0.851 0.365 
W-SVR-ROS (SRI)9 SPEI(t − 11 to t)(D4,D5)9 0.940 0.261 
W-MLP-ROS (SRI)12 SPEI(t − 10 to t)(D4,D5)12 0.999 0.031 

With ROS application, it has been observed that W-ML-ROS hybrid models have higher R2 and lower RMSE values than W-ML models. The scatter plots for the enhanced models are given in Figure 11.
Figure 11

Scatter plots for testing sets of W-ML-ROS models.

Figure 11

Scatter plots for testing sets of W-ML-ROS models.

Close modal

Scatter diagrams provide a good visual tool for comparing the performance of models. In general, the concentration of points on or near the 45-degree inclined line indicates that the model is making predictions with high accuracy. It is observed that the model in the 12-month period W-MLP-ROS shows the best performance and the point clustering is the densest. The model in the 9-month period W-SVR-ROS also performs quite well.

When all models were examined, it was seen that the best performance in predicting SRI3 in the 3-month period was obtained by using the W-GBOOST-ROS model for inputs from SPEI(t)3 to SPEI(t − 12)3. Similarly, in the 6- and 9-month periods, with inputs from SPEI(t)6 to SPEI(t − 7)6 and SPEI(t)9 to SPEI(t − 11)9 good SRI6 and SRI9 estimations are provided by W-SVR-ROS, respectively. Moreover, in the 12-month period, the best SRI12 model belongs to the W-MLP-ROS model with inputs from SPEI(t)12 to SPEI(t − 10)12. After observing any meteorological drought level occurring in the region, the value of a possible hydrological drought can be predicted with great accuracy in periods of 3-, 6-, 9-, and 12-months with a delay of approximately 7 − 11 months.

In general, it is observed from the results that both DWT and ROS algorithms significantly improve ML models in the prediction of hydrological drought. The R2 values obtained for the test set in the models developed with ML algorithms for 3-, 6-, 9-, and 12-month periods are found as 0.450, 0.570, 0.530, and 0.590, respectively. The R2 values for the test set in the W-ML models developed with DWT process are obtained as 0.484, 0.763, 0.889, and 0.935, respectively. With the application of ROS to the W-ML models, the R2 values increased to 0.893, 0.851, 0.940, and 0.999, respectively. The W-ML-ROS models provided better efficiency than ML and W-ML models.

From the literature on prediction of hydrological drought from meteorological drought, Achite et al. (2022) developed DT, ANFIS, ANN, and SVM models using SPI values for predicting hydrological drought (SRI) in Wadi Ouahrane Basin in Algeria. They obtained the highest R2 value of 0.95 for the SVM model at the 12-month periods and stated that R2 values decreased at smaller periods. Hao et al. (2023) preferred LSTM, CNN, and CATBOOST methods while determining agricultural and hydrological droughts from meteorological droughts. Among the developed models, the CNN model obtained the highest accuracy with a Nash–Sutcliffe coefficient of 0.70. They found a delay between meteorological drought and hydrological drought for 8–11 months.

According to the literature given above, different ML and deep learning methods have been tried in the prediction of hydrological drought. However, in this study, in addition to these ML algorithms, DWT and ROS were applied, and then hybrid ML models (W-ML-ROS) with higher accuracy were obtained. Also in this and similar studies, high accuracy prediction values were mostly found in models for the 12-month period (Keskin et al. 2009, 2011; El Ibrahimi& Baali 2018; Docheshmeh Gorgij et al. 2022).

In other study conducted in the Gediz Basin, Kumanlioglu (2020) examined the relationship between meteorological (SPEI, SPI) and hydrological droughts (SRI) and stated that a 12-month period can be considered to accurately predict the delay between meteorological and hydrological droughts since the hydrological cycle occurs in a 12-month period. Like the results obtained by Hao et al. (2023), in this study, this delay remained between 7 and 11 months in the 3-, 6-, 9-, and 12-month periods in the prediction of hydrological drought in the region.

In this study, hydrological drought events around the Uşak province in the Gediz Basin of Turkey were predicted by using meteorological data obtained from the Uşak meteorological observation station and streamflow data obtained from various streamflow gauging stations. The analysis revealed that station E05A014 exhibited the highest correlation with P and Ta, making this station the most effective streamflow gauging station for determining hydrological drought. Drought indices, SPEI for meteorological drought and SRI for hydrological drought, were calculated for 3-, 6-, 9-, and 12-month periods. Time series analysis showed a noticeable delay between meteorological and hydrological drought indices over each period. Correlation matrices were created for different time shifts to determine the inputs of ML models depending on this delay, and a decreasing but positive relationship emerged between SPEI values ​​and SRI for each time shift. ML regression models were then used to predict hydrological drought from meteorological drought, taking into account a monthly shift for up to 12 months. Models covering MLP, kNN, SVR, LR, AdaBoost, RF, LGBM, ridge, DT, CatBoost, and GradientBoosting algorithms were evaluated using R2 and RMSE. Additionally, hyperparameter tuning using the Optuna framework resulted in improved R2 and RMSE values for all models. For each period, certain models demonstrated superior performance. For example, the W-GBOOST model with a 12-month time shift gives the best results in the 3-month period (t − 7). The W-SVR model with inputs up to 6 months demonstrated excellence over a 6-month period. It was also observed for the 9-month period that W-SVR algorithms performed extremely well. The best model with inputs up to (t − 10) months for a 12-month period is provided by W-MLP. To further improve model performance, hybrid models were developed by applying ROS to create a more balanced dataset. This approach resulted in improvements in prediction accuracy, as evidenced by the reduced RMSE values. In hydrological drought prediction models, R2 values show very high performance by remaining between 0.851 and 0.999 when tested in 3-, 6-, 9-, and 12-month periods.

In summary, the following results were obtained from this study:

  • In this study, it was concluded that by separating the dataset into its subseries using DWT, the prediction performance was higher than the original dataset. Planning sustainable water resources in the region and predicting hydrological drought with high accuracy are an exemplary study in terms of the need to combat drought.

  • W-ML-ROS hybrid models were successfully used to predict hydrological drought from meteorological drought for the first time.

  • The development of hybrid models and hyperparameter tuning contributed to increased prediction accuracy, emphasizing the effectiveness of these techniques in hydrological drought predicting.

  • The use of W-ML-ROS models to monitor drought, which is one of the natural disasters of meteorological origin, and to create emergency precaution plans for hydrological drought that may occur in the region, increases the attractiveness of the study as it provides successful results in a shorter time than other methods.

  • It aims to improve the results by testing new W-ML algorithms in the future.

The study is based on datasets from the years 1985–2015. This is one of the limitations in the study. Another limitation is that only data from the Gediz Basin were used. Including a broader dataset and more stations could enhance the generalizability of the results.

For future studies, the suitability of the W-ML-ROS hybrid models proposed in this study for the prediction of hydrological drought for different regions can be investigated, and the model can be updated with additional studies for future drought predictions. It is recommended to try different optimization methods such as fish swarm and artificial bee colonies that can increase the performance of the developed model. Additionally, SHAP (SHapley Additive exPlanations) analysis can be performed to examine the effect of input parameters on outputs to test other value sets in future studies.

E.D.T.: Conceptualization, Methodology, Writing – original draft, Investigation, Software, and Visualization.

The author declares that no funds, grants, or other support were received during the preparation of this manuscript.

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Achite
M.
,
Jehanzaib
M.
,
Elshaboury
N.
&
Kim
T. W.
(
2022
)
Evaluation of machine learning techniques for hydrological drought modeling: A case study of the Wadi Ouahrane basin in Algeria
.
Water
14
(
3
),
431
.
https://doi.org/10.3390/w14030431
.
Achite
M.
,
Katipoglu
O. M.
,
Şenocak
S.
,
Elshaboury
N.
,
Bazrafshan
O.
&
Dalkılıç
H. Y.
(
2023
)
Modeling of meteorological, agricultural, and hydrological droughts in semi-arid environments with various machine learning and discrete wavelet transform
.
Theoretical and Applied Climatology
154
(
1
),
413
451
.
https://doi.org/10.1007/s00704-023-04564-4
.
Adib
A.
&
Tavancheh
F.
(
2019
)
Relationship between hydrologic and metrological droughts using the streamflow drought indices and standardized precipitation indices in the Dez Watershed of Iran
.
International Journal of Civil Engineering
17
,
1171
1181
.
https://doi.org/10.1007/s40999-018-0376-y
.
Adnan
R. M.
,
Mostafa
R. R.
,
Islam
A. R. M. T.
,
Kisi
O.
,
Kuriqi
A.
&
Heddam
S.
(
2021
)
Estimating reference evapotranspiration using hybrid adaptive fuzzy inferencing coupled with heuristic algorithms
.
Computers and Electronics in Agriculture
191
,
106541
.
https://doi.org/10.1016/j.compag.2021.10654
.
Adnan
R. M.
,
Dai
H. L.
,
Mostafa
R. R.
,
Parmar
K. S.
,
Heddam
S.
&
Kisi
O.
(
2022
)
Modeling multistep ahead dissolved oxygen concentration using improved support vector machines by a hybrid metaheuristic algorithm
.
Sustainability
14
(
6
),
3470
.
https://doi.org/10.3390/su14063470
.
Adnan
R. M.
,
Dai
H. L.
,
Mostafa
R. R.
,
Islam
A. R. M. T.
,
Kisi
O.
,
Heddam
S.
&
Zounemat-Kermani
M.
(
2023a
)
Modelling groundwater level fluctuations by ELM merged advanced metaheuristic algorithms using hydroclimatic data
.
Geocarto International
38
(
1
),
2158951
.
https://doi.org/10.1080/10106049.2022.2158951
.
Adnan
R. M.
,
Mostafa
R. R.
,
Dai
H. L.
,
Heddam
S.
,
Kuriqi
A.
&
Kisi
O.
(
2023b
)
Pan evaporation estimation by relevance vector machine tuned with new metaheuristic algorithms using limited climatic data
.
Engineering Applications of Computational Fluid Mechanics
17
(
1
),
2192258
.
https://doi.org/10.1080/19942060.2023.2192258
.
Ağbulut
Ü.
,
Gürel
A. E.
&
Biçen
Y.
(
2021
)
Prediction of daily global solar radiation using different machine learning algorithms: Evaluation and comparison
.
Renewable and Sustainable Energy Reviews
135
,
110114
.
https://doi.org/10.1016/j.rser.2020.110114
.
Agrawal
T.
(
2021
)
Optuna and AutoML
. In:
Agrawal, T. (ed.)
,
Hyperparameter Optimization in Machine Learning
.
Springer, Apress
,
Berkeley, CA
.
https://doi.org/10.1007/978-1-4842-6579-6_5
.
Aguiar-Conraria
L.
&
Soares
M. J.
(
2014
)
The continuous wavelet transform: Moving beyond uni- and bivariate analysis
.
Journal of Economic Surveys
28
(
2
),
344
375
.
https://doi.org/10.1111/joes.12012
.
Akiba
T.
,
Sano
S.
,
Yanase
T.
,
Ohta
T.
&
Koyama
M.
(
2019
)
Optuna: A next-generation hyperparameter optimization framework
. In:
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
, pp.
2623
2631
.
https://doi.org/10.1145/3292500.3330701
.
Anyanwu
M. N.
&
Shiva
S. G.
(
2009
)
Comparative analysis of serial decision tree classification algorithms
.
International Journal of Computer Science and Security
3
(
3
),
230
240
.
Araghi
A.
,
Adamowski
J.
&
Martinez
C. J.
(
2020
)
Comparison of wavelet-based hybrid models for the estimation of daily reference evapotranspiration in different climates
.
Journal of Water and Climate Change
11
(
1
),
39
53
.
https://doi.org/10.2166/wcc.2018.113
.
Ayodele
T. O.
(
2010
)
Types of machine learning algorithms
.
New Advances in Machine Learning
3
,
19
48
.
Baker
M. R.
,
Mahmood
Z. N.
&
Shaker
E. H.
(
2022
)
Ensemble learning with supervised machine learning models to predict credit card fraud transactions
.
Revue d'Intelligence Artificielle
36
(
4
),
509
518
.
https://doi.org/10.18280/ria.360401
.
Beguería
S.
,
Vicente-Serrano
S. M.
,
Reig
F.
&
Latorre
B.
(
2014
)
Standardized precipitation evapotranspiration index (SPEI) revisited: Parameter fitting, evapotranspiration models, tools, datasets and drought monitoring
.
International Journal of Climatology
34
(
10
),
3001
3023
.
https://doi.org/10.1002/joc.3887
.
Belayneh
A.
&
Adamowski
J.
(
2013
)
Drought forecasting using new machine learning methods
.
Journal of Water and Land Development
18
(
9
),
3
12
.
https://doi.org/10.2478/jwld-2013-0001
.
Brunton
S. L.
,
Noack
B. R.
&
Koumoutsakos
P.
(
2020
)
Machine learning for fluid mechanics
.
Annual Review of Fluid Mechanics
52
,
477
508
.
https://doi.org/10.1146/annurev-fluid-010719-060214
.
Büyükşahin
F.
(
2018
)
Air pollution with anthropocentric effect and climate change
.
International Journal of Human Studies
1
(
1
),
14
26
.
https://doi.org/10.35235/uicd.427397
.
CA
(
2023
)
Climate Assessment Report, Republic of Turkey Ministry of Agriculture and Forestry, General Directorate of Water Management
.
Department of Climate and Agricultural Meteorology, Republic of Türkiye
,
Ankara
, p.
27
.
Can
U.
(
2024
)
Solving the missing data problem
.
Encyclopedia of Management Information Systems (YBS Encyclopedia)
12
(
1
),
24
60
.
Chai
T.
&
Draxler
R. R.
(
2014
)
Root mean square error (RMSE) or mean absolute error (MAE)?–Arguments against avoiding RMSE in the literature
.
Geoscientific Model Development
7
(
3
),
1247
1250
.
https://doi.org/10.5194/gmd-7-1247-2014
.
Chawla
N. V.
,
Japkowicz
N.
&
Kotcz
A.
(
2004
)
Special issue on learning from imbalanced data sets
.
ACM SIGKDD Explorations Newsletter
6
(
1
),
1
6
.
https://doi.org/10.1145/1007730.1007733
.
Choi
M.
,
Jacobs
J. M.
,
Anderson
M. C.
&
Bosch
D. D.
(
2013
)
Evaluation of drought indices via remotely sensed data with hydrological variables
.
Journal of Hydrology
476
,
265
273
.
https://doi.org/10.1016/j.jhydrol.2012.10.042
.
Çoban
Ö.
,
Eşit
M.
&
Yalçın
S.
(
2023
)
ML-DPIE: Comparative evaluation of machine learning methods for drought parameter index estimation: A case study of Türkiye
.
Natural Hazards
1
33
.
https://doi.org/10.1007/s11069-023-06233-1
.
Cortes
C.
&
Vapnik
V.
(
1995
)
Support-vector networks
.
Machine Learning
20
,
273
297
.
Dabanlı
İ.
,
Mishra
A. K.
&
Şen
Z.
(
2017
)
Long-term spatio-temporal drought variability in Turkey
.
Journal of Hydrology
552
,
779
792
.
https://doi.org/10.1016/j.jhydrol.2017.07.038
.
Docheshmeh Gorgij
A.
,
Alizamir
M.
,
Kisi
O.
&
Elshafie
A.
(
2022
)
Drought modelling by standard precipitation index (SPI) in a semi-arid climate using deep learning method: Long short-term memory
.
Neural Computing and Applications
1
18
.
https://doi.org/10.1007/s00521-021-06505-6
.
Drucker
H.
(
1997
)
Improving regressors using boosting techniques
. In:
ICML
, Vol.
97
, pp.
107
115
.
El Ibrahimi
A.
&
Baali
A.
(
2018
)
Application of several artificial intelligence models for forecasting meteorological drought using the standardized precipitation index in the Saïss plain (Northern Morocco)
.
International Journal of Intelligent Engineering & Systems
11
(
1
).
https://doi.org/10.22266/ijies2018.0228.28
.
Farajpanah
H.
,
Lotfirad
M.
,
Adib
A.
,
Esmaeili-Gisavandani
H.
,
Kisi
Ö.
,
Riyahi
M. M.
&
Salehpoor
J.
(
2020
)
Ranking of hybrid wavelet-AI models by TOPSIS method for estimation of daily flow discharge
.
Water Supply
20
(
8
),
3156
3171
.
https://doi.org/10.2166/ws.2020.211
.
Friedl
M. A.
&
Brodley
C. E.
(
1997
)
Decision tree classification of land cover from remotely sensed data
.
Remote Sensing of Environment
61
(
3
),
399
409
.
https://doi.org/10.1016/S0034-4257(97)00049-7
.
Friedman
J. H.
(
2001
)
Greedy function approximation: A gradient boosting machine
.
Annals of Statistics
29
(
5
),
1189
1232
. https://www.jstor.org/stable/2699986.
Fu
M.
,
Fan
T.
,
Ding
Z. A.
,
Salih
S. Q.
,
Al-Ansari
N.
&
Yaseen
Z. M.
(
2020
)
Deep learning data-intelligence model based on adjusted forecasting window scale: Application in daily streamflow simulation
.
IEEE Access
8
,
32632
32651
.
Galar
M.
,
Fernández
A.
,
Barrenechea
E.
&
Herrera
F.
(
2013
)
EUSBoost: Enhancing ensembles for highly imbalanced data-sets by evolutionary undersampling
.
Pattern Recognition
46
(
12
),
3460
3471
.
https://doi.org/10.1016/j.patcog.2013.05.006
.
Gan
R.
,
Gu
S.
,
Tong
X.
,
Lu
J.
&
Tang
H.
(
2023
)
A nonparametric standardized runoff index for characterizing hydrological drought in the Shaying River Basin, China
.
Natural Hazards
1
21
.
https://doi.org/10.1007/s11069-023-06179-4
.
GBDMP
(
2019
)
Gediz Basin Drought Management Plan, Republic of Turkey Ministry of Agriculture and Forestry, General Directorate of Water Management
.
Republic of Türkiye
,
Ankara
, p.
484
.
Gharib
A.
&
Davies
E. G.
(
2021
)
A workflow to address pitfalls and challenges in applying machine learning models to hydrology
.
Advances in Water Resources
152
,
103920
.
https://doi.org/10.1016/j.advwatres.2021.103920
.
Gul
E.
,
Staiou
E.
,
Safari
M. J. S.
&
Vaheddoost
B.
(
2023
)
Enhancing meteorological drought modeling accuracy using hybrid boost regression models: A case study from the Aegean region, Türkiye
.
Sustainability
15
(
15
),
11568
.
https://doi.org/10.3390/su151511568
.
Hargreaves
G. H.
&
Samani
Z. A.
(
1985
)
Reference crop evapotranspiration from temperature
.
Applied Engineering in Agriculture
1
(
2
),
96
99
.
https://doi.org/10.13031/2013.26773
.
Haykin
S.
&
Network
N.
(
2004
)
A comprehensive foundation
.
Neural Networks
2
(
2004
),
41
.
Hernández
E.
&
Weiss
G.
(
1996
)
A First Course on Wavelets
, 1st edn.
CRC Press
,
Boca Raton, FL
,
489
pp.
Hoerl
A. E.
&
Kennard
R. W.
(
1970
)
Ridge regression: Biased estimation for nonorthogonal problems
.
Technometrics
12
(
1
),
55
67
.
doi:10.1080/00401706.1970.10488634
.
Huang
G.
,
Wu
L.
,
Ma
X.
,
Zhang
W.
,
Fan
J.
,
Yu
X.
,
Zeng
W.
&
Zhou
H.
(
2019
)
Evaluation of CatBoost method for prediction of reference evapotranspiration in humid regions
.
Journal of Hydrology
574
,
1029
1041
.
https://doi.org/10.1016/j.jhydrol.2019.04.085
.
Ikram
R. M. A.
,
Mostafa
R. R.
,
Chen
Z.
,
Parmar
K. S.
,
Kisi
O.
&
Zounemat-Kermani
M.
(
2023
)
Water temperature prediction using improved deep learning methods through reptile search algorithm and weighted mean of vectors optimizer
.
Journal of Marine Science and Engineering
11
(
2
),
259
.
https://doi.org/10.3390/jmse11020259
.
Kale
M. M.
(
2021
)
Hydrological drought analysis for the Akarçay closed basin
.
Journal of Geography
42
,
165
180
.
https://doi.org/10.26650/JGEOG2021-892360
.
Keskin
M. E.
,
Terzi
O.
,
Taylan
E. D.
&
Küçükyaman
D.
(
2009
)
Meteorological drought analysis using data-driven models for the Lakes District, Turkey
.
Hydrological Sciences Journal
54
(
6
),
1114
1124
.
https://doi.org/10.1623/hysj.54.6.1114
.
Keskin
M. E.
,
Terzi
O.
,
Taylan
E. D.
&
Küçükyaman
D.
(
2011
)
Meteorological drought analysis using artificial neural networks
.
Scientific Research and Essays
6
(
21
),
4469
4477
.
https://doi.org/10.5897/SRE10.1022
.
Khan
N.
,
Sachindra
D. A.
,
Shahid
S.
,
Ahmed
K.
,
Shiru
M. S.
&
Nawaz
N.
(
2020
)
Prediction of droughts over Pakistan using machine learning algorithms
.
Advances in Water Resources
139
,
103562
.
https://doi.org/10.1016/j.advwatres.2020.103562
.
Khorrami
B.
,
Ali
S.
,
Abadi
L. H.
&
Jehanzaib
M.
(
2023
)
Spatio-temporal variations in characteristics of terrestrial water storage and associated drought over different geographic regions of Türkiye
.
Earth Science Informatics
16
(
1
),
717
731
.
https://doi.org/10.1007/s12145-022-00907-3
.
Kim
J.
,
Han
H.
,
Johnson
L. E.
,
Lim
S.
&
Cifelli
R.
(
2019
)
Hybrid machine learning framework for hydrological assessment
.
Journal of Hydrology
577
,
123913
.
https://doi.org/10.1016/j.jhydrol.2019.123913
.
Kumanlioglu
A. A.
(
2020
)
Characterizing meteorological and hydrological droughts: A case study of the Gediz River Basin, Turkey
.
Meteorological Applications
27
(
1
),
e1857
.
https://doi.org/10.1002/met.1857
.
Kumar
P.
&
Foufoula-Georgiou
E.
(
1993
)
A multicomponent decomposition of spatial rainfall fields: 1. Segregation of large-and small-scale features using wavelet transforms
.
Water Resources Research
29
(
8
),
2515
2532
.
https://doi.org/10.1029/93WR00548
.
Lafreniere
M.
&
Sharp
M.
(
2003
)
Wavelet analysis of inter-annual variability in the runoff regimes of glacial and nival stream catchments, Bow Lake, Alberta
.
Hydrological Processes
17
(
6
),
1093
1118
.
https://doi.org/10.1002/hyp.1187
.
Legates
D. R.
&
McCabe
G. J.
Jr.
(
1999
)
Evaluating the use of ‘goodness-of-fit’ measures in hydrologic and hydroclimatic model validation
.
Water Resources Research
35
(
1
),
233
241
.
https://doi.org/10.1029/1998WR900018
.
Li
X.
,
Li
W.
&
Xu
Y.
(
2018
)
Human age prediction based on DNA methylation using a gradient boosting regressor
.
Genes
9
(
9
),
424
.
https://doi.org/10.3390/genes9090424
.
Li
L.
,
She
D.
,
Zheng
H.
,
Lin
P.
&
Yang
Z. L.
(
2020
)
Elucidating diverse drought characteristics from two meteorological drought indices (SPI and SPEI) in China
.
Journal of Hydrometeorology
21
(
7
),
1513
1530
.
https://doi.org/10.1175/JHM-D-19-0290.1
.
Lloyd-Hughes
B.
&
Saunders
M. A.
(
2002
)
A drought climatology for Europe
.
International Journal of Climatology: A Journal of the Royal Meteorological Society
22
(
13
),
1571
1592
.
https://doi.org/10.1002/joc.846
.
Luu
Q. H.
,
Lau
M. F.
,
Ng
S. P.
&
Chen
T. Y.
(
2021
)
Testing multiple linear regression systems with metamorphic testing
.
Journal of Systems and Software
182
,
111062
.
https://doi.org/10.1016/j.jss.2021.111062
.
Madsen
H.
,
Mikkelsen
P. S.
,
Rosbjerg
D.
&
Harremoës
P.
(
1998
)
Estimation of regional intensity-duration-frequency curves for extreme precipitation
.
Water Science and Technology
37
(
11
),
29
36
.
https://doi.org/10.1016/S0273-1223(98)00313-8
.
Manzano
A.
,
Clemente
M. A.
,
Morata
A.
,
Luna
M. Y.
,
Beguería
S.
,
Vicente-Serrano
S. M.
&
Martín
M. L.
(
2019
)
Analysis of the atmospheric circulation pattern effects over SPEI drought index in Spain
.
Atmospheric Research
230
,
104630
.
https://doi.org/10.1016/j.atmosres.2019.104630
.
McKee
T. B.
,
Doesken
N. J.
&
Kleist
J.
(
1993
)
The relationship of drought frequency and duration to time scales
. In
Proceedings of the 8th Conference on Applied Climatology
,
Vol. 17
,
No. 22
, pp.
179
183
.
Meng
Q.
,
Ke
G.
,
Wang
T.
,
Chen
W.
,
Ye
Q.
,
Ma
Z. M.
&
Liu
T. Y.
(
2016
)
A communication-efficient parallel algorithm for decision tree
.
Advances in Neural Information Processing Systems
29
.
https://doi.org/10.1016/j.energy.2020.118874
.
Mercado
V. D.
,
Perez
G. C.
,
Solomatine
D.
&
Van Lanen
H. A.
(
2016
)
Spatio-temporal analysis of hydrological drought at catchment scale using a spatially-distributed hydrological model
.
Procedia Engineering
154
,
738
744
.
https://doi.org/10.1016/j.proeng.2016.07.577
.
Mishra
A. K.
&
Singh
V. P.
(
2010
)
A review of drought concepts
.
Journal of Hydrology
391
(
1–2
),
202
216
.
https://doi.org/10.1016/j.jhydrol.2010.07.012
.
Mohamadi
S.
,
Sammen
S. S.
,
Panahi
F.
,
Ehteram
M.
,
Kisi
O.
,
Mosavi
A.
,
Ahmed
A. N.
,
El Shaive
A.
&
Al-Ansari
N.
(
2020
)
Zoning map for drought prediction using integrated machine learning models with a nomadic people optimization algorithm
.
Natural Hazards
104
,
537
579
.
https://doi.org/10.1007/s11069-020-04180-9
.
Mohammed
R.
,
Rawashdeh
J.
&
Abdullah
M.
(
2020
)
Machine learning with oversampling and undersampling techniques: Overview study and experimental results
. In:
2020 11th International Conference on Information and Communication Systems (ICICS)
.
IEEE
, pp.
243
248
.
https://doi.org/10.1109/ICICS49469.2020.239556
.
Mokhtar
A.
,
Jalali
M.
,
He
H.
,
Al-Ansari
N.
,
Elbeltagi
A.
,
Alsafadi
K.
,
Abdo
H. G.
,
Sh. Sammen
S.
,
Gyasi-Agyei
Y.
&
Rodrigo-Comino
J.
(
2021
)
Estimation of SPEI meteorological drought using machine learning algorithms
.
IEEE Access
9
,
65503
65523
.
https://doi.org/10.1109/ACCESS.2021.3074305
.
Moreo
A.
,
Esuli
A.
&
Sebastiani
F.
(
2016
)
Distributional random oversampling for imbalanced text classification
. In:
Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval
, pp.
805
808
.
https://doi.org/10.1145/2911451.2914722
.
Mostafa
R. R.
,
Kisi
O.
,
Adnan
R. M.
,
Sadeghifar
T.
&
Kuriqi
A.
(
2023
)
Modeling potential evapotranspiration by improved machine learning methods using limited climatic data
.
Water
15
(
3
),
486
.
https://doi.org/10.3390/w15030486
.
Nooruldeen
O.
,
Alturki
S.
,
Baker
M. R.
&
Ghareeb
A.
(
2022
)
Time series forecasting for decision making on city-wide energy demand: A comparative study
. In
2022 International Conference on Decision Aid Sciences and Applications (DASA)
.
IEEE
, pp.
1706
1710
.
https://doi.org/10.1109/DASA54658.2022.9765193
.
Nourani
V.
,
Danandeh Mehr
A.
&
Azad
N.
(
2018
)
Trend analysis of hydroclimatological variables in Urmia Lake basin using hybrid wavelet Mann–Kendall and Şen tests
.
Environmental Earth Sciences
77
,
1
18
.
https://doi.org/10.1007/s12665-018-7390-x
.
Ogunrinde
A. T.
,
Olasehinde
D. A.
&
Olotu
Y.
(
2020
)
Assessing the sensitivity of standardized precipitation evapotranspiration index to three potential evapotranspiration models in Nigeria
.
Scientific African
8
,
e00431
.
https://doi.org/10.1016/j.sciaf.2020.e00431
.
Ojha
S. S.
,
Singh
V.
&
Roshni
T.
(
2021
)
Comparison of meteorological drought using SPI and SPEI
.
International Journal of Civil Engineering
7
,
2130
2149
.
https://doi.org/10.28991/cej-2021-03091783
.
Öney
M.
&
Anlı
A. S.
(
2023
)
Regional drought analysis with standardized precipitation evapotranspiration index (SPEI): Gediz Basin, Turkey
.
Journal of Agricultural Sciences
29
(
4
),
1032
1049
.
https://doi.org/10.15832/ankutbd.1030782
.
Özcan
O.
(
2017
)
Evaluation of flood determination with various methods: Case study of Ayamama Creek
.
Journal of Natural Hazards and Environment
3
(
1
),
9
27
.
https://doi.org/10.21324/dacd.267200
.
Petty
T. R.
&
Dhingra
P.
(
2018
)
Streamflow hydrology estimate using machine learning (SHEM)
.
JAWRA Journal of the American Water Resources Association
54
(
1
),
55
68
.
https://doi.org/10.1111/1752-1688.12555
.
Prokhorenkova
L.
,
Gusev
G.
,
Vorobev
A.
,
Dorogush
A. V.
&
Gulin
A.
(
2018
)
CatBoost: Unbiased boosting with categorical features
. In:
Proceedings of the 32nd International Conference on Neural Information Processing Systems
, pp.
6639
6649
. .
Rajput
J.
,
Singh
M.
,
Lal
K.
,
Khanna
M.
,
Sarangi
A.
,
Mukherjee
J.
&
Singh
S.
(
2023
)
Performance evaluation of soft computing techniques for forecasting daily reference evapotranspiration
.
Journal of Water and Climate Change
14
(
1
),
350
368
.
https://doi.org/10.2166/wcc.2022.385
.
Rajsekhar
D.
,
Mishra
A. K.
&
Singh
V. P.
(
2013
)
Regionalization of drought characteristics using an entropy approach
.
Journal of Hydrologic Engineering
18
(
7
),
870
887
.
https://doi.org/10.1061/(ASCE)HE.1943-5584.0000683
.
Riebsame
W. E.
,
Changnon
S. A.
&
Karl
T. R.
(
1991
)
Drought and Natural Resource Management in the United States: Impacts and Implications of the 1987–1989 Drought
.
Westview Press
,
Boulder, CO
, p.
174
.
Salimian
N.
,
Nazari
S.
&
Ahmadi
A.
(
2021
)
Assessment of the uncertainties of global climate models in the evaluation of standardized precipitation and runoff indices: A case study
.
Hydrological Sciences Journal
66
(
9
),
1419
1436
.
https://doi.org/10.1080/02626667.2021.1937178
.
Schratz
P.
,
Muenchow
J.
,
Iturritxa
E.
,
Richter
J.
&
Brenning
A.
(
2019
)
Hyperparameter tuning and performance assessment of statistical and machine-learning algorithms using spatial data
.
Ecological Modelling
406
,
109
120
.
https://doi.org/10.1016/j.ecolmodel.2019.06.002
.
Şen
Z.
&
Almazroui
M.
(
2021
)
Actual precipitation index (API) for drought classification
.
Earth Systems and Environment
5
,
59
70
.
https://doi.org/10.1007/s41748-021-00201-0
.
Shakhnarovich
G.
,
Darrell
T.
&
Indyk
P.
(
2008
)
Nearest-neighbor Methods in Learning and Vision: Theory and Practice
,
Vol. 3
.
MIT Press
,
Cambridge, MA, USA
.
Shen
C.
(
2018
)
A transdisciplinary review of deep learning research and its relevance for water resources scientists
.
Water Resources Research
54
(
11
),
8558
8593
.
https://doi.org/10.1029/2018WR022643
.
Shukla
S.
&
Wood
A. W.
(
2008
)
Use of a standardized runoff index for characterizing hydrologic drought
.
Geophysical Research Letters
35
(
2
).
https://doi.org/10.1029/2007GL032487
.
Sibindi
R.
,
Mwangi
R. W.
&
Waititu
A. G.
(
2023
)
A boosting ensemble learning based hybrid light gradient boosting machine and extreme gradient boosting model for predicting house prices
.
Engineering Reports
5
(
4
),
e12599
.
https://doi.org/10.1002/eng2.12599
.
Snieder
E.
,
Abogadil
K.
&
Khan
U. T.
(
2021
)
Resampling and ensemble techniques for improving ANN-based high-flow forecast accuracy
.
Hydrology and Earth System Sciences
25
(
5
),
2543
2566
.
https://doi.org/10.5194/hess-25-2543-2021
.
Stagge
J. H.
,
Tallaksen
L. M.
,
Xu
C. Y.
&
Van Lanen
H. A.
(
2014
)
Standardized precipitation-evapotranspiration index (SPEI): Sensitivity to potential evapotranspiration model and parameters
. In:
Hydrology in a changing world (Vol. 363, pp. 367–373)
.
7th World Flow Regimes from International and Experimental Network Data-Water Conference
,
FRIEND-Water 2014, Montpellier, 7–10 October 2014
.
Stagge
J. H.
,
Tallaksen
L. M.
,
Gudmundsson
L.
,
Van Loon
A. F.
&
Stahl
K.
(
2015
)
Candidate distributions for climatological drought indices (SPI and SPEI)
.
International Journal of Climatology
35
(
13
),
4027
4040
.
https://doi.org/10.1002/joc.4267
.
Tan
M.
,
Song
X.
,
Yang
X.
&
Wu
Q.
(
2015
)
Support-vector-regression machine technology for total organic carbon content prediction from wireline logs in organic shale: A comparative study
.
Journal of Natural Gas Science and Engineering
26
,
792
802
.
https://doi.org/10.1016/j.jngse.2015.07.008
.
Terzi
Ö.
&
Barak
M.
(
2015
)
Runoff forecasting with wavelet-neural network approach: A case study of Kızılırmak River
.
Journal of Agricultural Sciences
21
(
4
).
https://doi.org/10.1501/Tarimbil_0000001356
.
Thornthwaite
C. W.
(
1948
)
An approach toward a rational classification of climate
.
Geographical Review
38
(
1
),
55
94
.
https://doi.org/10.2307/210739
.
Tirivarombo
S.
,
Osupile
D.
&
Eliasson
P.
(
2018
)
Drought monitoring and analysis: Standardised precipitation evapotranspiration index (SPEI) and standardised precipitation index (SPI)
.
Physics and Chemistry of the Earth, Parts A/B/C
106
,
1
10
.
https://doi.org/10.1016/j.pce.2018.07.001
.
Um
M. J.
,
Kim
Y.
,
Park
D.
&
Kim
J.
(
2017
)
Effects of different reference periods on drought index (SPEI) estimations from 1901 to 2014
.
Hydrology and Earth System Sciences
21
(
10
),
4989
5007
.
https://doi.org/10.5194/hess-21-4989-2017
.
Van der Meer
D.
,
Hoekstra
P. J.
,
Van Donkelaar
M.
,
Bralten
J.
,
Oosterlaan
J.
,
Heslenfeld
D.
,
Faraone
S. V.
,
Franke
B.
,
Buitelaar
J. K.
&
Hartman
C. A.
(
2017
)
Predicting attention-deficit/hyperactivity disorder severity from psychosocial stress and stress-response genes: A random forest regression approach
.
Translational Psychiatry
7
(
6
),
e1145
e1145
.
https://doi.org/10.1038/tp.2017.114
.
Vicente-Serrano
S. M.
,
Beguería
S.
&
López-Moreno
J. I.
(
2010
)
A multiscalar drought index sensitive to global warming: The standardized precipitation evapotranspiration index
.
Journal of Climate
23
(
7
),
1696
1718
.
https://doi.org/10.1175/2009JCLI2909.1
.
Vicente-Serrano
S. M.
,
Beguería
S.
&
Lopez-Moreno
J. I.
(
2011
)
Comment on ‘Characteristics and trends in various forms of the Palmer Drought Severity Index (PDSI) during 1900–2008’ by Aiguo Dai
.
https://doi.org/10.1029/2011JD016410
.
Vicente-Serrano
S. M.
,
Beguería
S.
,
Lorenzo-Lacruz
J.
,
Camarero
J. J.
,
López-Moreno
J. I.
,
Azorin-Molina
C.
,
Revuelto
J.
,
Morán-Tejeda
E.
&
Sanchez-Lorenzo
A.
(
2012
)
Performance of drought indices for ecological, agricultural, and hydrological applications
.
Earth Interactions
16
(
10
),
1
27
.
https://doi.org/10.1175/2012EI000434.1
.
Vicente-Serrano
S. M.
,
Aguilar
E.
,
Martínez
R.
,
Martín-Hernández
N.
,
Azorin-Molina
C.
,
Sánchez-Lorenzo
A.
,
El Kenawy
A.
,
Tomás-Burguera
M.
,
Moran-Tejeda
E.
,
López-Moreno
J. I.
,
Revuelto
J.
,
Beguería
S.
,
Nieto
J. J.
,
Drumond
A.
,
Gimeno
L.
&
Nieto
R.
(
2017
)
The complex influence of ENSO on droughts in Ecuador
.
Climate Dynamics
48
,
405
427
.
https://doi.org/10.1007/s00382-016-3082-y
.
Widiasari
I. R.
,
Nugoho
L. E.
&
Efendi
R.
(
2018
)
Context-based hydrology time series data for a flood prediction model using LSTM
. In:
2018 5th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE)
.
IEEE
, pp.
385
390
.
Wu
H.
,
Cai
Y.
,
Wu
Y.
,
Zhong
R.
,
Li
Q.
,
Zheng
J.
,
Lin
D.
&
Li
Y.
(
2017
)
Time series analysis of weekly influenza-like illness rate using a one-year period of factors in random forest regression
.
Bioscience Trends
11
(
3
),
292
296
.
https://doi.org/10.5582/bst.2017.01035
.
Wu
J.
,
Miao
C.
,
Tang
X.
,
Duan
Q.
&
He
X.
(
2018
)
A nonparametric standardized runoff index for characterizing hydrological drought on the Loess Plateau, China
.
Global and Planetary Change
161
,
53
65
.
https://doi.org/10.1016/j.gloplacha.2017.12.006
.
Yaseen
Z. M.
,
Naganna
S. R.
,
Sa'adi
Z.
,
Samui
P.
,
Ghorbani
M. A.
,
Salih
S. Q.
&
Shahid
S.
(
2020
)
Hourly river flow forecasting: Application of emotional neural network versus multiple machine learning paradigms
.
Water Resources Management
34
,
1075
1091
.
https://doi.org/10.1007/s11269-020-02484-w
.
Yuan
X.
,
Chen
C.
,
Lei
X.
,
Yuan
Y.
&
Muhammad Adnan
R.
(
2018
)
Monthly runoff forecasting based on LSTM–ALO model
.
Stochastic Environmental Research and Risk Assessment
32
,
2199
2212
.
https://doi.org/10.1007/s00477-018-1560-y
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY-NC-ND 4.0), which permits copying and redistribution for non-commercial purposes with no derivatives, provided the original work is properly cited (http://creativecommons.org/licenses/by-nc-nd/4.0/).