This study proposes a hybrid discrete wavelet transform (DWT) and support vector machine (SVM) model (W-SVM) to enhance drought prediction accuracy compared to traditional SVM and autoregressive integrated moving average (ARIMA) models using the Standardized Precipitation Index (SPI). Fifty years of historical average monthly precipitation data from the Kabul province, Afghanistan, are analysed. The data are split into training and testing sets at an 80/20 ratio. Daubechies order 2 (db2) is selected as the mother wavelet function with three decomposition levels for SPI-3, -6, -9, and -12. Each decomposed element, including detail components (D1, D2, D3) and approximation (A3), is forecasted with an SVM model, and the final forecast is obtained by summing all levels. Statistical metrics such as root mean square error (RMSE), mean absolute error (MAE), and correlation coefficient (R2) are used for evaluation. Results demonstrate that W-SVM outperforms benchmark models, achieving lower RMSE and MAE values and higher R2 across all SPI values. These findings highlight DWT's effectiveness in enhancing SVM's capability for SPI pattern recognition. The proposed model provides accurate and timely drought predictions, making it a valuable tool for real-time drought monitoring and management, enabling informed decision-making to reduce drought impacts.

  • Hybrid W-SVM model improves drought prediction accuracy over traditional SVM and ARIMA models.

  • Uses 50 years of precipitation data from Kabul, Afghanistan, based on SPI indicators.

  • Wavelet decomposition (db2) enhances SVM's ability to capture complex drought patterns.

  • W-SVM shows lower RMSE, MAE, and higher R2 across all SPI values.

  • Contributes to better early drought warning systems for climate adaptation.

Drought, a natural phenomenon resulting from lower-than-average precipitation, directly and indirectly affects human and animal life. Natural disasters, including drought and floods, are among the costliest, accounting for 22% of economic damage (Wilhite et al. 2007). As a slow-developing event, drought has garnered significant attention from researchers and policymakers in recent decades due to its profound impact. Early warning, monitoring, and forecasting of drought in advance can mitigate its potential damage. Drought, categorized into meteorological, hydrological, agricultural, and socioeconomic types (Mishra & Singh 2011; Kikon & Deka 2022), highly depends on several variables. Factors such as soil moisture, temperature, low humidity, and rare precipitation lead to drought in specific regions (Quan et al. 2021). Among the various types of droughts, meteorological drought is considered a significant reduction in precipitation levels in a particular region (Deepa et al. 2022), as it can significantly increase evaporation rates and temperature and reduce soil moisture, leading to other types of droughts, such as agricultural drought (Faghmous & Kumar 2014; Luo et al. 2017).

Due to the complex nature of determining drought, several drought indices, including the Standardized Precipitation Index (SPI), Standardized Precipitation Evaporation Index (SPEI), and Palmer Drought Severity Index (PDSI) (McKee et al. 1993a, b; Vicente-Serrano et al. 2010; Dai 2011; Eslamian et al. 2017) have been developed for identifying drought. Among them, the SPI is widely used for meteorological drought identification because of its unique advantages, such as a simple calculation procedure that depends only on precipitation and its normalized characteristic, which makes it possible to apply in various geographical areas (McKee et al. 1993a, b; Karavitis et al. 2011).

In addition to autoregressive integrated moving average (ARIMA) and seasonal ARIMA (SARIMA) models and regression-based models which are still popular in hydrological time series forecasting, machine learning methods had gained more attention in various field such as engineering, finance, agriculture, medicine and environmental sciences. The rapid widespread adoption of machine learning methods can be attributed to their high computational power, speed, availability of sensor data, and advancements in big data analysis (Yaseen et al. 2021; Çoban et al. 2024). Among the machine learning methods used in this area we can name artificial neural network (ANN), support vector machine (SVM), adaptive neuro-fuzzy inference system (ANFIS), long short-term memory (LSTM) (Chiang & Tsai 2013; Han et al. 2013; Mokhtarzad et al. 2017; Poornima & Pushpalatha 2019; Rezaiy & Shabri 2024b).

In a study by Moghimi et al. (2020), various ARIMA models were used to forecast the 3-month Reconnaissance Drought Index (RDI) for South Iran between 1980 and 2010. They determined that the moving average (MA) model was the most accurate, based on Akaike's Information Criterion Corrected (AICC). In another study, Zhang et al. (2020) evaluated ARIMA, wavelet neural network (WNN), and SVM models for predicting the 12-month SPEI in Sanjiang Plain, China. The ARIMA model proved to be the most effective, achieving high R2 and Nash–Sutcliffe Efficiency Coefficient (NSE) values above 0.9, with lower mean squared error (MSE) and Kolmogorov–Smirnov (K–S) distance. This underscores ARIMA's effectiveness in managing stationary data and capturing key features of hydrological time series.

Many researchers like Chen et al. (2020) used ordinary least square regression to develop meteorological composite drought index in Hubei Province, China. Li et al. (2020a, b) used ridge regression and lasso regression, two types of penalized linear regression, to forecast the SPEI over 3-, 6-, 12-, and 24-month timescales in Northeast China. They compared these models to ordinary least squares regression and three ensemble-based models: decision tree (DT), adaptive boosting, and random forest (RF). Among these, lasso regression demonstrated the highest accuracy for forecasting SPEI for both short- and long-term droughts. In their 2021 study, Ghasemi et al. (2021) used Gaussian process regression (GPR), multi-layer perceptron, and general regression neural network models to predict the SPEI for 12 months in advance, with lead times of 1–3 months, in Iran. They assessed the models' performance using mean absolute error (MAE), root mean square error (RMSE), sum of squared errors (SSE), and the coefficient of determination (R2). The GPR model stood out as the most accurate, achieving the lowest RMSE and highest R2 values, although its accuracy declined with longer lead times. Regression-based models for drought forecasting have become less popular in recent years because of their limitations. These models tend to fall short when it comes to accurately predicting outcomes in the presence of nonlinear relationships between hydrological variables (Fung et al. 2020).

In a study aimed at improving drought forecasting in eastern Australia, researchers Deo et al. (2017) utilized three models – multivariate adaptive regression splines (MARS), least square support vector machine (LSSVM), and M5Tree – to predict the SPI. The MARS model, which incorporated rainfall and climate indices, proved to be the most effective at three out of five stations, significantly reducing forecasting errors. At the other stations, the M5Tree model outperformed both MARS and LSSVM, while LSSVM performed best for Bathurst. Introducing periodicity into the models enhanced their accuracy by up to 8.1% and significantly reduced errors. These findings highlight the importance of considering periodicity and reveal that model performance can vary based on geographic location and seasonal factors, reflecting the complexity of drought dynamics. Zhang et al. (2017) used three different models – ARIMA, ANN, and wavelet neural networks (WANN) – to forecast drought in the North Haihe River Basin, looking at SPI over two periods (SPI-6 and SPI-12). They evaluated how well each model worked using various tests and metrics such as the Kolmogorov–Smirnov test, Kendall rank correlation, and R2 values. The findings showed that WANN was the best model for predicting SPI-6 and SPI-12 in this region. Essam et al. (2022) tested various machine learning models, including SVM, ANN, and LSTM, to forecast streamflow for 11 rivers using data from Malaysia's Department of Irrigation and Drainage. The ANN3 model, which uses streamflow data from the previous 3 days, turned out to be the most effective. It performed the best in 4 of the 11 datasets and was consistently reliable. This makes it the top choice for predicting streamflow in the region. Different pre-processing methods like variational mode decomposition (VMD), bagging, boosting, bagging-VMD, and boosting-VMD were tested to predict river water levels, which is key for flood management and planning. The study used daily rainfall data from the Dungun River Basin in Malaysia. Boosting-VMD made a big difference in accuracy when paired with ANN and support vector regression (SVR). Among these, the Boosting-ANN model performed the best, with the lowest errors and highest accuracy (Tiu et al. 2022). Mokhtar et al. (2021) used several machine learning models – RF, Extreme Gradient Boost (XGB), convolutional neural network (CNN), and LSTM – to forecast drought on the Tibetan Plateau between 1980 and 2019, using the SPEI as a measure. Among them, the XGB and RF models showed the best performance, accurately predicting drought conditions for both 3-month (SPEI-3) and 6-month (SPEI-6) periods using a variety of climate data. To predict hydrological drought in the Gidra River, researchers used ANNs and SVM on 58 years of daily river discharge data. These models achieved perfect accuracy, highlighting their strong potential for early drought detection and effective water management strategies (Almikaeel et al. 2022).

Achite et al. (2023) applied four machine learning models – ANN, ANFIS, SVM, and DT – to predict hydrological drought in Algeria's Wadi Ouahrane basin. The models were assessed using metrics like RMSE, MAE, NSE, and R2. All models performed well, but the SVM model stood out, with an average RMSE of 0.28, MAE of 0.19, NSE of 0.86, and R2 of 0.90. For forecasting the Standardized Runoff Index (SRI) over a 12-month period, SVM achieved an R2 of 0.95, although its accuracy decreased with shorter timescales. Researchers also evaluated different machine learning models like ANN, SVR, SVR-PSO (particle swarm optimization), and SVR-RSM (response surface method) to predict monthly drought indices like SPI, normal precipitation (PN), Effective Drought Index (EDI), and Modified China-Z Index (MCZI). The SVR-RSM model, with its new calibration method, proved to be the most accurate. SPI had the best match with MCZI, while EDI forecasts were less precise. For short-term drought predictions in dry areas, SVR-RSM is the recommended model due to its strong performance (Piri et al. 2023).

Machine learning methods often find it challenging to forecast non-stationary drought time series (Belayneh et al. 2014). To tackle this issue, data preprocessing techniques such as wavelet transform and empirical mode decomposition can be helpful. Wavelet transform is particularly effective for breaking down complex data into more manageable pieces, making it easier to handle nonlinear and non-stationary time series (Hinge et al. 2022; Karbasi et al. 2022; Piri et al. 2023). By analyzing data at different resolution levels, wavelet transform can improve the performance of machine learning models, especially when using decomposed data instead of raw data. Belayneh et al. (2016) investigated how SVR and ANN models for drought prediction in the Awash River Basin of Ethiopia by combining wavelet transforms with bootstrap and boosting approaches. For SPI values (SPI-3, SPI-12, SPI-24) that represent varying drought durations, wavelet analysis increased the accuracy of drought predictions. RMSE, MAE, and R2 were used to assess the models' performance; the boosting ensemble technique improved the correlation between the observed and predicted SPIs. In terms of prediction accuracy, the wavelet boosting-ANN (WBS-ANN) and wavelet boosting SVR (WBS-SVR) models fared better than other model types overall. ANN models were used for drought forecasting in the Algerois basin, Algeria, and compared with ARIMA and SARIMA models. In order to improve accuracy, the study applied wavelet preprocessing (WANN) to the SPI across a range of temporal scales. Due to their greater accuracy and simplicity, the WANN models are highly recommended for drought early warning, regularly outperforming the other models in predicting SPI levels (Djerbouai & Souag-Gamane 2016). In addition to that, wavelet transform and Ensemble Empirical Mode Decomposition (EEMD) as two powerful data preprocessing techniques are used with ARIMA and SARIMA modeling to predict drought in Kabul and Herat provinces in Afghanistan. The combined models W-ARIMA (Rezaiy & Shabri 2023a) and EEMD-ARIMA (Rezaiy & Shabri 2024a, b) showed excellent results with lower RMSE, MAE, and higher R2 compared to traditional ARIMA/SARIMA.

Although preprocessing techniques like wavelet transform are commonly used in combination with both traditional and machine learning methods for hydrological time series prediction, their application in drought forecasting – especially with advanced machine learning models like SVM – has been relatively limited. Most existing hybrid models combining wavelet transforms with machine learning, such as WANN, WBS-ANN, or WBS-SVR, have focused on specific aspects of prediction accuracy, often neglecting the integration of computational efficiency and adaptability across varying timescales of SPI.

The novelty of the W-SVM model is that it can effectively combine wavelet transform with SVM to improve the prediction accuracy with less computational cost. Unlike previous models which often use ensemble method or limited to specific SPI timescales, the W-SVM model uses the strength of wavelet decomposition to preprocess and decompose the raw precipitation data into meaningful components. Each component is modeled using SVM and then combined to provide highly accurate prediction of SPI at multiple timescales (SPI-3, SPI-6, SPI-9, and SPI-12). This structured approach can capture the nonlinearity and nonstationarity of drought time series. Moreover, the study applies this hybrid model to Kabul province, Afghanistan where limited studies have applied such advanced method for drought forecasting. The W-SVM model fills the gaps and provides a new framework for drought forecasting with better performance than individual models such as ARIMA and SVM. The performance of proposed model is examined through statistical evaluation metrices like RMSE, MAE, and R2 on forecasting SPI series.

Study area and data

Afghanistan, a country in South Asia, has an arid and semi-arid climate. Water resources and agricultural watering are highly dependent on winter precipitation, which typically occurs as snowfall. This country has experienced several drought periods since 1970 that have adversely affected the lives of millions of people, directly or indirectly. The most recent significant drought reported in 2018 affected 10.5 to 17 million people, leading to difficulties in drinking water, lack of enough water for land irrigation, unemployment, and consequently migration both within and outside the country (Mayar 2021). Therefore, an early and accurate drought prediction system is vital to mitigate the negative impact of drought. It can help policymakers, water resource managers, and people to take precautionary measures.

Kabul, the most significant and populated city, is the capital of Afghanistan with the latitude and longitude coordinates of 34.543896 and 69.160652, respectively. It is home to around five million people and is located in Kabul Province in the eastern-central region of the country. The province is situated near the Hindu Kush Mountain range and has an arid to semi-arid climate. The Darya-e-Kabul (the Kabul River) crosses it from east to west. The Kabul province has many rivers and mountains that are the source of agricultural irrigation. In this study, the monthly average precipitation data from January 1970 to December 2019 was obtained from the World Bank Climate Change Knowledge Portal (https://climateknowledgeportal.worldbank.org). Among these 50 years of historical data, the first 80% is used for the training period, and the remaining 20% is used for the testing period. Figure 1 shows the case study area, Kabul province, Afghanistan.
Figure 1

The location map of the Kabul province, the case study area.

Figure 1

The location map of the Kabul province, the case study area.

Close modal

Standardized Precipitation Index

The SPI, initially proposed by McKee et al. (1995), is one of the most popular and frequently used drought indices. Its dependence only on precipitation data and simple calculation procedure have placed it among popular indices, and the World Meteorological Organization (WMO) recognized it as a standard drought index that can be applied in all climate regions. In contrast, other drought indices might require a large number of different types of data, such as rainfall, temperature, and soil moisture, which make the calculation process more sophisticated, and all types of data might not be easily available in all regions (Yihdego et al. 2019).

The applicability of the SPI for use in arid and semi-arid climates was carefully evaluated in this study. Since SPI relies on the gamma distribution for probability fitting, its performance can sometimes be affected by deviations from this distribution, particularly in regions with low and variable precipitation. A goodness-of-fit test was conducted to assess whether the precipitation data from Kabul province adhered to the gamma distribution. The results indicated slight deviations (p-value = 0.0062), which are commonly observed in real-world precipitation data from arid regions. Such deviations are consistent with findings in similar studies (Wu et al. 2007; Mahmoudi et al. 2022). Despite this, SPI has been widely recognized as a reliable tool for drought analysis, especially when used alongside suitable pre-processing methods.

To address these deviations, wavelet decomposition was applied to preprocess the precipitation data. This technique allowed multi-temporal variability to be captured effectively, reducing the impact of the observed deviations on SPI calculations. The combination of wavelet decomposition and the SPI's established reliability ensures that it remains a robust tool for drought assessment in the study area. Similar approaches have been successfully applied in other arid and semi-arid regions, as demonstrated in studies by (Belayneh et al. 2016) and (Zhang et al. 2020), further supporting the methodology employed in this research

For SPI computation, the gamma distribution function is fitted to the rainfall data, and then it is transformed to a normal distribution. The following equations show how the SPI is calculated. Suppose x the precipitation data in a specific period, the gamma distribution is defined as:
(1)
where the scale, shape variables, precipitation amount, and gamma function are represented by the letters , , x, and respectively. The best estimates of α and β borders are represented by Equations (2) and (3) (Guttman 1999).
(2)
In Equation (2), the A is defined as follow and the Probability Density Function (PDF) is derived from Equation (1) is expressed as
(3)
(4)
Since zero precipitation is possible in the dataset, the cumulative probability of precipitation data, whether the data includes zero or non-zero precipitation, can be determined as shown in Equation (5)
(5)
where is computed using m/n (The total number of days is n, and the number of days without precipitation is m (x = 0)), and q represents the likelihood of no precipitation and the number of zeros that occur in a rainfall time series.
If the cumulative probability is transformed into a standard normal distribution (Sönmez et al. 2005), the SPI will have a mean of zero and a variance of one, as indicated by Equations (6) and (7).
(6)
(7)

The constant values in Equation (6) and Equation (7) are C0 = 2.515517, C1 = 0.802853, C2 = 0.010328, D1 = 1.432788, D2 = 0.189269, and D3 = 0.001308.

To have a better insight into SPI values and their interpretation, Table 1 illustrates the SPI categorization.

Table 1

Drought and moisture classification using the SPI (Guttman 1999)

SPI valuesCategory
 Extremely wet 
 Very wet 
 Moderately wet 
 Nearly normal 
 Moderate drought 
 Severe drought 
 Extreme drought 
SPI valuesCategory
 Extremely wet 
 Very wet 
 Moderately wet 
 Nearly normal 
 Moderate drought 
 Severe drought 
 Extreme drought 

ARIMA

ARIMA, first introduced by Box & Jenkins (1976), is a popular and widely used model for time series forecasting. It consists of three key components: AR(p), I(d), and MA(q). The autoregressive part (AR) captures the relationship between past and current values, while the integration part (I) is used to convert non-stationary data into a stationary form. The MA component accounts for the relationship between past errors and current values. The non-seasonal ARIMA model presented in the Equations (8), (9), and (10):
(8)
(9)
(10)

In these equations, represents the observed time series, while and are polynomials of order p and q, respectively. The random errors, , are assumed to be independently and identically distributed with a mean of zero and constant variance. The differencing operation, , is used to make the data series stationary, where d indicates the number of regular differences applied.

When we introduce seasonal periodic fluctuations to a standard ARIMA model, it evolves into a SARIMA model. The SARIMA model is denoted by ARIMA(p, d, q)(P, D, Q)S, where (p, d, q) represents the non-seasonal components: the AR, differencing (I), and MA parts, respectively. Meanwhile, (P, D, Q)S captures the seasonal components of the model with a specific seasonal period S. The general formulation for the SARIMA model can be expressed as follows:
(11)

In the SARIMA model, P denotes the order seasonal AR part, and Q indicates the order of the seasonal MA part. D is the number of seasonal differencing needed to make the series stationary, and S specifies the length of the seasonal cycle. On the other hand, p represents the order of the non-seasonal AR component, q is the order of the non-seasonal MA component, and d is the number of non-seasonal differencing required to stabilize the series. For more information on how to implement ARIMA and SARIMA models, check out (Rezaiy & Shabri 2023b). It offers a clear guide on using these models, including steps for applying them and tips for getting the best results.

Support vector machine

SVM is a popular machine learning technique used for tasks like regression, classification, and pattern recognition. The core aim of SVM is to minimize errors and enhance the model's generalization capabilities. It's grounded in statistical learning theory, emphasizing structural risk minimization to improve performance. Due to its proven effectiveness and robustness, SVM has become a widely used tool across various fields (Vapnik 1995; Deka 2014).

Training data are represented as where is the input vector for the ith sample, and is the corresponding output value. The goal is to create a linear regression function based on these samples, which captures the relationship between the inputs and outputs. The formula for this function is:
(12)
In the Equation (12), w is the weight vector in the feature space, b is the bias term, and f is the nonlinear mapping function. To identify the best model, w is minimized using a method known as convex quadratic programming.
(13)
With the following constraints:
(14)

In Equations (13) and (14), and are slack variables that set limits on the output values. The model is optimized using the Lagrange function and includes a kernel function to handle non-linearities. The parameter C controls the trade-off between the model's complexity and its ability to generalize. Meanwhile, defines the acceptable error level, helping to keep the solution simple and sparse.

Kernel functions can be used to handle the nonlinearity in incoming data. By projecting the data into a higher-dimensional space, these functions control the intricate, nonlinear character of urban water demand. Next, the nonlinear SVM function model is obtained by means of the subsequent procedures:
(15)

In this context, is the kernel function, which defines the inner product between and in a higher-dimensional space. The variables and are Lagrange multipliers that emerge from solving the optimization problem.

Radial Basis Function (RBF) and polynomial kernel functions are used in this study as they have been shown to work well with nonlinear and complex relationships in hydrological time series data such as SPI. RBF kernel is good at capturing intricate nonlinear relationships by mapping the input data into a higher-dimensional feature space. This is important for drought prediction as drought patterns often have complex interactions with climate and temporal factors. Moreover, RBF kernel is flexible and makes minimal assumptions about the data distribution and is a robust choice for different datasets. Its hyperparameter gamma controls the influence of individual training samples and balances bias and variance (Cristianini & Shawe-Taylor 2000).

On the other hand, the polynomial kernel excels in modeling structured relationships through its ability to capture polynomial trends of varying complexity via its degree parameter. This flexibility is particularly beneficial for datasets with periodic or quasi-periodic trends, which are common in climatic time series. By adjusting the kernel's degree and coefficients, it can represent both linear and nonlinear relationships effectively. Previous studies, including those by Vapnik (1995), Cristianini & Shawe-Taylor (2000), and Smola & Schölkopf (2004) have highlighted the strengths of these kernels in applications involving nonlinear and structured data.

By leveraging the strengths of both kernels, this study ensures a comprehensive approach to modeling SPI data, where the RBF kernel addresses nonlinearity, and the polynomial kernel captures underlying structured relationships. This combination enhances the robustness and accuracy of the proposed drought forecasting framework across multiple timescales.

Discrete wavelet transform

Wavelet transform, which has emerged as a powerful data decomposition technique over the past two decades, is categorized into two types: discrete wavelet transform (DWT) and CWT (continuous wavelet transform) and the result from wavelet analysis appeared more accurate than Fourier transform (FT) in addressing non-stationary data set (Kisi 2011; Feng et al. 2015). DWT is used in this study for decomposing the original SPI because its computational procedure is more straightforward than that of CWT, and it is widely used in time series forecasting when combined with traditional and machine learning methods. The DWT formula can be written as follow:
(16)
In Equation (16), the controller of time and scale is illustrated by n and m, and is the mother wavelet. The parameters and are commonly used values. Mallat's theory suggests that the inverse DWT can decompose an original discrete time series into a series of linearly independent approximation and detail components. Mallat (1989) represented the inverse DWT using the following equation:
(17)

Proposed hybrid W-SVM model

The goal of the study is to introduce a combined wavelet and SVM (W-SVM) approach to improve drought prediction compared to the benchmarks of ARIMA and SVM. To achieve this goal, in the first step, the level of decomposition should be determined using the Equation (18) (Dawson et al. 2007):
(18)
where N is the number of SPI data and L is indicating the number of decomposed levels. Since there are 600 observations for each SPI, this study considers three levels of decomposition, which include three detail elements (D1, D2, and D3) and one approximate level (A3). Once the decomposition level is determined using a suitable mother wavelet function such as Daubechies, Symlet, Meyer, or Morlet (Benaouda et al. 2006; Nury Hasan & Alam 2017), each decomposed level is forecasted with the SVM model, and the final forecast is obtained by summing all the decomposed layers.
Daubechies with order 2 (db2) was chosen as the mother wavelet for this study because of its good properties and proven track record in hydrological time series analysis. db2 has compact support and 2 vanishing moments, which makes it well suited for signals with sharp transitions like precipitation data. This allows db2 to capture low frequency trends and high frequency variations in drought data (Daubechies 1992; Kisi 2010). Also, db2 has been used in meteorological and hydrological studies. For example, Rubio et al. (2023) and Nason & Wei (2024) showed its superiority in decomposing drought indices and improved forecasting. Preliminary tests comparing db2 with other mother wavelets such as Haar and Symlets showed that db2 had the best trade-off between temporal localization and frequency resolution and higher statistical metrics like RMSE and R2 for SPI data. This ensures that the proposed W-SVM model can handle nonlinear and non-stationary characteristics of SPI time series data across different time scales. Figure 2 shows the schematic overview of constructing the W-SVM model.
Figure 2

Construction of the W-SVM framework.

Figure 2

Construction of the W-SVM framework.

Close modal

Model evaluation metrics

Several statistical measures are used to evaluate the performance of the proposed model against its counterpart benchmarks. The mathematical formulas for the RMSE, MAE and coefficient of determination (R2) are presented in Equations (19), (20), and (21) respectively:
(19)
(20)
(21)
where the number of observations is indicated with n, the observation with Yi, mean of observation with , and the predicted values with . The RMSE looks at how much the predicted values deviate from the actual ones, showing the standard deviation of these predictions. MAE measures the average difference between the real and predicted values. Meanwhile, the R2 value reveals how well the model explains the variance in the observed data. For detailed information about the model evaluation metrics please refer to (Moriasi et al. 2007; Tongal 2013; Kontopoulou et al. 2023).
This study is conducted to improve drought prediction accuracy through integration of DWT and SVM (W-SVM) based on SPI. The first 80% of 50 years of historical precipitation data from Kabul province, Afghanistan, is used for training purposes, and the remaining 20% is used for testing the model. SPI values are calculated from the original average monthly precipitation data, with Figure 3 showing SPI-3, -6, -9, and -12 to provide a clearer understanding of the SPI trends in the region.
Figure 3

SPI values for monthly average precipitation data obtained from the Kabul province (1970–2019).

Figure 3

SPI values for monthly average precipitation data obtained from the Kabul province (1970–2019).

Close modal

The ARIMA model consists of three main stages: model identification, parameter estimation, and diagnostic checking. The best model is determined by the minimum Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC), while the maximum likelihood approach is the most common method for model parameter estimation. The diagnostic checking process ensures that the residuals follow white noise and that the model is sufficient. For more information on how the ARIMA model can be used for drought forecasting, refer to (Rezaiy & Shabri 2023b).

The RBF kernel was utilized for SVM drought modeling on the original SPI data. For the decomposed elements (A3, D1, D2, D3), both RBF and polynomial kernels of order 5 were used because, in some cases, the RBF kernel and, in other cases, the SVM with a polynomial kernel of order 5 showed better accuracy for SPI prediction. All the analyses for SVM and W-SVM were done using the ‘fitrsvm’ function in MATLAB. The reconstruction series for the W-SVM model is done by first fitting a suitable SVM model to each decomposed component (A3, D1, D2, D3). Then, the forecast values for each of these elements are summed to achieve the W-SVM final forecast. Figure 4 shows the decomposition levels of SPI-12, and Table 2 compares the ability of the proposed W-SVM against the benchmark traditional ARIMA and SVM models.
Table 2

Accuracy comparison of W-SVM, SVM, and ARIMA models for drought prediction using statistical evaluation metrics

NameModelTraining
Testing
RMSEMAER2RMSMAER2
SPI-3 ARIMA 0.7100 0.5534 0.4922 0.6380 0.4734 0.5732 
SVM 0.6883 0.5024 0.5207 0.6550 0.5051 0.5476 
W-SVM 0.3002 0.2102 0.9124 0.3857 0.2567 0.8383 
SPI-6 ARIMA 0.4895 0.3779 0.7445 0.4399 0.3256 0.7942 
SVM 0.5126 0.3863 0.7130 0.5477 0.4052 0.6991 
W-SVM 0.2338 0.1584 0.9406 0.2153 0.1574 0.9529 
SPI-9 ARIMA 0.3789 0.2866 0.8402 0.3550 0.2561 0.8674 
SVM 0.4107 0.2974 0.8095 0.4277 0.3185 0.8385 
W-SVM 0.1898 0.1344 0.9594 0.1924 0.1401 0.9629 
SPI-12 ARIMA 0.2743 0.2010 0.9191 0.2468 0.1846 0.9411 
SVM 0.3368 0.2403 0.8739 0.3603 0.2380 0.8759 
W-SVM 0.1651 0.1127 0.9696 0.1949 0.1428 0.9640 
NameModelTraining
Testing
RMSEMAER2RMSMAER2
SPI-3 ARIMA 0.7100 0.5534 0.4922 0.6380 0.4734 0.5732 
SVM 0.6883 0.5024 0.5207 0.6550 0.5051 0.5476 
W-SVM 0.3002 0.2102 0.9124 0.3857 0.2567 0.8383 
SPI-6 ARIMA 0.4895 0.3779 0.7445 0.4399 0.3256 0.7942 
SVM 0.5126 0.3863 0.7130 0.5477 0.4052 0.6991 
W-SVM 0.2338 0.1584 0.9406 0.2153 0.1574 0.9529 
SPI-9 ARIMA 0.3789 0.2866 0.8402 0.3550 0.2561 0.8674 
SVM 0.4107 0.2974 0.8095 0.4277 0.3185 0.8385 
W-SVM 0.1898 0.1344 0.9594 0.1924 0.1401 0.9629 
SPI-12 ARIMA 0.2743 0.2010 0.9191 0.2468 0.1846 0.9411 
SVM 0.3368 0.2403 0.8739 0.3603 0.2380 0.8759 
W-SVM 0.1651 0.1127 0.9696 0.1949 0.1428 0.9640 
Figure 4

Decomposition of SPI-12 components using DWT.

Figure 4

Decomposition of SPI-12 components using DWT.

Close modal

From Table 2, it can be clearly observed that the proposed W-SVM model outperformed both the ARIMA and SVM models for all SPI values in terms of lower RMSE and MAE, and higher R2 values. A general overview of the results in the table shows that all three models – ARIMA, SVM, and W-SVM – improved prediction accuracy from short-term drought (SPI-3) to medium (SPI-6) and long-term (SPI-9, -12) drought scales. However, W-SVM outperformed the other benchmarks in short, mid, and long-term drought prediction.

Looking more closely at the results in the table, for example, in SPI-3 during the testing period, the ARIMA model had an RMSE of 0.6380, MAE of 0.4734, and R2 of 0.5732. The SVM model's RMSE, MAE, and R2 were 0.6550, 0.5051, and 0.5476, respectively. The evaluation performance criteria for the proposed W-SVM were RMSE: 0.3857, MAE: 0.2567, and R2: 0.8383. These numbers indicate that the proposed model improved drought prediction accuracy compared to ARIMA by 39.55% and compared to SVM by 41.11%, based on RMSE. The improvement for MAE compared to ARIMA and SVM is 45.76 and 49.18%, respectively. The W-SVM improved R2 by 38.78 and 34.68% compared to the traditional ARIMA and SVM models.

These promising results continued for mid and longer-period SPI. For example, in the testing period of SPI-12, the W-SVM demonstrated improved drought prediction accuracy compared to ARIMA, with reductions in RMSE by 21.03%, MAE by 22.64%, and an increase in R2 by 2.38%. The RMSE, MAE, and R2 for the SVM model in SPI-12 are 0.3603, 0.2380, and 0.8759, respectively, while these measures for W-SVM are 0.1949, 0.1428, and 0.9640, respectively, for RMSE, MAE, and R2. This means that the proposed W-SVM improved drought prediction accuracy. The improvement percentage is 45.91% for RMSE, 40% for MAE, and 9.14% for R2.

To gain better insight into the results of Table 2, a comparison of the proposed W-SVM and benchmark models versus observed data is conducted across all SPI values. Figure 5 illustrates that the W-SVM outperformed the benchmark ARIMA and SVM models with a closer fit to the original data. In addition to these time series plots, a Taylor diagram is also presented to support the results of Table 2 and Figure 5. The Taylor diagram (Taylor 2001) is a useful graphical method for summarizing how different models match the observed values based on correlation, centered RMSE, and the amplitude of variation indicated by standard deviation. For SPI-3, -6, -9, and -12, it is presented in Figure 6. It can be clearly seen that the proposed model closely captures the observed data compared to the ARIMA and SVM models.
Figure 5

Comparison of W-SVM, SVM, and ARIMA models with observed values across different SPI levels.

Figure 5

Comparison of W-SVM, SVM, and ARIMA models with observed values across different SPI levels.

Close modal
Figure 6

Model comparison for each SPI data using Taylor diagrams.

Figure 6

Model comparison for each SPI data using Taylor diagrams.

Close modal

Limitations and future work

While the proposed W-SVM model demonstrates significant improvements in drought forecasting accuracy, it is not without limitations. A key challenge lies in the reliance on the SPI, which, as highlighted by Wu et al. (2007), can introduce inaccuracies in arid and semi-arid regions due to its dependence on a gamma distribution. Mahmoudi et al. (2022) suggested adjustments to SPI for better representation of drought conditions in these regions, while Isia et al. (2022) emphasized its limitations in capturing drought severity, particularly under irregular rainfall patterns. These studies underscore the need for careful consideration of SPI's assumptions when applied to such climates.

The integration of wavelet transform with SVM in this study aimed to address some of SPI's shortcomings by decomposing multi-temporal precipitation patterns. This approach enhances SPI's utility in arid environments, but its performance in temperate regions, where SPI is inherently more robust, has yet to be explored. Extending this model to temperate zones may offer additional insights into its adaptability and reliability across diverse climatic conditions.

Another limitation of the W-SVM model lies in its validation within a single semi-arid case study region. While the results indicate the model's potential for generalization, further research is required to confirm its applicability in other semi-arid and arid regions with varying climatic and hydrological characteristics. Testing the model in regions such as the Middle East, North Africa, and Central Asia would help establish its robustness. Similarly, adapting the model for use with other drought indices, such as the PDSI or RDI, could further broaden its applicability.

The computational complexity of wavelet decomposition and SVM modeling may limit its applicability to real-world problems. Future research should focus on developing computationally efficient variants of the model or exploring hybrid approaches that integrate W-SVM with advanced deep learning techniques.

Therefore, the following areas are proposed for future research:

  • (i) Refining SPI or integrating alternative drought indices to address limitations in arid and semi-arid regions.

  • (ii) Testing the model's generalizability across diverse climatic zones, including temperate regions.

  • (iii) Developing computationally efficient methodologies to ensure broader accessibility and usability.

  • (iv) Exploring hybrid models that integrate DWT with deep learning methods such as LSTM and CNNs to further enhance prediction accuracy.

  • (v) Applying different wavelet mother functions and comparing the prediction performance of other preprocessing techniques like EMD and EEMD with DWT in creating combined models for drought forecasting.

Time series drought prediction is vital for better water supply management and precautionary actions against the potential negative effects of drought. Drought, which can be categorized through drought indices like SPI, may exhibit complex patterns such as nonstationarity and nonlinearity, making accurate predictions more complex. This is because statistical and machine learning methods have limitations in capturing these non-stationary and nonlinear characteristics. This study aimed to develop a new approach for drought prediction that combines DWT with SVM (W-SVM) using 50 years of historical data from Kabul province, Afghanistan, which has been considered a drought-prone area in recent decades.

The findings of the study revealed that the proposed model outperformed the benchmark models of SVM and ARIMA across short, mid, and long-term droughts. In SPI-3, the W-SVM improved drought prediction accuracy by about 39.55 and 41.11% compared to ARIMA and SVM according to RMSE, and by 45.76 and 49.18% according to MAE, respectively. The R2 for the W-SVM showed an improvement of 38.78 and 34.68% compared to the benchmark ARIMA and SVM models. This promising performance is also valid for SPI-6, SPI-9, and SPI-12. For instance, the W-SVM improved drought prediction accuracy by 21.03% (RMSE), 22.64% (MAE), and 2.38% (R2) compared to the ARIMA model during the testing period. Meanwhile, these statistics against the SVM models show increased accuracy of 45.91% (RMSE), 40% (MAE), and 9.14% (R2).

Another interesting result is that all three models (ARIMA, SVM, and W-SVM) demonstrated greater capability in capturing longer drought prediction patterns. As we move from SPI-3 (short-term) to SPI-6 (mid-term) and SPI-9 and 12 (long-term) droughts, the RMSE and MAE of the models decrease, while the R2 increases. This indicates that W-SVM, as well as traditional ARIMA and SVM, have a better ability to capture longer drought patterns, with the proposed model surpassing the other two benchmarks across all SPI values.

The first author would like to express gratitude to the Kabul Education University (KEU) for the study leave and Afghanistan's Ministry of Higher Education for the scholarship.

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Achite
M.
,
Elshaboury
N.
,
Jehanzaib
M.
,
Vishwakarma
D. K.
,
Pham
Q. B.
,
Anh
D. T.
,
Abdelkader
E. M.
&
Elbeltagi
A
. (
2023
)
Performance of machine learning techniques for meteorological drought forecasting in the wadi mina basin, Algeria
,
Water
,
15
(
4
),
765
.
https://doi.org/10.3390/w15040765
.
Almikaeel
W.
,
Čubanová
L.
&
Šoltész
A.
(
2022
)
Hydrological drought forecasting using machine learning – gidra river case study
,
Water
,
14
(
3
),
387
.
https://doi.org/10.3390/w14030387
.
Belayneh
A.
,
Adamowski
J.
,
Khalil
B.
&
Ozga-Zielinski
B. J. J. O. H
. (
2014
)
Long-term SPI drought forecasting in the awash river basin in Ethiopia using wavelet neural network and wavelet support vector regression models
,
Journal of Hydrology
,
508
, 418–429.
https://doi.org/10.1016/j.jhydrol.2013.10.052
.
Belayneh
A.
,
Adamowski
J.
,
Khalil
B.
&
Quilty
J
. (
2016
)
Coupling machine learning methods with wavelet transforms and the bootstrap and boosting ensemble approaches for drought prediction
,
Atmospheric Research
,
172–173, 37–47. https://doi.org/10.1016/j.atmosres.2015.12.017
.
Benaouda
D.
,
Murtagh
F.
,
Starck
J. L.
&
Renaud
O
. (
2006
)
Wavelet-based nonlinear multiscale decomposition model for electricity load forecasting
,
Neurocomputing
,
70
(
1 -3
),
139
-
154
.
https://doi.org/10.1016/j.neucom.2006.04.005
.
Box
G. E. P.
&
Jenkins
G. M.
(
1976
)
Time Series Analysis: Forecasting and Control
.
San Francisco
:
Holden-Day
.
Chen
S.
,
Zhong
W.
,
Pan
S.
,
Xie
Q.
&
Kim
T. W
. (
2020
)
Comprehensive drought assessment using a modified composite drought index: a case study in Hubei province, China
,
Water
,
12
(
2
),
462
.
https://doi.org/10.3390/w12020462
.
Chiang
J. L.
&
Tsai
Y. S.
(
2013
)
Reservoir drought prediction using Two-Stage SVM
,
Applied Mechanics and Materials
,
1473–1477
,
284
287
.
https://doi.org/10.4028/www.scientific.net/AMM.284-287.1473
.
Çoban
Ö.
,
Eşit
M.
&
Yalçın
S.
(
2024
)
ML-DPIE: comparative evaluation of machine learning methods for drought parameter index estimation: a case study of türkiye
,
Natural Hazards
,
120
(
2
),
989
1021
.
https://doi.org/10.1007/s11069-023-06233-1
.
Cristianini
N.
&
Shawe-Taylor
J.
(
2000
)
An introduction to support vector machines and other kernel-based learning methods
,
An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods
[Preprint], https://doi.org/10.1017/CBO9780511801389
.
Dai
A.
(
2011
)
Characteristics and trends in various forms of the palmer drought severity index during 1900–2008
,
Journal of Geophysical Research
,
116
(
D12
),
D12115
.
https://doi.org/10.1029/2010JD015541
.
Daubechies
I.
(
1992
)
Ten Lectures on Wavelets
.
Society for industrial and applied mathematics
,
Philadelphia, PA, USA
.
Dawson
C. W.
,
Abrahart
R. J.
&
See
L. M.
(
2007
)
Hydrotest: a web-based toolbox of evaluation metrics for the standardised assessment of hydrological forecasts
,
Environmental Modelling & Software
,
22
(
7
),
1034
1052
.
https://doi.org/10.1016/j.envsoft.2006.06.008
.
Deepa
D.
,
Prakash
B. V. S.
,
Neeraj
C. S.
,
Akhil
K.
&
Sasireka
K
. (
2022
)
Evaluation of meteorological drought impact using drought indices
,
145
156
.
https://doi.org/10.1007/978-981-16-5839-6_13
.
Deka
P. C.
(
2014
)
Support vector machine applications in the field of hydrology: a review
,
Applied Soft Computing
,
19
,
372
386
.
https://doi.org/10.1016/j.asoc.2014.02.002
.
Deo
R. C.
,
Kisi
O.
&
Singh
V. P.
(
2017
)
Drought forecasting in eastern Australia using multivariate adaptive regression spline, least square support vector machine and m5tree model
,
Atmospheric Research
,
184
,
149
175
.
https://doi.org/10.1016/j.atmosres.2016.10.004
.
Djerbouai
S.
&
Souag-Gamane
D.
(
2016
)
Drought forecasting using neural networks, wavelet neural networks, and stochastic models: case of the algerois basin in north Algeria
,
Water Resources Management
,
30
(
7
),
2445
2464
.
https://doi.org/10.1007/s11269-016-1298-6
.
Eslamian
S.
,
Ostad-Ali-Askari
K.
,
Singh
V. P.
,
Dalezios
N. R.
,
Ghane
M.
,
Yihdego
Y.
&
Matouq
M
. (
2017
)
A review of drought indices
,
International Journal of Constructive Research in Civil Engineering
,
3
(
4
), 48–66.
https://doi.org/10.20431/2454-8693.0304005
.
Essam
Y.
,
Huang
Y. F.
,
Ng
J. L.
,
Birima
A. H.
,
Ahmed
A. N.
&
El-Shafie
A
. (
2022
)
Predicting streamflow in peninsular Malaysia using support vector machine and deep learning algorithms
,
Scientific Reports
,
12
(
1
),
3883
.
https://doi.org/10.1038/s41598-022-07693-4
.
Faghmous
J. H.
&
Kumar
V.
(
2014
)
A Big data guide to understanding climate change: the case for theory-Guided data science
,
Big Data
,
2
(
3
),
155
163
.
https://doi.org/10.1089/big.2014.0026
.
Feng
Q.
,
Wen
X.
&
Li
J.
(
2015
)
Wavelet analysis-Support vector machine coupled models for monthly rainfall forecasting in arid regions
,
Water Resources Management
,
29
(
4
),
1049
1065
.
https://doi.org/10.1007/s11269-014-0860-3
.
Fung
K. F.
,
Huang
Y. F.
,
Koo
C. H.
&
Soh
Y. W
. (
2020
)
Drought forecasting: a review of modelling approaches 2007 -2017
,
Journal of Water and Climate Change
,
11
(
3
),
771
799
.
https://doi.org/10.2166/wcc.2019.236
.
Ghasemi
P.
,
Karbasi
M.
,
Nouri
A. Z.
,
Tabrizi
M. S.
&
Azamathulla
H. M
. (
2021
)
Application of Gaussian process regression to forecast multi-step ahead SPEI drought index
,
Alexandria Engineering Journal
,
60
(
6
),
5375
5392
.
https://doi.org/10.1016/j.aej.2021.04.022
.
Guttman
N. B.
(
1999
)
Accepting the standardized precipitation index: a calculation algorithm 1
,
JAWRA Journal of the American Water Resources Association
,
35
(
2
),
311
322
.
https://doi.org/10.1111/j.1752-1688.1999.tb03592.x
.
Han
P.
,
Wang
P.
,
Tian
M.
,
Zhang
S.
,
Liu
J.
&
Zhu
D
. (
2013
)
Application of the ARIMA models in drought forecasting using the standardized precipitation index
.
In Computer and Computing Technologies in Agriculture VI: 6th IFIP WG 5.14 International Conference, CCTA 2012, Zhangjiajie, China, October 19–21, 2012, Revised Selected Papers, Part I (pp. 352–358). Springer Berlin Heidelberg
.
Hinge
G.
,
Piplodiya
J.
,
Sharma
A.
,
Hamouda
M. A.
&
Mohamed
M. M
. (
2022
)
Evaluation of hybrid wavelet models for regional drought forecasting
,
Remote Sensing
,
14
(
24
),
6381
.
https://doi.org/10.3390/RS14246381/S1
.
Isia
I.
,
Hadibarata
T.
,
Jusoh
M. N. H.
,
Bhattacharjya
R. K.
,
Shahedan
N. F.
,
Bouaissi
A.
,
Fitriyani
N. L.
&
Syafrudin
M
. (
2022
)
Drought analysis based on standardized precipitation evapotranspiration index and standardized precipitation index in Sarawak, Malaysia
,
Sustainability
2023
,
15
(1), 734. https://doi.org/10.3390/SU15010734
.
Karavitis
C. A.
,
Alexandris
S.
,
Tsesmelis
D. E.
&
Athanasopoulos
G
. (
2011
)
Application of the standardized precipitation index (SPI) in Greece
,
Water
,
3
(
3
),
787
805
.
https://doi.org/10.3390/w3030787
.
Karbasi
M.
,
Karbasi
M.
,
Jamei
M.
,
Malik
A.
&
Azamathulla
H. M
. (
2022
)
Development of a new wavelet-based hybrid model to forecast multi-scalar SPEI drought index (case study: Zanjan city, Iran)
,
Theoretical and Applied Climatology
,
147
(
1–2
),
499
522
.
https://doi.org/10.1007/S00704-021-03825-4/FIGURES/18
.
Kikon
A.
&
Deka
P. C.
(
2022
)
Artificial intelligence application in drought assessment, monitoring and forecasting: a review
,
Stochastic Environmental Research and Risk Assessment
,
36
(
5
),
1197
1214
.
https://doi.org/10.1007/s00477-021-02129-3
.
Kisi
O.
(
2010
)
Wavelet regression model for short-term streamflow forecasting
,
Journal of Hydrology
,
389
(
3–4
),
344
353
.
https://doi.org/10.1016/j.jhydrol.2010.06.013
.
Kisi
O.
(
2011
)
Wavelet regression model as an alternative to neural networks for river stage forecasting
,
Water Resources Management
,
25
(
2
),
579
600
.
https://doi.org/10.1007/s11269-010-9715-8
.
Kontopoulou
V. I.
,
Panagopoulos
A. D.
,
Kakkos
I.
&
Matsopoulos
G. K
. (
2023
)
A review of ARIMA vs. machine learning approaches for time series forecasting in data driven networks
,
Future Internet
,
15
(
8
),
255
.
https://doi.org/10.3390/fi15080255
.
Li
J.
,
Zhang
S.
,
Huang
L.
,
Zhang
T.
&
Feng
P
. (
2020a
)
Drought prediction models driven by meteorological and remote sensing data in guanzhong area, China
,
Hydrology Research
,
51
(
5
),
942
958
.
https://doi.org/10.2166/nh.2020.184
.
Li
Z.
,
Chen
T.
,
Wu
Q.
,
Xia
G.
&
Chi
D
. (
2020b
)
Application of penalized linear regression and ensemble methods for drought forecasting in northeast China
,
Meteorology and Atmospheric Physics
,
132
(
1
),
113
130
.
https://doi.org/10.1007/s00703-019-00675-8
.
Luo
L.
,
Apps
D.
,
Arcand
S.
,
Xu
H.
,
Pan
M.
&
Hoerling
M
. (
2017
)
Contribution of temperature and precipitation anomalies to the California drought during 2012 -2015
,
Geophysical Research Letters
,
44
(
7
),
3184
3192
.
https://doi.org/10.1002/2016GL072027
.
Mahmoudi
P.
,
Ghaemi
A.
,
Rigi
A.
&
Amir Jahanshahi
S. M
. (
2022
)
Retraction note to: recommendations for modifying the standardized precipitation index (SPI) for drought monitoring in arid and semi-arid regions
,
Water Resources Management
,
36
(
15
),
6223
.
https://doi.org/10.1007/S11269-022-03379-8/METRICS
.
Mallat
S. G.
(
1989
)
A theory for multiresolution signal decomposition: the wavelet representation
,
IEEE Transactions on Pattern Analysis and Machine Intelligence
,
11
(
7
),
674
693
.
https://doi.org/10.1109/34.192463
.
Mayar
M. A.
(
2021
)
Droughts on the Horizon: can Afghanistan Manage This Risk?
Afghanistan-analysts.org
,
Kabul, Afghanistan
.
McKee
T. B.
,
Doesken
N. J.
&
Kleist
J.
(
1993a
)
Analysis of standardized precipitation index (SPI) data for drought assessment
,
Water (Switzerland)
,
26
(
2
),
1
72
.
https://doi.org/10.1088/1755-1315/5
.
McKee
T. B.
,
Doesken
N. J.
&
Kleist
J.
(
1993b
). '
The Relationship of Drought Frequency and Duration to Time Scales
’,
8th Conference on Applied Climatology
.
Anaheim
.
McKee
T. B.
,
Doesken
N. J.
&
Kleist
J.
(
1995
). '
Drought monitoring with multiple time scales
’,
Ninth Conference on Applied Climatology
.
American Meteorological Society
, pp.
233
236
.
Mishra
A. K.
&
Singh
V. P.
(
2011
)
Drought modeling – A review
,
Journal of Hydrology
,
403
,
157
175
.
Elsevier. https://doi.org/10.1016/j.jhydrol.2011.03.049
.
Moghimi
M. M.
,
Zarei
A. R.
&
Mahmoudi
M. R.
(
2020
)
Seasonal drought forecasting in arid regions, using different time series models and rdi index
,
Journal of Water and Climate Change
,
11
(
3
),
633
654
.
https://doi.org/10.2166/wcc.2019.009
.
Mokhtar
A.
,
Jalali
M.
,
He
H.
,
Al-Ansari
N.
,
Elbeltagi
A.
,
Alsafadi
K.
,
Abdo
H. G.
,
Sammen
S. S.
,
Gyasi-Agyei
Y.
&
Rodrigo-Comino
J
. (
2021
)
Estimation of SPEI meteorological drought using machine learning algorithms
,
IEEE Access
,
9
,
65503
65523
.
https://doi.org/10.1109/ACCESS.2021.3074305
.
Mokhtarzad
M.
,
Eskandari
F.
,
Vanjani
N. J.
&
Arabasadi
A.
(
2017
)
Drought forecasting by ANN, ANFIS, and SVM and comparison of the models
,
Environmental Earth Sciences
,
76
(
21
),
729
.
https://doi.org/10.1007/s12665-017-7064-0
.
Moriasi
D. N.
,
Arnold
J. G.
,
Van Liew
M. W.
,
Bingner
R. L.
,
Harmel
R. D.
&
Veith
T. L.
(
2007
)
Model evaluation guidelines for systematic quantification of accuracy in watershed simulations
,
Transactions of the ASABE
,
50
(
3
),
885
900
.
https://doi.org/10.13031/2013.23153
.
Nason
G. P.
&
Wei
J. L.
(
2024
)
‘Leveraging Non-Decimated Wavelet Packet Features and Transformer Models for Time Series Forecasting’, arXiv.org [Preprint]. https://doi.org/10.48550/ARXIV.2403.08630
.
Nury
A. H.
,
Hasan
K.
&
Alam
M. J. B.
(
2017
)
Comparative study of wavelet-ARIMA and wavelet-ANN models for temperature time series data in northeastern Bangladesh
,
Journal of King Saud University – Science
,
29
(
1
),
47
61
.
https://doi.org/10.1016/j.jksus.2015.12.002
.
Piri
J.
,
Abdolahipour
M.
&
Keshtegar
B.
(
2023
)
Advanced machine learning model for prediction of drought indices using hybrid SVR-RSM
,
Water Resources Management
,
37
(
2
),
683
712
.
https://doi.org/10.1007/s11269-022-03395-8
.
Poornima
S.
&
Pushpalatha
M.
(
2019
)
Drought prediction based on SPI and SPEI with varying timescales using LSTM recurrent neural network
,
Soft Computing
,
23
(
18
),
8399
8412
.
https://doi.org/10.1007/s00500-019-04120-1
.
Quan
Q.
,
Gao
S.
,
Shang
Y.
&
Wang
B
. (
2021
)
Assessment of the sustainability of Gymnocypris eckloni habitat under river damming in the source region of the Yellow River
,
Science of The Total Environment
,
778
,
146312
.
https://doi.org/10.1016/j.scitotenv.2021.146312
.
Rezaiy
R.
&
Shabri
A.
(
2023a
)
Drought forecasting using W-ARIMA model with standardized precipitation index
,
Journal of Water and Climate Change
,
14
(
9
),
3345
3367
.
https://doi.org/10.2166/wcc.2023.431
.
Rezaiy
R.
&
Shabri
A.
(
2023b
)
‘Using the ARIMA/SARIMA model for Afghanistan's drought forecasting based on standardized precipitation index’
,
MATEMATIKA
,
39
(
3
),
239
261
.
https://doi.org/10.11113/matematika.v39.n3.1478
.
Rezaiy
R.
&
Shabri
A.
(
2024a
)
Enhancing drought prediction precision with EEMD-ARIMA modeling based on standardized precipitation index
,
Water Science & Technology
,
89
(
3
),
745
770
.
https://doi.org/10.2166/wst.2024.028
.
Rezaiy
R.
&
Shabri
A.
(
2024b
)
Improving drought prediction accuracy: a hybrid EEMD and support vector machine approach with standardized precipitation index
,
Water Resources Management
38, 5255–5277. [Preprint], https://doi.org/10.1007/s11269-024-03912-x
.
Rubio
L.
,
Palacio Pinedo
A.
,
Mejía Castaño
A.
&
Ramos
F
. (
2023
)
Forecasting volatility by using wavelet transform, ARIMA and GARCH models
,
Eurasian Economic Review
,
13
(
3–4
),
803
830
.
https://doi.org/10.1007/S40822-023-00243-X/FIGURES/13
.
Smola
A. J.
&
Schölkopf
B.
(
2004
)
A tutorial on support vector regression
,
Statistics and Computing
,
14
(
3
),
199
222
.
https://doi.org/10.1023/B:STCO.0000035301.49549.88/METRICS
.
Sönmez
F. K.
,
Kömüscü
A. Ü.
,
Erkan
A.
&
Turgu
E
. (
2005
)
An analysis of spatial and temporal dimension of drought vulnerability in Turkey using the standardized precipitation index
,
Natural Hazards
,
35
(
2
),
243
264
.
https://doi.org/10.1007/s11069-004-5704-7
.
Taylor
K. E.
(
2001
)
Summarizing multiple aspects of model performance in a single diagram
,
Journal of Geophysical Research: Atmospheres
,
106
(
D7
),
7183
7192
.
https://doi.org/10.1029/2000JD900719
.
Tiu
E. S. K.
,
Huang
Y. F.
,
Ng
J. L.
,
AlDahoul
N.
,
Ahmed
A. N.
&
Elshafie
A
. (
2022
)
An evaluation of various data pre-processing techniques with machine learning models for water level prediction
,
Natural Hazards
,
110
(
1
),
121
153
.
https://doi.org/10.1007/s11069-021-04939-8
.
Tongal
H.
(
2013
)
Nonlinear forecasting of stream flows using a chaotic approach and artificial neural networks
,
Earth Sciences Research Journal
,
17
(
2
),
119
126
.
Vapnik
V. N.
(
1995
)
The Nature of Statistical Learning Theory
.
New York
:
Springer-Verlag
.
Vicente-Serrano
S. M.
,
Beguería
S.
&
López-Moreno
J. I.
(
2010
)
A multiscalar drought index sensitive to global warming: the standardized precipitation evapotranspiration index
,
Journal of Climate
,
23
(
7
),
1696
1718
.
https://doi.org/10.1175/2009JCLI2909.1
.
Wilhite
D. A.
,
Svoboda
M. D.
&
Hayes
M. J.
(
2007
)
Understanding the complex impacts of drought: a key to enhancing drought mitigation and preparedness
,
Water Resources Management
,
21
(
5
),
763
774
.
https://doi.org/10.1007/s11269-006-9076-5
.
Wu
H.
,
Svoboda
M. D.
,
Hayes
M. J.
,
Wilhite
D. A.
&
Wen
F
. (
2007
)
Appropriate application of the standardized precipitation index in arid locations and dry seasons
,
International Journal of Climatology
,
27
(
1
),
65
79
.
https://doi.org/10.1002/joc.1371
.
Yaseen
Z. M.
,
Ali
M.
,
Sharafati
A.
,
Al-Ansari
N.
&
Shahid
S
. (
2021
)
Forecasting standardized precipitation index using data intelligence models: regional investigation of Bangladesh
,
Scientific Reports
,
11
(
1
),
3435
.
https://doi.org/10.1038/s41598-021-82977-9
.
Yihdego
Y.
,
Vaheddoost
B.
&
Al-Weshah
R. A.
(
2019
)
Drought indices and indicators revisited
,
Arabian Journal of Geosciences
,
12
(
3
),
1
12
.
https://doi.org/10.1007/s12517-019-4237-z
.
Zhang
Y.
,
Li
W.
,
Chen
Q.
,
Pu
X.
&
Xiang
L
. (
2017
)
Multi-models for SPI drought forecasting in the north of Haihe River Basin, China
,
Stochastic Environmental Research and Risk Assessment
,
31
(
10
),
2471
2481
.
https://doi.org/10.1007/s00477-017-1437-5
.
Zhang
Y.
,
Yang
H.
,
Cui
H.
&
Chen
Q
. (
2020
)
Comparison of the ability of ARIMA, WNN and SVM models for drought forecasting in the Sanjiang Plain, China
,
Natural Resources Research
,
29
(
2
).
https://doi.org/10.1007/s11053-019-09512-6
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).