Using the existing measures for training numerical (non-categorical) prediction models can cause misclassification of droughts. Thus, developing a drought category-based measure is critical. Moreover, the existing fixed drought category thresholds need to be improved. The objective of this research is to develop a category-based scoring support vector regression (CBS-SVR) model based on an improved drought categorization method to overcome misclassification in drought prediction. To derive variable threshold levels for drought categorization, K-means (KM) and Gaussian mixture (GM) clustering are compared with the traditional drought categorization. For drought prediction, CBS-SVR is performed by using the best categorization method. The new drought model was applied to the Red River of the North Basin (RRB) in the USA. In the model training and testing, precipitation, temperature, and actual evapotranspiration were selected as the predictors, and the target variables consisted of multivariate drought indices, as well as bivariate and univariate standardized drought indices. Results indicated that the drought categorization method, variable threshold levels, and the type of drought index were the major factors that influenced the accuracy of drought prediction. The CBS-SVR outperformed the support vector classification and traditional SVR by avoiding overfitting and miscategorization in drought prediction.

  • Developed a category-based scoring support vector regression (CBS-SVR) method for drought prediction.

  • Improved drought prediction by avoiding overfitting and miscategorization.

  • The CBS-SVR outperformed the traditional SVR.

  • The new method outperformed the support vector classification.

Drought prediction provides early warnings of drought development and valuable drought mitigation information for stakeholders. The data-driven methods widely used for drought prediction include time-series analysis (Rao & Padmanabhan 1984; Mishra & Desai 2005; Modarres 2007; Fernández et al. 2009; Durdu 2010), linear regression (Liu & Juárez 2001; Panu & Sharma 2002; Barros & Bowden 2008; Sun & Kim 2016) and nonlinear regression (Hwang & Carbone 2009; Liu & Hwang 2015), artificial neural network (ANN) (Mishra & Desai 2006; Mishra et al. 2007; Morid et al. 2007; Barua et al. 2012; Santos et al. 2014; Yang et al. 2015), Markov chain analysis (Lohani & Loganathan 1997; Cancelliere et al. 2007; Paulo & Pereira 2007, 2008; Sharma & Panu 2012), and probabilistic forecasting (Madadgar & Moradkhani 2013; Hao & Singh 2016).

Regression models are used if the predictand (drought index) is continuous. For drought prediction, support vector machine (SVM) for regression or support vector regression (SVR) has been suggested since it has some advantages over other prediction models. For example, the SVR can overcome some limitations of ANN such as local maxima and overfitting, and outperform ANN to some extent for drought prediction (Ganguli & Reddy 2014). It also has the ability to learn from a much smaller dataset for training and is capable of handling a large number of variables (Hao et al. 2018). The success or failure of a prediction model depends on various factors such as the selection of predictors. Shamshirband et al. (2020) showed a relatively worse performance of SVR than other drought prediction models when meteorological drought indices were used to predict hydrological drought. Mohamadi et al. (2020) compared different drought prediction models and found that a hybrid SVM showed a relatively lower capability in predicting droughts among the models tested in their study.

There are various hyperparameters in an SVR model. Penalty factor determines the trade-off between the model complexity and the training error (Joachims 2002). Parameter epsilon controls the width of the epsilon-insensitive zone, which is used to fit the training data. The optimal value of epsilon scales linearly with gamma (Schölkopf & Smola 2002), a Kernel parameter used to reduce the model space and control the complexity of the solution (Kisi & Cimen 2011). Therefore, tuning these hyperparameters is crucial in defining decision boundaries and the success of an SVR model. These hyperparameters have been selected by trial and error in some studies for drought prediction (Belayneh et al. 2014; Feng et al. 2019). Cross-validation methods have been also used for tuning these hyperparameters. Grid search (Larochelle et al. 2007) is one of the well-accepted cross-validation methods in SVR (Bergstra & Bengio 2012) for drought prediction (Deo et al. 2016a, 2016b; Deo et al. 2017).

To assess the prediction skill of a regression model, the regression results are commonly compared with a reference (i.e., actual values). Mean absolute error (MAE), mean square error (MSE), root mean square error (RMSE), and coefficient of determination (R2) are among the most commonly-used performance measures (Sundararajan et al. 2021). Although these performance metrics enhance the preciseness and predictive skills of models, they may cause misclassification (hereafter miscategorization). For instance, the objective in a cross-validation effort for tuning hyperparameters in SVR is to select the values of hyperparameters so that lower MAE, MSE, and RMSE or higher R2 are achieved. Although it can reduce the difference between the actual and predicted values of drought indices and increase the mathematical preciseness of the prediction model, it may also lead to miscategorization of the predicted drought. This provides misinformation for stakeholders and causes waste of budgets and efforts, while they take inappropriate actions for preparedness and mitigation for the predicted drought category.

Drought categorization converts a large volume of data (drought indices) into a category that represents a measure of severity and facilitates ‘apple to apple’ comparisons over time, according to the National Center for Environmental Information (NCEI 2020). Compared with drought indices values, interpretation of drought categories is simple and can easily be understood by stakeholders. Thus, stakeholders such as decision makers mostly care about drought categories rather than the values of drought indices. Any miscategorization in drought studies can mislead the stakeholders. Thus, defining a threshold level for drought categorization is a crucial process (Bazrkar et al. 2020; Hao et al. 2016a).

One can avoid drought miscategorization by using the classification framework such as support vector classification (SVC) and logistic regression. A logistic regression model works for a binary drought category (drought or wet). Since most of the existing drought information systems are based on multiple drought categories, the logistic regression model needs to be modified to be employed for drought prediction (Regonda et al. 2006; Hao et al. 2016b, 2016c). However, the potential limitation is the large number of parameters for the prediction of multiple drought categories (Hao et al. 2018).

Accurate predictions of drought categories are crucial for stakeholders to decide on how to be prepared for potential upcoming droughts. However, using the existing standard performance metrics such as RMSE for training SVR-based drought prediction models potentially leads to miscategorization of droughts since such metrics minimize the difference between the actual and predicted values by accounting for the numerical differences only without considering the categorical discrepancies. Depending on whether the predicted value is lower or greater than the actual value, the predicted value can be closer to the actual value in two directions. The ideal condition is to remove or minimize the numerical differences in a direction so that the predicted value is located in the same category as the actual value. However, this ideal condition is less achievable or can be achieved through certain improvements in the concept behind the prediction models (e.g., the overfitting issue in ANN can be resolved in SVR by applying structural risk). This study focuses on improving the capability of a specific drought prediction model by avoiding miscategorization and aims to develop a new category-based scoring approach for tuning hyperparameters and to improve the prediction model training by accounting for the differences in drought categories, rather than the values of drought indices. Specifically, the objective of this study is to use the concept of classification in cross-validation of an SVR model to avoid miscategorization in drought prediction. A novel category-based scoring SVR (CBS-SVR) method is developed and compared with the traditional SVR and SVC since these models share some conceptual similarities. SVR is a numerical model with a potential of miscategorization. Thus, a comparison of SVR and CBS-SVR can shed light on the improvement achieved by the category-based scoring employed in this study. SVC has a classification nature and its comparison with CBS-SVR helps assess their performances.

Introduction to SVR

SVR was developed by Vapnik (1995) to reduce the generalization error by using the structural risk minimization. A regression function for a set of sampled points from input and target vectors is estimated by tuning hyperparameters. There exist different Kernel functions such as linear, polynomial, sigmoid, and nonlinear radial basis function (RBF) that can be used to increase the dimensional space in SVR. Depending on the nature of the input data, different Kernel functions can be employed, and thus different hyperparameters need to be tuned. In the current study, various methods were tested, and the nonlinear RBF was selected. Thus, the hyperparameters included gamma (γ), cost (C), and epsilon (ε). Parameter C determines the trade-off between the model complexity and the training error. Parameter ε controls the width of the ε-insensitive zone and can affect the number of support vectors used to construct the regression function. Parameter γ reduces the model space and controls the complexity of the solution (Kisi & Cimen 2011). These three hyperparameters in SVR (i.e., γ, C, and ε) need to be tuned. Cross-validation has been widely used to identify the values of these hyperparameters. In this method, a combination of hyperparameters is selected based on a specific type of scoring. In this study, a novel category-based scoring method was developed and used in CBS-SVR. Grid search cross-validation was also used for tuning these hyperparameters. A wider range was tested for each hyperparameter and then narrowed down based on the results. The narrower range was set as the initial range of the hyperparameter in grid search cross-validation. C values varied among 1, 5, 10, 15, 20, and 25; ε values included 1, 0.1, 0.01, 0.001, and 0.0001; and γ values were 0.1, 0.5, 1, 1.5, 2, and 5. The best combination was eventually selected based on the new category-based R scoring.

Development of CBS-SVR

Training, validation, and testing are three phases in the development of an SVR-based prediction model. Training is conducted on the training dataset. If the evaluation of performance on the validation dataset is successfully proceeded, the final evaluation can be done on the testing dataset. There are two major problems if the dataset is divided into three parts: (1) a decrease in the number of samples by wasting too much data and (2) dependence of the results on the random choice for the training and validation datasets. k-fold cross-validation solves these problems by dividing the dataset into training and testing sets. The training set is further divided into k-folds and the validation process is repeated k times so that k−1-folds are used for training and 1-fold is used for the primary evaluation. The averaged value of performance measures in the k loops is the performance measure of k-fold cross-validation. The minimum value of k is 2, by which the dataset is divided into two subsets for training and test. No maximum value is determined for k. The higher the k value, the more training data and the more complex the problem. The optimum k value (equal to 5) was selected in this study, as shown in Figure 1. Among a number of combinations of hyperparameters that go through the cross-validation, the value with the lowest error is selected and set for the testing period of the model (Figure 1).

Figure 1

5-fold cross-validation in CBS-SVR and SVR.

Figure 1

5-fold cross-validation in CBS-SVR and SVR.

Close modal
In the traditional cross-validation for SVR, scoring is based on numerical statistical measures (RMSE in Figure 1). In this way, the hyperparameters are selected if they have the lowest difference between the actual and predicted values of drought indices. However, in the CBS-SVR, scoring is based on categorical discrepancies. Therefore, the risk of miscategorization is potentially lower. Risk or R scoring is defined by the following equation:
(1)
where is the actual category at time i and in fold j; is the predicted category at time i and in fold j; is the total number of time steps; and k is the total number of folds.

The selected combination of hyperparameters is used in SVR to define the weights and intercept. Figure 2 shows two cases of miscategorization by using RMSE and how CBS-SVR can avoid this issue. D4, D3, D2, D1, and D0 are exceptional, extreme, severe, moderate, and abnormal droughts, respectively; N indicates a normal condition; and W0, W1, W2, W3, and W4 are representatives of abnormal, moderate, severe, extreme, and exceptional wet conditions, respectively. The thresholds are selected based on the traditional drought categorization. In case 1, the value predicted by SVR using the RMSE scoring is closer to the actual value but it is in a different drought category. The value predicted by CBS-SVR has a higher RMSE but it is in the same category as that of the actual value. In case 2, as a result of overfitting in other cases (e.g., case 1), the difference between the predicted value by SVR and the actual value is larger than that of the value predicted by CBS-SVR. Thus, the RMSE scoring causes miscategorization, while CBS-SVR successfully avoids this problem.

Figure 2

Risk of miscategorization in SVR and more accurate categorization by CBS-SVR.

Figure 2

Risk of miscategorization in SVR and more accurate categorization by CBS-SVR.

Close modal

Drought categorization

Drought categorization plays a crucial role in this study since it determines the hyperparameters for SVR and thus impacts drought prediction. Traditional categorization is based on fixed threshold levels, which need to be varied by time and location (Mishra & Singh 2010; Bazrkar et al. 2020). Bazrkar et al. (2020) proposed a customized drought categorization (CDC) and employed a cell-by-cell-based analysis to incorporate both spatial and temporal distributions of drought. A joint probability distribution and conditional expectation were used to estimate the average probability of occurrence of each drought category. K-means (KM) clustering was further used in the CDC to derive variable thresholds. Two normal distributions (one for dry and another for wet) were observed in their study. A Gaussian mixture (GM) model is suggested when two normal distributions are combined. Therefore, GM can be employed for drought categorization in the CDC. Traditional drought categorization (McKee et al. 1993) is based on the normal distribution (N). Svoboda et al. (2002) employed the percentile method to define the categories thresholds. The same categories were used in this study, including exceptional, extreme, severe, moderate, and abnormal dry conditions as well as the normal condition. In the traditional drought categorization, drought indices values of −3.00, −2.00, −1.00, and −0.50 are, respectively, the thresholds for extreme, severe, moderate, and abnormal drought categories. 0.50, 1.00, 2.00, and 3.00 are used for abnormal, moderate, severe, and extreme wet categories, respectively. The values between −0.50 and 0.50 are categorized as a normal category. The threshold values for exceptional dry and wet categories cannot be defined unless a long study period is chosen. Since the study period in this research was limited, the threshold values for exceptional dry and wet categories were predefined as −3.50 and 3.5, respectively. These values were chosen to be in accordance with the values for extreme dry and wet categories. The variable threshold levels for drought categories in the CDC were developed to vary by time and geographic locations and were determined by KM or GM for these 11 categories. In this study, the performance of KM was compared with the traditional categorization (McKee et al. 1993) and GM.

Assessment of CBS-SVR

To assess the performance of CBS-SVR, it was first compared with the traditional SVR with the RMSE-based scoring. Then, CBS-SVR was compared with SVC. To measure and compare the performances of the models, a confusion matrix was extended from binary-class to multiclass and then different performance metrics were used.

Comparison of the CBS-SVR with SVR and SVC

RMSE has been widely used as scoring in cross-validation for tuning hyperparameters in SVR and for the evaluation of the performances of other drought prediction models. However, there is a risk of miscategorization if hyperparameters are selected based on RMSE. The traditional RMSE-based SVR was modified to overcome the issue of miscategorization by developing CBS-SVR in this study. Due to its classification nature, SVC is potentially capable of avoiding miscategorization. To quantize the predictive skills of each model, the performance of CBS-SVR was compared with that of SVR and SVC.

There are some major differences between SVR and SVC. First, the main goal of SVR is to fit as many points as possible in the decision boundary or ε-tube. However, the primary goal of SVC is to separate the points by maximizing the margin or minimizing the dot product of the coefficients in Equation (2). Unlike SVR, in SVC, the fewer points in the boundary margin (ε-tube in SVR), the better. Therefore, the ideal is to satisfy Equation (3) for all samples (Vapnik 1995).
(2)
(3)
where is the coefficient vector of the predictor x; is the transpose of ; indicates dot product; is predictand i; b is the intercept; and is the kernel function, which implicitly maps the training vectors into a higher dimensional space. In this study, the nonlinear RBF kernel is used in the SVR and SVC.
Since the satisfaction of Equation (3) is not possible, some samples are allowed to be at a specific distance from their correct margin boundary, defined as a slack variable. However, these points are penalized by a penalty term (regularization parameter). Only one slack variable in SVC is defined so that it is greater than 1 if the point is above the hyperplane (blue line in Figure 3 with y = 0) and less than 1 if the point is below the hyperplane (Figure 3). Therefore, the objective function in SVC is given by (Vapnik 1995) the following equation:
(4)
Figure 3

(a) SVR and (b) SVC. Please refer to the online version of this paper to see this figure in color: http://dx.doi.10.2166/hydro.2022.104.

Figure 3

(a) SVR and (b) SVC. Please refer to the online version of this paper to see this figure in color: http://dx.doi.10.2166/hydro.2022.104.

Close modal
Subject to:
(5)
where C is the regularization parameter; is the slack variable i in SVC.
There are two slack variables (, ) for SVR. Depending on the location of the points, these slack variables have different values. If the points are above the ɛ-tube, is greater than zero, is equal to zero; if the points are below the ɛ-tube, is equal to zero and is greater than zero; and if the points are inside the ɛ-tube, both and are equal to zero (Figure 3). Thus, the objective function for SVR is defined as (Vapnik 1995) follows:
(6)
Subject to:
(7)
(8)
where is the radius of the ɛ-tube. The samples penalize the objective by or above or below the ɛ-tube, respectively (Figure 3). In other words, the samples whose predictions are at least away from their true targets are penalized.

The main difference between SVR and SVC is in their loss functions. The loss function in SVC just assures that the prediction is greater than 1 if the predicted value is positive and less than −1 if the predicted value is negative (Figure 3). However, instead of minimizing the observed training error, the loss function in the SVR tries to minimize the generalization error bound so that a generalized performance is achieved.

Confusion matrix and accuracy measures for the evaluation of the prediction models

The confusion matrix (Stehman 1997) and accuracy measures such as the area under the curve of a receiving operating characteristic (AUC-ROC) were originally developed for two-category or binary models. Various approaches have been used to extend the original binary models to multiclass models. One versus all (also referred to as one versus rest) and one versus one (Provost & Domingos 2000) are among the well-known approaches that use binary classification algorithms for multiclass classification. One versus all generates fewer binary classifiers and thus has relatively lower accuracy, but it is less complex. Therefore, to avoid complexity, one versus all was used in this study to evaluate the performances of the multi-category drought prediction models. Figure 4 shows how a confusion matrix for two categories (dry/wet) is extended to the one for 11 drought categories. The AUC-ROC and micro, macro, and weighted average (Fawcett 2001, 2006) of F1 scores were used to obtain overall performances of CBS-SVR, SVR, and SVC.

Figure 4

Confusion matrix for the evaluation of binary and multiclass drought prediction models (TP: true positive; FP: false positive; FN: false negative; TN: true negative. D4, D3, D2, D1, and D0 are exceptional, extreme, severe, moderate, and abnormal droughts, respectively; N indicates a normal condition; and W0, W1, W2, W3, and W4 are representatives of abnormal, moderate, severe, extreme, and exceptional wet conditions, respectively).

Figure 4

Confusion matrix for the evaluation of binary and multiclass drought prediction models (TP: true positive; FP: false positive; FN: false negative; TN: true negative. D4, D3, D2, D1, and D0 are exceptional, extreme, severe, moderate, and abnormal droughts, respectively; N indicates a normal condition; and W0, W1, W2, W3, and W4 are representatives of abnormal, moderate, severe, extreme, and exceptional wet conditions, respectively).

Close modal
If the model predicts a correct category of drought, it is ‘True Positive (TP).’ If the model predicts an incorrect category or categories, it is ‘True Negative (TN).’ Type I error occurs if the model predicts a specific category of drought but in reality, it is not right. This type of error in the confusion matrix is ‘False Positive (FP).’ Type II error occurs if the model predicts that a specific type of drought does not occur, but the prediction is false. This type of error is ‘False Negative (FN)’ in the confusion matrix (Wang & Zheng 2013). Recall (sensitivity), which indicates the correctness of the model prediction, is calculated by (Ting 2011) the following equation:
(9)
Precision (specificity) represents the number of actual positive cases out of the positive classes that a model is able to predict correctly and is given by (Ting 2011) the following equation:
(10)
Danandeh Mehr & Fathollahzadeh Attar (2021) also used precision and recall to test the performance of a drought prediction model. F1 score is used to make two models with low and high precision comparable. Therefore, to account for these cases, F1 score is used for the evaluation of the performances of the models. F1 score is calculated by (Zhang & Zhang 2009) the following equation:
(11)
AUC-ROC is also used for comparison of the models’ performances. The higher the AUC-ROC, the better the model. The ROC curve is plotted with the true positive rate (TPR) against the false positive rate (FPR). TPR is defined the same way as to recall or sensitivity. FPR is calculated by (Ting 2011) the following equation:
(12)

Precipitation, temperature, actual evapotranspiration, and a deficit variable (DFCT) were selected as the predictors. The DFCT was defined as the difference between precipitation and actual evapotranspiration. The target variables consisted of three types of drought indices at different time scales (monthly, seasonally, and semiannually) including (1) multivariate hydroclimatic aggregate drought index (HADI) (Bazrkar et al. 2020) and snow-based hydroclimatic aggregate drought index (SHADI) (Bazrkar & Chu 2022), (2) bivariate standardized drought indices based on the difference between precipitation and actual evapotranspiration (SPEI) and the summation of snowmelt and rainfall, and (3) univariate standardized drought indices based on precipitation, surface runoff, baseflow (BF), soil moisture, snowpack, and snow water equivalent (SWE). In an application to the Red River of the North Basin (RRB), the North American Land Data Assimilation System (NLDAS) (Xia et al. 2012) data in 1979–2010 and 2011–2016 were, respectively, used for the training and testing of the CBS-SVR model.

Drought indices

In this study, a CBS-SVR model was developed. Sundararajan et al. (2021) reviewed several studies on applications of machine learning for drought prediction and suggested using multiple drought indices for prediction, instead of a single index. Following this suggestion, 10 drought indices were used to evaluate the performance of CBS-SVR. The multivariate HADI and SHADI (Bazrkar et al. 2020; Bazrkar & Chu 2022) distinguished drought and snow drought in cold climate regions. The R-mode correlation-based principal component analysis (PCA) was employed to aggregate rainfall (RF), snowmelt (SM), surface runoff (R), and soil water storage (SWS) in the calculation of HADI. Precipitation and snowpack (SP) were used, instead of rainfall and snowmelt in HADI, to calculate SHADI to enhance the capability of identifying snow drought (Bazrkar & Chu 2022). Two bivariate standardized drought indices were also used. The standardized difference between precipitation and potential evapotranspiration was used to develop SPEI by Vicente-Serrano et al. (2010). Homdee et al. (2016) and Joetzjer et al. (2013) argued that the use of actual evapotranspiration showed higher consistency with hydrological drought indices and it considered land use and vegetation cover. In this study, actual evapotranspiration was used to calculate SPEI. The standardized snowmelt and rainfall index (SMRI) (Staudinger et al. 2014) was also estimated. The univariate standardized drought indices used in this study included standardized precipitation index (SPI) (McKee et al. 1993), standardized runoff index (SRI) (Shukla & Wood 2008), standardized soil moisture index (SSMI) (Xu et al. 2018), standardized snowpack index (SSPI), standardized snow water equivalent index (SSWEI) (Huning & Aghakouchak 2018), and standardized baseflow index (SBFI) (Bazrkar & Chu 2020). These drought indices cover a wide range of hydroclimatic variables and different types of drought indices, and hence using all of them can reassure the capability of the CBS-SVR model.

Predictors

Depending on the specific drought index used as the target variable in the prediction model, different types of predictor(s) were used. Table 1 shows the predictands and their associated predictors. Precipitation (P) was used as a predictor for the prediction of all drought indices. Temperature (T) was added for the prediction of SPEI, SMRI, SSPI, and SSWEI, where besides precipitation, temperature also played a key role in the identification of drought. Actual evapotranspiration (ET) and the DFCT were also added in the prediction of HADI, SHADI, SSMI, SRI, and SBFI.

Table 1

Predictands and predictors

PredictandsTypeInputMethod of calculationPredictors
HADI Multivariate RF, SM, R, SWS PCA P, T, ET, DFCT 
SHADI Multivariate P, SP, R, SWS PCA P, T, ET, DFCT 
SSMI Univariate SWS Standardized P, T, ET, DFCT 
SRI Univariate R Standardized P, T, ET, DFCT 
SBFI Univariate BF Standardized P, T, ET, DFCT 
SPEI Bivariate P, ET Standardized P, T 
SMRI Bivariate SM, RF Standardized P, T 
SSPI Univariate SP Standardized P, T 
SSWEI Univariate SWE Standardized P, T 
SPI Univariate P Standardized P 
PredictandsTypeInputMethod of calculationPredictors
HADI Multivariate RF, SM, R, SWS PCA P, T, ET, DFCT 
SHADI Multivariate P, SP, R, SWS PCA P, T, ET, DFCT 
SSMI Univariate SWS Standardized P, T, ET, DFCT 
SRI Univariate R Standardized P, T, ET, DFCT 
SBFI Univariate BF Standardized P, T, ET, DFCT 
SPEI Bivariate P, ET Standardized P, T 
SMRI Bivariate SM, RF Standardized P, T 
SSPI Univariate SP Standardized P, T 
SSWEI Univariate SWE Standardized P, T 
SPI Univariate P Standardized P 

P, precipitation; T, temperature; ET, evapotranspiration; RF, rainfall; SM, snowmelt; R, runoff; SWS, soil moisture; SP, snowpack; BF, baseflow; SWE, snow water equivalent; DFCT, difference between precipitation and evapotranspiration; PCA, principal component analysis.

Study area

The U.S. part of the RRB (Figure 5) is a typical cold climate region in the Northern Great Plain, which covers over 90,000 km2 in the states of Minnesota (MN), North Dakota (ND), and South Dakota (SD). According to the NCEI data (2021), the cold season of 2020 was the driest and the 10th warmest cold season, and March 2021 was the driest and the 4th warmest March in the last 127 years in most of the climate divisions (CDs) in ND. This sets an alarm for more severe droughts in the RRB. According to Palmer Z-index for March 2021 (Figure 5), CDs 3202, 3203, and 3205 were the driest, and CD 2101 was the wettest in the RRB. However, according to the 5-year mean of Z-index, CD 2101 was also among the driest CDs in the RRB, besides CDs 3202, 3203, and 3205.

Figure 5

RRB, its associated climate divisions, and Palmer Z-index.

Figure 5

RRB, its associated climate divisions, and Palmer Z-index.

Close modal

A great portion of the RRB is covered by cultivated croplands. Therefore, drought has significant socioeconomic impacts on the population in the RRB. One of the devastative droughts in the RRB occurred in the 1990s. That drought was referred to as a snow drought since the RRB experienced the lowest precipitation and the highest temperature in the cold seasons in that dry decade. However, the 1930s drought known as the Great Drought was due to low precipitation in the warm seasons (Bazrkar & Chu 2022). Therefore, different drought indices with the capabilities of addressing different types of drought are required. In this study, the performances of different drought indices were assessed in the CBS-SVR model in the RRB.

Selection of the best categorization method

To select the best categorization method, the traditional categorization based on the normal distribution and the CDC with KM and GM clustering methods based on the 10 drought indices were compared. The R scoring was not used in this step for the selection of the best categorization method. Instead, the RMSE scoring, a well-accepted metric, was used for cross-validation of these models. Table 2 shows the best, worst, and average of AUC-ROCs and the weighted, micro, and macro F1 scores for these models. A categorization method was selected as the best if the majority of performance measures showed the highest values for that specific categorization method. KM was selected as the best for SHADI and SMRI. For SHADI, micro AUC-ROC, weighted F1 score, and micro F1 score were 0.71, 0.38, and 0.47 for KM clustering, respectively. All metrics, except macro F1 score, were better than those of the other categorization methods. Therefore, KM was selected as the best categorization method for SHADI. For SMRI, all metrics indicated that KM had better performances than the traditional drought categorization and CDC with the GM clustering method. Micro AUC-ROC, weighted F1 score, micro F1 score, and macro F1 score were 0.89, 0.80, 0.80, and 0.43, respectively. Traditional categorization was selected as the best for SPEI, SPI, SRI, SSMI, and SBFI. For HADI, AUC-ROC and the micro F1 score suggested traditional categorization, while the weighted and macro F1 scores showed KM as the best categorization method. Unlike other indices, the best results for SSPI were obtained when GM clustering was applied. According to the results, a slight to significant improvement was achieved by changing the employed categorization method. For instance, a significant increase (0.18) in the weighted F1 score was observed when GM was switched to KM in the categorization of drought based on SRI. The most significant improvement (0.21) was obtained for the same measure and index when the traditional normal categorization was used instead of GM. There was also an improvement in the performance of the model (an increase of 0.08 in the micro F1 score) when the traditional normal drought categorization based on SSPI was changed to GM. It was concluded that depending on the selected drought index or indices, different performances and results were obtained when different drought categorization methods were employed.

Table 2

Selection of the best categorization method

 
 

Table 3 shows the variable thresholds for the drought categories based on the drought indices. Different variable thresholds were obtained for each drought index. The variable thresholds for HADI and SHADI were close. This result was in accordance with the finding by Bazrkar & Chu (2022), where the output from a grid-based hydrologic model (GHM) (Chu et al. 2019) in a different study period (2003–2007) was used. The clustering method was used to derive these variable thresholds based on the probabilities of occurrences of those values considering both spatial and temporal distributions of droughts. For example, the lower- and upper-class limits for the normal category in the traditional categorization were −0.5 and 0.5, respectively. However, the lower class limits for the normal category were as low as −1.05 for SHADI and −1.03 for SSPI, and the upper-class limit was as low as −0.33 for SSPI when GM was used for clustering. Since the range of each drought index was different, these different variable thresholds reassured that each drought category was defined based on the temporal and spatial frequencies of the drought index values.

Table 3

Variable threshold levels for all drought indices based on different categorization methods

 
 

DI, drought index; SDI, standardized drought index.

Comparison of the performances of CBS-SVR and SVR

The best categorization method for each drought index was selected and then SVR based on RMSE and CBS-SVR based on R scoring were compared for the drought indices. The results of this comparison are shown in Table 4. Unlike the results for SRI, SSMI, SSPI, and SSWEI, better results were obtained when R was used, instead of the traditional RMSE for cross-validation of SVR based on HADI, SHADI, SPEI, SMRI, SPI, and SBFI. Shamshirband et al. (2020) also found that SVR had a relatively lower capability in drought prediction than other models tested in their study.

Table 4

Comparison of the performances of RMSE (SVR) and R (CBS-SVR)

 
 

Although the overall results (areal average) for all grids showed that R scoring was better than RMSE for the drought prediction models based on most drought indices (6 out of 10), four cases occurred for different grids: (1) R scoring performed better than RMSE; (2) RMSE performed better than R scoring; (3) there was no difference in the performances of R scoring and RMSE based on AUC-ROC; and (4) there was no difference in the performances of R scoring and RMSE based on the standard performance measures (i.e., R2, MAE, MSE, and RMSE). Figure 6 shows the AUC-ROC for the 11 categories of drought in the SVR model based on SRI for grid 466, where R outperformed RMSE with the highest degree among all grids (an example for case 1). The micro-average of AUC-ROC increased from 0.66 to 0.73 when R scoring was used for cross-validation of SVR for SRI. Table 6 also shows an increase in R2 (from 0.63 for R scoring to 0.70 for RMSE) and decreases in MAE, MSE, and RMSE for grid 466. When RMSE was used (Figure 10), it caused overfitting in some months and underfitting in other months (e.g., March 2011, September 2011, and September 2015). The value of the actual SRI in February 2015 was −0.47, indicating a normal condition based on the best categorization method for SRI (i.e., traditional normal). The predicted value by the R scoring for the same month was 0.41 and still in a normal condition. However, the predicted value by using RMSE was 0.54 and in an abnormal wet condition. This was an example of miscategorization caused by using RMSE. However, the CBS-SVR model avoided this kind of miscategorization by using R scoring.

Figure 6

AUC-ROC for 11 categories of drought in SVR (left) and CBS-SVR (right) based on SRI for grid 466.

Figure 6

AUC-ROC for 11 categories of drought in SVR (left) and CBS-SVR (right) based on SRI for grid 466.

Close modal

The micro- and macro-averages of a specific metric are slightly different in their calculations. In the macro-average computation, the metric is computed independently for each class and then the average is taken by treating all classes equally. However, in the micro-average computation, the contributions of all classes are aggregated to compute the average metric. Therefore, the values of ‘nan’ for macro-average in Figures 69 were attributed to the severer dry/wet categories that did not occur in the study area and study period, and thus their AUC-ROC were indicted as ‘nan.’ If lower values are assigned for the threshold variables of exceptional dry and wet categories, the probabilities of occurrence of these exceptional categories are higher and fewer ‘nan’ values can be observed. Ignoring these values of nan, the AUC-ROC values for moderate wet (0.58–0.71), abnormal wet (0.53–0.70), normal (0.64–0.76), and abnormal dry (0.54–0.61) increased when R scoring was used. For moderate dry, a decrease in AUC-ROC from 0.72 to 0.64 and for severe drought a slight decrease from 0.50 to 0.49 were observed. The same value for AUC-ROC (0.5) for extreme drought was observed for both RMSE and R scoring. The highest type I and type II errors occurred for normal and abnormal wet categories, respectively (Table 5). However, the magnitudes of errors in R scoring were lower than those of RMSE. For these categories, both precision and recall increased when R scoring was used. For abnormal wet, the precision and recall values increased from 0.87 to 0.92 and from 0.76 to 0.79, respectively. For the normal category, higher increases in precision (from 0.70 to 0.79) and recall (from 0.77 to 0.86) were observed.

Table 5

Confusion matrices for different drought categories in grids 466, 399, 400, and 401

 
 
Table 6

Performance measures for predictions of SRI in grids 466, 399, 400, and 401

RMSE/RGrid No.R2MAEMSERMSE
RMSE 466 0.63 0.46 0.35 0.59 
399 0.75 0.40 0.29 0.54 
400 0.53 0.48 0.37 0.61 
401 0.63 0.45 0.37 0.61 
R 466 0.70 0.40 0.29 0.54 
399 0.64 0.48 0.42 0.65 
400 0.48 0.47 0.41 0.64 
401 0.63 0.45 0.37 0.61 
RMSE/RGrid No.R2MAEMSERMSE
RMSE 466 0.63 0.46 0.35 0.59 
399 0.75 0.40 0.29 0.54 
400 0.53 0.48 0.37 0.61 
401 0.63 0.45 0.37 0.61 
R 466 0.70 0.40 0.29 0.54 
399 0.64 0.48 0.42 0.65 
400 0.48 0.47 0.41 0.64 
401 0.63 0.45 0.37 0.61 
Figure 7

AUC-ROC for 11 categories of drought in SVR (left) and CBS-SVR (right) based on SRI for grid 399.

Figure 7

AUC-ROC for 11 categories of drought in SVR (left) and CBS-SVR (right) based on SRI for grid 399.

Close modal
Figure 8

AUC-ROC for 11 categories of drought in SVR (left) and CBS-SVR (right) based on SRI for grid 400.

Figure 8

AUC-ROC for 11 categories of drought in SVR (left) and CBS-SVR (right) based on SRI for grid 400.

Close modal
Figure 9

AUC-ROC for 11 categories of drought in SVR (left) and CBS-SVR (right) based on SRI for grid 401.

Figure 9

AUC-ROC for 11 categories of drought in SVR (left) and CBS-SVR (right) based on SRI for grid 401.

Close modal
Figure 10

Comparison of actual and predicted values of SRI based on RMSE and R.

Figure 10

Comparison of actual and predicted values of SRI based on RMSE and R.

Close modal

Figure 7 shows the AUC-ROC for grid 399, where RMSE outperformed R with the highest degree among all grids (an example for case 2). The micro-average of AUC-ROC for RMSE decreased from 0.78 to 0.69 when R scoring was used. The exceptional dry and wet and extreme wet categories were not observed, and thus their associated values were depicted as nan in Figure 7. The AUC-ROC values for severe wet (0.49) and extreme dry (0.5) categories remained unchanged. Except for the abnormal dry and moderate dry categories, decreases in the AUC-ROC values were observed for all other drought categories. The highest type I and type II errors occurred for abnormal wet and normal categories, respectively (Table 4). However, the magnitudes of errors in R scoring were higher than those of RMSE. For these categories, both precision and recall decreased when R scoring was used. For abnormal wet, precision and recall decreased from 0.88 to 0.82 and from 0.88 to 0.83, respectively. For the normal category, higher decreases in precision (from 0.85 to 0.73) and recall (from 0.75 to 0.61) were observed. According to Figure 10, the major differences were related to April 2011, 2012, and 2013, as well as March 2014, when the predictions based on RMSE were closer to the actual values. Table 6 also shows that R2 increased and MAE, MSE, and RMSE decreased after the application of RMSE.

Figure 8 shows the AUC-ROC for grid 400, where the same AUC-ROC was obtained when the performances of RMSE and R were compared (an example for case 3). However, except for the normal category, the AUC-ROC varied for different drought categories. Increases in the AUC-ROC values from 0.52 to 0.58, from 0.48 to 0.69, and from 0.49 to 0.50 were observed for abnormal, moderate, and severe drought categories, respectively. In contrast, the AUC-ROC values decreased for moderate (from 0.78 to 0.68) and abnormal wet (from 0.66 to 0.62) categories. Table 5 shows that the highest type II error occurred with the same value (13 out of 72 months) in the normal category. This means that 13 out of 72 months were categorized as a normal condition but the model predicted a different category. The precision (0.83) and recall (0.69) had the same value in this category. When RMSE was used, the highest type I error occurred in the abnormal wet/dry and moderate dry categories with a value of 7 out of 72. However, when R scoring was used, the highest type I error occurred in the abnormal and moderate wet categories with a value of 8. Based on Figure 10, the major difference was related to March 2014, when R scoring performed better than RMSE. Table 5 also shows that based on R2, MAE, MSE, and RMSE, SVR was better than CBS-SVR.

Figure 9 shows the AUC-ROC for grid 401, where R2, MAE, MSE, and RMSE were equal when the performances of RMSE and R scoring were compared (an example for case 4). Although the R2, MAE, MSE, and RMSE values were the same, there was a slight difference between AUC-ROCs. AUC-ROC increased from 0.72 to 0.73 when R scoring was used. This difference was attributed to the slight increases in AUC-ROC of the normal and abnormal drought categories, while AUC-ROC was exactly the same for all other drought categories. The highest type I and II errors also occurred in the same categories (i.e., moderate dry and normal, respectively). Table 6 indicates that the values of R2, MAE, MSE, and RMSE were exactly the same. Figure 10 also shows the similar values of SRI in grid 401.

Comparison of the performances of CBS-SVR and SVC

Table 7 shows the results of comparison between SVC and CBS-SVR for the drought indices. According to the results, CBS-SVR outperformed SVC based on HADI, SHADI, SPEI, SMRI, SPI, SSMI, SSWEI, and SBFI. Two exceptions were observed for SRI and SSPI. Overall, CBS-SVR had better performances since a category-based scoring was used for tuning the hyperparameters for cross-validation. Although all the metrics indicated better performances of CBS-SVR over SVC when HADI, SHADI, SPEI, SMRI, SPI, SSMI, SSWEI, and SBFI were used, the differences between the weighted F1 scores were remarkable. In the multiclass classification, a weighted average is preferred if there is a class imbalance (i.e., one class has more occurrences than others). The weighted F1 scores for CBS-SVR were 0.457, 0.415, 0.651, 0.804, 0.975, 0.339, 0.543, and 0.434 for HADI, SHADI, SPEI, SMRI, SPI, SSMI, SSWEI, and SBFI, respectively. Lower weighted F1 scores were observed for SVC for all drought indices except for SRI and SSPI. The weighted F1 scores for SVC for HADI, SHADI, SPEI, SMRI, SPI, SSMI, SSWEI, and SBFI were 0.433, 0.394, 0.640, 0.757, 0.960, 0.267, 0.527, and 0.377, respectively. The superiority of CBS-SVR over SVC is related to their different loss functions. Unlike SVC, which minimizes the observed training error, the loss function in CBS-SVR tries to minimize the generalization error bound so that a generalized performance is achieved. Thus, although SVC can avoid drought miscategorization, CBS-SVR predicts drought more accurately while avoiding drought miscategorization.

Table 7

Comparison of the performances of SVC and CBS-SVR

 
 

Comparison of the performances of SVR and CBS-SVR using different drought indices

Regardless of the drought categorization methods used in the SVR model, the best performance was observed when SPI was used in the drought prediction model (Table 2). An AUC-ROC of 0.99 was achieved when SPI was used for all three types of drought categorization methods. The weighted and micro F1 scores for all cases were equal to 0.97, indicating high accuracy and precision of the SPI-based model. In contrast, when SSMI was used in the prediction model, the performance of the model was relatively poor, indicating the worst performance among the drought indices (Table 2). The values of micro AUC-ROC were 0.66, 0.64, and 0.60 for the traditional drought categorization, CDC with KM, and CDC with GM, respectively.

Similarly, for both CBS-SVR and SVC, SPI and SSMI had the best and worst performances, respectively. For SPI and SSMI, the values of micro AUC-ROC for CBS-SVR with traditional drought categorization were 0.986 and 0.656, respectively (Table 4). The values of micro AUC-ROC of SVC for SPI and SSMI were 0.980 and 0.653, respectively (Table 7). Thus, regardless of the type of the drought prediction models, the employed drought index is influential on the results, which justifies the application of multiple drought indices, as suggested by Sundararajan et al. (2021), for testing and comparing the performances of different prediction models and eventually assessing the performance of CBS-SVR developed in this study.

The poor model performance for SSMI can be attributed to the complexity of the hydrological processes associated with soil moisture and its relationships with other hydroclimatic variables that were not included in the employed predictors. Inclusion of more predictors other than precipitation, evapotranspiration, and temperature or consideration of other large-scale climate variables such as sea surface temperature (e.g., Lima & AghaKouchak 2017) and ocean oscillation indices (e.g., Deo et al. 2017) can improve the modeling results. Moreover, the RRB is a cold climate region and the frozen ground considerably impacts soil moisture. However, the frozen ground process is not fully considered in the NLDAS model. The better performance of the SPI-based prediction model can be attributed to a less complex relationship between precipitation (predictor) and SPI. The methodology proposed in this research is a feasible solution for the models with poor performances, and the results demonstrate how a category-based scoring method used for tuning hyperparameters can effectively improve drought prediction.

Using the standard performance metrics such as RMSE for scoring in cross-validation of non-categorical prediction models can result in overfitting and miscategorization of droughts. In this study, a category-based scoring method (CBS-SVR) was proposed. Due to the crucial role of the thresholds for defining drought categories, KM and GM clustering methods were tested and compared with the traditional categorization method based on AUC-ROC and F1 score. The traditional drought categorization and CDC with KM clustering showed better results. The traditional categorization was selected as the best method for most of the standardized drought indices including SPEI, SPI, SRI, SSMI, and SBFI. In addition to the drought categorization method, the employed drought indices played critical roles in the determination of the thresholds for drought categories and hence their accurate predictions.

The CBS-SVR with R scoring outperformed the SVR with RMSE scoring in most cases, specifically if the target variables were multivariate and bivariate drought indices. A grid-based analysis also confirmed that R was a better scoring method since it avoided overfitting and miscategorization. CBS-SVR also performed better than SVC for eight out of ten drought indices employed in this study. The results suggested that using the proposed category-based scoring method for tuning hyperparameters in cross-validation of SVR successfully avoided overfitting and drought miscategorization.

There are some limitations that can be addressed in the future research. Since this study focused on comparing the performances of a new category-based scoring method and the traditional standard approaches in a numerical prediction model (i.e., SVR), specific ranges of the hyperparameters were selected for cross-validation. Further analyses can be performed for a wider range of each hyperparameter and other numerical prediction models. In this study, KM and GM were adopted to derive variable threshold levels for drought categorization. Performances of other clustering methods can be tested in the future studies. Other clustering methods can potentially affect the results and lead to different variable thresholds, and thus the performances of drought prediction models can also be explored. In this study, local climate variables such as precipitation, temperature, actual evapotranspiration, and a deficit variable were adopted as the predictors. The accessibility and availability of the data were the primary reason for choosing these variables. The other large-scale climate indices that reflect the atmosphere-ocean circulation pattern (e.g., sea surface temperature, southern oscillation index, and North Atlantic oscillation) can be tested in the future studies.

This material is based upon work supported by the National Science Foundation under Grant No. NSF EPSCoR Award IIA-1355466. The North Dakota Water Resources Research Institute also provided partial financial support in the form of a graduate fellowship for the first author.

The authors declare that they have no conflict of interest.

All relevant data are included in the paper or its Supplementary Information. Some or all data, models, or code generated or used during the study are available in a repository online in accordance with funder data retention policies. The datasets used for this research are available at the National Center for Atmospheric Research Staff (Eds) (2019), The Climate Data Guide: NLDAS: North American Land Data Assimilation System: https://climatedataguide.ucar.edu/climate-data/nldas-north-american-land-data-assimilation-system. The modeling data generated from this study will be eventually uploaded to the UND Scholarly Commons and will be available for any interested readers.

Barros
A. P.
&
Bowden
G. J.
2008
Toward long-lead operational forecasts of drought: an experimental study in the Murray-Darling River basin
.
Journal of Hydrology
357
,
349
367
.
https://doi.org/10.1016/j.jhydrol.2008.05.0
.
Barua
S.
,
Ng
A. W. M.
&
Perera
B. J. C.
2012
Artificial neural network-based drought forecasting using a nonlinear aggregated drought index
.
Journal of Hydrologic Engineering
17
,
1408
1413
.
https://doi.org/10.1061/(ASCE)HE.1943-5584.0000574
.
Bazrkar
M. H.
&
Chu
X.
2020
A new standardized baseflow index for identification of hydrologic drought in the Red River of the North Basin
.
Natural Hazards Review
21
(
4
),
05020011
.
1–8, doi:10.1061/(ASCE)NH.1527-6996.0000414
.
Bazrkar
M. H.
&
Chu
X.
2022
A snow-based hydroclimatic aggregate drought index (SHADI) for identification and lead prediction of droughts in cold climate regions
.
Theoretical and Applied Climatology
.
Under review
.
Bazrkar
M. H.
,
Zhang
J.
&
Chu
X.
2020
Hydroclimatic aggregate drought index (HADI): a new approach for identification and categorization of drought in cold climate regions
.
Stochastic Environmental Research and Risk Assessment
34
(
11
),
1847
1870
.
doi:10.1007/s00477-020-01870-5
.
Belayneh
A.
,
Adamowski
J.
,
Khalil
B.
&
Ozga-Zielinski
B.
2014
Long-term SPI drought forecasting in the Awash River basin in Ethiopia using wavelet neural networks and wavelet support vector regression models
.
Journal of Hydrology
508
,
418
429
.
https://doi.org/10.1016/j.jhydrol.2013.10.052
.
Bergstra
J.
&
Bengio
Y.
2012
Random search for hyper-parameter optimization
.
The Journal of Machine Learning Research
13
,
281
305
.
Cancelliere
A.
,
Di Mauro
G.
,
Bonaccorso
B.
&
Rossi
G.
2007
Drought forecasting using the standardized precipitation index
.
Water Resources Management
21
(
5
),
801
819
.
https://doi.org/10.1007/s11269-006-9062-y
.
Chu
X.
,
Lin
Z.
,
Tahmasebi Nasab
M.
,
Zeng
L.
,
Grimm
K.
,
Bazrkar
M. H.
,
Wang
N.
,
Liu
X.
,
Zhang
X.
&
Zheng
H.
2019
Macro-scale grid-based and subbasin-based hydrologic modeling: joint simulation and cross-calibration
.
Journal of Hydroinformatics
21
(
1
),
77
91
.
https://doi.org/10.2166/hydro.2018.026
.
Danandeh Mehr
A.
&
Fathollahzadeh Attar
N.
2021
A gradient boosting tree approach for SPEI classification and prediction in Turkey
.
Hydrological Sciences Journal
66
(
11
),
1653
1663
.
doi:10.1080/02626667.2021.1962884
.
Deo
R. C.
,
Tiwari
M. K.
,
Adamowski
J.
&
Quilty
J.
2016a
Forecasting effective drought index using a wavelet extreme learning machine (W-ELM) model
.
Stochastic Environmental Research and Risk Assessment
.
http://dx.doi.org/10.1007/s00477-00016-01265-z
.
Deo
R. C.
,
Kisi
O.
&
Singh
V. P.
2017
Drought forecasting in eastern Australia using multivariate adaptive regression spline, least square support vector machine and m5tree model
.
Atmospheric Research
184
,
149
175
.
https://doi.org/10.1016/j.atmosres.2016.10.004
.
Durdu
Ö. F
, .
2010
Application of linear stochastic models for drought forecasting in the Büyük Menderes River basin, western Turkey
.
Stochastic Environmental Research and Risk Assessment
24
,
1145
1162
.
https://doi.org/10.1007/s00477-010-0366-3
.
Fawcett
T.
2001
Using rule sets to maximize ROC performance in data mining, 2001
. In
Proceedings IEEE International Conference
. pp.
131
138
.
Fawcett
T.
2006
An introduction to ROC analysis
.
Pattern Recognition Letters
27
(
8
),
861
874
.
Feng
P.
,
Wang
B.
,
Liu
D. L.
&
Yu
Q.
2019
Machine learning-based integration of remotely-sensed drought factors can improve the estimation of agricultural drought in South-Eastern Australia
.
Agricultural Systems
173
,
303
316
.
https://doi.org/10.1016/j.agsy.2019.03.015
.
Fernández
C.
,
Vega
J. A.
,
Fonturbel
T.
&
Jiménez
E.
2009
Streamflow drought time series forecasting: a case study in a small watershed in North West Spain
.
Stochastic Environmental Research and Risk Assessment
23
,
1063
1070
.
https://doi.org/10.1007/s00477-008-0277-8
.
Ganguli
P.
&
Reddy
M. J.
2014
Ensemble prediction of regional droughts using climate inputs and the SVM–copula approach
.
Hydrological Processes
28
(
19
),
4989
5009
.
https://doi.org/10.1002/hyp.9966
.
Hao
Z.
&
Singh
V. P.
2016
Review of dependence modeling in hydrology and water resources
.
Progress in Physical Geography
40
(
4
),
549
578
.
https://doi.org/10.1177/0309133316632460
.
Hao
Z.
,
Hao
F.
,
Singh
V. P.
,
Xia
Y.
,
Ouyang
W.
&
Shen
X.
2016a
A theoretical drought classification method for the multivariate drought index based on distribution properties of standardized drought indices
.
Advances in Water Resources
92
,
240
247
.
https://doi.org/10.1016/j.advwatres.2016.04.010
.
Hao
Z.
,
Hao
F.
,
Xia
Y.
,
Singh
V. P.
,
Hong
Y.
,
Shen
X.
&
Ouyang
W.
2016b
A statistical method for categorical drought prediction based on NLDAS-2
.
Journal of Applied Meteorology and Climatology
55
(
4
),
1049
1061
.
https://doi.org/10.1175/JAMC-D-15-0200.1
.
Hao
Z.
,
Hong
Y.
,
Xia
Y.
,
Singh
V. P.
,
Hao
F.
&
Cheng
H.
2016c
Probabilistic drought characterization in the categorical form using ordinal regression
.
Journal of Hydrology
535
,
331
339
.
https://doi.org/10.1016/j.jhydrol.2016.01.074
.
Hao
Z.
,
Singh
V. P.
&
Xia
Y.
2018
Seasonal drought prediction: advances, challenges, and future prospects
.
Reviews of Geophysics
56
(
1
),
108
141
.
doi:10.1002/2016RG000549
.
Homdee
T.
,
Pongput
K.
&
Kanae
S.
2016
A comparative performance analysis of three standardized climatic drought indices in the Chi River basin, Thailand
.
Agriculture and Natural Resources
50
,
211
219
.
http://dx.doi.org/10.1016/j.anres.2016.02.002
.
Huning
L. S.
&
AghaKouchak
A.
2018
Mountain snowpack response to different levels of warming
.
Proceedings of the National Academy of Sciences of the United States of America
115
(
43
),
10932
10937
.
doi:10.1073/pnas.1805953115
.
Hwang
Y.
&
Carbone
G. J.
2009
Ensemble forecasts of drought indices using a conditional residual resampling technique
.
Journal of Applied Meteorology and Climatology
48
,
1289
1301
.
https://doi.org/10.1175/2009JAMC2071.1
.
Joachims
T.
2002
Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms. Springer, Boston, MA, USA
.
Joetzjer
E.
,
Douville
H.
,
Delire
C.
,
Ciais
P.
,
Decharme
B.
&
Tyteca
S.
2013
Hydrologic benchmarking of meteorological drought indices at interannual to climate change timescales: a case study over the Amazon and Mississippi river basins
.
Hydrology and Earth System Sciences
17
,
4885
4895
.
Larochelle
H.
,
Erhan
D.
,
Courville
A.
,
Bergstra
J.
&
Bengio
Y.
2007
An empirical evaluation of deep architectures on problems with many factors of variation
. In:
Proceedings of the Twenty-Fourth International Conference on Machine Learning (ICML'07)
(
Ghahramani
Z.
, ed.).
ACM
, pp.
473
480
. Corvalis, OR, USA.
Lima
C. H. R.
&
AghaKouchak
A.
2017
Droughts in Amazonia: spatiotemporal variability, teleconnections, and seasonal predictions
.
Water Resources Research
53
,
10,824
10,840
.
https://doi.org/10.1002/2016WR020086
.
Liu
W.
&
Juárez
R. N.
2001
ENSO drought onset prediction in northeast Brazil using NDVI
.
International Journal of Remote Sensing
22
(
17
),
3483
3501
.
https://doi.org/10.1080/014311600100064
.
Liu
Y.
&
Hwang
Y.
2015
Improving drought predictability in Arkansas using the ensemble PDSI forecast technique
.
Stochastic Environmental Research and Risk Assessment
29
(
1
),
79
91
.
https://doi.org/10.1007/s00477-014-0930-3
.
Lohani
V. K.
&
Loganathan
G.
1997
An early warning system for drought management using the Palmer drought index
.
Journal of the American Water Resources Association
33
(
6
),
1375
1386
.
https://doi.org/10.1111/j.1752-1688.1997.tb03560.x
.
Madadgar
S.
&
Moradkhani
H.
2013
A Bayesian framework for probabilistic seasonal drought forecasting
.
Journal of Hydrometeorology
14
(
6
),
1685
1705
.
https://doi.org/10.1175/JHM-D-13-010.1
.
McKee
T. B.
,
Doesken
N. J.
&
Kleist
J.
1993
The relationship of drought frequency and duration to time scales
. In
Paper Presented at Eighth Conference on Applied Climatology
.
American Meteorological Society
,
Anaheim, CA
.
Mishra
A.
&
Desai
V.
2005
Drought forecasting using stochastic models
.
Stochastic Environmental Research and Risk Assessment
19
,
326
339
.
https://doi.org/10.1007/s00477-005-0238-4
.
Mishra
A.
&
Desai
V.
2006
Drought forecasting using feed-forward recursive neural network
.
Ecological Modelling
198
,
127
138
.
https://doi.org/10.1016/j.ecolmodel.2006.04.017
.
Mishra
A. K.
&
Singh
V. P.
2010
A review of drought concepts
.
Journal of Hydrology
391
,
202
216
.
https://doi.org/10.1016/j.jhydrol.2010.07.012
.
Mishra
A.
,
Desai
V.
&
Singh
V. P.
2007
Drought forecasting using a hybrid stochastic and neural network model
.
Journal of Hydrologic Engineering
12
,
626
638
.
https://doi.org/10.1061/(ASCE)1084-0699(2007)12:6(626)
.
Modarres
R.
2007
Streamflow drought time series forecasting
.
Stochastic Environmental Research and Risk Assessment
21
,
223
233
.
https://doi.org/10.1007/s00477-006-0058-1
.
Mohamadi, S., Sammen, S. S., Panahi, F., Ehteram, M., Kisi, O., Mosavi, A., Ahmed, A. N., El-Shafie, A. & Al-Ansari, N.
2020
Zoning map for drought prediction using integrated machine learning models with a nomadic people optimization algorithm
.
Natural Hazards
104
,
537
579
.
https://doi.org/10.1007/s11069-020-04180-9
.
Morid
S.
,
Smakhtin
V.
&
Bagherzadeh
K.
2007
Drought forecasting using artificial neural networks and time series of drought indices
.
International Journal of Climatology
27
,
2103
2111
.
https://doi.org/10.1002/joc.1498
.
National Center for Atmospheric Research Staff (Eds). Last modified 10 Dec 2019. The Climate Data Guide: NLDAS: North American Land Data Assimilation System. Available from: https://climatedataguide.ucar.edu/climate-data/nldas-north-american-land-data-assimilation-system
NCEI (National Center for Environmental Information)
2020
DROUGHT: Degrees of Drought Reveal the True Picture
. .
NCEI (National Center for Environmental Information)
2021
Climate at a Glance: Divisional Mapping
.
Available from: https://www.ncdc.noaa.gov/cag/ (accessed 30 April 2021)
.
Panu
U.
&
Sharma
T.
2002
Challenges in drought research: some perspectives and future directions
.
Hydrological Sciences Journal
47
,
S19
S30
.
https://doi.org/10.1080/02626660209493019
.
Paulo
A. A.
&
Pereira
L. S.
2007
Prediction of SPI drought class transitions using Markov chains
.
Water Resources Management
21
,
1813
1827
.
https://doi.org/10.1007/s11269-006-9129-9
.
Paulo
A. A.
&
Pereira
L. S.
2008
Stochastic prediction of drought class transitions
.
Water Resources Management
22
,
1277
1296
.
https://doi.org/10.1007/s11269-007-9225-5
.
Provost
F.
&
Domingos
P.
2000
Well-Trained PETs: Improving Probability Estimation Trees (Section 6.2). CeDER Working Paper #IS-00-04, Stern School of Business, New York University
.
Rao
A. R.
&
Padmanabhan
G.
1984
Analysis and modeling of Palmer's drought index series
.
Journal of Hydrology
68
(
1–4
),
211
229
.
https:// doi.org/10.1016/0022-1694(84)90212-9
.
Regonda
S. K.
,
Rajagopalan
B.
&
Clark
M.
2006
A new method to produce categorical streamflow forecasts
.
Water Resources Research
42
,
W09501
.
https://doi.org/10.1029/2006WR004984
.
Santos
J. F.
,
Portela
M. M.
&
Pulido-Calvo
I.
2014
Spring drought prediction based on winter NAO and global SST in Portugal
.
Hydrological Processes
28
(
3
),
1009
1024
.
https://doi.org/10.1002/hyp.9641
.
Schölkopf
B.
&
Smola
A. J.
2002
Learning with kernels: support vector machines, regularization
.
Optimization, and Beyond
. MIT Press: https://doi.org/10.7551/mitpress/4175.001.0001.
Shamshirband
S.
,
Hashemi
S.
,
Salimi
H.
,
Samadianfard
S.
,
Asadi
E.
,
Shadkani
S.
,
Kargar
K.
,
Mosavi
A.
,
Nabipour
N.
&
Chau
K.
2020
Predicting Standardized Streamflow index for hydrological drought using machine learning models
.
Engineering Applications of Computational Fluid Mechanics
14
(
1
),
339
350
.
https://doi.org/10.1080/19942060.2020.1715844
.
Sharma
T. C.
&
Panu
U. S.
2012
Prediction of hydrological drought durations based on Markov chains: case of the Canadian prairies
.
Hydrological Sciences Journal
57
,
705
722
.
https://doi.org/10.1080/02626667.2012.672741
.
Shukla
S.
&
Wood
A. W.
2008
Use of a standardized runoff index for characterizing hydrologic drought
.
Geophysical Research Letters
35
(
2
),
1
7
.
https://doi.org/10.1029/2007GL032487
.
Staudinger
M.
,
Stahl
K.
&
Seibert
J.
2014
A drought index accounting for snow
.
Water Resources Research
50
(
10
),
7861
7872
.
https://doi.org/10.1002/2013WR015143
.
Stehman
S. V.
1997
Selecting and interpreting measures of thematic classification accuracy
.
Remote Sensing of Environment
62
(
1
),
77
89
.
doi:10.1016/S0034-4257(97)00083-7
.
Sun
M.
&
Kim
G.
2016
Quantitative monthly precipitation forecasting using cyclostationary empirical orthogonal function and canonical correlation analysis
.
Journal of Hydrologic Engineering
21
(
1
),
1
13
.
https://doi.org/10.1061/(ASCE)HE.1943-5584.0001244
.
Sundararajan
K.
,
Garg
L.
,
Srinivasan
K.
,
Bashir
A. K.
,
Kaliappan
J.
,
Ganapathy
G. P.
,
Selvaraj
S. K.
&
Meena
T.
2021
A contemporary review on drought modeling using machine learning approaches
.
Computer Modeling in Engineering & Sciences
128
(
2
),
447
487
.
https://doi.org/10.32604/cmes.2021.015528
.
Svoboda
M.
,
LeComte
D.
,
Hayes
M.
,
Heim
R.
,
Gleason
K.
,
Angel
J.
&
Stooksbury
D.
2002
The drought monitor
.
Bulletin of the American Meteorological Society
83
(
8
),
1181
1190
.
Ting
K. M.
2011
Precision and recall
. In:
Encyclopedia of Machine Learning
(
Sammut
C.
&
Webb
G. I.
, eds).
Springer
,
Boston, MA
.
https://doi.org/10.1007/978-0-387-30164-8_652.
Vapnik
V.
1995
The Nature of Statistical Learning Theory
.
Springer
,
New York
.
Vicente-Serrano Sergio
M.
,
Juan
S. B.
&
López-Moreno
I.
2010
A multiscalar drought index sensitive to global warming, the standardized precipitation evapotranspiration index
.
Journal of Climate
23
,
1696
1718
.
https://doi.org/10.1175/2009JCLI2
.
Wang
H.
,
Zheng
H.
2013
True positive rate
. In:
Encyclopedia of Systems Biology
(
Dubitzky
W.
,
Wolkenhauer
O.
,
Cho
K. H.
&
Yokota
H.
, eds).
Springer
,
New York, NY
.
https://doi.org/10.1007/978-1-4419-9863-7_255
.
Xia, Y., Mitchell, K., Ek, M., Cosgrove, B., Sheffield, J., Luo, L., Alonge, C., Wei, H., Meng, J., Livneh, B., Duan, Q. & Lohmann D.
2012
Continental-scale water and energy flux analysis and validation for the North American Land Data Assimilation System project phase 2 (NLDAS-2): 1. Intercomparison and application of model products
.
Journal of Geophysical Research
117
,
D03109
.
https://doi.org/10.1029/2011JD016051
.
Yang
T.
,
Zhou
X.
,
Yu
Z.
,
Krysanova
V.
&
Wang
B.
2015
Drought projection based on a hybrid drought index using artificial neural networks
.
Hydrological Processes
29
(
11
),
2635
2648
.
https://doi.org/10.1002/hyp.10394
.
Zhang
E.
,
Zhang
Y.
2009
F-measure
. In:
Encyclopedia of Database Systems
(
Liu
L.
&
Özsu
M. T.
, eds).
Springer
,
Boston, MA
.
https://doi.org/10.1007/978-0-387-39940-9_483
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).