This study aimed to predict monthly flows using an adaptive neuro-fuzzy inference system (ANFIS) and wavelet-ANFIS (W-ANFIS) and to determine the effect of wavelet transformation on the success of the machine learning model. For this purpose, the model inputs are divided into three subcomponents with Daubechies 10 mother wavelets. Subcomponents with the highest correlation were chosen as inputs. The most suitable models were selected by dividing the inputs into 3–7 sub-sets, using 11 different lagged input combinations, and testing various membership functions. In establishing the ANFIS model, 75% of the data were used for training and 25% for testing. The performance of ANFIS models was evaluated with root mean square error, Pearson correlation coefficient, determination coefficients, and Taylor diagram. A model with two sub-sets, a hybrid learning algorithm, a Gbellmf membership function, and 400 iterations was selected as the most suitable. It was concluded that the W-ANFIS model used with the wavelet transform method increased the success of the established ANFIS model. Moreover, it was suggested that the W-ANFIS hybrid machine learning model established in the study can be used effectively in similar climatic regions fed by snowmelt and dominated by a semi-arid climate.

  • A hybrid approach is designed by combining the wavelet transform ANFIS model to estimate monthly stream flows.

  • The wavelet-ANFIS model is quite successful compared to the single ANFIS model in semi-arid climates fed by snowmelt.

  • The hybrid wavelet-ANFIS model, established using streamflow values with a delay of up to 6 months in estimating monthly stream flows, has the highest accuracy.

Graphical Abstract

Graphical Abstract
Graphical Abstract

Streamflow is one of the most critical components of the hydrological cycle. The successful prediction of monthly streamflow is vital for the safe design of water structures and the effective management of water resources. It also dramatically impacts many water-related sectors, such as construction, energy, agriculture, industry, and tourism. Therefore, having comprehensive knowledge about the behavior of water resources is of great significance for the design of water structures, water management, managing meteorological disaster risks such as drought and flood, determining the amount of drinking and utility water for the future, revealing the hydroelectric potential of the region, and increasing the economic and social welfare of the country.

There are generally two possible approaches to streamflow prediction. The first is the rainfall–runoff model, which depends on weather conditions, land use, the structure of the drainage basin, vegetation, ground humidity, infiltration, and evapotranspiration parameters (Kuchment et al. 1996; Mehr et al. 2013). The second is the pattern recognition methodology based on previous streamflows. However, this approach does not require a thorough understanding of the physical laws, and the data requirement is not as comprehensive as the rainfall–runoff model (Nourani et al. 2011).

Wavelet transform (WT) has recently become a popular tool with its ability to explain both spectral and temporal information in the signal. Wavelets are mathematical functions that divide the considered data into different frequency components and work on features with a resolution matched to their scales. Wavelet analysis is a helpful method for analyzing nonstationary variance to varying frequencies in a time series. The discrete wavelet transform (DWT) method divides a time series into various sub-signals and provides useful information. Thus, the artificial intelligence (AI) model's predictive power can be improved (Kim & Valdés 2003; Nourani et al. 2011).

Researchers' interest in hybrid models created by WT and AI techniques has constantly increased. Some of the highlights of these studies; Partal (2007) divided the precipitation data into sub-series by WT and used it as input to the adaptive network-based fuzzy inference system (ANFIS) model to estimate the daily precipitation data. It was determined that the wavelet-ANFIS (W-ANFIS) model used with the WT is much superior to the ANFIS model. Adamowski & Sun (2010) used discrete WT and artificial neural network (ANN) models for streamflow prediction. The performance of wavelet-neural network (WNN) models is compared with ANN models for streamflow predictions in Cyprus 1 and 3 days later. It was demonstrated that the WNN model provides more successful streamflow predictions than the ANN models. Yarar (2014) used ANFIS and W-ANFIS methods to model the monthly flow values of five different flow observation stations in the Sakarya Basin. It was determined that the results of the W-ANFIS model were more successful than the ANFIS model within five stations. Seo et al. (2015) applied ANN and ANFIS models and wavelet-based W-ANN and W-ANFIS models for daily water level estimation. As a result, it was seen that the results of the W-ANN and W-ANFIS models were better than single models. Seo & Kim (2016) used ANN, ANFIS, wavelet packet-based ANN (WP-ANN), and wavelet packet-based ANFIS (WP-ANFIS) methods to predict daily river stages in the Gam Stream Watershed, South Korea. As a result, the WP-ANN and WP-ANFIS models are superior to the ANN and ANFIS models. Taylan (2018) integrated the DWT technique and ANFIS techniques to predict the streamflows in the Dalaman river. The results of the study were compared with AR-ANFIS, an integrated auto-regressive (AR) process. In conclusion, it is noteworthy that the W-ANFIS model is superior to AR-ANFIS. Choubin et al. (2019) found that streamflow estimates generated from pseudo-unmetered basins, the procedure applied, and the semi-distributed precipitation-runoff model is useful techniques for estimating streamflow in unmetered basins. Kisi et al. (2019) applied various machine learning models to predict streamflow in the Mediterranean region of Turkey. According to the analysis results, by including the climate signal information in the streamflow prediction, LSSVM gave superior results than other models. Khazaee Poul et al. (2019) employed ANN and ANFIS and K-nearest neighbors (KNN) and multi-linear regression (MLR) methods and their wavelet-based combination to estimate the monthly streamflows in the St. Clair River. It was proven that the prediction performance of machine learning models increases with WT. Mosavi et al. (2021) propose a regionalization method for flow estimation of the Gharehsoo Hydrometry Station in Ardabil Province in northern Iran. For this purpose, it was applied the fuzzy c-means clustering method. As a result of the study, reasonable flow estimates were obtained. Abda et al. (2021) used W-ANFIS to predict daily flow rates in the Tizi Ouzou region. As a result, it was proposed to establish the W-ANFIS model using the db7 mother wavelet to estimate the daily flow rate. Katipoğlu (2022b) applied the ANN technique to predict the Karasu river's monthly flow in the Euphrates basin. According to the results, it was revealed that potential evapotranspiration values improved the performance of the flow estimation model. Katipoğlu (2022a) estimated monthly streamflow in the Amasya, Turkey, with a hybrid model that combines ANN and DWT. The performance of various mother wavelets was compared, and it identified that the Coiflet 5 mother wavelet gave the best results. The ANFIS model combines the learning capability of the neural network and the decision-making capability of fuzzy logic (FL), thus performing both learning and decision-making using the information extracted from the dataset. In this study, the ANFIS model was preferred because it is a powerful model that combines these two approaches. In addition, Guimarães Santos & Silva (2014); Seo et al. (2015); Kim et al. (2016); Nourani et al. (2019) studies, suggested to use the db10 wavelet in hydrological and meteorological forecasts due to being one of the best wavelets and improving the performance of the machine learning model. When the studies were examined, it was revealed that the studies on the W-ANFIS method with db10 wavelet in flow estimation have limited information. For this reason, the presented research is divided into various subcomponents by db10 WT for assessing the flow data in the Bitlis River. The components with high correlation are collected and presented to the ANFIS model.

This study aimed to increase the success of the ANFIS model by preprocessing the data with DWT in estimating the monthly stream flows of the Bitlis River, Turkey. For this purpose, inputs were decomposed to sub-signals by the db10 wavelet. Obtained sub-signals are presented as input to AI models for flow prediction. The main purpose of flow estimation is to prepare for natural disasters such as floods and droughts in the planning and operation of water resources. Furthermore, the streamflow estimation model was applied in the Bitlis River, which has a semi-arid climate, and the flow values depend on snowmelt. Therefore, evaluating the performance of the W-ANFIS model for regions with similar flow and climate regimes is crucial.

Study area and data

The location map of the Bitlis River, Baykan station numbered 2610 in the Tigris Basin is shown in Figure 1. The monthly streamflow data used in the study were taken from the General Directorate of Electrical Power Resources Survey and Development Administration of Turkey. Since the study area has an important geopolitical position within the scope of Turkey's agricultural production and transboundary waters, estimating the flows is of great importance.
Figure 1

Location map of the Baykan station on the Bitlis River in the Tigris Basin (EIEI 2009).

Figure 1

Location map of the Baykan station on the Bitlis River in the Tigris Basin (EIEI 2009).

Close modal

The Tigris Basin is the second-largest basin in Western Asia and borders Turkey, Iraq, Iran, and Syria (Muratoglu & Yuce 2016). Although the Tigris Basin does not have a very mountainous structure, the high mountains in the north significantly affect the basin's climate. The region is hot during the summer months because the mountains prevent the air mass from the north from descending to the south. The high-pressure area, effective in the winter, causes the winter months to be cold in the basin. The summer months are very hot because the basin is under the desert climate in the south, and the influence of the high mountains in the north prevents the air mass from entering the basin. The drainage area of the basin is 51,500 km2, and the average temperature is 16–18 °C. In addition, the annual precipitation average reaches 450–700 mm in plains and 800–1,500 mm in mountainous lands (Alashan 2010).

Adaptive network-based fuzzy inference system

ANFIS consists of a combination of ANN and FL. In the FL method, it is critical to specify the membership degrees of the input and output variables. While ANFIS determines fuzzy sets and rules from the training dataset, the learning ability of ANN is applied. Therefore, ANFIS can be evaluated as a three-layer feed-forward ANN model. The first layer is where the inputs are presented in this network structure. The second layer is where the input variables and membership functions are determined. Rules are created according to the third layer's Sugeno FL inference system. In the fourth layer, it calculates the normalized value of each rule. In the fifth layer, the weighted result values of a given rule are calculated. Finally, in the sixth layer, the actual value of the ANFIS model is acquired by summing the output value of each node in the previous layer (Jang et al. 1997). The network structure of the ANFIS model is given in Figure 2.
Figure 2

An ANFIS model network structure (Kaya 2018).

Figure 2

An ANFIS model network structure (Kaya 2018).

Close modal

Wavelet analysis

A wavelet can be defined as a small part of the wave and is a time-limited vibration signal (Partal 2007). Wavelets are mathematical functions that split data into different frequency components. The main idea behind the WT is to analyze the signals according to a specified scale (Graps 1995). Wavelet analysis; analyzes using the time-scale domain. The essential feature of wavelet analysis is that it can perform local analysis. However, compared to traditional methods, wavelet analysis can be done by compressing or purifying a signal without destroying the original signal. WTs are divided into continuous wavelet transform (CWT) and DWT. In the CWT, the scaling and wavelet function parameters are constantly changing. Therefore, the calculation of wavelet coefficients is complex and takes time. In the DWT, the signal is split into several scales. Thus, the DWT is used more frequently (Uyar et al. 2007). In the DWT, binary scale and time step are used, so each component is divided into time series of scale values as two and multiples as two. In Equation (1), the wavelet function of the DWT is given (Grossmann & Morlet 1984).
(1)
Here, ψ shows the mother wavelet; m and n are the translation parameters of the wavelet in the scale and time axis, respectively. s0 indicates a dilation step whose value is unchanged, and τ0 symbolizes the location variable. The wavelet function created by using a multiple of two can be defined as in Equation (2).
(2)
The most critical step of wavelet analysis is determining the wavelet decomposition level and type. The most widely used wavelet types are Daubechies and Morlet wavelets (Benaouda et al. 2006). It is also suggested to use Daubechies (db10) wavelet (Seo et al. 2015). It was recommended to use the db10 wavelet, which better depicts time series data (Nalley et al. 2012; Santos & da Silva 2014). Therefore, this study divided the data into sub-series with db10 mother wavelet. Equation (3) is used to characterize the optimal decomposition level of the wavelets used (Nourani et al. 2009).
(3)
where L is the decomposition level, N is the number of time series data, and int[·] is the integer-part function. Three decomposition levels were selected in this study due to the 476 data used.

W-ANFIS model

The following steps are followed for the establishment of this model:

  • Input and output data are separated as training and test sets.

  • Each input data are decomposed into detailed (D) and approximate (A) components via DWT.

  • For each input, the effective one of each sub-series is selected with the help of correlation analysis.

  • The new input of ANFIS will be the most effective sub-time series component for each input variable, and the original output time series is the output of ANFIS.

  • The accuracy of the W-ANFIS model is evaluated with various statistical criteria obtained using the test set (Abdourahmane 2019).

Performance evaluation criteria

This study evaluated the estimation efficiency of ANFIS and W-ANFIS models using statistical indicators such as root mean square error (RMSE), determination coefficient (R2), and Taylor diagram. The RMSE values close to 0 and the R, R2 values close to 1 indicate the high success of the model. The statistical indicators used were calculated as in Equations (4)–(6). In addition, a detailed visual comparison of the results was made using the Taylor diagram. The Taylor diagram (2001) expresses how closely a set of patterns matches observations. The model similarity is calculated by standard deviation, correlation coefficient, and mean square root center difference.
(4)
(5)
(6)
where bi is the observed value of models, yi is the output produced by models, bi–yi is the value of the errors, bavg is the average of b values, yavg is the average of y values, and n is the number of data.

In the study, ANFIS and W-ANFIS methods were used to estimate the monthly streamflow data of the Bitlis River.

In order to establish the model, the partition ratio of the data must first be selected. In the literature, there is no definite information about the rate of training and testing used in modeling AI techniques. However, in studies in general, the value is between 70 and 90% of the entire data length for education (Partal & Cigizoglu 2008). In this study, 75% of the data for the training of the AI model and the rest was reserved as tests. 1970–1999 and 2000–2009 were used for training and testing, respectively. The inputs were divided into various sub-series to select the most suitable models, and various membership functions and iterations were tried. Establishing the W-ANFIS model, the lagged streamflow data were divided into sub-series by DWT. In the design of the ANFIS model, eleven different scenarios were planned according to autocorrelation and partial autocorrelation (Figure 3). These scenarios consist of various combinations of streamflow data with a delay between 1 and 11 days (Table 1). While developing W-ANFIS models, db10 wavelets and three decomposition levels were used according to Equation (3). It was chosen according to the correlation values of the stream flows separated into sub-signals by the WT to determine the input combinations of the W-ANFIS model (Table 2). Correlation values greater than 0.2 (Partal 2007), greater than 0.1 (Tiwari & Chatterjee 2011), and greater than 0.3 were used as the limit value for selecting effective subcomponents. According to the literature, the sum of the D1, D2, D3, and A1 components with the highest correlation was presented as an input to the W-ANFIS model. In the selection of the inputs, the sum of the subcomponents with a correlation coefficient above 0.1 with the output according to the literature was included in the model.
Table 1

Comparison of designed ANFIS scenarios

ModelInputOutputIterationSub-setsMembership functionTraining RMSETest RMSETraining R2Test R2
Qt-1 Qt 1,000 Gauss2mf 15.03 12.97 0.48 0.40 
Qt-1, Qt-2 Qt 2,000 7-7 Gauss2mf 10.87 10.91 0.73 0.62 
Qt-1, Qt-2, Qt-3 Qt 100 4-4-4 Gaussmf 12.34 10.11 0.65 0.67 
4 Qt-1, Qt-2, Qt-3, Qt-4 Qt 5,000 2-2-2-2 Trimf 12.01 10.07 0.67 0.68 
Qt-1, Qt-2, Qt-3, Qt-4, Qt-5 Qt 100 3-3-3-3-3 Gbellmf 11.90 11.23 0.68 0.60 
Qt-1, Qt-2, Qt-3, Qt-4, Qt-5, Qt-6 Qt 2,000 All 2 Trimf 11.82 10.64 0.68 0.64 
Qt-1, Qt-2, …, Qt-7 Qt 300 All 2 Trimf 10,94 10.80 0.73 0.65 
Qt-1, Qt-2, …., Qt-8 Qt 50 All 2 Trimf 10.95 12.19 0.76 0.58 
Qt-1, Qt-2, …, Qt-9 Qt 50 All 2 Trimf 9.52 12.64 0.79 0.58 
10 Qt-1, Qt-2, …, Qt-10 Qt 10 All 2 Gaussmf 10.85 11.21 0.73 0.62 
11 Qt-1, Qt-2, …, Qt-11 Qt 20 All 2 Trimf 6.38 12.59 0.90 0.57 
ModelInputOutputIterationSub-setsMembership functionTraining RMSETest RMSETraining R2Test R2
Qt-1 Qt 1,000 Gauss2mf 15.03 12.97 0.48 0.40 
Qt-1, Qt-2 Qt 2,000 7-7 Gauss2mf 10.87 10.91 0.73 0.62 
Qt-1, Qt-2, Qt-3 Qt 100 4-4-4 Gaussmf 12.34 10.11 0.65 0.67 
4 Qt-1, Qt-2, Qt-3, Qt-4 Qt 5,000 2-2-2-2 Trimf 12.01 10.07 0.67 0.68 
Qt-1, Qt-2, Qt-3, Qt-4, Qt-5 Qt 100 3-3-3-3-3 Gbellmf 11.90 11.23 0.68 0.60 
Qt-1, Qt-2, Qt-3, Qt-4, Qt-5, Qt-6 Qt 2,000 All 2 Trimf 11.82 10.64 0.68 0.64 
Qt-1, Qt-2, …, Qt-7 Qt 300 All 2 Trimf 10,94 10.80 0.73 0.65 
Qt-1, Qt-2, …., Qt-8 Qt 50 All 2 Trimf 10.95 12.19 0.76 0.58 
Qt-1, Qt-2, …, Qt-9 Qt 50 All 2 Trimf 9.52 12.64 0.79 0.58 
10 Qt-1, Qt-2, …, Qt-10 Qt 10 All 2 Gaussmf 10.85 11.21 0.73 0.62 
11 Qt-1, Qt-2, …, Qt-11 Qt 20 All 2 Trimf 6.38 12.59 0.90 0.57 

Note: Bold characters indicate the most successful ANFIS model.

Table 2

Correlation coefficients of sub-series

Sub-seriesD1D2D3A3
Qt-1 −0.17 0.16 0.64 0.35 
Qt-2 0.21 0.34 0.42 0.32 
Qt-3 0.10 0.41 0.02 0.27 
Qt-4 0.15 0.17 0.37 0.30 
Qt-5 0.08 0.25 0.61 0.16 
Qt-6 0.12 0.35 0.70 0.11 
Qt-7 0.04 0.10 0.60 0.07 
Qt-8 0.14 0.18 0.32 0.03 
Qt-9 0.00 0.28 0.01 0.02 
Qt-10 0.17 0.08 0.29 0.01 
Qt-11 0.03 0.20 0.56 0.01 
Sub-seriesD1D2D3A3
Qt-1 −0.17 0.16 0.64 0.35 
Qt-2 0.21 0.34 0.42 0.32 
Qt-3 0.10 0.41 0.02 0.27 
Qt-4 0.15 0.17 0.37 0.30 
Qt-5 0.08 0.25 0.61 0.16 
Qt-6 0.12 0.35 0.70 0.11 
Qt-7 0.04 0.10 0.60 0.07 
Qt-8 0.14 0.18 0.32 0.03 
Qt-9 0.00 0.28 0.01 0.02 
Qt-10 0.17 0.08 0.29 0.01 
Qt-11 0.03 0.20 0.56 0.01 

Note: Bold characters indicate the sub-signals selected as input to the model.

Figure 3

Autocorrelation and partial autocorrelation functions of streamflow values.

Figure 3

Autocorrelation and partial autocorrelation functions of streamflow values.

Close modal

W-ANFIS model results

The ANFIS model was established by testing various model parameters such as iteration number, sub-sets number, and membership function type. The comparison of the statistical criteria of the based model is indicated in Table 1. When the R2 and error values of the training and test data were compared, the model 4 combination was chosen as the most appropriate model. Accordingly, the lowest estimation error was obtained using the streamflow values with a delay of 1–4 months as input to predict the streamflow data. In addition, when the effect of lagged streamflow values on the estimation performance of a single ANFIS model is evaluated, lagged values from 1 to 4 increase the estimation power. However, it was seen that the success of estimating from 5 to 11 decreases in general (Table 1).

The time-scale variation of the observed and predicted streamflow data of model 4 and the scattering diagrams are presented in Figure 4. The predicted and observed streamflow time series overlap to a large extent. However, it was seen that the estimation error increases at the maximum values. For this reason, to improve the model's success, it was decided to establish the W-ANFIS model by separating the inputs into subcomponents with the WT.
Figure 4

Results from the ANFIS model 4.

Figure 4

Results from the ANFIS model 4.

Close modal

W-ANFIS model results

In designing the W-ANFIS model, the input data are first divided into subcomponents with the db10 wavelet. Then, decomposed detail and approximate components of Qt-4, Qt-3, Qt-2, Qt-1 inputs selected as examples by WT are given in Figure 5. The W-ANFIS model was established according to the correlation of the obtained subcomponents with the output value (Qt).
Figure 5

Sub-series of Qt-1, Qt-2, Qt-3, Qt-4, time series separated by wavelet analysis.

Figure 5

Sub-series of Qt-1, Qt-2, Qt-3, Qt-4, time series separated by wavelet analysis.

Close modal

The selection of input variables is one of the most critical processes in optimizing ANFIS models. To develop the ANFIS models used in this study, four input combinations consisting of streamflow values with a lag time of up to 11 months were evaluated. These inputs used are divided into sub-signals with db10 DWT. The correlation coefficients between these inputs and outputs are presented in Table 2.

Table 2 shows the correlation coefficients of the approximate and detail components of the lagged streamflow values used in the setup of the W-ANFIS model with the output variable. As a result of the correlation analysis, the sum of the inputs with a correlation greater than 0.1 was used as an input to the W-ANFIS model. In WT, the approximation and detail components represent two different parts of the transformed signal. The approximation component, the low-frequency component, describes the general bias or soft part of the original signal. It shows the important properties of the signal to interpret the basic structure of the signal. The detail component, also known as the high-frequency component, represents the fine-scale or oscillating nature of the original signal. It can reveal signal features that the proximity component cannot detect. It is thought that the variability of the results obtained is due to the differences in the frequencies and delay values of the details and approximate components used.

The comparison of the statistical criteria of the most suitable W-ANFIS models is presented in Table 3. When W-ANFIS models are compared, it is noteworthy that model 6 is the most successful. Accordingly, it is seen that the best estimation model can be established by separating the 400 iterations, the Gbellmf membership function, and the inputs into two sub-series.

Table 3

Comparison of designed W-ANFIS scenarios

ModelInputOutputIterationSub-setsMembership functionTraining RMSETest RMSETraining R2Test R2
Q(t–1)(D2+D3) Q(t) 1,000 Gbellmf 15.68 13.61 0.44 0.41 
Q(t–1)(D2+D3), Q(t–2)(D3+A3) Q(t) 1,000 5 5 Gauss2mf 13.70 14.57 0.57 0.35 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), Q(t–3)(A4) Q(t) 800 4 4 4 Gbellmf 9.92 12.09 0.78 0.53 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), Q(t–3)(A4), Q(t–4)(D1) Q(t) 100 3-3-3-3 Gaussmf 9.09 9.26 0.81 0.73 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), Q(t–3)(A4), Q(t–4)(D1), Q(t–5)(D2+A3) Qt 200 3-3-3-3-3 Trimf 6.59 10.35 0.90 0.66 
Q(t1)(D2+D3), Q(t2)(D3+A3), Q(t3)(A4), Q(t4)(D1), Q(t5)(D2+A3), Q(t6)(D2+A3) Qt 400 All 2 Gbellmf 7.49 9.13 0.87 0.73 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), …, Q(t–7)(D2) Qt 100 All 2 Gaussmf 6.58 9.58 0.90 0.71 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), …., Q(t–8)(D1) Qt 50 All 2 Gaussmf 5.40 10.19 0.93 0.69 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), …, Q(t–9) Qt 20 All 2 Gbellmf 4.67 11.63 0.95 0.63 
10 Q(t–1)(D2+D3), Q(t–2)(D3+A3), …, Q(t–10)(D3) Qt 10 All 2 Trimf 38.72 31.07 0.10 0.02 
11 Q(t–1)(D2+D3), Q(t–2)(D3+A3), …, Q(t–11)(D2+D3) Qt 20 All 2 Trimf 37.52 30.89 0.06 0.01 
ModelInputOutputIterationSub-setsMembership functionTraining RMSETest RMSETraining R2Test R2
Q(t–1)(D2+D3) Q(t) 1,000 Gbellmf 15.68 13.61 0.44 0.41 
Q(t–1)(D2+D3), Q(t–2)(D3+A3) Q(t) 1,000 5 5 Gauss2mf 13.70 14.57 0.57 0.35 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), Q(t–3)(A4) Q(t) 800 4 4 4 Gbellmf 9.92 12.09 0.78 0.53 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), Q(t–3)(A4), Q(t–4)(D1) Q(t) 100 3-3-3-3 Gaussmf 9.09 9.26 0.81 0.73 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), Q(t–3)(A4), Q(t–4)(D1), Q(t–5)(D2+A3) Qt 200 3-3-3-3-3 Trimf 6.59 10.35 0.90 0.66 
Q(t1)(D2+D3), Q(t2)(D3+A3), Q(t3)(A4), Q(t4)(D1), Q(t5)(D2+A3), Q(t6)(D2+A3) Qt 400 All 2 Gbellmf 7.49 9.13 0.87 0.73 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), …, Q(t–7)(D2) Qt 100 All 2 Gaussmf 6.58 9.58 0.90 0.71 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), …., Q(t–8)(D1) Qt 50 All 2 Gaussmf 5.40 10.19 0.93 0.69 
Q(t–1)(D2+D3), Q(t–2)(D3+A3), …, Q(t–9) Qt 20 All 2 Gbellmf 4.67 11.63 0.95 0.63 
10 Q(t–1)(D2+D3), Q(t–2)(D3+A3), …, Q(t–10)(D3) Qt 10 All 2 Trimf 38.72 31.07 0.10 0.02 
11 Q(t–1)(D2+D3), Q(t–2)(D3+A3), …, Q(t–11)(D2+D3) Qt 20 All 2 Trimf 37.52 30.89 0.06 0.01 

Note: Bold characters indicate the most successful model.

When the effect of the lagged streamflow values on the prediction performance of the W-ANFIS model was evaluated, the model's prediction accuracy increased when the lagged values from 1 to 6 were used as inputs. However, when the monthly lag values from 7 to 11 were used, it was seen that the estimation success generally decreased. As a result, the most successful streamflow estimation was obtained with the W-ANFIS model (model 6), which was preprocessed by separating the most successful model inputs into sub-signals (Table 3). When the statistical parameters of the ANFIS and W-ANFIS models were compared, it was determined that the W-ANFIS model was more successful than the ANFIS model, except for models 2, 3, 10, and 11.

A Taylor diagram visually revealed the agreement between the modeled and observed behavior. This diagram evaluates model success by comparing statistical criteria such as correlation coefficient, standard deviation, and RMSE. Accordingly, the statistical results of eleven different ANFIS models for which the flow was estimated were compared during the testing and training phases. Although M1 was the best model in training the model, the M4 model was determined as the best model in the testing phase because it had the smallest error and the highest R2 value and was closest to the x-axis (observed data) (Figure 6).
Figure 6

Taylor's diagrams for the prediction of streamflows by the ANFIS model.

Figure 6

Taylor's diagrams for the prediction of streamflows by the ANFIS model.

Close modal
In Figure 7, Taylor diagrams of the training and testing phases of W-ANFIS models are shown. According to the statistical indicators obtained with this diagram, it was seen that the M9 model in the training phase and the M6 model in the testing phase show the best performance. However, the M4 model showed the results closest to the observed values, with the lowest error rate and most relative to the x-axis. For this reason, the M4 was chosen as the top model.
Figure 7

Taylor's diagrams for the prediction of streamflows by the W-ANFIS model.

Figure 7

Taylor's diagrams for the prediction of streamflows by the W-ANFIS model.

Close modal
The structure of the most successful W-ANFIS model is shown in Figure 8. The established model consists of six inputs and an output variable. In addition, the temporal variation and scattering diagrams of the predicted and observed streamflow data of the established model 6 are shown in Figure 9. The estimated and observed streamflow data overlap to a large extent. In addition, as seen from the statistical indicators and distribution of the model, the W-ANFIS model showed more successful results than the ANFIS model. It was proven that the ANFIS model could be improved by collecting the highly correlated entries separated into sub-series by WT.
Figure 8

Structure of the W-ANFIS model 6.

Figure 8

Structure of the W-ANFIS model 6.

Close modal
Figure 9

Results of the W-ANFIS model 6.

Figure 9

Results of the W-ANFIS model 6.

Close modal

In this study, ANFIS and W-ANFIS methods were used to estimate the monthly flows of the Bitlis River. It was revealed that the W-ANFIS model, which was decomposed into subcomponents with db10 wavelet and three decomposition levels, is more successful than the ANFIS model. Terzi & Barak (2015) used ANN and W-ANN methods to predict Söğütlühan flows. It is noteworthy that the W-ANN models obtained by applying the WT are more successful than the ANN models obtained with the original series. Yarar (2014) used ANFIS and W-ANFIS methods to estimate monthly flow values. The data were divided into sub-bands at different frequencies by the WT method, and all bands were collected by evaluating within themselves. The W-ANFIS method made more successful predictions than single ANFIS. Badrzadeh et al. (2018) applied ANFIS and wavelet neuro-fuzzy methods to estimate daily, weekly, and monthly flows at the Australian Railway Parade station. It was determined that the success of the ANFIS model increases when the Haar, Coiflet, and Daubechies main wavelets are used in estimation by separating the inputs into subcomponents. Yabar & Aydin (2020) estimated the daily flow series with W-ANN in the Bitlis River. Although W-ANN and ANN models gave successful results, the ANN model was insufficient to predict extreme peak flow rates. As a result, it was determined that W-ANN models are very successful in estimating flow series in Rivers and can be used in modeling hydrological processes. Dalkiliç & Hashimi (2020) applied the ANN, ANFIS, and WNN to predict the daily streamflow. As a result, it was concluded that the WNN model obtained the most successful predictions. Based on the literature, it was revealed that the use of WT significantly increases the prediction success of the models, even if the time-scale (monthly or daily) or input variables of the data used are different. This indicates that the presented study largely overlaps with the literature. It was deduced that the historical current data, which is decomposed into sub-signals by WT in different regions and time periods, when used with machine learning models, significantly improves the performance of the model. Khazaee Poul et al. (2019) combined WT of multiple linear regression (MLR), KNN, ANN, and ANFIS techniques for the estimation of river flows in the St. Clair River of the US and Canada. As a result of the study, it was revealed that the WT significantly increased the performance of the single models, and the W-ANFIS hybrid model showed successful results for some data scenarios. Khazaee Poul et al. (2019) supports the current study in terms of its satisfactory performance in streamflow prediction. Yilmaz et al. (2022) combined ANN and DWT techniques to predict streamflow data in four stations on the Çoruh Basin. It has been determined that the hybrid model built by ANN and DWT shows more successful results than single ANN with RMSE: 14.29 and R2 0.817 values. When the obtained outputs are compared, it has been revealed that the DWT-ANN hybrid model is slightly successful in the DWT-ANFIS model. In addition, Saraiva et al. (2021); Güneş et al. (2021); Momeneh & Nourani (2022); Wang et al. (2022) studies suggested using the hybrid model by separating the model into sub-signals with the WT to increase the performance of the AI technique. The mentioned studies align with the work done in improving the performance of streamflow estimation models in monthly and daily time periods with WT. Reliable streamflow estimates can help water managers assess the availability of sufficient water to meet the needs of agriculture, industry, and communities during dry periods. In addition, accurate streamflow estimates are essential for evaluating the need for water to sustain aquatic ecosystems and their dependent plants and animals.

This study estimated Bitlis River's monthly average streamflow values by ANFIS and W-ANFIS methods. Models were established using 476 monthly flow rates. While the models were being built, about 75% of the data were used for training and 25% for testing. In addition, for the W-ANFIS model, which requires a preprocessing process, the data are divided into four bands by WT. The prominent results of the study can be listed as follows:

  • The ANFIS model obtained using input values with a delay of up to 4 months is superior to the single ANFIS models.

  • The most effective model was the W-ANFIS model obtained using input values lagged up to 6 months. It was observed that the success of the model and effectiveness of the W-ANFIS model increased with the number of inputs.

  • The W-ANFIS model was established by combining the WT and the ANFIS model, which can be used effectively in improving the prediction performance of the ANFIS models.

  • The most appropriate W-ANFIS model, created by collecting the sub-signals related to the output values, increased the R2 values while reducing the training and testing errors in the streamflow estimation.

  • Trimf and Gbellmf were the most effective membership functions in obtaining the best estimates in the ANFIS and W-ANFIS models, respectively.

  • Db10 WT can be used effectively to predict monthly average streamflow.

This study was carried out at the Baykan station 2610, in streams fed by snowmelt and the semi-arid climate zone. The study results contribute to using the W-ANFIS model in regions with similar climate structures. In addition, the study's results can help water resource managers, engineers, and planners make informed decisions about water management and control and can also help mitigate the effects of floods and droughts.

For future studies, it is thought that the prediction success may be increased a little more by using climatic factors such as precipitation, evaporation, and transpiration, which have a high relationship with output values as inputs to the model.

The author thanks the General Directorate for State Hydraulic Works of Turkey for the streamflow data provided; the Editor and the anonymous reviewers for their contributions to the content and development of this paper.

All relevant data are available from an online repository or repositories. The data is available at the link below (https://www.dsi.gov.tr/Sayfa/Detay/744).

The authors declare there is no conflict.

Abda
Z.
,
Chettih
M.
&
Zerouali
B.
2021
Assessment of neuro-fuzzy approach based different wavelet families for daily flow rates forecasting
.
Modeling Earth Systems Environment
7
(
3
),
1523
1538
.
https://doi.org/10.1007/s40808-020-00855-1.
Abdourahmane
Z. S.
2019
Analysis and Forecast of Meteorological Drought Associated with Climate Indices in Niger: A Combined Wavelet-Anfis Approach
.
Ph.D. Thesis
,
Atatürk University Graduate School of Natural and Applied Sciences Department of Civil Engineering
.
Adamowski
J.
&
Sun
K.
2010
Development of a coupled wavelet transform and neural network method for flow forecasting of non-perennial rivers in semi-arid watersheds
.
Journal of Hydrology
390
(
1–2
),
85
91
.
https://doi.org/10.1016/j.jhydrol.2010.06.033.
Alashan
S.
2010
Determination of Flow Characteristics of the Dicle Basin (Turkish: Dicle Nehri Akış Karakteristiklerinin Tayin Edilmesi)
.
Master's thesis
,
Fırat University Institute of Sciences
.
Badrzadeh, H., Sarukkalige, R., & Jayawardena, A. W. 2018. Intermittent stream flow forecasting and modelling with hybrid wavelet neuro-fuzzy model. Hydrology Research, 49 (1), 27-40. https://doi.org/10.2166/nh.2017.163
Benaouda
D.
,
Murtagh
F.
,
Starck
J. L.
&
Renaud
O.
2006
Wavelet-based nonlinear multiscale decomposition model for electricity load forecasting
.
Neurocomputing
70
,
139
154
.
https://doi.org/10.1016/j.neucom.2006.04.005.
Choubin
B.
,
Solaimani
K.
,
Rezanezhad
F.
,
Roshan
M. H.
,
Malekian
A.
&
Shamshirband
S.
2019
Streamflow regionalization using a similarity approach in ungauged basins: application of the geo-environmental signatures in the Karkheh River Basin, Iran
.
Catena
182
,
104128
.
https://doi.org/10.1016/j.catena.2019.104128.
EİEİ
2009
General Directorate of Electrical Power Sources Survey and Development Administration, Stream Flow Observation Annals
.
Graps
A.
1995
An introduction to wavelets
.
J IEEE Computational Science Engineering
2
(
2
),
50
61
.
Grossmann
A.
&
Morlet
J.
1984
Decomposition of Hardy functions into square integrable wavelets of constant shape
.
SIAM Journal on Mathematical Analysis
15
,
723
736
.
https://doi.org/10.1137/0515056.
Guimarães Santos
C. A.
&
Silva
G. B. L. D.
2014
Daily streamflow forecasting using a wavelet transform and artificial neural network hybrid models
.
Hydrological Sciences Journal
59
(
2
),
312
324
.
https://doi.org/10.1080/02626667.2013.800944.
Güneş
M.
,
Parim
C.
,
Yıldız
D.
&
Büyüklü
A.
2021
Predicting monthly streamflow using a hybrid wavelet neural network: case study of the Çoruh River Basin
.
Polish Journal of Environmental Studies
30
(
4
).
https://doi.org/10.15244/pjoes/130767.
Jang
J. S. R.
,
Sun
C. T.
&
Mizutani
E.
1997
Neuro-fuzzy and soft computing-a computational approach to learning and machine intelligence [Book review]
.
IEEE Transactions on Automatic Control
42
(
10
),
1482
1484
.
Katipoğlu
O. M.
2022a
Monthly streamflow prediction in Amasya, Türkiye, using an integrated approach of a feedforward backpropagation neural network and discrete wavelet transform
.
Modeling Earth Systems and Environment
1
13
.
https://doi.org/10.1007/s40808-022-01629-7
.
Katipoğlu
O. M.
2022b
Monthly stream flows estimation in the Karasu river of Euphrates basin with artificial neural networks approach
.
Journal of Engineering Sciences and Design
10
(
3
),
917
928
.
https://doi.org/10.21923/jesd.982868.
Kaya
M.
2018
The Completion with ANFIS of the Missing Streamflow Data
.
MSc Thesis
,
Süleyman Demirel University Graduate School of Natural and Applied Sciences Department of Civil Engineering
,
Isparta, Turkey
.
Khazaee Poul
A.
,
Shourian
M.
&
Ebrahimi
H.
2019
A comparative study of MLR, KNN, ANN and ANFIS models with wavelet transform in monthly stream flow prediction
.
Water Resources Management
33
(
8
),
2907
2923
.
https://doi.org/10.1007/s11269-019-02273-0.
Kim
T. W.
&
Valdés
J. B.
2003
Nonlinear model for drought forecasting based on a conjunction of wavelet transforms and neural networks
.
Journal of Hydrologic Engineering
8
(
6
),
319
328
.
https://doi.org/10.1061/(ASCE)1084-0699(2003)8:6(319).
Kim
S.
,
Kisi
O.
,
Seo
Y.
,
Singh
V. P.
&
Lee
C. J.
2016
Assessment of rainfall aggregation and disaggregation using data-driven models and wavelet decomposition
.
Hydrology Research
48
(
1
),
99
116
.
https://doi.org/10.2166/nh.2016.314.
Kisi
O.
,
Choubin
B.
,
Deo
R. C.
&
Yaseen
Z. M.
2019
Incorporating synoptic-scale climate signals for streamflow modelling over the Mediterranean region using machine learning models
.
Hydrological Sciences Journal
64
(
10
),
1240
1252
.
https://doi.org/10.1080/02626667.2019.1632460.
Kuchment
L.
,
Demidov
V.
,
Naden
P.
,
Cooper
D.
&
Broadhurst
P.
1996
Rainfall-runoff modelling of the Ouse basin, North Yorkshire: an application of a physically based distributed model
.
Journal of Hydrology
181
(
1–4
),
323
342
.
https://doi.org/10.1016/0022-1694(95)02916-8.
Mehr
A. D.
,
Kahya
E.
&
Olyaie
E.
2013
Streamflow prediction using linear genetic programming in comparison with a neuro-wavelet technique
.
Journal of Hydrology
505
,
240
249
.
https://doi.org/10.1016/j.jhydrol.2013.10.003.
Momeneh
S.
&
Nourani
V.
2022
Application of a novel technique of the multi-discrete wavelet transforms in hybrid with artificial neural network to forecast the daily and monthly streamflow
.
Modeling Earth Systems and Environment
1
20
.
https://doi.org/10.1007/s40808-022-01387-6.
Mosavi
A.
,
Golshan
M.
,
Choubin
B.
,
Ziegler
A. D.
,
Sigaroodi
S. K.
,
Zhang
F.
&
Dineva
A. A.
2021
Fuzzy clustering and distributed model for streamflow estimation in ungauged watersheds
.
Scientific Reports
11
(
1
),
1
14
.
https://doi.org/10.1038/s41598-021-87691-0.
Muratoglu
A.
&
Yuce
M. I.
2016
Estimating the long term average flow rates of Tigris Basin using machine learning methods
.
Batman University Journal of Life Sciences
6
(
2/2
),
32
40
.
Nalley
D.
,
Adamowski
J.
&
Khalil
B.
2012
Using discrete wavelet transforms to analyze trends in streamflow and precipitation in Quebec and Ontario (1954–2008)
.
Journal of Hydrology
475
,
204
228
.
https://doi.org/10.1016/j.jhydrol.2012.09.049.
Nourani
V.
,
Alami
M. T.
&
Aminfar
M. H.
2009
A combined neural-wavelet model for prediction of Ligvanchai watershed precipitation
.
Engineering Applications of Artificial Intelligence
22
(
3
),
466
472
.
https://doi.org/10.1016/j.engappai.2008.09.003
Nourani
V.
,
Kisi
Ö.
&
Komasi
M.
2011
Two hybrid artificial intelligence approaches for modeling rainfall–runoff process
.
Journal of Hydrology
402
(
1–2
),
41
59
.
https://doi.org/10.1016/j.jhydrol.2011.03.002.
Nourani
V.
,
Davanlou Tajbakhsh
A.
,
Molajou
A.
&
Gokcekus
H.
2019
Hybrid wavelet-M5 model tree for rainfall-runoff modeling
.
Journal of Hydrologic Engineering
24
(
5
),
04019012
.
Partal
T.
2007
Estimation of Turkish Precipitation Data Using Artificial Neural Networks and Wavelet Transformation Methods
.
Institute of Science, Department of Civil Engineering, Istanbul Technical University
,
Istanbul, Turkey
.
Partal
T.
&
Cigizoglu
H. K.
2008
Estimation and forecasting of daily suspended sediment data using wavelet–neural networks
.
Journal of Hydrology
358
(
3–4
),
317
331
.
https://doi.org/10.1016/j.jhydrol.2008.06.013.
Santos
C. A. G.
&
da Silva
G. B. L.
2014
Daily streamflow forecasting using a wavelet transform and artificial neural network hybrid models
.
Hydrological Sciences Journal
59
,
312
324
.
https://doi.org/10.1080/02626 667.2013.80094 4
.
Saraiva
S. V.
,
de Oliveira Carvalho
F.
,
Santos
C. A. G.
,
Barreto
L. C.
&
Freire
P. K. D. M. M.
2021
Daily streamflow forecasting in Sobradinho Reservoir using machine learning models coupled with wavelet transform and bootstrapping
.
Applied Soft Computing
102
,
107081
.
https://doi.org/10.1016/j.asoc.2021.107081.
Seo
Y.
,
Kim
S.
,
Kisi
O.
&
Singh
V. P.
2015
Daily water level forecasting using wavelet decomposition and artificial intelligence techniques
.
Journal of Hydrology
520
,
224
243
.
https://doi.org/10.1016/j.jhydrol.2014.11.050.
Taylan
D.
2018
An investigation on modeling of Dalaman stream flows by using WAVET-ANFIS
.
Journal of Engineering Sciences and Design
6
(
1
),
56
63
.
Taylor
K. E.
2001
Summarizing multiple aspects of model performance in a single diagram
.
Journal of Geophysical Research: Atmospheres
106
(
D7
),
7183
7192
.
https://doi.org/10.1029/2000JD900719.
Terzi
Ö.
&
Barak
M.
2015
Rainfall-runoff forecasting with wavelet-neural network approach: a case study of Kızılırmak River
.
Journal of Agricultural Sciences
21
(
4
),
546
557
.
Tiwari
M. K.
&
Chatterjee
C.
2011
A new wavelet–bootstrap–ANN hybrid model for daily discharge forecasting
.
Journal of Hydroinformatics
13
(
3
),
500
519
.
https://doi.org/10.2166/hydro.2010.142.
Uyar
M.
,
Yıldırım
S.
&
Gençoğlu
M. T.
2007
Güç kalitesi bozulmalarının sınıflandırılmasında dalgacık dönüşümüyle enerji dağılımına dayalı özelliklerin incelenmesi (Investigation of Energy Distribution Based Properties by Wavelet Transform in Classification of Power Quality Distortions)
.
Electrical-Electronic Computer Biomedical
.
Wang
K.
,
Band
S. S.
,
Ameri
R.
,
Biyari
M.
,
Hai
T.
,
Hsu
C. C.
, Hadjouni, M., Elmannai, H., Chau, K. W. & Mosavi A.
2022
Performance improvement of machine learning models via wavelet theory in estimating monthly river streamflow
.
Engineering Applications of Computational Fluid Mechanics
16
(
1
),
1833
1848
.
https://doi.org/10.1080/19942060.2022.2119281.
Yabar
E.
&
Aydin
M. C.
2020
Dalgacık Dönüşümlü Yapay Sinir Ağlarıyla Günlük Akım Serilerinin Tahmini (Estimation of daily flow series by wavelet transformed artificial neural networks)
.
Bitlis Eren University Journal of Science
9
(
2
),
943
951
.
https://doi.org/10.17798/bitlisfen.609410.
Yarar
A.
2014
A hybrid wavelet and neuro-fuzzy model for forecasting the monthly streamflow data
.
Water Resources Management
28
(
2
),
553
565
.
https://doi.org/10.1007/s11269-013-0502-1.
Yilmaz
M.
,
Tosunoğlu
F.
,
Kaplan
N. H.
,
Üneş
F.
&
Hanay
Y. S.
2022
Predicting monthly streamflow using artificial neural networks and wavelet neural networks models
.
Modeling Earth Systems and Environment
1
17
.
https://doi.org/10.1007/s40808-022-01403-9
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).