In this research, the discharge of five karstic springs, which are located in the Nahavand Plain of west of Iran, were simulated. The monthly discharge data for the years 1994–2020 were obtained. Three intelligent models, ANN, PCA-ANN, and CANFIS, were used to simulate the relevant data. In the present study, hydraulic parameters such as porosity, hydraulic conductivity, and specific yield were measured by frequent field visits and precipitation data of five meteorological stations and elevations were used as input data. The three intelligent models used are based on the error measurement indicators. Based on the analysis of results, Levenberg-Marquardt's learning rules were preferred as the best learning rules for all three models. Bearing the highest accuracy, MBE = −1.3 (l/s) and the highest value correlation coefficient (0.654) with nRMSE equal to 9.4% were preferred as the best intelligent models for simulating the discharge of these springs. The maximum average percentage of the simulated discharge increases compared with the value observed in the test phase. This was observed for May in the period of 27 years, which has a value of 24.20%. The lowest fluctuation percentage was observed, with a value of −24.61% for December. It is concluded that the hydraulic parameters were preferred instead of spring discharge lag-time for simulation.

  • Hydrogeological investigations.

  • Data generation.

  • Intelligent models.

  • Best model selected.

Today, water supply is a topic that is increasingly being considered to meet the needs of various sectors such as industrial, agricultural, and drinking uses and requires researchers to accurately predict the flow of water to use it optimally (Gondwe et al. 2011; Adnan et al. 2017). Due to their high water quality and relatively easy extraction, karst water sources are among the most important groundwater sources extracted by many people worldwide (Smith & Hunt 2010).

The unique and complex features of the karstic areas make these features different from other aquifers (Bakalowicz 2005). Karst aquifers are highly heterogeneous and contain conduits, fractures, and pores that may indicate permeability or flow paths (An et al. 2020). Karst aquifers provide approximately 20–25% of drinking water in the world. It can be said that 9% of the world's population relies on freshwater supply of karst aquifers and karst springs (Andreo 2012; Goldscheider & Drew 2014; Stevanović 2019). Karst formations cover 11% of the geological formations in Iran. Despite the vulnerability, karst aquifers are the main source of water supply, especially in the Zagros mountain range of Iran (Nassery 1992; Drew 2017).

Spring discharge is a highly nonlinear and nonstationary natural process. It is affected by various factors, such as hydrology, meteorology, and human activities (Dudley et al. 2018; Gai et al. 2023). Karstic water sources are widely used all over the world and simulating the discharge of karstic springs still seems an important challenge (Rudolph et al. 2023). One of the reasons for the challenging nature of karst water resources modeling is that it possesses highly variable unknown channel networks.

Intelligent neural methods such as fuzzy logic and artificial neural networks (ANNs) are similar to black boxes that are less constrained by physical issues, can do this without the need to model the environmental factors and geometry affecting the surface flow and perform modeling with acceptable accuracy (El-Shafie et al. 2007). The advantage of using black box models and ANNs in the karstic areas is that detailed information about the physics of hydrological processes is not required (El-Shafie et al. 2007; Guo et al. 2011; He et al. 2014). Among the disadvantages of ANNs is that the order and sequence of input data information are omitted (Kratzert et al. 2018).

The results of studies have shown that the neural intelligent models can capture nonlinear relationships between input and output parameters and identify inherent patterns hidden in time series data (Cerqueira et al. 2019). Intelligent models, including ANNs, are considered a powerful tool to create a relationship between the factors influencing the input and output of springs (Rawat et al. 2019; Wunsch et al. 2022). An important limitation of intelligent modeling in karst is access to temporal and spatial data because climatological stations usually do not exist inside or near karst spring basins, or few stations are available in the region (Wunsch et al. 2022). Therefore, in recent years, new intelligent models such as ANNs have been adopted to simulate karst aquifers to solve these problems (Zeydalinejad et al. 2020).

Gai et al. (2023) used a graph neural network (GNN), and then applied this model to simulate and predict Niangziguan spring discharge in China. Three graph structures were used to optimize the GNN model. The performances of the ChebNet and the Graph Convolutional Network (GCN) models were compared and the results showed that the high-order ChebNet was more adaptable to simulate the karst hydrological processes with nonlinear and nonstationary behaviors than that of GCN. In one study, a model was presented to deal with the problems of unknown/illegal wells. The results showed that the Analytic Element Method (AEM) model is more efficient for solving the problems of unknown wells (Gaur et al. 2023). Wunsch et al. (2022) used convolutional neural networks (CNNs) to process gridded meteorological data directly. They used one-dimensional complex neural network to perform simulations based on artificial intelligence to overcome the problems of karst spring discharge modeling. The results showed that the model used for modeling the spring's discharge is very suitable. In the Alpine and Mediterranean regions, Gholami (2022) reconstructed the temporal changes of spring discharge in an alluvial aquifer on the southern shores of the Caspian Sea. The results indicate the high efficiency of the ANN model in the test and train phases. In a recent study of the Colorado River basin in the United States, it was shown that the flow is very sensitive to temperature and precipitation variations. Different types of methods can be used to accurately predict the flow (Towler et al. 2022). The development of a multilayer model of groundwater under transient conditions was carried out in the Bihar region of India. The results of this study showed that groundwater modeling is an important method for understanding the behavior of aquifer systems and detecting the groundwater head under different hydrological stresses (Omar et al. 2021). Di Nunno & Granata (2020) predicted the discharge of nine springs in the Umbria region using the nonlinear autoregressive exogenous (NARX) model. The results proved that this model performs well for all sources and short-term and long-term predictions. The researchers recommended using the NARX network to predict spring discharge in other areas characterized by karst aquifers. An et al. (2020) used time–frequency analysis methods, SSA and EEMD, to extract the frequency and features of the Niangziguan Springs discharge trend. The long short-term memory (LSTM) neural model was also used to simulate each frequency and trend sequence. The results show that SSA-LSTM and EEMD-LSTM are better than LSTM. The EEMD-LSTM model has the best prediction performance. Rawat et al. (2019) used an ANN to simulate two springs of Hill Campus and Fakua from Tehri Garhwal district of Uttarakhand, India. Rainfall, temperature, and relative humidity data from 1999 to 2003 were used as input to simulate spring discharge. The superiority of the intelligent neural model in simulating spring discharge was pointed out. The ability of ANN and Suport Vector Machine (SVM) methods in simulating the monthly flow was investigated. The results of ANN and SVM models were compared to evaluate the performance of the applied models. The results showed that the SVM model can be used with higher accuracy to predict the monthly flow (Adnan et al. 2017). The performance of two Vensim models and neural networks for simulating spring flow and piezometer for karst aquifer was studied by Kong-A-Siou et al. (2014), and the results showed that the ANN model is more effective in combining karst nonlinearity. The daily discharge of two karst springs on the area of Rouva, island of Crete, Greece was simulated using a multilayer perceptron (MLP) back-propagation ANN. Two models were developed for springs. The results showed that in karst environments, hydraulic behavior is influenced by local conditions, even in a few hundred meters (Paleologos et al. 2013). Population and economic growth have led to water shortages worldwide, especially in the developing countries. Due to the excessive exploitation of groundwater resources in arid and semi-arid regions such as Iran, the simulation of spring discharge is of great importance. The planning and management of surface water resources as one of the main water supply sources become more important. The karst springs on the Nahavand area are the most important water supply source in western Iran. Therefore, the present research can significantly contribute to water supply, which is in line with the management of water resources and development. It is very necessary to understand and investigate karst water sources to develop regions. The springs are bearing high discharge rate due to limestone formations and suitable hydrogeological conditions. In mountainous watersheds such as Nahavand Plain with high altitudes, it is also sometimes difficult to predict the spring's discharges due to the unavailability or shortness of hydrological and meteorological data. The present study also tries to use three neural intelligent models, the input data of the simulation was hydrogeological (measured) and hydrology data without using the lag-time of the investigated springs discharge as an input to analyze the behavior of karst springs in the Nahavand region. The detailed analysis of the spring discharge simulation in the region for the first time shows the importance of this study.

Study area

The study area is about 1,461 km2 and is located in the south of Hamedan province along the Zagros mountain ranges and the Nahavand delta plain route. The Zagros mountain range in the region has abundant groundwater resources. More than 600 springs are available in the area, irrigating the entire region and turning it into a big agricultural pole (Banejad et al. 2013). The main source of precipitation in the region is the Mediterranean air front which causes precipitation. The average annual rainfall of the plain is 425 mm (Kiyani et al. 2021; Fasihi et al. 2024). Nahavand Plain, with a catchment area of 1,902 km2, is one of the plains of the upper basin of the Karkhe River and is located in the northeast of the Green Mountains of the Zagros highlands. The flat land (plain) of Nahavand is 644 km2 and the rest of the land, which is 1,046 km2 (62%), includes the heights of the edge of the plain. The general slope of the plain is from the southeast to the northwest, and the average elevation of the catchment area is 1,890 m above sea level. Figure 1 shows the geographical coordinates of five main karstic springs, namely: Famaseb (Sp1), Faresban (Sp2), Ghalebaroodab (Sp3), Giyan (Sp4), and Gonbadkabood (Sp5), and five meteorological stations, namely: Varayneh (St1), Faresban (St2), Synoptic (St3), Giyan (St4), and Barzool (St5).
Figure 1

Location of springs and stations in the Nahavand Plain.

Figure 1

Location of springs and stations in the Nahavand Plain.

Close modal

Nahavand region is located in the cold semi-arid climate (BSK) based on Koppen's climate classification (Mané 1975; Fasihi et al. 2024). Summers are fairly mild and winters are relatively cold (Kiyani et al. 2021).

Geology

The study area is located in the Sanandaj-Sirjan zone (Figure 2) and the high Zagros (Zagros Trust Belt). Geologically, the area was considered one of the most active zones in western Iran during the Mesozoic and Cenozoic periods (Aghanabati 2004). All its geological structures follow the general structural trend of Zagros. In general, the studied area morphologically has mountainous parts, and a plain is hydrogeologically important. The mountainous part is mainly composed of thick layered limestones with vertical dipping. This section shows many karst landscapes such as dolines, caves, karrens, and karst valleys (Ghanavati 2014). The direct relationship between the physical properties of rocks, such as the percentage of porosity, plays an important role in the creation of karst and its extent, the size and type of sinkholes in the dissolution ability of carbonate rocks, which is indicative of rock porosity (Ford & Williams 2007).
Figure 2

Regional geological map of Iran showing the main Zagros thrust, Sanandaj-Sirjan zone.

Figure 2

Regional geological map of Iran showing the main Zagros thrust, Sanandaj-Sirjan zone.

Close modal

Climate, high altitude, and time are among the most effective external factors in creating karst's geological terrain. Because of tectonic movements in the region, the development of fracture patterns has appeared in the region. The expansion of dissolvable rocks has caused the formation of various types of karstic formations of hydrogeomorphological features importance. The development of karst aquifers with an average discharge of 4 m3/s can be seen in the region, and the most important of these faulted springs is Giyan spring (Sp4), its formation related to the faults in the region (Ghobadi et al. 2012).

Intelligent models

Artificial neural network

ANN has been developed based on the neural structure and function of the human brain (Agatonovic-Kustrin & Beresford 2000; Lingireddy & Brion 2005). A common ANN architecture used in this study is MLP, the simplest and most widely used ANN architecture (Tabari et al. 2015). A neural network can be trained to perform a specific function by adjusting the connection weight values between elements. As a brief description, MLP is a feedforward network consisting of an input, hidden layer(s), and an output layer (Demuth & Beale 1992; Onyari & Ilunga 2013).

The input layer receives the external data, and the output layer produces the final result. Hidden layers are neural nodes between the input and output layers that provide nonlinearity. More complex problems can be solved by increasing the number of neurons or using hidden layers (Rumelhart et al. 1986; Onyari & Ilunga 2013). In this study, the optimal number of neurons is considered from 1 to 20 by trial and error. The selection of the minimum number of nodes is based on the research attempted and is usually considered based on the number of inputs (Sheela & Deepa 2013). This simulation used the Tangent hyperbolic (Tan) and Sigmoid activation function (Sig), Levenberg–Marquardt (LM), and Conjugate Gradient learning rules (Conj).

Principal component analysis (PCA)

Principal component analysis (PCA) is a method for statistical analysis and simplification of data sets. PCA mathematically uses an orthogonal transformation to linearly transform the observations of a series of potentially related variables, representing a series of uncorrelated linear variables. These unrelated variables are principal components (Barnett & Preisendorfer 1987; Jang 2017; Hsu et al. 2021). PCA is a powerful tool that attempts to explain the variance of a large data set of correlated variables with a smaller set of independent variables, as well as an MLP implemented to determine the nonlinear ordering of these components. PCA provides information about the most meaningful parameters, which describe the entire data set while allowing training data with minimal loss of original information (Ilaboya & Kayode 2018). The steps of PCA are followed as normalization, correlation coefficient matrix, computation of eigenvalues and eigenvectors, calculation of participation rate, calculation of cumulative participation rate, and calculation of principal component loading (Wu et al. 2021). From the Hebbian principle, there are two choices, Oja's and Sanger's, to implement the model (Bayatvarkeshi et al. 2020). In this research, the learning principle was implemented with two main components: activation functions and second learning rules similar to the neural method.

Co-active neuro-fuzzy inference system (CANFIS)

The co-active neuro-fuzzy inference system (CANFIS) network proposed by Jang et al. (1997) integrates a modular neural network (MNN) and adaptive fuzzy inputs to increase the accuracy of the modeling process. It can be said that the neural network increases the performance of the network by integrating CANFIS and fuzzy inference systems (FIS). The high performance of CANFIS has been demonstrated in modeling many hydrological parameters (Vernieuwe et al. 2005; Zhang et al. 2009). The CANFIS network uses a fuzzy neuron that applies membership functions (MFs). Two main MFs, Bell and Gaussian, are used in the CANFIS network. The number of modular networks will equal the number of outputs; the CANFIS network also uses a hybrid axon to process MF outputs in MNN outputs (Tabari et al. 2012). In general, in this model, two-phase structures, the Tsukamoto model and the Sugeno model, were used (Aytek 2009). The Tsukamoto structure is more complicated than the Sugeno model (Bayatvarkeshi et al. 2020). The TSK structure with Bell and Gaussian MFs and the number of MFs were also applied in this research work. Tangent hyperbolic (Tan), Sigmoid (Sig), Levenberg–Marquardt (LM), and Conjugate Gradient learning rules (Conj) were used to simulate the spring discharge.

NeuroSolutions 5 software was used to simulate ANN, CANFIS, and PCA-ANN models. Parameters such as porosity, hydraulic conductivity, and specific yield were measured by frequent field visits and precipitation data of five meteorological stations such as Varayaneh (St1), Faresban (St2), Synoptic (St3), Giyan (St4), and Barzool (St5) and elevations were used as input data. The discharge of springs, namely: Famaseb karst springs (Sp1), Faresban (Sp2), Ghale Baroodab (Sp3), Gian (Sp4), and Gonbad Kabood (Sp5) for the years 1994–2020 were obtained from the Hamadan Regional Water Organization (HMRW 2023). The positions of all springs are presented in Figure 1. The data were divided into two categories: train data (70%) and test data (30%), and this percentage is acceptable in various research work related to neural models (Coulibaly et al. 2001; Guzman et al. 2019; Bayatvarkeshi et al. 2020; Di Nunno & Granata 2020; Mohammadi et al. 2021). The division of training and test sets was selected randomly with the aid of the black box of Neurosolution 5 software which is commonly available in the software itself. Among the data sets, 30% was considered as the test data set and the remaining 70% was considered as the training set. The three intelligent models used are based on the error measurement indicators.

SPSS16 software was used for quality control and data reconstruction, including statistical reconstruction of the monthly average of precipitation data of five metrological stations. The skewness coefficient was used to check the normality of the data presented in Table 1, the value of the skewness coefficient is between (−2, 2). It can be said that the discharge and precipitation data are normal. The flowchart for this research is shown in Figure 3.
Table 1

Skewness coefficient of data

ParameterDischargePrecipitation
Skewness 1.77 1.97 
ParameterDischargePrecipitation
Skewness 1.77 1.97 
Figure 3

Flowchart for the methodology of the research.

Figure 3

Flowchart for the methodology of the research.

Close modal
The average amount of precipitation for the historical period of five weather stations in the region is shown in Figure 4. The data from each were used as input for the simulation. The trends of precipitation variations in five weather stations are similar to each other; in the wet months of the year (October to March), the precipitation is increasing, and in the dry months of the year, this trend decreases. In June, July and August they are the lowest.
Figure 4

The average precipitation of five weather stations in the region during the historical period (1994–2020).

Figure 4

The average precipitation of five weather stations in the region during the historical period (1994–2020).

Close modal
According to Figure 5, the average monthly discharge during the 27 years in Nahavand Plain shows that the highest discharge of five karst springs belongs to the months of April and May. In contrast, the lowest discharge is in the months of October and November (wet season).
Figure 5

The average discharge of five karstic springs in the region during the historical period (1994–2020).

Figure 5

The average discharge of five karstic springs in the region during the historical period (1994–2020).

Close modal

Model evaluation

The models were evaluated by choosing the best model for simulating the discharge of the studied springs in the region. Three indicators of correlation coefficient (r), normalized root mean square error (nRMSE), and mean bias error (MBE) (Shcherbakov et al. 2013) were used. The calculation was based on relations (1) to (3):
(1)
(2)
(3)
where is the normalization coefficient, usually equal to the maximum value measured in the forecast horizon or the difference between the maximum and minimum values. The normalization coefficient can be calculated in the entire time horizon or short-term observation period. X is the measured value at time i, and Y is the simulated value at time i. and are the mean of X and Y, and n is the total data.
To simulate the discharge of springs of intelligent models, namely: ANN, PCA-ANN, and CANFIS, 20 neurons were implemented by trial and error methods. Three indices, r, MBE, and nRMSE, were used to select the best model. The summary of the results of all the implementations of intelligent neural models based on the r index in the test phase is presented in Figure 6.
Figure 6

Comparison of the r of all the executions of the neural intelligence model with 20 neurons in the test phase.

Figure 6

Comparison of the r of all the executions of the neural intelligence model with 20 neurons in the test phase.

Close modal

For each neural intelligence model, three models (ANN, PCA-ANN, and CANFIS) were implemented for 20 neurons with activation and MFs, learning rules, and other information. Figure 6 shows the results of these performances in the test phase. The value of the correlation coefficient in the test phase is less than 0.7 in all executions; the maximum value for the three ANN models is 0.645 with the Sigmoid activation function, in the CANFIS model, is 0.654 with the activation function Tangent hyperbolic (Tan) and Gaussian membership function. Finally, for the PCA model, the r value is 0.652 with the Sigmoid activation function and Sanger's implementation, which is preferred for PCA-ANN because it naturally divides the PCA components based on the magnitude (Bayatvarkeshi et al. 2020), was calculated. The Levenberg–Marquardt was introduced and preferred as the best learning rule for all three models, which is also mentioned based on previous research (Zare Abyaneh et al. 2016; Bayatvarkeshi et al. 2020). The value of the nRMSE coefficient of all three models is 0.094. A summary of the execution based on the best neuron that has the highest accuracy among 20 neurons separately for each learning rule, activation function, and membership of ANN and PCA-ANN models is presented in Tables 2 and 3. A summary of all CANFIS models simulated is given in Table 4.

Table 2

The results of the best performance of each subgroup based on the best neuron of the ANN model

Activation function
Tangent hyperbolic
Sigmoid
Learning rules
Levenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate Gradient
Test and train n 16 
Epoch 1,000 1,000 15 15 
Structure 5-2-1 5-2-1 5-16-1 5-7-1 
Test r (l/s) 0.637 0.638 0.645 0.469 
MBE (l/s) 26.410 28.540 24.740 19.2 
nRMSE 0.0946 0.0945 0.0948 0.1087 
Train r (l/s) 0.752 0.756 0.749 0.504 
MBE (l/s) 0.027 −1.32 − 1.22 19.29 
nRMSE 0.081 0.081 0.083 0.1086 
Activation function
Tangent hyperbolic
Sigmoid
Learning rules
Levenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate Gradient
Test and train n 16 
Epoch 1,000 1,000 15 15 
Structure 5-2-1 5-2-1 5-16-1 5-7-1 
Test r (l/s) 0.637 0.638 0.645 0.469 
MBE (l/s) 26.410 28.540 24.740 19.2 
nRMSE 0.0946 0.0945 0.0948 0.1087 
Train r (l/s) 0.752 0.756 0.749 0.504 
MBE (l/s) 0.027 −1.32 − 1.22 19.29 
nRMSE 0.081 0.081 0.083 0.1086 

The bold values indicate the best model for simulation.

Table 3

The results of the best performance of each subgroup based on the best neuron of the PCA-ANN model

Membership function
Sanger's
Oja's
Learning rules
Levenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate Gradient
Activation function
Tangent hyperbolicTangent hyperbolicSigmoidSigmoidTangent hyperbolicTangent hyperbolicSigmoidSigmoid
Test and train n 13 11 18 12 
Epoch 1,000 1,000 116 117 1,000 1,100 118 116 
Test r (l/s) 0.638 0.640 0.645 0.481 0.627 0.638 0.649 0.481 
MBE (l/s) 29.92 28.17 65.23 −595.38 26.79 24.38 62.97 137.02 
nRMSE 0.094 0.094 0.094 0.206 0.095 0.094 0.098 0.1118 
Train r (l/s) 0.754 0.757 0.754 0.520 0.761 0.756 0.726 0.513 
MBE (l/s) −0.004 −0.969 40.80 −575.58 0.235 −3.72 42.45 133.74 
nRMSE 0.081 0.081 0.083 0.227 0.080 0.081 0.090 0.1123 
Membership function
Sanger's
Oja's
Learning rules
Levenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate Gradient
Activation function
Tangent hyperbolicTangent hyperbolicSigmoidSigmoidTangent hyperbolicTangent hyperbolicSigmoidSigmoid
Test and train n 13 11 18 12 
Epoch 1,000 1,000 116 117 1,000 1,100 118 116 
Test r (l/s) 0.638 0.640 0.645 0.481 0.627 0.638 0.649 0.481 
MBE (l/s) 29.92 28.17 65.23 −595.38 26.79 24.38 62.97 137.02 
nRMSE 0.094 0.094 0.094 0.206 0.095 0.094 0.098 0.1118 
Train r (l/s) 0.754 0.757 0.754 0.520 0.761 0.756 0.726 0.513 
MBE (l/s) −0.004 −0.969 40.80 −575.58 0.235 −3.72 42.45 133.74 
nRMSE 0.081 0.081 0.083 0.227 0.080 0.081 0.090 0.1123 

The bold values indicate the best model for simulation.

Table 4

Results of the CANFIS model

Membership function
Bell
Gaussian
Learning rules
Levenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate Gradient
Activation function
Tangent hyperbolicTangent hyperbolicSigmoidSigmoidTangent hyperbolicTangent hyperbolicSigmoidSigmoid
Test and train Epoch 1,000 1,000 18 68 1,000 1,000 20 16 
Structure 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 
Test r (l/s) 0.631 0.609 0.572 0.036 0.654 0.340 0.623 0.422 
MBE (l/s) 1.09 2.88 151.15 −167.72 − 1.3 −1,753.73 101.96 −27.47 
nRMSE 0.096 0.098 0.107 0.128 0.094 0.464 0.098 0.1123 
Train r (l/s) 0.728 0.686 0.641 0.048 0.731 0.371 0.726 0.464 
MBE (l/s) −19.88 −14.17 134.81 −185.64 − 21.44 −1,686.61 80.32 −40.10 
nRMSE 0.0866 0.091 0.103 0.133 0.086 0.524 0.088 0.1116 
Membership function
Bell
Gaussian
Learning rules
Levenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate GradientLevenberg–MarquardtConjugate Gradient
Activation function
Tangent hyperbolicTangent hyperbolicSigmoidSigmoidTangent hyperbolicTangent hyperbolicSigmoidSigmoid
Test and train Epoch 1,000 1,000 18 68 1,000 1,000 20 16 
Structure 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 2,2,2,2,2 
Test r (l/s) 0.631 0.609 0.572 0.036 0.654 0.340 0.623 0.422 
MBE (l/s) 1.09 2.88 151.15 −167.72 − 1.3 −1,753.73 101.96 −27.47 
nRMSE 0.096 0.098 0.107 0.128 0.094 0.464 0.098 0.1123 
Train r (l/s) 0.728 0.686 0.641 0.048 0.731 0.371 0.726 0.464 
MBE (l/s) −19.88 −14.17 134.81 −185.64 − 21.44 −1,686.61 80.32 −40.10 
nRMSE 0.0866 0.091 0.103 0.133 0.086 0.524 0.088 0.1116 

The bold values indicate the best model for simulation.

Among the 20 implemented neurons: n = 2, with the Tangent hyperbolic activation function, the Levenberg–Marquardt learning rules, and the Conjugate Gradient. The Sigmoid activation function and the learning rules of Levenberg–Marquardt are n = 16. The Sigmoid activation function and the Conjugate Gradient learning rules are n = 7. The separation of the activation function and different learning rules was considered based on the error measurement indicators. Finally, the implemented model with the value of n = 16, the MBE value equal to 24.740 (l/s), and the highest correlation coefficient value was selected as the best model for discharge simulation with the ANN model.

Table 3 presents the best neuron separated by MFs, activation functions, and learning rules. In this table, the best value of n for Sanger's membership function, two learning rules (Levenberg–Marquardt and Conjugate Gradient), and two activation functions (Tangent hyperbolic and sigmoid) equal to 3, 4, 11, and 13 were selected based on higher accuracy and less error among 20 neurons. The executions based on Oja's membership function with the mentioned statistical functions and rules showed that the best value of n equals 2, 3, 12, and 18 among 20 executed neurons.

Finally, Sanger's membership function, Levenberg–Marquardt learning rules, and Sigmoid activation function with the value of n = 13 became the best neuron and execution with a high accuracy of 9.4% and correlation coefficient of 0.652 for simulating the discharge of karstic springs in the area.

Two Bell and Gaussian MFs were implemented in the CANFIS model separately with different learning rules and activation functions. The results of the performances are presented in Table 4. Based on the three error measurement indicators examined, the model with Gaussian membership function, Levenberg–Marquardt learning rules, and Tangent function has higher accuracy and less error among other implementations used for the simulation of spring discharges.

The model's accuracy using the nRMSE index shows that, in all three used models, the model error in the five investigated karstic springs is less than 10% (9.4%), which shows the accurate modeling of discharge in the study area. Among the three ANN, PCA-ANN, and CANFIS models, which are based on the used error measurement indicators, can be declared that the CANFIS performance is somewhat better than the ANN and PCA-ANN. The CANFIS model with TSK structure, Gaussian membership function, hyperbolic tangent activation function, and Levenberg–Marquardt learning rules with the highest accuracy is MBE = −1.3 (l/s). The highest correlation coefficient value of 0.654 between the measured and simulated values was preferred as the best intelligent model for simulating the discharge of karstic springs. The Levenberg–Marquardt algorithm is the best learning rule currently available (Hagan & Menhaj 1994). Figure 7 shows the simulated and observed discharge rate variations of the springs during the investigated period in the two stages of test and train with the best model provided for simulation (CANFIS).
Figure 7

The average percentage of changes in the simulated discharge value compared with the observed of the springs under the CANFIS model in the two stages of test and train (1994–2020).

Figure 7

The average percentage of changes in the simulated discharge value compared with the observed of the springs under the CANFIS model in the two stages of test and train (1994–2020).

Close modal

The average percentage of the simulated discharge rate changes compared with the observed discharge rate of the springs during the statistical period under investigation in the two stages of test and train is shown in Figure 7. The highest average percentage of the simulated discharge increases compared with the value observed in the test phase. This was observed for May in the period of 27 years, which has a value of 24.20%. The lowest fluctuation percentage was observed, with a value of −24.61% for December. The results in the train stage also indicate that October and April had the maximum fluctuation with an average value of 23 and 22%, respectively, compared with the observed value.

At this stage, the minimum average reduction percentage was also achieved, with the amount of −0.61% in January. In general, in the wet seasons, November, December, and January, the average percentage of fluctuation is the lowest compared with the dry seasons (June, July, and August).

Prediction of surface flow, including spring discharge, assists in providing reliable and useful information in managing and planning water resources. Using selected intelligent models has helped us significantly to interpret. In this research, five hydraulic parameters were measured in the field with precipitation and elevation data, and five parameters were considered as input to simulate the discharge of five karstic springs in Nahavand Plain, Hamedan Province, located in the west of Iran. Three ANN, PCA-ANN, and CANFIS intelligent models were implemented to simulate the discharge of selected springs. According to the analysis of results, the CANFIS model is the best model based on the r, nRMSE, and MBE index to simulate spring discharge. The value of the correlation coefficient computed in the ANN model is 0.645; in the CANFIS model, it is 0.654; and in the PCA-ANN model, it is 0.652. Leven-Marquardt learning rules were preferred as the best learning rules for all three models. The value of nRMSE of all three models is 9.4%. Finally, the CANFIS model was preferred for simulating karstic spring discharge due to its higher accuracy than the other two models. The average amount of the simulated discharge fluctuation percentage compared with the observed value during the investigated period in the months of the wet seasons of the year, including November, December, and January, has the lowest value compared with the dry seasons.

In this research, precipitation as time series data was used for the first time as input data to simulate the discharge of five karstic springs in the region and four hydrogeological parameters were also measured by attempting field visits.

For further studies, we suggest using satellite data of groundwater level as input for future research to improve the model's performance.

This research was not funded by any organization.

The data sets generated during and analysed during the current study are available from the corresponding author on reasonable request.

The authors declare there is no conflict.

Adnan
R. M.
,
Yuan
X.
,
Kisi
O.
&
Yuan
Y.
(
2017
)
Streamflow forecasting using artificial neural network and support vector machine models
,
American Scientific Research Journal for Engineering, Technology, and Sciences (ASRJETS)
,
29
,
286
294
.
Agatonovic-Kustrin
S.
&
Beresford
R.
(
2000
)
Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research
,
Journal of Pharmaceutical and Biomedical Analysis
,
22
,
717
727
.
https://doi.org/10.1016/S0731-7085(99)00272-1
.
Aghanabati
A.
(
2004
)
Geology of Iran
, vol.
2004
.
Tehran
:
Geological Survey of Iran
,
606
pp.
An
L.
,
Hao
Y.
,
Yeh
T.-C. J.
,
Liu
Y.
,
Liu
W.
&
Zhang
B.
(
2020
)
Simulation of karst spring discharge using a combination of time–frequency analysis methods and long short-term memory neural networks
,
Journal of Hydrology
,
589
,
125320
.
https://doi.org/10.1016/j.jhydrol.2020.125320
.
Andreo
B.
(
2012
)
Introductory editorial: advances in karst hydrogeology
,
Environmental Earth Sciences
,
65
,
2219
2220
.
https://doi.org/10.1007/s12665-012-1621-3
.
Aytek
A.
(
2009
)
Co-active neurofuzzy inference system for evapotranspiration modeling
,
Soft Computing
,
13
,
691
700
.
https://doi.org/10.1007/s00500-008-0342-8
.
Bakalowicz
M.
(
2005
)
Karst groundwater: a challenge for new resources
,
Hydrogeology Journal
,
13
,
148
160
.
https://doi.org/10.1007/s10040-004-0402-9
.
Banejad
H.
,
Mohebzadeh
H.
&
Olyaie
E.
(
2013
)
Applying ANN and GIS for estimation of effective parameters in determination of plant pattern (case study: Nahavand city)
,
Journal of Environmental Science and Technology
,
15
(
1
),
23
35
.
Barnett
T.
&
Preisendorfer
R.
(
1987
)
Origins and levels of monthly and seasonal forecast skill for United States surface air temperatures determined by canonical correlation analysis
,
Monthly Weather Review
,
115
,
1825
1850
.
https://doi.org/10.1175/1520-0493(1987)115<1825:OALOMA > 2.0.CO;2
.
Bayatvarkeshi
M.
,
Mohammadi
K.
,
Kisi
O.
&
Fasihi
R.
(
2020
)
A new wavelet conjunction approach for estimation of relative humidity: wavelet principal component analysis combined with ANN
,
Neural Computing and Applications
,
32
,
4989
5000
.
https://doi.org/10.1007/s00521-018-3916-0
.
Cerqueira
V.
,
Torgo
L.
&
Soares
C
. (
2019
)
Machine learning vs statistical methods for time series forecasting: size matters. arXiv preprint arXiv:1909.13316
.
Coulibaly
P.
,
Anctil
F.
,
Aravena
R.
&
Bobee
B.
(
2001
)
Artificial neural network modeling of water table depth fluctuations
,
Water Resources Research
,
37
,
885
896
.
https://doi.org/10.1029/2000WR900368
.
Demuth
H. B.
&
Beale
M. H.
(
1992
)
Neural Network Toolbox User's Guide
.
The Mathworks, Incorporated
, Version 4. 1-846.
Di Nunno
F.
&
Granata
F.
(
2020
)
Groundwater level prediction in Apulia region (Southern Italy) using NARX neural network
,
Environmental Research
,
190
,
110062
.
https://doi.org/10.1016/j.envres.2020.110062
.
Drew
D.
(
2017
)
Karst Hydrogeology and Human Activities: Impacts, Consequences and Implications: IAH International Contributions to Hydrogeology 20
.
Abingdon, UK
:
Routledge
.
Dudley
R.
,
Hodgkins
G.
,
Nielsen
M.
&
Qi
S.
(
2018
)
Estimating historical groundwater levels based on relations with hydrologic and meteorological variables in the US glacial aquifer system
,
Journal of Hydrology
,
562
,
530
543
.
https://doi.org/10.1016/J.JHYDROL.2018.05.019
.
El-Shafie
A.
,
Taha
M. R.
&
Noureldin
A.
(
2007
)
A neuro-fuzzy model for inflow forecasting of the Nile River at Aswan high dam
,
Water Resources Management
,
21
,
533
556
.
https://doi.org/10.1007/s11269-006-9027-1
.
Fasihi
R.
,
Tizro
A. T.
&
Marofi
S.
(
2024
)
Climate change impacts on the Nahavand karstic springs using the data mining techniques
,
Theoretical and Applied Climatology
,
155
(
4
),
3347
3359
.
https://doi.org/10.1007/s00704-023-04810-9
.
Ford
D.
&
Williams
P. D.
(
2007
)
Karst Hydrogeology and Geomorphology
.
Hoboken, NJ
:
John Wiley & Sons
.
Gai
Y.
,
Wang
M.
,
Wu
Y.
,
Wang
E.
,
Deng
X.
,
Liu
Y.
,
Yeh
T.-C. J.
&
Hao
Y.
(
2023
)
Simulation of spring discharge using graph neural networks at Niangziguan Springs, China
,
Journal of Hydrology
,
625
,
130079
.
https://doi.org/10.1016/j.jhydrol.2023.130079
.
Gaur
S.
,
Omar
P. J.
&
Eslamian
S.
(
2023
)
Advantage of grid-free analytic element method for identification of locations and pumping rates of wells
. In: Eslamian, S. & Eslamian, F. (Eds.)
Handbook of Hydroinformatics
.
Amsterdam
:
Elsevier
, pp.
1
10
.
https://doi.org/10.1016/B978-0-12-821962-1.00003-9
.
Ghanavati
E.
(
2014
)
Geomorphological model of overland flows in Gamasiab
,
Geographical Research Journal
,
18
,
174
182
.
Ghobadi
M.
,
Abdilor
Y.
&
Mohebi
Y.
(
2012
)
The importance of recognition of morphology, lithology and physical properties in development of karst in Nahavand area
,
Journal of Geotechechnical Geology (Applied Geology)
,
7
(
4
),
299
310
.
Goldscheider
N.
&
Drew
D.
(
2014
)
Methods in Karst Hydrogeology: IAH: International Contributions to Hydrogeology, 26
.
Boca Raton, FL:
CRC Press
.
Gondwe
B. R.
,
Merediz-Alonso
G.
&
Bauer-Gottwein
P.
(
2011
)
The influence of conceptual model uncertainty on management decisions for a groundwater-dependent ecosystem in karst
,
Journal of Hydrology
,
400
,
24
40
.
https://doi.org/10.1016/j.jhydrol.2011.01.023
.
Guo
J.
,
Zhou
J.
,
Qin
H.
,
Zou
Q.
&
Li
Q.
(
2011
)
Monthly streamflow forecasting based on improved support vector machine model
,
Expert Systems with Applications
,
38
,
13073
13081
.
https://doi.org/10.1016/j.eswa.2011.04.114
.
Guzman
S. M.
,
Paz
J. O.
,
Tagert
M. L. M.
&
Mercer
A. E.
(
2019
)
Evaluation of seasonally classified inputs for the prediction of daily groundwater levels: NARX networks vs support vector machines
,
Environmental Modeling & Assessment
,
24
,
223
234
.
https://doi.org/10.1007/s10666-018-9639-x
.
Hagan
M. T.
&
Menhaj
M. B.
(
1994
)
Training feedforward networks with the Marquardt algorithm
,
IEEE Transactions on Neural Networks
,
5
,
989
993
.
https://doi.org/10.1109/72.329697
.
HMRW
(
2023
)
Annual Hydrogeological Report of Nahavand Plain
.
Unpublished Report of Western Regional Water Company
.
Hamedan, Iran
.
Available at: https://www.hmrw.ir [Accessed 1 December 2023]
.
Hsu
W.-L.
,
Tsai
F.-M.
&
Shiau
Y.-C.
(
2021
)
Planning and assessment system for light rail transit construction in Taiwan
,
Microsystem Technologies
,
27
,
1051
1060
.
https://doi.org/10.1007/s00542-018-4023-y
.
Ilaboya
I. R.
&
Kayode
O. N.
(
2018
)
Physico-chemical analysis and modelling of groundwater quality parameters using water quality index method (WQI) and principal component analysis (PCA)
,
Trends in Civil Engineering and its Architecture
,
2
,
208
222
.
https://doi.org/10.32474/TCEIA.2018.02.000134
.
Jang
D.
(
2017
)
Estimation of Non-Revenue Water Ratio Using PCA and ANN in Water Distribution Systems
.
Incheon, Korea
:
Incheon National University
.
Jang
J.-S. R.
,
Sun
C.-T.
&
Mizutani
E.
(
1997
)
Neuro-fuzzy and soft computing – a computational approach to learning and machine intelligence
,
IEEE Transactions on Automatic Control
,
42
,
1482
1484
.
https://doi.org/10.1109/TAC.1997.633847
.
Kiyani
S.
,
Kiyani
V.
&
Behdarvand
N.
(
2021
)
Forecasting occur probability intense storm using Gumbel distribution; case study: Nahavand township
,
Central Asian Journal of Environmental Science and Technology Innovation
,
2
,
219
226
.
https://doi.org/10.22034/CAJESTI.2021.06.01
.
Kong-A-Siou
L.
,
Fleury
P.
,
Johannet
A.
,
Estupina
V. B.
,
Pistre
S.
&
Dörfliger
N.
(
2014
)
Performance and complementarity of two systemic models (reservoir and neural networks) used to simulate spring discharge and piezometry for a karst aquifer
,
Journal of Hydrology
,
519
,
3178
3192
.
https://doi.org/10.1016/j.jhydrol.2014.10.041
.
Kratzert
F.
,
Klotz
D.
,
Brenner
C.
,
Schulz
K.
&
Herrnegger
M.
(
2018
)
Rainfall-runoff modelling using long short-term memory (LSTM) networks
,
Hydrology and Earth System Sciences
,
22
,
6005
6022
.
https://doi.org/10.5194/hess-22-6005-2018
.
Lingireddy
S.
&
Brion
G. M.
(
2005
)
Artificial Neural Networks in Water Supply Engineering
.
Reston, VA
:
ASCE Publications
.
Mané
U.
(
1975
)
General climatology. HJ Critchfield, 1974. Prentice Hall, Englewood Cliffs, NJ, 446 pp.,£ 6.15
,
Earth Science Reviews
,
11
,
184
185
.
https://doi.org/10.1016/0012-8252(75)90105-1
.
Mohammadi
B.
,
Mehdizadeh
S.
,
Ahmadi
F.
,
Lien
N. T. T.
,
Linh
N. T. T.
&
Pham
Q. B.
(
2021
)
Developing hybrid time series and artificial intelligence models for estimating air temperatures
,
Stochastic Environmental Research and Risk Assessment
,
35
,
1189
1204
.
https://doi.org/10.1007/s00477-020-01898-7
.
Nassery
H.
(
1992
)
Hydrogeology Study of Karstic Springs in Doroudzan Dam Basin
.
Master thesis
,
Shiraz University
,
Iran
.
Omar
P. J.
,
Gaur
S.
&
Dikshit
P. K. S.
(
2021
)
Conceptualization and development of multi-layered groundwater model in transient condition
,
Applied Water Science
,
11
,
162
.
https://doi.org/10.1007/s13201-021-01485-3
.
Onyari
E. K.
&
Ilunga
F.
(
2013
)
Application of MLP neural network and M5P model tree in predicting streamflow: a case study of Luvuvhu catchment, South Africa
,
International Journal of Innovation, Management and Technology
,
4
,
11
.
https://doi.org/10.7763/IJIMT.2013.V4.347
.
Paleologos
E. K.
,
Skitzi
I.
,
Katsifarakis
K.
&
Darivianakis
N.
(
2013
)
Neural network simulation of spring flow in karst environments
,
Stochastic Environmental Research and Risk Assessment
,
27
,
1829
1837
.
https://doi.org/10.1007/s00477-013-0717-y
.
Rawat
S. S.
,
Mathur
S.
,
Sharma
H. C.
&
Singh
P. K.
(
2019
)
Modelling of spring flow using artificial neural network
,
Journal of Indian Water Resources Society
,
39
(
3
),
10
17
.
Rudolph
M. G.
,
Collenteur
R. A.
,
Kavousi
A.
,
Giese
M.
,
Wöhling
T.
,
Birk
S.
,
Hartmann
A.
&
Reimann
T.
(
2023
)
A data-driven approach for modelling karst spring discharge using transfer function noise models
,
Environmental Earth Sciences
,
82
,
339
.
https://doi.org/10.1007/s12665-023-11012-z
.
Rumelhart
D. E.
,
Hinton
G. E.
&
Williams
R. J.
(
1986
)
Learning representations by back-propagating errors
,
Nature
,
323
,
533
536
.
https://doi.org/10.1038/323533a0
.
Shcherbakov
M. V.
,
Brebels
A.
,
Shcherbakova
N. L.
,
Tyukov
A. P.
,
Janovsky
T. A.
&
Kamaev
V. A. E.
(
2013
)
A survey of forecast error measures
,
World Applied Sciences Journal
,
24
,
171
176
.
https://doi.org/10.5829/idosi.wasj.2013.24.itmies.80032
.
Sheela
K. G.
&
Deepa
S. N.
(
2013
)
Review on methods to fix number of hidden neurons in neural networks
,
Mathematical Problems in Engineering
,
2013
,
425740
.
https://doi.org/10.1155/2013/425740
.
Smith
B. A.
&
Hunt
B. B.
(
2010
)
A comparison of the 1950s drought of record and the 2009 drought, Barton Springs segment of the Edwards Aquifer, Central Texas, Gulf Coast Association of Geological Society's Transactions, 60, 611622
.
Stevanović
Z.
(
2019
)
Karst waters in potable water supply: a global scale overview
,
Environmental Earth Sciences
,
78
,
662
.
https://doi.org/10.1007/s12665-019-8670-9
.
Tabari
H.
,
Hosseinzadeh Talaee
P.
&
Abghari
H.
(
2012
)
Utility of coactive neuro-fuzzy inference system for pan evaporation modeling in comparison with multilayer perceptron
,
Meteorology and Atmospheric Physics
,
116
,
147
154
.
https://doi.org/10.1007/s00703-012-0184-x
.
Tabari
H.
,
Hosseinzadeh Talaee
P.
&
Willems
P.
(
2015
)
Short-term forecasting of soil temperature using artificial neural network
,
Meteorological Applications
,
22
,
576
585
.
https://doi.org/10.1002/met.1489
.
Towler
E.
,
Woodson
D.
,
Baker
S.
,
Ge
M.
,
Prairie
J.
,
Rajagopalan
B.
,
Shanahan
S.
&
Smith
R.
(
2022
)
Incorporating mid-term temperature predictions into streamflow forecasts and operational reservoir projections in the Colorado river basin
,
Journal of Water Resources Planning and Management
,
148
,
04022007
.
https://doi.org/10.1061/(ASCE)WR.1943-5452.0001534
.
Vernieuwe
H.
,
Georgieva
O.
,
De Baets
B.
,
Pauwels
V. R.
,
Verhoest
N. E.
&
De Troch
F. P.
(
2005
)
Comparison of data-driven Takagi–Sugeno models of rainfall–discharge dynamics
,
Journal of Hydrology
,
302
,
173
186
.
https://doi.org/10.1016/j.jhydrol.2004.07.001
.
Wunsch
A.
,
Liesch
T.
,
Cinkus
G.
,
Ravbar
N.
,
Chen
Z.
,
Mazzilli
N.
,
Jourde
H.
&
Goldscheider
N.
(
2022
)
Karst spring discharge modeling based on deep learning using spatially distributed input data
,
Hydrology and Earth System Sciences
,
26
,
2405
2430
.
https://doi.org/10.5194/hess-26-2405-2022
.
Zare Abyaneh
H.
,
Bayat Varkeshi
M.
,
Golmohammadi
G.
&
Mohammadi
K.
(
2016
)
Soil temperature estimation using an artificial neural network and co-active neuro-fuzzy inference system in two different climates
,
Arabian Journal of Geosciences
,
9
,
1
10
.
https://doi.org/10.1007/s12517-016-2388-8
.
Zeydalinejad
N.
,
Nassery
H. R.
,
Shakiba
A.
&
Alijani
F.
(
2020
)
Prediction of the karstic spring flow rates under climate change by climatic variables based on the artificial neural network: a case study of Iran
,
Environmental Monitoring and Assessment
,
192
,
1
20
.
https://doi.org/10.1007/s10661-020-08332-z
.
Zhang
L.
,
Zhao
W.
,
He
Z.
&
Liu
H.
(
2009
)
Application of the Takagi–Sugeno fuzzy system for combination forecasting of river flow in semiarid mountain regions
,
Hydrological Processes: An International Journal
,
23
,
1430
1436
.
https://doi.org/10.1002/hyp.7265
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).