This paper investigates the performance of five supervised machine learning algorithms, including support vector machine (SVM), logistic regression (LogR), decision tree (DT), multiple perceptron neural network (MLP-NN), and K-nearest neighbours (KNN) for predicting the water quality index (WQI) and water quality class (WQC) in the coastal aquifer of the Gaza Strip. A total of 2,448 samples of groundwater were collected from the coastal aquifer of the Gaza Strip, and various physical and chemical parameters were measured to calculate the WQI based on weight. The prediction accuracy was evaluated using five error measures. The results showed that MLP-NN outperformed other models in terms of accuracy with an R value of 0.9945–0.9948, compared with 0.9897–0.9880 for SVM, 0.9784–0.9800 for LogR, 0.9464–0.9247 for KNN, and 0.9301–0.9064 for DT. SVM classification showed that 78.32% of the study area fell under poor to unsuitable water categories, while the north part of the region had good to excellent water quality. Total dissolved solids (TDS) was the most important parameter in WQI predictions while and were the least important. MLP-NN and SVM were the most accurate models for the WQI prediction and classification in the Gaza coastal aquifer.

  • Machine learning (ML) algorithms are used for predicting water quality index.

  • Prediction performance of LogR, DT, KNN, SVM, and MLP-NN are compared.

  • MLP-NN and SVM-based prediction and quality classification models performed better than other ML-developed models.

  • Gaza coastal aquifer is experiencing a severe deterioration in water quality, as it is currently unsafe for drinking purposes without adequate treatment.

Graphical Abstract

Graphical Abstract
Graphical Abstract

Water is one of the greatest significant sources found on the Earth's surface and is vital for the sustainability of all life; however, it is a source under persistent risk of contamination by life itself. Rapid urbanisation and industrialisation have deteriorated water quality at a worrying rating, giving rise to many serious illnesses. The poor quality of water consequences is known to be one of the main reasons for the increase in traumatic illnesses (Ahmed et al. 2019). The Gaza Strip is going through a serious crisis of depleting accessible fresh water due to the over-pumping of groundwater, the only resource used for domestic purposes, agriculture, and small industries. The problem has been worsened over time by seawater intrusion into the coastal aquifer and infiltration of poorly treated wastewater into the coastal aquifer (Aish et al. 2015). Water is a vital factor for social and economic growth as well as the sustainability of the environment. Most people in the Gaza Strip live in a catastrophic situation that needs imperative and thoughtful measures to recover the water situation in terms of quantity and quality. The groundwater in the coastal aquifer of the Gaza Strip is almost saline, except for some fresh water in the form of shallow water lenses. Therefore, the amount of drinkable groundwater is very much less and is found only in a small number of regions in the Gaza Strip, for example, Beit Lahiya. To secure fresh water for drinking and agricultural purposes in the Gaza Strip has become a vital and strategic objective for the Palestinian Water Authority (PWA) to accomplish (El Sheikh et al. 2003). Domestic and agricultural water quality has almost completely deteriorated. The assessment and survey of groundwater quality are vital for the optimal applied management system of groundwater resources (Kawo & Karuppannan 2018; Li et al. 2018; Egbueri 2020; Islam et al. 2020a). The increasing growth of people is directly related to the rising demand for potable water (Islam et al. 2017; Rahman et al. 2020). This excessive demand has made researchers more to work and do research in order to develop new methods for predicting water quality (Uddin et al. 2021). Since it is an important factor in the water cycle and drinking water resources, groundwater has been shown to be a global problem under significant global stress (Ahmed et al. 2019; Saha et al. 2020). Thus, the assessment of water quality globally has become of great importance in recent years. Horton (1965) intended the first water quality index (WQI) for the purpose of updating some standards that cover water into a single number to determine the water quality general status. Subsequently, several indicators have been established (Mukate et al. 2019; Hossain & Patra 2020; Islam et al. 2020b). The physiochemical and biological variables involved in estimating the WQI must be reasonably selected to obtain logical results (Abbasi, T. & Abbasi, S. A. 2012). Several WQIs have been implemented by numerous investigators to evaluate the appropriateness of drinking water from underground water wells, wetlands and the water quality of rivers (Islam et al. 2017, 2019, 2021). Though, the worsening of water quality could be generated by many aspects, such as insufficient good sanitation system, contaminants resulting from industries and extreme use of fertiliser for agricultural purposes, climate change, and non-existent management strategy for groundwater (Trevett et al. 2005; Alam et al. 2007; Loecke et al. 2017; Islam et al. 2018). Moreover, water quality assessment includes issues such as large-scale sample collection, laboratory testing, and data manipulation, which are often time-consuming and costly in relation to equipment, chemicals, reagent, and the economic value of the human resources (Tiyasha Tung & Yaseen 2020). Water quality can be impacted by a variety of factors, including the calcium content in natural waters, industrial waste, rocks, and contaminated wastewater, which serve as sources of calcium, leading to poor water quality. Additionally, hard water has been linked to serious and long-term health problems such as heart disease, skin irritation, hair damage, dental issues, respiratory difficulties, and gastrointestinal problems (Sharma & Bhattachary 2017). Impurities in water sources due to chemical, physical, and radioactive pollutants are the most common reason for water pollution (Chatterjee et al. 2017). In fact, the consequences of polluted drinking water are extremely harmful and pose a serious condition to human health, environment, and infrastructure. Rendering to a United Nations (UN) report, 1.5 million people die every year as a result of diseases that result in polluted water. Water pollution is documented to cause 80% of health issues in poor countries. Every year 5 million deaths and 2.5 billion illnesses are documented (Aldhyani et al. 2020). In order to come up with solutions to deal with and overcome these conditions, it was necessary to use realistic, cost-effective, accurate, and quick methods to assess the quality of water. In order to come up with the optimal solution, in this case, an artificial neural network (ANN) model is an alternate choice for creating neural networks that help forecast the underground water quality founded on the results of laboratory analyses that do not require costly and highly advanced measurement tools. Artificial intelligence (AI) technology is a potential and powerful multifunctional method in fields related to the sciences of water (Kisi et al. 2018; Kim et al. 2019; Abba et al. 2020; Bilali et al. 2021; Hayder et al. 2021; Singha et al. 2021). A number of researchers have used AI methods around the world, such as ANN, support vector machine (SVM), and random forest (RF), in several different water-related studies. The learning machine RF algorithm has been developed for its application to predict the quality of groundwater (Singha et al. 2021), and worked on predicting the quality of river waters (Asadollah et al. 2021). ANN technology has been widely used in various fields, including science and engineering. For example, this technology has been used to predict heavy metals pollution (Singha et al. 2020) and the water table prediction (Zhu et al. 2020). Apart from these previously mentioned investigations, many researchers have conducted many studies to predict WQI by assessing the performance of the varied number of AI models. Wang et al. (2017) have used a support vector regression (SVR) model based on swarm optimisation to forecasting WQI. Muharemi et al. (2018) have used neural network and K-nerest neighbours (KNN) algorithm for classification purposes created by the logistic regression algorithm (LogR) to attain a suitable method to solve and report the problem of changes in drinking water quality. Mohammed Pour et al. (2015) have studied the water quality problem, using three different algorithms, including SVM and other two methods of ANNs. Then the performance of the developed networks using root-mean-square error (RMSE), mean absolute error (MAE), and coefficient of determination (R2) was used for comparison purposes. Based on the study's results, the SVM algorithm can compete with neural networks. Najafzadeh and other researchers have conducted multiple studies on the use of machine learning (ML) algorithms for predicting water quality in the Karun River and Rafsanjan Plain, Iran. In their research, they have proposed modifications to SVR, used multivariate adaptive regression spline (MARS), and least square-support vector machine (LS-SVM), and employed gene expression programming (GEP), evolutionary polynomial regression (EPR), model tree (MT), and four well-known DDMs for water quality prediction. In a separate study, they also applied four robust data-driven techniques (DDTs) for predicting groundwater quality index (GQI) values (Najafzadeh & Ghaemi 2019; Najafzadeh et al. 2019, 2021, 2022; Najafzadeh & Niazmardi 2021). A study carried out by Ahmed et al. (2019) applied 15 algorithms owned to AI to predict the WQI, where the classification and regression developed networks outperformed other models. It has been found (Bui et al. 2020a) that the prediction performance of hybrid AI models is better than the classical models of WQI prediction using 4 classical methods and 12 AI hybrid methods. A deep learning machine (Singha et al. 2021) has been used to predict WQI using three conventional models. The prediction results showed that the predictive model of deep learning is a more powerful and precise method than the classical model in predicting groundwater quality. In the study by Kouadri et al. (2021a, 2021b), the authors used various physicochemical and biological variables such as temperature, pH, conductivity, dissolved oxygen, and faecal coliforms to classify water quality using ML algorithms such as LogR, multilayer perceptron (MLP), decision tree (DT), naive Bayes (NB), and KNN. They found that the DT algorithm performed best among the models with a classification accuracy of 99%.

According to the literature reviewed, a variety of AI models have been utilized in different hydrogeological conditions with varying degrees of accuracy for predicting WQI. LogR, DT, long short-term memory (LSTM), SVR, multiple perceptron neural network (MLP-NN), and KNN were among the models used in these studies. However, these AI techniques are not commonly used in hydrology for predicting groundwater quality. A thorough review of the literature revealed that no prior study has examined or validated the performance of these AI methods for predicting the WQI of the coastal aquifer in the Gaza Strip. This study aims to address this gap by investigating the application of five WQI prediction models based on ML in the Gaza Strip coastal aquifer, including LogR, DT, MLP-NN, KNN, and SVM algorithms. This study focuses on the importance of regular assessment of groundwater quality in the Gaza coastal aquifer, as groundwater is a crucial resource for human consumption and is largely impacted by human-caused pollution. To the authors' knowledge, no prior research has been conducted in this area. The novelty of this study is the use of ML techniques to forecast the WQI, which proves to be more efficient than conventional methods. The study investigates several models using five algorithms to predict WQI using various physicochemical variables as inputs. This approach is intended to reduce the time consumed in evaluating the water quality and to allow for quicker predictions in critical situations in the Gaza Strip coastal aquifer. We aspirate that the utilisation of ML algorithms in this study to predict the WQI can provide a more efficient, accurate, and effective means of monitoring and managing the water quality of the Gaza coastal aquifer. Through this, the PWA can acquire a clearer comprehension of the factors affecting the water quality and take proactive measures to maintain or improve its quality.

Study area description

A territory of the Palestinian Authority or the Gaza Strip (or Gaza) is located in the southeastern corner of the Mediterranean Sea but is separated from the other part of the Palestine-West Bank. The area is a narrow section located along the southwestern part of the Palestinian coastal plains. It is located between longitudes [34°20′ and 34°25″E] and latitudes [31°16″ and 31°45″N], and its area is estimated to be about 365 km2 (Figure 1). The length is about 42 km on the western coast of the Mediterranean Sea, and the width varies from 6 to 12 km. The Negev desert is located in the east, the Mediterranean in the west, and the Sinai desert in the south (Aish et al. 2015). The population density in the Gaza Strip is relatively high compared with some other regions in the world (5,936 people/km2), and the population is estimated at 2.1 million people, with a growth rate of 2.8%, and that was at the end of 2021 (PCBS 2021). The Gaza Strip experiences a Mediterranean climate with moderate temperatures, ranging from an average of 25.8 °C in summer to 13.4 °C in winter. The region is located in an arid to semi-arid area and experiences all its precipitation between October and April. The average seasonal rainfall varies between 474 mm/year in the north and 250 mm/year in the south, and there is a risk of severe drought in some years, reaching up to 83% (Al-Najjar et al. 2021). Most of the topographical area of the Gaza Strip is defined as a flat area of increasing slope ranging from 0 to 5%, oriented westward towards the sea, allowing surface runoff. The landscape is basically a front beach plain. Along the coast, there is a sandy beach bordered in the east by a series of sand dunes up to 40 m high. Land surface elevations range from mean sea level (MSL) to about 110 MSL in the eastern parts. Gaza's water resources are mainly limited to that part of the coastal aquifer that lies under its area (Aish et al. 2015). The landscape of the Gaza Strip includes deposits from different geological periods, with irregular stratified limestone sandstones and red-brown layers. The coastal area is a small part of a larger sedimentation system that extends from Alexandria in Egypt to Haifa Bay in Israel. The topography of the coastal plain is shaped by Kurkar Aeolian hills, some of which have deep depressions filled with clay deposits (Zaineldeen & Aish 2012).
Figure 1

Location map of the study area.

Figure 1

Location map of the study area.

Close modal

The Pleistocene granular aquifer in the Gaza Strip is an extension of the coastal aquifer on the Mediterranean coast. It extends from Ashqelon in the north to Rafah in the south and from the sea coast to 10 km inland. The aquifer consists of various layers of dune sandstone, silty clay, and clay that occur as lenses, begin on the coastline, and extend to a distance of about 5 km from the sea. The aquifer is divided into an upper and a deep subaquifer. The aquifer is built on the sea soil clay (Saqiye Group) of the Neocene age, which has a hydraulic conductivity of about 108 m/s. The coastal aquifer, located in the east-south part of the Gaza Strip, is relatively thin and has no apparent subaquifers (Aish 2013). The water demand in the Gaza Strip is influenced by population growth and economic development and is primarily met by pumping from over 8,000 wells. In 2020, around 97 million cubic metres were used for irrigation and 93.5 million cubic metres for domestic and industrial purposes. However, the unsustainable pumping practices have led to the depletion of groundwater reserves and 97% of the water pumped from the coastal aquifer does not meet the water quality standards set by the World Health Organization (WHO). The water table is 19 m below sea level and the sustainable water pumping limit is estimated to be 50–60 million cubic metres annually (PCBS & PWA 2022). The Gaza Strip relies heavily on groundwater from the coastal aquifer as its main source of water. Intensive pumping from over 4,000 wells has led to seawater intrusion, which in turn has affected the quality of the groundwater. The groundwater quality is not consistent throughout the region and is affected by factors such as excessive withdrawal and human activities, leading to high levels of nitrates and chlorides and making much of it unsuitable for drinking (El Baba et al. 2020).

Data collection

This study used water quality analysis results from the PWA of the State of Palestine to evaluate the water quality of five different locations in the coastal aquifer of the Gaza Strip. The dataset consisted of 2,448 samples collected from 272 wells between 2009 and 2018. Missing values were handled using the multiple imputation method and data normalisation was performed to improve data quality. The WQI was calculated and a histogram was created to visualize the data distribution (Figure 2). Correlation was applied to measure the relationship between variables. The dataset was divided into training and test sets and five ML algorithms (SVM, MLP-NN, LogR, DT, and KNN) were applied using 10 parameters to determine the relevance of water quality.
Figure 2

The dataset distribution histogram.

Figure 2

The dataset distribution histogram.

Close modal

After applying these techniques, we focused on modelling and obtaining results from the proposed model, which was also based on the prepared dataset of the test part. Finally, all the obtained results were compared based on the accuracy of the developed models and possibly some existing methods. This methodology ensures the rationality and reliability of the whole study. The dataset for this study was collected from PWA. The dataset contains 10 water quality parameters consisting of physical elements (TDS) and chemical elements (pH, Ca2+, Mg2+, Na+, K+), anions (Cl, , ), and pollution indicators (). These parameters were considered significant values. Table 1 shows the given weights and relative weights of the water quality parameters, and Table 3 shows the statistical analysis of the parameters used in this study.

Table 1

Allocated weights and relative weights of water quality variables

VariableUnitWHOsWeight (wi)Relative weight (Wi)
pH  6.5–8.5 0.103 
TDS mg/L 500 0.172 
Calcium mg/L 75 0.069 
Magnesium mg/L 50 0.069 
Sodium mg/L 200 0.103 
Potassium mg/L 12 0.034 
Chloride mg/L 250 0.138 
Nitrate mg/L 45 0.172 
Sulphate mg/L 250 0.103 
Alkalinity mg/L 120 0.034 
   wi = 29 Wi = 1.000 
VariableUnitWHOsWeight (wi)Relative weight (Wi)
pH  6.5–8.5 0.103 
TDS mg/L 500 0.172 
Calcium mg/L 75 0.069 
Magnesium mg/L 50 0.069 
Sodium mg/L 200 0.103 
Potassium mg/L 12 0.034 
Chloride mg/L 250 0.138 
Nitrate mg/L 45 0.172 
Sulphate mg/L 250 0.103 
Alkalinity mg/L 120 0.034 
   wi = 29 Wi = 1.000 

Box plot analysis and outlier detection

In this study, box plots were used for outlier discovery analysis due to their ability to show the distribution, dispersion, and variation of univariate and categorical data. The box plot analysis showed that most variables had outliers that were considered normal, so an upper bound strategy was used to filter them out. The maximum threshold was set as the variable value just below the outlier values, and this process was repeated for all variables. The outliers were removed manually to avoid losing data, and the researchers were lenient in choosing the upper threshold to preserve the dataset and only penalize values that seemed unlikely to occur (Gazzaz et al. 2012; Ahmed et al. 2019).

WQI calculation

The WQI is one of the most commonly used methods for calculating water quality and its appropriateness for human practices (Bahir et al. 2020; El Baba et al. 2020; Kouadri et al. 2021a, 2021b). The standards recommended by the WHO (2011, 2017) for water quality for drinking purposes were used to calculate the WQI. The weights of the different water quality parameters are considered to be inversely proportional to the corresponding suggested standards (Sadat-Noori et al. 2014). To estimate the WQI, each factor must be assigned a weight (wi) between 1 and 5 based on its importance and impact on drinking water and public human health (Kouadri et al. 2021a, 2021b). The maximum weight of 5 was assigned to variables such as chloride, total dissolved solids, and nitrate due to their great importance in water quality appraisal (Srinivasamoorthy et al. 2008). On the other hand, the bicarbonate and potassium parameters are assigned the minimum weight of (1) since both play an irrelevant role in water quality appraisal. Other variables such as pH, sodium, calcium, sulfate, and magnesium were assigned weights ranging from 2 to 4 depending on their significance in the water quality for drinking purposes (Ketata-Rokbani et al. 2011). In the next step, the relative weight (Wi) of each parameter is calculated using Equation (1). The assigned weights, relative weights, and limits required by WHO are presented in Table 2.
(1)
where wi is the weight of every variable, n is the number of variables, and Wi is the relative weight. Then, in step three, a quality rating scale (qi) is computed for every variable using Equation (2).
(2)
where qi is the quality ranking, Ci is the value of each chemical variable in each water sample in milligrams per litre, and Si is the WHO standard for each chemical variable in milligrams per litre (Table 1). For calculating the WQI, the SI is first computed for each chemical variable using Equation (3), which is then used to calculate WQI according to Equation (4).
(3)
(4)
where SIi is the sub-index rating, qi is the quality rating, and Wi is the relative weight.
Table 2

Classification of groundwater quality

WQI rateWater class
0–50 Excellent 
50–100 Good 
100–200 Poor 
200–300 Very poor 
Above 300 Unsuitable for drinking 
WQI rateWater class
0–50 Excellent 
50–100 Good 
100–200 Poor 
200–300 Very poor 
Above 300 Unsuitable for drinking 

In this study, five types of water quality were classified which are excellent, good, poor, very poor, and unsuitable for human use (Sahu & Sikdar 2008). When the WQI rate is 0–50, then it is excellent; when it is 50–100, then it is good; when it is 100–200, then it is poor; when it is 200–300, then it is very poor, and when it is above 300, then it is unsuitable for drinking. Once the WQI is calculated, each sample's water quality class (WQC) is determined using the WQI in the classification algorithms (Srivastava & Kumar 2013; Ahmed et al. 2019). The classification of groundwater quality is shown in Table 2.

Application of ML-based prediction models

Regression and classification algorithms were used in this study. Regression algorithms were used to estimate the WQI, and classification algorithms were used to classify the samples into the previously determined WQC. Five classification and regression algorithms were used. The algorithms used are listed below:

Logistic regression (LogR)

LogR is a classification algorithm. It is created by applying the logistic function or the sigmoid function. It is the most commonly used algorithm in binary classification, but in this study, multinomial LogR is used because there are more than two classes (Ahmed et al. 2019). During this work, the tuning of hyperparameters was performed using GridSearchCV method to obtain the best parameters, which were then fed into the LogR model that provided optimal prediction results.

Decision tree (DT)

A DT is a simple, understandable algorithm that can be generally used for both regression and recognition. The DT makes decisions based on the values of all relevant input variables after training. It randomly selects the root variable and tends to the values of the other variables accordingly. It contains all parameter decisions organised in a top-down tree and evolves the decision based on different parameter values (Quinlan 1990). The prediction used hyperparameter optimisation using grid search and implementing the standard scaling function on a dataset, as well as cross-validation to avoid overfitting.

K-nearest neighbours (KNN)

The KNN algorithm generally classifies by finding the given points closest to the N neighbours and assigning the class to the majority of the n neighbours. In the case of a tie, various techniques can be applied to solve the problem, such as increasing n or adding a bias towards a category. KNN is not suggested for large datasets because all processing occurs during testing, iterating all training data and computing nearest neighbours every time (Beyer et al. 1999). In this study, KNN was tuned by tune hyperparameter using GridSearchCV for k to achieve the best prediction performance. The best value for n was found to be 9, while k_range = range (1.31).

Support vector machine (SVM)

SVM is generally used for classification but can also be used for regression. When visualising data points plotted on a plane, SVMs outline the hyperplane between classes and expand the margin to maximise the discrimination between two classes, resulting in fewer nearby errors (Ahmed et al. 2019). In this prediction model, the default hyperparameter used was the C value of 1, the gamma value of scale, and the kernel value of ‘rbf’, which gave the best performance results.

Neural network/Multilayer perceptron (MLP)

ANN is a model based on the structure of nodes that consists of multiple layers of neurons. It has an input layer that takes in prediction variables and an output layer that displays the prediction. During training, the model updates the weights of each neuron in each layer by iterating with each training data point. Once trained, the model uses these weights to activate specific units based on the inputs. MLP is a commonly used ANN model for classification and regression tasks (Ahmed et al. 2019). In this study, an MLP-NN with a configuration (10,30,1) of max 100 eps and a learning rate (‘constant’, ‘adaptive’) using Python version 3.9.7 was used for classification and prediction of WQI and WQC.

Model development

In this study, five ML models (LogR, DT, KNN, SVM, and MLP-NN) were developed to predict GQI using Python version 3.9.7. The LogR, SVM, MLP, DT, and KNN packages were installed in Python to perform the analysis. To avoid misleading results, the data were normalised zero-to-one before developing the models. The normalisation method used was minimum and maximum normalisation, which improved the performance of the models by reducing computation time and errors in the fitting process (Tumer & Edebali 2019; Singha et al. 2021). The following equation attends to determine the normalisation of the input data.
(5)
where Xmin is the minimum value, Xmax is the maximum value, and Xn is the normalised value. The normalised input dataset has been divided into two parts, one part called training data and the other is test data. In Bhagat et al. (2020a), it is documented that there is no such standard rule for dividing the datasets during model calibration and testing. A characteristic value for a test dataset should be between 10 and 40% of the total dataset. However, the data division for the computing algorithms will be finalised after the experiments by changing the partition ranges from 75 to 25 or 85 to 15. Finally, the splitting range is finalised based on the minimum error (i.e., minimal RMSE) achieved during model calibration (Singha et al. 2021). In this study, the dataset was split into two datasets before running the models, with 70% of the data used for training and 30% of the data used for testing network performance.

Model calibration

Model calibration involves adjusting the training algorithm by reducing the cost function. Weights are calculated by the model learning from the used training dataset. At this point, a communal deficiency can happen, namely, overfitting, which leads to noise and adverse effects during training. Hyperparameters cannot gain information directly from the dataset used in training, but they increase the complexity to achieve the optimal structure of the model. Therefore, model hyperparameters were set for all models during model calibration.

Model performance metrics

In order to benchmark the effectiveness of the techniques presented in this study, we used five commonly recognised statistical measures: RMSE, MAE, scatter index (SI), discrepancy ratio (DR), RMSE-standard deviation of observations (RSR), and correlation coefficient (R) (Najafzadeh & Oliveto 2020). Moriasi et al. (2007) found that the performance of the model can be classified as very good (RSR 0.50), good (0.50065), satisfactory (0.60050), and unsatisfactory (RSR > 0.70). A low value of mean absolute percentage error denotes high model fidelity and vice versa (Lu & Ma 2020). The mathematical expressions for the numerous metrics used are as follows.
(6)
where is the observed value and is the predicted value.
(7)
where is the actual observation time series values, is the predicted time series values, and N is the number of error values.
(8)
(9)
(10)
where are the number of error values; is the actual observation time series values, and is the predicted time series values. The calculation of the SI involves dividing the RMSE by the average of the observations at each grid point. This produces a percentage representation of the deviation of the RMSE from the mean observation, effectively indicating the expected error percentage for the parameter (Najafzadeh & Oliveto 2020).
(11)

As present in the equation, n is the number of observations in the period under consideration, Oi is the observed first-order value, O is the observed mean value, Pi is the expected value from the first model, and P is the expected mean value from the model (Golmohammadi et al. 2014).

Groundwater quality statistical analysis

The statistical results for the physical and chemical parameters selected for this study are presented in Table 3. Several statistical tools were used in this study, including minimum, maximum, mean, standard deviation, first quarter, third quarter, and skewness and were calculated for each physicochemical parameter. The mean value of TDS (1,540.68 mg/L), Cl (536.75 mg/L), Na+ (346.49 mg/L), (275.03 mg/L), (118.89 mg/L), Ca2+ (93.71 mg/L), and Mg2+ (63.99 mg/L) exceeded the WHO (2011, 2017) the permissible limit of water for the purpose of drinking is taken from the groundwater of the study area. The skewness of the chosen parameters ranged from 0.146 to 24.52. All physical and chemical parameters showed positive skewness (right) and were in the same plane, while K+ was estimated to be significantly skewed. Water samples collected from the coastal aquifer are characterised by total dissolved solids values ranging from 154 to 9,219 mg/L. Sodium and magnesium are the predominant cations and range from 2.7 to 1,470 mg/L and from 1.6 to 379 mg/L, respectively. The water samples have pH between 6.1 and 8.96, nitrate concentrations between 10.2 and 528 mg/L, calcium between 7.05 and 2,376 mg/L, and potassium between 1.06 and 430 mg/L. The pH with a mean value of 7.58 indicates that the groundwater in the study area is predominantly alkaline. Chloride is the predominant anion (25.52–2,759 mg/L) and sulphate (1.42–1,873 mg/L). Groundwater in the Gaza coastal aquifer is largely enriched in moderate Ca2+ and Mg2+ concentrations, mainly due to the geogenic sources, especially carbonates (calcite, aragonite, and dolomite). Therefore, the dissolution of calcite and dolomite plays an important role in releasing Ca2+, Mg2+, and in the groundwater of the study area. Moreover, the enrichment of Ca2+ and Mg2+ may be due to the carbonate dissolution of the aquifer matrix resulting from the occurrence of the ion exchange process. In the Gaza Strip, farmers regularly use chemical fertilisers based on urea, single superphosphate, and nitrogen-phosphate-potassium (N-P-K), which are the main sources of elevated content in groundwater (Singha et al. 2019; El Baba et al. 2020). Referring to the standards of the WHO (2011, 2017), most cations and anions show that the water quality of the study area's coastal aquifer has deteriorated significantly. The correlation matrix is valuable because it independently demonstrates each parameter's significance and influence on water's hydrochemical mechanism (Helena et al. 2000; Islam et al. 2017, 2020b; Patil et al. 2020; Singha et al. 2021). When the values of r in the Pearson correlation matrix (Figure 3) are +1 or −1, they are treated as strong correlation coefficient values and indicate the overall correlation, i.e., useful dependence, between two parameters. When the values are closer to zero, it means that there is no living correlation between two variables at the p < 0.05 level (Singh et al. 2011; Patil et al. 2020). When r is greater than 0.75, the parameters are highly correlated, and when r is between 0.5 and 0.75, the parameters are moderately correlated. The correlation matrix was used in this study to account for the interaction between the physicochemical parameters and the WQI values. The WQI, which is the focus of this study, has a very weak correlation with , pH, and nitrate, a moderate correlation with K+, and a strong correlation with TDS, Ca2+, Mg2+, Na+, , and chloride. Water TDS correlates negatively with pH and positively with r < 0.45 with , Ca2+, and Cl, 0.45 > r > 0.7 with TDS, Mg2+, Na+, and WQI, r > 0.75 with , which has a strong correlation. Sodium correlates moderately with calcium, and Mg2+ correlates moderately with and WQI, while there is no correlation with the other parameters. pH does not correlate with other parameters, with an r coefficient ranging from 0.033 to −0.26. The correlation of TDS with , K+, and is weak, while all parameters Ca2+, Mg2+, Na+, , Cl, and WQI show a strong correlation with it. has no relationship with Ca2+, Mg2+, Na+, K+, , Cl, and WQI, whereas Ca2+, Mg2+, Na+, , and Cl have a strong and moderate correlation with each other.
Table 3

Basic statistics of physical and chemical parameters of groundwater in the study area

ParametersMin1st QuartileMean3rd QuartileSDMaxSkewnessWHO StandardsNumber of samples exceeding standards
(mg/L) 45.23 211 275.03 323 100.35 1,568.4 1.53 120 2,382 
(mg/L) 1.42 43 164.77 241 160.59 1,873 1.94 250 565 
Cl (mg/L) 25.52 170 536.75 784.1 445.69 2,759 1.40 75 1,570 
Na+ (mg/L) 2.7 110 346.49 520 273.18 1,470 0.92 200 1,432 
Mg2+ (mg/L) 1.62 38 63.99 79 39.83 379 1.87 50 1,312 
K+ (mg/L) 1.06 2.8 5.99 6.2 11.13 430 24.52 12 195 
Ca2+ (mg/L) 7.05 55 93.71 114 75.77 2,376 12.89 75 1,344 
(mg/L) 10.2 58.44 118.89 155 81.38 528 1.53 45 2,085 
TDS (mg/L) 154 727.3 1,540.82 2,141 989.6473 9,219 1.25 500 2,249 
pH 6.1 7.3 7.58 7.84 0.394 8.96 0.146 6.5–8.5 35 
WQI 33.53 105.20 184.21 249.05 94.37 499.57 0.719 – – 
ParametersMin1st QuartileMean3rd QuartileSDMaxSkewnessWHO StandardsNumber of samples exceeding standards
(mg/L) 45.23 211 275.03 323 100.35 1,568.4 1.53 120 2,382 
(mg/L) 1.42 43 164.77 241 160.59 1,873 1.94 250 565 
Cl (mg/L) 25.52 170 536.75 784.1 445.69 2,759 1.40 75 1,570 
Na+ (mg/L) 2.7 110 346.49 520 273.18 1,470 0.92 200 1,432 
Mg2+ (mg/L) 1.62 38 63.99 79 39.83 379 1.87 50 1,312 
K+ (mg/L) 1.06 2.8 5.99 6.2 11.13 430 24.52 12 195 
Ca2+ (mg/L) 7.05 55 93.71 114 75.77 2,376 12.89 75 1,344 
(mg/L) 10.2 58.44 118.89 155 81.38 528 1.53 45 2,085 
TDS (mg/L) 154 727.3 1,540.82 2,141 989.6473 9,219 1.25 500 2,249 
pH 6.1 7.3 7.58 7.84 0.394 8.96 0.146 6.5–8.5 35 
WQI 33.53 105.20 184.21 249.05 94.37 499.57 0.719 – – 
Figure 3

The Pearson correlation coefficient between input and output variables.

Figure 3

The Pearson correlation coefficient between input and output variables.

Close modal

Spatial distribution of WQI

The weighting of physicochemical parameters according to the weighted arithmetic WQI method (Table 1) is used to calculate the GQI. The WQI value varies from 33.53 to 499.57 within the study area with a mean, median, standard deviation, and skewness value of 184.12, 60.5, 94.73, and 0.719, respectively (Table 3). The maximum and minimum weights of TDS and (0.172) and and K+ (0.034) indicate their highest and lowest significance, respectively, in the calculation of WQI (Table 1). The WQIs are then incorporated into ArcGIS 10.3 using the ordinary Kriging and inverse distance-weighted (IDW) methods. Of these two methods, the IDW interpolation is chosen to produce the spatial distribution map (Figure 4) because it has the lowest error values (MSE and RMSE) compared with Kriging. The calculated quality indices of groundwater samples are divided into five subcategories: <50 (Excellent), 50–100 (Good), 101–200 (Poor), 201–300 (Very Poor), and >300 not fit for drinking (unsuitable water) (Gorgij et al. 2017). The results show that a total of 26, 505, 964, 608, and 346 samples are classified into the groundwater categories of ‘excellent’, ‘good’, ‘poor’, ‘very poor’, and ‘unsuitable’, respectively. Figure 3 shows that most of the study area falls into the categories of poor, very poor, and unsuitable water (about 78.32% of the total area), while mainly the northern part of the study area falls into the quality category of ‘good’ (Figure 3). The relatively low groundwater quality in these areas could be due to seawater intrusion or continued intensive agricultural use in this area.
Figure 4

Spatial distribution map of WQI of the study area.

Figure 4

Spatial distribution map of WQI of the study area.

Close modal

Performance of developed ML models

ML algorithms are used in a number of studies across areas with a high accuracy rate. This study proposed that ML methods can predict the WQI in the Gaza Strip coastal aquifer with both accuracy and precision. This study compared five ML algorithms under classification and regression modes. Three of the five algorithms accurately predicted the groundwater class and quality parameters. The data were split into training and testing data, and their respective performances were good. Groundwater monitoring is always expensive, and ML algorithms can make classification and regression accurate for the unknown or yet to be explored for WQI. This study can be extended to wastewater quality parameters and propagation of pollutants. The coastal aquifer of Gaza selected for this study is not conducive to systematic groundwater investigations due to topographic inconvenience and the unstable political situation. It calls for a need to use ML algorithms. In this study, two types of supervised ML algorithms are employed, i.e., classification and regression. In this study, we have worked on evaluating the results obtained from the two types of algorithms differently. Several evaluation metrics are used for regression, such as RMSE, MAE, R, DR, SI, and RSR and the following measures are used for classification: accuracy, precision, recall, and FI score.

Results for classification algorithms

Classification algorithms were used to predict the WQC assigned to the samples based on their previously calculated WQI. The 10 parameters listed in Table 3 were used for classification. At the beginning, the same 10 parameters were considered. The prediction results showed that SVM performed better than the other algorithms in training the dataset in such an environment, with an accuracy of 0.95, a precision of 0.95, a recall of 0.96, and an F1 score of 0.95, as given in Table 4. In this part, we went through the results of our study and found that SVM performs better in predicting WQC, while MLP-NN performs better than the other three learning machines (Table 4). These results are significantly better than the results of a previous research study conducted by Nair & Vijaya (2022), which had an accuracy of 0.61, a precision of 0.43, a recall of 0.51, and an F1 score of 0.42.

Table 4

Performance metrics for classification models result

AlgorithmAccuracyPrecisionRecallF1 score
Logistic regression 0.91 0.90 0.86 0.89 
Decision tree 0.90 0.85 0.89 0.87 
K-nearest neighbours 0.87 0.89 0.78 0.81 
SVM 0.95 0.95 0.96 0.95 
MLP-NN 0.92 0.93 0.93 0.93 
AlgorithmAccuracyPrecisionRecallF1 score
Logistic regression 0.91 0.90 0.86 0.89 
Decision tree 0.90 0.85 0.89 0.87 
K-nearest neighbours 0.87 0.89 0.78 0.81 
SVM 0.95 0.95 0.96 0.95 
MLP-NN 0.92 0.93 0.93 0.93 

Results for regression algorithms

Table 5 presents the results obtained in terms of MSE, RMSE, MAE, R Squared, and RSR values with the training and testing dataset for groundwater quality prediction using the calculated WQI. Comparison of these values indicates a slight improvement in WQI performance with the adjusted MLP-NN [RMSE = (0.1158–0.1106), MAE = (0.0099–0.0095), R = (0.9945–0.9948), DR = (0.0012–0.0011), SI = (0.0629–0.0601) ,and RSR = (0.1043–0.1007)] as compared with the SVM [RMSE = (0.1583–0.1690), MAE = (0.0169–0.0204), R = (0.9897–0.9880), DR = (0.0017–0.0018), SI = (0.0859–0.0919), and RSR = (0.1426–0.1539)], LogR [RMSE =(0.2291–0.2182), MAE = (0.0373–0.0340), R = (0.9784–0.9800), DR = (0.0024–0.0023), SI = (0.1243–0.1187), and RSR = (0.2063–0.1987)], KNN [RSME = (0.3582–0.4173), MAE = (0.0886–0.0204), R = (0.9464–0.9247), DR = (0.0037–0.0044), SI = (0.1944–0.2271), and RSR = (0.3226–0.3801)], and DT [RMSE = (0.4079–0.4607), MAE = (0.1022–0.1387), R = (0.9301–0.9064), DR = (0.0043–0.0048), SI = (0.2214–0.2507), and RSR = (0.3621–0.4196)]. From the obtained results, it is clear that the MLP-NN algorithm is the most efficient regression model, although SVM and LogR were found to perform well in the prediction of the WQI, while the performance of KNN and DT algorithms was slightly poorer in comparison with the mentioned above algorithms (Table 5). Supervised learning-based algorithms prediction models (LogR, DT, KNN, SVM, and MLP) are trained and validated, and then their performances are compared to find the best prediction model for the present study. Table 4 shows that the developed MLP-NN model attained the highest precision in terms of MSE (0.0134), MAE (0.0099), RMSE (0.1158), RSR (0.1043), DR (0.0012), SI (0.0629), and R (0.9948) throughout model training compared with the other models where it is understood that MLP algorithm performs good with the current datasets. Moreover, as compared with the other five ML models, the MLP-NN-based prediction model attains the highest accuracy during model testing in terms of RMSE (0.1106), MAE (0.0095), RSR (0.1007), DR (0.0011), SI (0.0601), and R (0.9948) (Table 5).

Table 5

Performance metrics for regression models result

AlgorithmRMSEMAEDRSIRRSR
Logistic regression       
 Training data 0.2291 0.0373 0.0024 0.1243 0.9784 0.2063 
 Testing data 0.2182 0.0340 0.0023 0.1187 0.9800 0.1987 
Decision tree       
 Training data 0.4079 0.1022 0.0043 0.2214 0.9301 0.3621 
 Testing data 0.4607 0.1387 0.0048 0.2507 0.9064 0.4196 
K-nearest neighbours       
 Training data 0.3582 0.0886 0.0037 0.1944 0.9464 0.3226 
 Testing data 0.4173 0.1251 0.0044 0.2271 0.9247 0.3801 
SVM       
 Training data 0.1583 0.0169 0.0017 0.0859 0.9897 0.1426 
 Testing data 0.1690 0.0204 0.0018 0.0919 0.9880 0.1539 
MLP-NN       
 Training data 0.1158 0.0099 0.0012 0.0629 0.9945 0.1043 
 Testing data 0.1106 0.0095 0.0011 0.0601 0.9948 0.1007 
AlgorithmRMSEMAEDRSIRRSR
Logistic regression       
 Training data 0.2291 0.0373 0.0024 0.1243 0.9784 0.2063 
 Testing data 0.2182 0.0340 0.0023 0.1187 0.9800 0.1987 
Decision tree       
 Training data 0.4079 0.1022 0.0043 0.2214 0.9301 0.3621 
 Testing data 0.4607 0.1387 0.0048 0.2507 0.9064 0.4196 
K-nearest neighbours       
 Training data 0.3582 0.0886 0.0037 0.1944 0.9464 0.3226 
 Testing data 0.4173 0.1251 0.0044 0.2271 0.9247 0.3801 
SVM       
 Training data 0.1583 0.0169 0.0017 0.0859 0.9897 0.1426 
 Testing data 0.1690 0.0204 0.0018 0.0919 0.9880 0.1539 
MLP-NN       
 Training data 0.1158 0.0099 0.0012 0.0629 0.9945 0.1043 
 Testing data 0.1106 0.0095 0.0011 0.0601 0.9948 0.1007 

To better understand the accuracy of the developed models, a plot is made from the best-fit line of the best-obtained model (Figure 5). Figure 4 displays the distribution of the predicted values of the model with respect to the test values, which are closer to the best-fit line in the case of the MLP model than the others, confirming the validity of the developed MLP-NN model. In general, the MLP-NN model agreed well with the actual values in the testing phase, while this was not the case for the DT and KNN models. The DT and KNN models obtained the weakest performance in the training and testing phases. Referring to Bui et al. (2020b), neural network-based models performed very accurately even on a poorly structured dataset and are able to achieve high accuracy compared with tree-based models, especially RF. Based on the obtained R values, the performance of the predictive models in the present study followed the order of MLP-NN > SVM > LogR > KNN > DT. Compared with some previous studies, WQI prediction models were developed using the RF algorithm and gradient boosting (GB) algorithm, and the XGBoost model was more efficient compared with the RF model (MAE 2.3053; MSE 9.5669, RMSE 3.0930) with a decrease in values of MAE 1.9642, MSE 7.2011, and RMSE 2.6835 (Ahmed et al. 2019). Additionally, a higher coefficient of determination was found for the XGBoost model (R2 0.7485) compared with the RF model (R2 0.6705). However, in another study, researchers developed an ANN prediction model using a Bayesian regularisation algorithm to predict WQI with a correlation coefficient (R) of 0.94 to test model accuracy, which showed a successful prediction of WQI in that study (Singha et al. 2021).
Figure 5

Plot showing prediction versus testing data of MLP-NN, SVM, LogR, KNN, and DT models.

Figure 5

Plot showing prediction versus testing data of MLP-NN, SVM, LogR, KNN, and DT models.

Close modal

Besides, an innovative model named LSTM was developed to predict some of the water quality parameters such as pH and temperature, with a notable value of R2 98.56 and 98.97% in the short-term, while R2 is 95.76 and 96.88% in the long-term prediction (Hu et al. 2018). Moreover, in a recent study, a significant reduction in RMSEs, i.e., 84.98, 76.81, and 82.83%, was noticed in the case of the DL model compared with RF, XGBoost, and ANN model, respectively (Singha et al. 2021). The results obtained in this study were found to be similar to the results of previous studies for predicting the WQI. However, the results in this study were significantly better than those of a study conducted by Nair & Vijaya (2022), which used an MLP model and had an RMSE of 2.432 and an MAE of 1.9143. This suggests that the method used in this study was more effective in predicting the WQI compared with the previous study.

Variable importance by ML models

The input variables play an important role as they control the stability and robustness of the predictive model (Beyene et al. 2009; Gültekin & Erdogdu Sakar 2018). The importance of the input variables in predicting the WQI is calculated by the predictive models (Figure 6). It can be seen that TDS has a significant impact on the prediction of WQI by reaching higher relative importance in all prediction models (LogR 0.38, MLP 0.36, SVM 0.36, KNN 0.35, and DT 0.31). The most thrilling fact is that the TDS value has a comparatively higher relative importance than the other variables in most cases (Figure 5). The pH variable also has higher relative importance in all models MLP-NN (0.23), SVM (0.21), KNN (0.21), LogR (0.19), and DT (0.15). High variation in variable importance is observed for and Mg2+ (Figure 6). The results also show that MLP-NN, SVM, LogR, KNN, and DT have a similar trend of high relative importance for Ca2+ for predicting WQI (Figure 6). A comparable situation was observed for and , which obtain lower relative importance in the SVM model (0.006–0.005) and KNN model (0.006–0.009), whereas MLP, LogR, and DT relatively show moderate relative importance (0.02–0.01), (0.02–0.02), and (0.02–0.02), respectively, of these two variables for predicting WQI. Figure 6 obviously shows that most models assign moderate relative importance to NO3, with the exception of the KNN model, which identifies this variable as highly significant (DT 0.12) for predicting WQI. The identification of the minimum significant variables is slightly unclear among the models. reaches the smallest significance in the KNN (0.006) and SVM (0.006) models, while MLP-NN and LogR highpoint (0.01) and (0.02) as the minimum significant variables in predicting WQI. It is noteworthy that all models give high priority to the variable TDS, estimating large values for the relative importance of this variable with percentages as follows: (LogR 38%, MLP-NN 36%, SVM 36%, KNN 35%, and DT 31%) compared with the other variables, indicating that the prediction calculation of all ML models is strongly influenced by TDS (Figure 6). The calculated relative importance of the input variables depends on the optimisation algorithm used for each prediction model. Comparing Table 1 and Figure 6, it is clear that the changing order of priorities in model calculation with ML follows a comparable trend to the relative weights of the calculated physical and chemical parameters by the WQI calculation method, which justifies the robustness and consistency of the developed ML algorithms used in the current study. In conclusion, it is clear from analysing the results of applying MLP and SVM predictive learning models to predict changes in groundwater quality that the results are logical and highly realistic. Based on the good results, it can be said that the applied MLP-NN and SVM learning algorithms can accurately determine the importance of input variables for accurate water quality prediction and can be used in other areas instead of classical methods.
Figure 6

Relative importance of input variables from LogR, DT, KNN, MLP, and SVM models.

Figure 6

Relative importance of input variables from LogR, DT, KNN, MLP, and SVM models.

Close modal

The WQI is traditionally calculated using water quality parameters determined by time-consuming laboratory analyses. This work investigated alternative ML methods for predicting the WQI of the Gaza coastal aquifer, and some studies were found to apply them. In this study, three ML models performed well in classifying and predicting the WQI class, which may be due to a large amount of data and normalisation of the data between zero and one. The results showed that MLP-NN performed better in WQI prediction and SVM performed better in WQC classification of water, while LogR performed better than the other two learning machines. The prediction performances of all models were compared using performance metrics, and the result clearly indicates that the prediction model MLP-NN exceeded the other models by attaining the highest model training and testing accuracy in terms of MLP-NN RMSE = (0.1158–0.1106), MAE = (0.0099–0.0095), R = (0.9945–0.9948), DR = (0.0012–0.0011), SI = (0.0629–0.0601), and RSR = (0.1043–0.1007). Though, the model MLP-NN was found to have relatively higher predictive performance than the SVM, LogR, KNN, and DT models. The order of performance of the developed models is MLP-NN > SVM > LogR > KNN and > DT according to their R values in the test phase, i.e., 0.9948, 0.9880, 0.9800, 0.9247, and 0.9064, respectively. The result also confirms that the MLP-NN and SVM models are able to effectively identify the significance of input variables in the WQI calculation. Therefore, the developed ML models MLP-NN and SVM models can be efficiently used to evaluate the groundwater quality in the current study area. In addition, these models can also be used in various water research studies, such as predicting water quality parameters. The only limitation of this study is that the sample dataset was collected once a year, i.e., the samples were collected in one of the seasons considered in the predictive models. Data from multiple seasons may lead to a deeper understanding of groundwater quality in this study. This consideration is likely to promote the high adoption of ML models in the field of water resource studies. The present study can be further improved by investigating the predictive capability of the developed models in comparison with other ML models considering various possible hydrometeorological inputs.

The authors appreciate the Palestinian Water Authority for providing the required water quality dataset and information to complete this research.

This research has not received any kind of funding.

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Abba
S. I.
,
Hadi
S. J.
,
Sammen
S. S.
,
Salih
S. Q.
,
Abdulkadir
R. A.
,
Pham
Q. B.
&
Yaseen
Z. M.
2020
Evolutionary computational intelligence algorithm coupled with a self-tuning predictive model for water quality index determination
.
J. Hydrol.
587
,
124974
.
Abbasi
T.
&
Abbasi
S. A.
2012
Water Quality Indices. Elsevier. 384 pp. Hardback ISBN 978-0-444-54304-2 USD 131.00-eBook; ISBN 978-0-444-54305-9 USD 131.00-Elsevier
.
Ahmed
U.
,
Mumtaz
R.
,
Anwar
H.
,
Shah
A. A.
,
Irfan
R.
&
García-Nieto
J.
2019
Efficient water quality prediction using supervised machine learning
.
Water
11
(
11
),
2210
.
https://doi.org/10.3390/w11112210
.
Aish
A. M.
2013
Drinking water quality assessment of the Middle Governorate in the Gaza Strip, Palestine
.
Water Resour. Ind.
4
(
2013
),
13
20
.
https://doi.org/10.1016/j.wri.2013.09.004
.
Aish
A. M.
,
Zaqoot
H. A.
&
Abdeljawad
S. M.
2015
Artificial neural network approach for predicting reverse osmosis desalination plants performance in the Gaza Strip
.
Desalination
367
(
2015
),
240
247
.
Alam
M. J.
,
Islam
M. R.
,
Muyen
Z.
,
Mamun
M.
&
Islam
S.
2007
Water quality parameters along rivers
.
Int. J. Environ. Sci. Technol.
4
(
1
),
159
167
.
Aldhyani
T. H. H.
,
Al-Yaari
M.
,
Alkahtani
H.
&
Maashi
M.
2020
Water quality prediction using artificial intelligence algorithms
.
Appl. Bionics Biomech.
2020
,
6659314
.
https://doi.org/10.1155/2020/6659314
.
Al-Najjar
H.
,
Ceribasib
G.
&
Ahmet Iyad Ceyhunlub
A. I.
2021
Effect of unconventional water resources interventions on the management of Gaza coastal aquifer in Palestine
.
Water Supply
21
(
8
),
4205
.
doi:10.2166/ws.2021.170
.
Asadollah
S. B. H. S.
,
Sharafati
A.
,
Motta
D.
&
Yaseen
Z. M.
2021
River water quality index prediction and uncertainty analysis: a comparative study of machine learning models
.
J. Environ. Chem. Eng.
9
,
104599
.
https://doi.org/10.1016/j.jece.2020.104599
.
Bahir
M.
,
Ouhamdouch
S.
,
Ouazar
D.
&
El Moçayd
N.
2020
Climate change effect on groundwater characteristics within semi-arid zones from western Morocco
.
Groundw. Sustain. Dev.
11
,
100380
.
https://doi.org/10.1016/j.gsd.2020.100380
.
Beyene
J.
,
Atenafu
E. G.
,
Hamid
J. S.
,
To
T.
&
Sung
L.
2009
Determining relative importance of variables in developing and validating predictive models
.
BMC Med. Res. Methodol.
9
(
1
),
1
10
.
https://doi.org/10.1186/1471-2288-9-64
.
Beyer
K.
,
Goldstein
J.
,
Ramakrishnan
R.
&
Shaft
U.
1999
When is ‘nearest neighbor’ meaningful?
In:
Proceedings of the International Conference on Database Theory
,
10–12 January 1999
,
Jerusalem, Israel
, pp.
217
235
.
Bui
D. T.
,
Khosravi
K.
&
Tiefenbacher
J.
2020a
Improving prediction of water quality indices using novel hybrid machine-learning algorithms
.
Sci. Total Environ
.
15
,
721
.
137612
.
https://doi.org/10.1016/j.scitotenv.2020.137612
.
Bui
D. T.
,
Tsangaratos
P.
,
Nguyen
V. T.
,
Van Liem
N.
&
Trinh
P. T.
2020b
Comparing the prediction performance of a deep learning neural network model with conventional machine learning models in landslide susceptibility assessment
.
Catena
188
,
104426
.
https://doi.org/10.1016/j.catena.2019.104426
.
Chatterjee
S.
,
Rodriguez
C. L.
&
Rasio
F. A.
2017
Binary black holes in dense star clusters: exploring the theoretical uncertainties
.
Astrophys. J.
834
,
68
(22 pp.). doi:10.3847/1538-4357/834/1/68
.
El Sheikh
R.
,
Ahmed
M.
&
Hamdan
S.
2003
Strategy of water desalination in the Gaza Strip
.
Desalination
156
(
2003
),
39
42
.
Gazzaz
N. M.
,
Yusoff
M. K.
,
Aris
A. Z.
,
Juahir
H.
&
Ramli
M. F.
2012
Artificial neural network modeling of the water quality index for Kinta River (Malaysia) using water quality variables as predictors
.
Mar. Pollut. Bull.
2012
(
64
),
2409
2420
.
Golmohammadi
G.
,
Shiv Prasher
S.
,
Madani
A.
&
Rudra
R.
2014
Evaluating three hydrological distributed watershed models: MIKE-SHE, APEX, SWAT
.
Hydrology
1
,
20
39
.
doi:10.3390/hydrology1010020
.
Gültekin
B.
&
Erdogdu Sakar
B.
2018
Variable importance analysis in default prediction using machine learning techniques
. In:
DATA 2018: Proceedings of the 7th International Conference on Data Science, Technology and Applications
,
July 2018
, pp.
56
62
.
https://doi.org/10.5220/0006872400560062
.
Hayder
G.
,
Kurniawan
I.
&
Mustafa
H. M.
2021
Implementation of machine learning methods for monitoring and predicting water quality parameters
.
Biointerface Res. Appl. Chem.
11
(
2
),
9285
9295
.
Helena
B.
,
Pardo
R.
,
Vega
M.
,
Barrado
E.
,
Fernandez
J. M.
&
Fernandez
L.
2000
Temporal evolution of groundwater composition in an alluvial aquifer (Pisuerga River, Spain) by principal component analysis
.
Water Res.
34
(
3
),
807
816
.
Horton
R. K.
1965
An index number system for rating water quality
.
J. Water Pollut. Control Fed.
37
(
3
),
300
306
.
Hossain
M.
&
Patra
P. K.
2020
Water pollution index – a new integrated approach to rank water quality
.
Ecol. Indic.
117
,
106668
.
https://doi.org/10.1016/j.ecolind.2020.106668
.
Islam
A. R. M. T.
,
Ahmed
N.
,
Bodrud-Doza
M.
&
Chu
R.
2017
Characterising groundwater quality ranks for drinking purposes in Sylhet district Bangladesh, using entropy method, spatial autocorrelation index and geostatistics
.
Environ. Sci. Pollut. Res.
24
(
34
),
26350
26374
.
https://doi.org/10.1007/s11356-017-0254-1
.
Islam
A. R. M. T.
,
Shen
S.
&
Haque
M. A.
2018
Assessing groundwater quality and its sustainability in Joypurhat district of Bangladesh using GIS and multivariate statistical approaches
.
Environ. Dev. Sustain.
20
(
5
),
1935
1959
.
https://doi.org/10.1007/s10668-017-9971-3
.
Islam
A. R. M. T.
,
Bodrud-doza
M.
,
Rahman
M. S.
,
Amin
S. B.
,
Chu
R.
&
Mamun
H. A.
2019
Sources of trace elements identification in drinking water of Rangpur district, Bangladesh and their potential health risk following multivariate techniques and Monte-Carlo simulation
.
Groundw. Sustain. Dev.
9
,
100275
.
https://doi.org/10.1016/j.gsd.2019.100275
.
Islam
A. R. M. T.
,
Mamun
A. A.
,
Rahman
M. M.
&
Zahid
A.
2020a
Simultaneous comparison of modified-integrated water quality and entropy weighted indices: implication for safe drinking water in the coastal region of Bangladesh
.
Ecol. Indic.
113
,
106229
.
https://doi.org/10.1016/j.ecolind.2020.106229
.
Islam
A. R. M. T.
,
Siddiqua
M. T.
,
Zahid
A.
,
Tasnim
S. S.
&
Rahman
M. M.
2020b
Drinking appraisal of coastal groundwater in Bangladesh: an approach of multi-hazards towards water security and health safety
.
Chemosphere
255
,
126933
.
https://doi.org/10.1016/j.chemo sphere.2020.126933
.
Islam
A. R. M. T.
,
Talukdar
S.
&
Mahato
S.
2021
Machine learning algorithm-based risk assessment of riparian wetlands in Padma River Basin of Northwest Bangladesh
.
Environ. Sci. Pollut. Res.
28
,
34450
34471
.
https://doi.org/10.1007/s11356-021-12806-z
.
Kawo
N. S.
&
Karuppannan
S.
2018
Groundwater quality assessment using water quality index and GIS technique in Modjo River Basin, central Ethiopia
.
J. African Earth Sci.
147
,
300
311
.
https://doi.org/10.1016/j.jafrearsci.2018.06.034
.
Ketata-Rokbani
M.
,
Gueddari
M.
&
Bouhlila
R.
2011
Use of geographical information system and water quality index to assess groundwater quality in El Khairat Deep Aquifer (Enfidha, Tunisian Sahel)
.
Iran. J. Energy Environ.
2
(
2
),
133
144
.
Kim
J.
,
Han
H.
,
Johnson
L. E.
,
Lim
S.
&
Cifelli
R.
2019
Hybrid machine learning framework for hydrological assessment
.
J. Hydrol
.
577
,
October 2019, 123913. https://doi.org/10.1016/j.jhydrol.2019.123913
.
Kisi
O.
,
Azad
A.
,
Kashi
H.
,
Saeedian
A.
,
Ali
S.
,
Hashemi
A.
&
Ghorbani
S.
2018
Modeling groundwater quality parameters using hybrid neuro-fuzzy methods
.
Water Resour. Manage
.
33
(
2
), pp.
847
861
.
https://doi.org/10.1007/s11269-018-2147-6
.
Kouadri
S.
,
Elbeltagi
A.
,
Towfiqul Islam
A.
&
Kateb
S.
2021a
Performance of machine learning methods in predicting water quality index based on irregular data set: application on IIIizi region (Algerian southeast)
.
Appl. Water Sci.
11
,
190
.
https://doi.org/10.1007/s13201-021-01528-9
.
Kouadri
S.
,
Kateb
S.
&
Zegait
R.
2021b
Spatial and temporal model for WQI prediction based on back-propagation neural network, application on EL MERK region (Algerian southeast)
.
J. Saudi Soc. Agric. Sci.
20
(
5
),
324
336
.
Loecke
T. D.
,
Burgin
A. J.
,
Riveros-Iregui
D. A.
,
Ward
A. S.
,
Thomas
S. A.
,
Davis
C. A.
&
Clair
M. A. S.
2017
Weather whiplash in agricultural regions drives deterioration of water quality
.
Biogeochemistry
133
(
1
),
7
15
.
Lu
H.
&
Ma
X.
2020
Hybrid decision tree-based machine learning models for short-term water quality prediction
.
Chemosphere
249
,
126169
.
https://doi.org/10.1016/j.chemosphere.2020.126169
.
Moriasi
D. N.
,
Arnold
J. G.
,
Van Liew
M. W.
,
Bingner
R. L.
,
Harmel
R. D.
&
Veith
T. L.
2007
Model evaluation guidelines for systematic quantification of accuracy in watershed simulations
.
Trans. ASABE
50
(
3
),
885
900
.
https://doi.org/10.13031/2013.23153
.
Muhammad Pour
R.
,
Shahrauddin
S.
,
Chang
C. K.
,
Zakaria
N. A.
,
AB Ghani
A.
&
Chan
N. W.
2015
Prediction of water quality index in constructed wetlands using support vector machine
.
Environ. Sci. Pollut. Res.
22
(
8
),
6208
6219
.
Muharemi
F.
,
Logofătu
D.
,
Andersson
C.
&
Leon
F.
2018
Approaches to building a detection model for water quality: a case study
. In:
(Sieminski, A., Kozierkiewicz, A., Nunez, M., Ha, Q.)
Modern Approaches for Intelligent Information and Database Systems
.
Springer
,
Cham
, pp.
173
183
.
Mukate
S.
,
Wagh
V.
,
Panaskar
D.
,
Jacobs
J. A.
&
Sawant
A.
2019
Development of new integrated water quality index (IWQI) model to evaluate the drinking suitability of water
.
Ecol. Indic.
101
,
348
354
.
https://doi.org/10.1016/j.ecolind.2019.01.034
.
Nair
J. P.
&
Vijaya
M. S.
2022
River water quality prediction and index classification using machine learning. J. Phys. Conf. Ser. 2325. International Conference on Electronic Circuits and Signalling Technologies. https://doi.org/10.1088/1742-6596/2325/1/012011
.
Najafzadeh
M.
&
Ghaemi
A.
2019
Prediction of the five-day biochemical oxygen demand and chemical oxygen demand in natural streams using machine learning methods
.
Environ. Monit. Assess.
191
(
6
),
380
.
https://doi.org/10.1007/s10661-019-7446-8
.
Najafzadeh
M.
&
Niazmardi
S.
2021
A novel multiple-Kernel support vector regression algorithm for estimation of water quality parameters
.
Nat. Resour. Res.
30
(
5
),
3761
3775
.
https://doi.org/10.1007/s11053-021-09895-5
.
Najafzadeh
M.
&
Oliveto
G.
2020
Riprap incipient motion for overtopping flows with machine learning models
.
J. Hydroinformatics
22
(
4
),
749
767
.
https://doi.org/10.2166/hydro.2020.129
.
Najafzadeh
M.
,
Ghaemi
A.
&
Emamgholizadeh
S.
2019
Prediction of water quality parameters using evolutionary computing-based formulations
.
Int. J. Environ. Sci. Technol.
16
(
10
),
6377
6396
.
Najafzadeh
M.
,
Homaei
F.
&
Mohamadi
S.
2022
Reliability evaluation of groundwater quality index using data-driven models
.
Environ. Sci. Pollut. Res.
29
(
6
),
8174
8190
.
https://doi.org/10.1007/s11356-021-16158-6
.
Palestinian Central Bureau of Statistics (PCBS)
2021
About 13 Million Palestinians in the Historical Palestine and Diaspora, on the Occasion of the International Population day 11/7/2019
.
Available from: http://www.pcbs.gov.ps/post.aspx?lang=en&ItemID=3503 (accessed 20 June 2022)
.
Patil
V. B. B.
,
Pinto
S. M.
,
Govindaraju
T.
,
Hebbalu
V. S.
,
Bhat
V.
&
Kannanur
L. N.
2020
Multivariate statistics and water quality index (WQI) approach for geochemical assessment of groundwater quality – a case study of Kanavi Halla Sub-Basin, Belagav India
.
Environ. Geochem. Health
42
(
9
),
2667
2684
.
Quinlan
J. R.
1990
Decision trees and decision-making
.
IEEE Trans. Syst. Man Cybern.
1990
(
20
),
339
346
.
Rahman
M. M.
,
Bodrud-Doza
M.
,
Siddique
T.
,
Zahid
A.
&
Islam
A. R. M. T.
2020
Spatiotemporal distribution of fluoride in drinking water and associated probabilistic human health risk appraisal in the coastal region, Bangladesh
.
Sci. Total Environ.
724
,
138316
.
https://doi.org/10.1016/j.scitotenv.2020.138316
.
Sadat-Noori
S. M.
,
Ebrahimi
K.
&
Liaghat
A. M.
2014
Groundwater quality assessment using the water quality index and GIS in Saveh-Nobaran aquifer, Iran
.
Environ. Earth Sci
.
71
(
9
),
3827
3843
.
doi:10.1007/s12665-013-2770-8
.
Saha
N.
,
Bodrud-doza
M.
,
Islam
A. R. M. T.
&
Begum
B. A.
2020
Hydrogeochemical evolution of shallow and deeper aquifers in central Bangladesh: arsenic mobilisation process and health risk implications from the potable use of groundwater
.
Environ. Earth Sci.
79
(
20
),
477
.
https://doi.org/10.1007/s12665-020-09228-4
.
Sharma
S.
&
Bhattachary
A.
2017
Drinking water contamination and treatment techniques
.
Appl. Water Sci.
7
,
1043
1067
.
https://doi.org/10.1007/s13201-016-0455-7
.
Singh
K. P.
,
Basant
N.
&
Gupta
S.
2011
Support vector machines in water quality management
.
Anal. Chim. Acta
703
,
152
162
.
doi:10.1016/j.aca.2011.07.027
.
Singha
S.
,
Pasupuleti
S.
,
Durbha
K. S.
,
Singha
S. S.
,
Singh
R.
&
Venkatesh
A. S.
2019
An analytical hierarchy process-based geospatial modeling for delineation of potential anthropogenic contamination zones of groundwater from Arang block of Raipur district, Chhattisgarh, Central India
.
Environ. Earth Sci.
78
,
694
.
https://doi.org/10.1007/s12665-019-8724-z
.
Singha
S.
,
Pasupuleti
S.
,
Singha
S. S.
&
Kumar
S.
2020
Effectiveness of groundwater heavy metal pollution indices studies by deep-learning
.
J. Contam. Hydrol.
235
,
103718
.
https://doi.org/10.1016/j.jconhyd.2020.103718
.
Singha
S.
,
Pasupuleti
S.
,
Singha
S. S.
,
Singh
R.
&
Kumar
S.
2021
Prediction of groundwater quality using efficient machine learning technique
.
Chemosphere
276
,
130265
.
Srinivasamoorthy
K.
,
Chidambaram
M.
,
Prasanna
M. V.
,
Vasanthavigar
M.
,
John Peter
A.
&
Anandhan
P.
2008
Identification of major sources controlling groundwater chemistry from a hard rock terrain – a case study from Mettur taluk, Salem district, Tamilnadu, India
.
J. Earth Syst. Sci.
117
(
1
),
49
58
.
Srivastava
G.
&
Kumar
P.
2013
Water quality index with missing parameters
.
Int. J. Res. Eng. Technol.
2013
(
2
),
609
614
.
The Palestinian Central Bureau of Statistics (PCBS) & Palestinian Water Authority (PWA)
2022
Issue a Joint Press Release on the Occasion of World Water Day; on March 22nd, 2022
.
Available from: https://pcbs.gov.ps (accessed 3 July 2022)
.
Tiyasha Tung
T. M.
&
Yaseen
Z. M.
2020
A survey on river water quality modelling using artificial intelligence models: 2000–2020
.
J. Hydrol.
585
,
124670
.
https://doi.org/10.1016/j. jhydrol.2020
.
Trevett
A. F.
,
Carter
R. C.
&
Tyrrel
S. F.
2005
Mechanisms leading to postsupply water quality deterioration in rural Honduran communities
.
Int. J. Hyg. Environ. Health
208
(
3
),
153
161
.
Tumer
A. E.
&
Edebali
S.
2019
Modeling of trivalent chromium sorption onto commercial resins by artificial neural network
.
Appl. Artif. Intell.
33
(
4
),
349
360
.
https://doi.org/10.1080/08839514.2019.1577015
.
Uddin
M. G.
,
Nash
S.
&
Olbert
A. I.
2021
A review of water quality index models and their use for assessing surface water quality
.
Ecol. Indic.
122
,
107218
.
https://doi.org/10.1016/j.ecolind.2020.107218
.
Wang
X.
,
Zhang
F.
&
Ding
J.
2017
Evaluation of water quality based on a machine learning algorithm and water quality index for the Ebinur Lake Watershed China
.
Sci Rep.
12858
.
https://doi.org/10.1038/s41598-017-12853-y
.
World Health Organization (WHO)
2011
Guidelines for Drinking-Water Quality
, 4th edn.
World Health Organization
,
Geneva
,
Switzerland
, p.
542
.
World Health Organization (WHO)
2017
Guidelines for Drinking Water Quality
, 4th edn.
Geneva
.
Zhu
S.
,
Hrnjica
B.
,
Ptak
M.
,
Choinski
A.
&
Sivakumar
B.
2020
Forecasting of water level in multiple temperate lakes using machine learning models
.
J. Hydrol.
585
,
124819
.
https://doi.org/10.1016/j.jhydrol.2020.124819
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).