The main objective of the present study is to evaluate groundwater quality for irrigation purposes in the central-western part of Haryana state (India). For this, 272 groundwater samples were collected during the pre- and post-monsoon periods in 2022. Several indices, including SAR, PI, Na%, KR, magnesium adsorption ratio (MAR), and IWQI were derived. The results of SAR, Na%, and KR values indicate that the groundwater is generally suitable for irrigation. On the other hand, PI and MAR exceeded the established limits, primarily showing issues related to salinity and magnesium content in the groundwater. Furthermore, according to the IWQI classification, 47.06 and 25% of the total collected samples fell under the ‘severe restriction for irrigation’ category during the pre-monsoon and post-monsoon periods, respectively. Spatial variation maps indicate that water quality in the western portion of the study area is unsuitable for irrigation during both periods. Three ML algorithms, namely RF, SVM, and XGBoost were integrated and validated to predict the IWQI. The results revealed that the XGBoost with random search achieves the best prediction performances. The approaches established in this study have been confirmed to be cost-effective and feasible for groundwater quality, using hydrochemical parameters as input variables, and highly beneficial for water resource planning and management.

  • The study focuses on assessing groundwater quality for irrigation using various indices.

  • SAR, Na%, and KR values generally indicate groundwater suitability for irrigation, and PI and MAR values exceeded established limits.

  • RF, SVM, and XGBoost ML algorithms were employed and validated to predict IWQI.

  • XGBoost with random search offers a potentially effective tool for groundwater quality prediction and management.

Groundwater resources are a fundamental component of primary natural sources, playing an essential role in the socio-economic development of countries. Nevertheless, agronomy is the principal global consumer of groundwater resources, as highlighted by Kouadri et al. (2022). These resources face a range of challenges exposing their sustainability, including the influence of climate change, human activities, and natural developments (Makki et al. 2021; Abu El-Magd et al. 2023; Raheja et al. 2023a). Mostly, these issues lead to the degradation of water's chemical composition, making it unfit for drinking or irrigation. The United Nations has recognized admittance to fresh drinking water as a fundamental human right. Their findings show that roughly 10% of urban and rural populations lack access to clean and pure water for drinking (United Nations 2015). Nevertheless, regular monitoring and assessment strategies can help manage water quality, decrease contamination, and mitigate health hazards (Al-Barakah et al. 2017; Adimalla 2019; Tleuova et al. 2023; Raheja et al. 2024a). Several techniques and methodologies have been employed for this objective, with positive outcomes in estimating groundwater quality, delineating pollution risks, and assessing health hazards. These methodologies such as index-based, statistical methods, and GIS techniques, which are commonly used for groundwater quality assessment and mapping (Gebrehiwot et al. 2011; Gao et al. 2020; Khan et al. 2022; Gaagai et al. 2023; Omeka et al. 2023; Raheja et al. 2023b).

To assist in irrigation use, the irrigation water quality (IWQ) frequently relies on the utilization of various indexes and parameters as specified by Food, Agriculture Organization (FAO) guidelines (Ayers & Westcot 1994). Several studies have explored the application of index-based and statistical methods to minimize subjectivity while evaluating water suitability for irrigation. Ewaid et al. (2019) introduced a novel software tool designed to assess the suitability of irrigation water based on the FAO guidelines. While the findings of these studies indicate the suitability of conventional methods as they are both quick and cost-effective methods for water quality estimation and controlling processes, they require a significant volume of data. Hence, using traditional approaches to evaluate IWQ may be challenging regarding labor and costs, particularly for farmers in developing countries. Sustainable groundwater management plans currently address innovative and cost-effective methods to evaluate and predict groundwater quality. To meet this requirement, prediction-based approaches can prove to be invaluable in overcoming the challenges of groundwater planning and management.

In recent years, the GIS has developed as an influential technique for collecting, investigating, and visually representing spatial data. These data are then used to make informed decisions across several water resource fields (Raheja et al. 2022a; Awasthi et al. 2023). Omeka et al. (2023) used GIS tools to evaluate the water quality for irrigation purposes in southeastern parts of Obubra LGA, Nigeria, and the results specified that geographical information system–analytical hierarchical process (GIS-AHP) techniques are very dependable for understanding the complete groundwater quality. Islam et al. (2018) utilized GIS methods to estimate the groundwater quality in Bangladesh. They determined that spatial maps of the studied area offer dependable information for policymakers in a highly sustainable manner. Amrani et al. (2022) evaluated the suitability of groundwater in the Timahdite–Almis Guigou region of Morocco. Their findings suggested fluctuating qualities of groundwater, ranging from good to poor for drinking purposes while signifying that the groundwater was appropriate for irrigation uses.

Over the past few years, several scientists have employed machine learning (ML) modeling to forecast groundwater quality for irrigation purposes, aiming to address these challenges. The results obtained using ML approaches performed better than traditional methods (Abba et al. 2020; Guo et al. 2021; Mosavi et al. 2021; Kouadri et al. 2022; Abu El-Magd et al. 2023; Raheja et al. 2024b). Recently, El Bilali et al. (2021) evaluated groundwater quality for irrigation purposes using ML models in the Berrechid aquifer of Morocco. However, Egbueri & Agbasi (2022) predicted the water quality index (WQI) and overall index of pollution (OIP) using a multiple linear regression (MLR) model and multilayer perceptron neural networks (MLP-NN) in Nigeria, and they found that rephrase ML approaches have proven helpful in the assessment, classification, and forecast of water quality indices. Ewaid et al. (2018) developed a predictive model using MLR to quickly forecast irrigation water, water quality indices, and their applications in agriculture, manufacturing industry, and drinking uses of the Tigris River in Iraq. Kouadri et al. (2022) used three ML models such as long short-term memory (LSTM), MLR, and artificial neural network (ANN) to forecast the groundwater quality for irrigation in the Maharashtra region of India. Another study by Aldrees et al. (2023) used multi-expression programming (MEP) and random forest (RF) regression models to predict water quality in the Indus River Basin of Pakistan. Singha et al. (2021) used XGBoost, RF, and ANN models for groundwater quality evaluation in the Raipur region of India. In addition to established ML tools and techniques, newer methods are developed to enhance the accuracy and reliability of data mining, forecasting, and data extraction in water quality. Consequently, the water quality research area appears to be consistently progressing scientifically in recent times.

In view of assessing groundwater quality, the current study is carried out in the central-western portion of Haryana state (India), where groundwater serves as the predominant water source for drinking and irrigation purposes. So far, no single study estimated IWQ using a combined approach involving several indices and ML processes specifically for this central-western part of Haryana. Hence, there is significant value in broadly characterizing the hydrogeochemistry of groundwater, assessing water quality, and identifying the governing factors within a region to help the sustainable utilization of groundwater resources. With this view, this study was carried out to achieve the following objectives: (1) Evaluation of groundwater quality for irrigation purposes by using several indices such as SAR, PI, Na%, KR, magnesium adsorption ratio (MAR), and IWQI; (2) to determine the controlling factors in groundwater and their spatial distribution by ArcGIS; (3) to predict the IWQI by use of advance ML algorithms, and (4) to compare the performances of three ML algorithms such as RF, SVM, and XGBoost and choosing the best suitable models. This research outcome aims to support farmers and groundwater development in semi-arid countries by allowing rapid prediction of water quality for irrigation at a low cost.

Description of the study area

The research area, comprising Hisar, Rohtak, and Jind, is situated in the central-western region of Haryana, India. Hisar, a district in the west-central part of Haryana State, covers a total geographical area of 3,983 km2. Its coordinates range between 28° 56′ 00″ and 29° 38′ 30″ North latitudes, and 75° 21′ 12″ and 76° 18′ 12″ East longitudes. Rohtak district, also in Haryana, lies between 28° 40′ to 29° 05′ North latitude and 76° 13′ to 76° 51′ East longitude, encompassing an area of 1,745 km2, with elevations ranging from 216 to 275 m (CGWB 2013a). Jind district, another region in Haryana, spans between 29° 03′ to 29° 51′ North latitude and 75° 53′ 00″ to 76° 45′ 30′ East longitude, covering a geographical area of 2702 km2.

Climate and rainfall

The climate of the Hisar area falls within the arid category, including hot summers and cold winters. Temperatures fluctuate between 5.0 °C (minimum in January) and 43°C (maximum in May and June). The average annual rainfall in Hisar is 330 mm, with 75–80% of the precipitation occurring during the southwest monsoon period from June to September. Rohtak area is situated in a subtropical semi-arid region, characterized by hot summers and cold winters. The average temperature ranges from 7°C in January to 40.5°C in May and June. The approximate annual rainfall in Rohtak is 592 mm. The climate of the Jind area is tropical, with extremely hot summers and cold winters. The average annual rainfall in Jind is around 515 mm (CGWB 2013b).

Geology and drainage system

The study area falls under the Indo-Gangetic Plains, shaped by the sedimentary deposition of the river systems traversing the region. The examination of soil characteristics holds an essential role in determining both the quality and quantity of groundwater. Soils are derived from rocks due to weathering and erosion, influenced by geological forces such as rivers, wind, and rainfall, which can vary depending on the prevailing climate conditions. The primary geological formations found in the study area cover Aeolian Deposits, Alluvium, Proterozoic Rocks, and the Aravalli Super Group. Natural drainage is absent in the study area, but it is drained by a canal network and artificial drains.

Digital elevation model

In this study, a 30 × 30 m resolution of the SRTM (Shuttle Radar Topography Mission) dataset was taken from the USGS Earth Explorer site (United States Geological Survey), and an elevation map was prepared using ArcGIS software. Figure 1 displays the digital elevation model (DEM) plot of the study area. The prominently green regions specify high elevation and the orange zones have low-altitude regions (Figure 1).
Figure 1

Elevation map of the study area.

Figure 1

Elevation map of the study area.

Close modal

Data collection

For this research, groundwater samples were obtained from 136 locations across all blocks within three districts (Hisar, Rohtak, and Jind), as shown in Figure 2. The sampling primarily targeted government hand pumps and private tube wells during both pre-monsoon (136 samples) and post-monsoon (136 samples) seasons in 2022, following uniform procedures drawn by the American Public Health Association (APHA 2012). Before sampling, hand pumps and tube wells underwent a 3–5 min flushing to remove any residual water in the pipes. Each sample was meticulously collected in 1,000 mL bottles. The latitude and longitude of all sampling sites were recorded using a portable global positioning system (GPS). Post-collection, the samples were promptly transported to the laboratory to maintain their integrity and prevent degradation. Upon arrival, the samples were stored at 4 °C to ensure preservation. Subsequently, all the samples were filtered using 0.45 μm filter paper in preparation for further analysis.
Figure 2

The study area map with groundwater sample locations.

Figure 2

The study area map with groundwater sample locations.

Close modal

Field and laboratory analysis

The pH, EC, and TDS were evaluated on the site using a portable pH/TDS/electrical conductivity (EC) meter. The calcium (Ca2+), magnesium (Mg2+), and total hardness (TH) were observed using the ethylenediaminetetraacetic acid (EDTA) titration method. The sodium (Na+) and potassium (K+) were calculated using the flame photometer method. Anion analysis involved the chloride (Cl), carbonate (), and bicarbonate () titration methods, while significant anions nitrate (), sulfate (), and fluoride (F) were measured using a spectrophotometer.

Data reliability check

Using Equation (1), the ion balance error (IBE) was calculated to validate the accuracy of the comprehensive hydrochemical investigation conducted on each groundwater sample.
(1)

All the above concentrations of ions are in mg/L. The IBE value falls within the acceptable limit of ±10% for all groundwater samples.

Indexing approach

Numerous indices, such as SAR, PI, Na%, KR, MAR, and IWQI, were computed to classify the groundwater for irrigation uses as shown in Table 1.

Table 1

Groundwater quality for irrigation based on several indices

Sr. No.ParametersReference
1.  Richards (1954)  
2.  Doneen (1964)  
3.  Wilcox (1955)  
4.  Kelly (1956)  
5.   Paliwal KV (1972)  
6.  Meireles et al. (2010)  
Sr. No.ParametersReference
1.  Richards (1954)  
2.  Doneen (1964)  
3.  Wilcox (1955)  
4.  Kelly (1956)  
5.   Paliwal KV (1972)  
6.  Meireles et al. (2010)  

Irrigating water quality index (IWQI)

The IWQI is a dimensionless index and is used to evaluate the water quality for irrigation purposes based on its chemical composition. It provides a comprehensive evaluation by considering multiple water quality parameters and assigning a single index value to indicate the overall quality of the water for irrigation purposes (Meireles et al. 2010; Masoud et al. 2022). It was calculated by using five parameters: EC, SAR, Na+, Cl, and . The calculation of IWQI involves assigning weights to each water quality parameter based on its importance for irrigation and then aggregating these weighted parameters to obtain an overall index value. This index value typically varies from 0 to 100, where higher values indicate good water quality for irrigation. The detailed description to calculate IWQI is as follows:

Step 1. Relative weight (wi) for the IWQI can be computed using Equation (2) and shown in Table 2:
(2)
where F is the constant, i is the number of used parameters, Aij is the extent parameter i with factor j, j is the number of factors in IWQI, which varies from 1 to k.
Table 2

Relative weights of IWQI parameters as proposed by (Meireles et al. 2010)

ValueName of parameter
ECSARNa+Cl
 wi 0.211 0.189 0.202 0.194 0.204 
ValueName of parameter
ECSARNa+Cl
 wi 0.211 0.189 0.202 0.194 0.204 
Step 2. Water quality measurements (qi) can be calculated by the following equation
(3)
where qmax is the extreme value of qi for each class, xij is the actual value, xinf is the minimum limit, qimp is the class amplitude, xamp is the class amplitude to which the parameter belongs. Table 3 represents the estimated value of (qi) and IWQ parameters.
Table 3

Limiting values of parameters for calculation of quality measurement (qi)

qiECSARNa+Cl
85–100 200–750 <3 2–3 <4 1–1.5 
60–85 750–1,500 3–6 3–6 4–7 1.5–4.5 
35–60 1,500–3,000 6–12 6–9 7–10 4.5–8.5 
0–35 <200 or >3,000 >12 <2 or >9 >10 <1 or >8.5 
qiECSARNa+Cl
85–100 200–750 <3 2–3 <4 1–1.5 
60–85 750–1,500 3–6 3–6 4–7 1.5–4.5 
35–60 1,500–3,000 6–12 6–9 7–10 4.5–8.5 
0–35 <200 or >3,000 >12 <2 or >9 >10 <1 or >8.5 
Step 3. Finally, IWQI can be computed using the following equation:
(4)
where n is the number of parameters used for calculating IWQI.

ML algorithms

In this study, three ML algorithms, such as SVM, RF, and XGBoost, are used to predict the IWQI. However, the extensive literature on ML algorithms used can be found in Sánchez A (2003), Goel & Pal (2009), Singh et al. (2011), Chen & Guestrin (2016), Chen et al. (2020), El Bilali et al. (2021), Raheja et al. (2022b) and Aldrees et al. (2023).

Random forest

RF is a helpful method for classification and regression tasks. It combines various tree predictors, with each tree being generated independently using a bagged sample (67% of training samples). In regression, the tree predictor produces numerical values, whereas the RF classifier naturally allows class labels (Breiman 2001). The foremost aim is to perform regression predictions. Therefore, this section will focus exclusively on introducing the regression tree (RT). At every branching point of the RT, we calculate the average of the samples with leaf nodes and determine the mean square error (MSE) formed between each sample. The lowest the MSE value among leaf nodes as the branching condition, the RT will continue to grow until no more features are available or until the overall MSE reaches its optimal point (El Bilali & Taleb 2020; Zhang et al. 2021).

Support vector machine

Support vector machine (SVM) is a discriminative approach employed for both classification and regression methods. It operates by constructing a hyperplane to minimize errors, as initially proposed by Cortes & Vapnik (1995). Its objective is to maximize the margin, which is the distance between the hyperplane and the nearest data points of each class, making it robust against overfitting and generalizing well to unseen datasets. For modeling system (S) with observation dataset (DS), , xi and yi represent the input and outputs with a linear function as shown in the following equation.
(5)
The optimal function is the minimization of the function, as shown in Equations (6) and (7). Hence, the loss functions such as ɛ-insensitive, quadratic, and Hubber methods can be used as given in the following equations.
(6)
(7)

With as the kernel function (k), i.e., linear, radial basis, and polynomial, ω and b, are the weight and basis vectors, and C is referred to as a pre-specified value to penalize the training error. whereas ξi and ξi+ the lower and higher limitations on the output. Similar to other ML algorithms, methods like Grid Search, Random Search, and Bayesian Optimization are commonly used to fine-tune SVM's hyperparameters. These hyperparameters can significantly influence the performance of the SVM algorithm. Further details about the SVM model readers can be referred to (Cortes & Vapnik 1995; Brereton & Lloyd 2010).

Extreme gradient boosting (XGBoost)

XGBoost is a parallel tree-boosting algorithm that resolves large-scale problems with higher accuracy and has recently been used for classification and regression (T. Chen & Guestrin 2016). The procedure of XGBoost is as follows: Consider a dataset DS, which includes m features, and n is a number of examples, for instance, . The objective function of XGBoost that needs to be minimized is defined in the following equations.
(8)
(9)
where is the predicted value, yi is the actual value at the tth round, L is the loss function, T is the number of leaves in the tree, ωj is a score of jth leaf, and lambda (λ) is the regularization parameter (Chen & Guestrin 2016). Ω is a measure of how complex the model, this contributes to avoiding overfitting of the model. In the context of hyperparameter optimization, XGBoost offers several parameters that can significantly impact its performance and generalization ability. Methods like grid search, random search, and Bayesian optimization are commonly employed to fine-tune these hyperparameters and achieve the best model performance. For more exhaustive information on the XGBoost algorithm, readers can refer to Chen & Guestrin (2016).

The study's limitations include its reliance on data from specific periods, potentially missing long-term trends or extreme weather effects. While used ML algorithm performed well in predicting IWQI, exploring other advanced models and expanding the study's temporal and spatial scope would enhance accuracy and account for complex variable interactions.

Metric evaluation algorithms

The performance of different ML algorithms FOR IWQI was evaluated by four statistical criteria, such as Pearson's correlation coefficient (r), root mean square error (RMSE), mean absolute error (MAE), and relative bias (RBIAS). The values can be calculated as given in Table 4.

Table 4

Descriptive summary of three statistical indexes used for the algorithm's performances

Statistical indicatorsFormulationDescription
Pearson's correlation coefficient (r When r = 1 signifies the strongest correlation between the actual and forecast values, it does not necessarily imply the best-fitting model. If the value of r < 1 suggests a model that fits the data to a lesser extent. 
Root mean square error (RMSE)  A lower RMSE value than the results suggests a more accurate fit for the model. 
Mean absolute error (MAE)  A lower MAE value indicates a higher accuracy of the model, as it signifies that the predicted values are closer to the observed values on average. 
Relative bias (RBIAS)  When RBIAS > 0, it implies that the model tends to underestimate the target magnitude. When RBIAS < 0, it suggests that the model tends to overestimate the target magnitude. If RBIAS = 0, it indicates a perfect model. A higher value of RBIAS specifies a more significant bias in the model. 
Statistical indicatorsFormulationDescription
Pearson's correlation coefficient (r When r = 1 signifies the strongest correlation between the actual and forecast values, it does not necessarily imply the best-fitting model. If the value of r < 1 suggests a model that fits the data to a lesser extent. 
Root mean square error (RMSE)  A lower RMSE value than the results suggests a more accurate fit for the model. 
Mean absolute error (MAE)  A lower MAE value indicates a higher accuracy of the model, as it signifies that the predicted values are closer to the observed values on average. 
Relative bias (RBIAS)  When RBIAS > 0, it implies that the model tends to underestimate the target magnitude. When RBIAS < 0, it suggests that the model tends to overestimate the target magnitude. If RBIAS = 0, it indicates a perfect model. A higher value of RBIAS specifies a more significant bias in the model. 

Sodium adsorption ratio (SAR)

SAR is an important parameter for evaluating the suitability of water for irrigation and managing the potential risks associated with high sodium concentrations in soil and water. SAR is a significant indicator because Na+ can have harmful effects on soil structure and plant growth when present in high concentrations. The SAR is classified into four classes such as SAR < 10: water is considered excellent for irrigation, when 10 ≤ SAR < 18: water may be suitable for irrigation, when 18 ≤ SAR < 26: water is normally not suitable, and when SAR >26 water is unsuitable for irrigation due to the high sodium content, The results of the SAR classification indicate that out of total pre-monsoon samples, 30.88% have low sodium, 22.06% medium sodium, 9.56% are in the doubtful range (high sodium content), and 37.5% have very high sodium, which is unsuitable for irrigation. For the post-monsoon study, 34.56% of samples fall under the low sodium category, 16.92% of samples in the medium sodium range, 13.23% of samples in the high sodium category, and 35.29% of samples lie in the not suitable for irrigation category (Table 5).

Table 5

Classification of irrigation water based on SAR and PI

Name of parametersWater typeRangePercentage of samples in pre-monsoonPercentage of samples in post-monsoon
SAR Low sodium <10 30.88 34.56 
Medium sodium 10–18 22.06 16.92 
High sodium 18–26 9.56 13.23 
Very high sodium >26 37.5 35.29 
PI Excellent >75 2.94 2.94 
Good 25–75 46.32 45.58 
Unsuitable <25 50.74 51.48 
Name of parametersWater typeRangePercentage of samples in pre-monsoonPercentage of samples in post-monsoon
SAR Low sodium <10 30.88 34.56 
Medium sodium 10–18 22.06 16.92 
High sodium 18–26 9.56 13.23 
Very high sodium >26 37.5 35.29 
PI Excellent >75 2.94 2.94 
Good 25–75 46.32 45.58 
Unsuitable <25 50.74 51.48 

A spatial variation map of SAR using the inverse weighted difference (IDW) interpolation method was also obtained (Figure 3). A difference between pre- and post-monsoon maps reveals a larger SAR distribution in before-monsoon samples compared to those from the after-monsoon period. Furthermore, Figure 3 illustrates that the lower SAR values during both periods are predominantly located in the northeastern part of the study region.
Figure 3

Spatial variations of SAR, (a) pre-monsoon; (b) post-monsoon.

Figure 3

Spatial variations of SAR, (a) pre-monsoon; (b) post-monsoon.

Close modal

Permeability index

The PI serves as an additional parameter to assess the groundwater quality for irrigation (Doneen 1964). Its measure is used in soil science and hydrogeology to assess the hydraulic conductivity or permeability of soil layers. The PI classification is categorized into three classes: excellent if PI > 75, good within the range of 25–75, and unsuitable if the PI value is below 25. In pre- and post-monsoon season, only 2.94% of the sample is categorized into excellent category (Table 5). During the pre-monsoon period, 46.32% of samples were into suitable water type, and 50.74% were unsuitable for irrigation purposes. Post-monsoon samples indicate that 45.58 and 51.48% of groundwater samples fall into the categories of good and unfit for irrigation, respectively. The spatial maps highlight higher concentrations primarily during the before-monsoon season in contrast to the after-monsoon period (Figure 4). Overall, the PI results suggest that most of the study areas exhibit acceptable quality.
Figure 4

Spatial variations of PI, (a) pre-monsoon; (b) post-monsoon.

Figure 4

Spatial variations of PI, (a) pre-monsoon; (b) post-monsoon.

Close modal

Sodium percentage (Na %)

The sodium percentage provides information about the relative dominance of Na+ ions compared to other cations (such as Ca2+ and Mg2+) in the water. According to Todd (1995), higher sodium levels in groundwater can diminish soil permeability, delay plant growth, and reduce overall production. Na % values below 60% are generally considered safe for most crops and soils, indicating that the water or soil extract has a balanced cation composition. The Na % values between 60 and 80% may indicate a moderate risk of soil degradation due to sodium, particularly in soils with poor drainage or high clay content, and the Na% values above 80% show a high risk of sodium-related issues, such as soil dispersion, reduced permeability, and decreased crop yields. Table 6 presents the classification of Na % for both pre- and post-monsoon groundwater samples. These findings indicate that during the pre-monsoon period, 34.55 and 30.14% of samples fell in the excellent and good ranges, respectively, while 8.82% were classified as doubtful. In the post-monsoon period, 14.71, 36.76, 28.68, and 17.64% of groundwater samples fell into the excellent, good, permissible, and doubtful ranges, respectively. Spatial distribution maps of Na % during the pre-monsoon period exhibit lower concentrations in the northern portion of the study area (Figure 5).
Table 6

The suitability of groundwater for irrigation is based on multiple parameters

Name of parametersWater typeRangePercentage of samples in pre-monsoonPercentage of samples in post-monsoon
Na % Excellent <20 34.55 14.71 
Good 20–40 30.14 36.76 
Permissible 40–60 25 28.68 
Doubtful 60–80 8.82 17.64 
Unsuitable >80 1.47 2.21 
KR Suitable <1 84.56 84.56 
Unsuitable >1 15.44 15.44 
MAR Suitable <50 30.89 27.95 
Unsuitable >50 69.11 72.05 
Name of parametersWater typeRangePercentage of samples in pre-monsoonPercentage of samples in post-monsoon
Na % Excellent <20 34.55 14.71 
Good 20–40 30.14 36.76 
Permissible 40–60 25 28.68 
Doubtful 60–80 8.82 17.64 
Unsuitable >80 1.47 2.21 
KR Suitable <1 84.56 84.56 
Unsuitable >1 15.44 15.44 
MAR Suitable <50 30.89 27.95 
Unsuitable >50 69.11 72.05 
Figure 5

Spatial variations of Na %, (a) pre-monsoon; (b) post-monsoon.

Figure 5

Spatial variations of Na %, (a) pre-monsoon; (b) post-monsoon.

Close modal

Kelly's ratio

The Kelly's ratio (KR) is another important index to use in estimating the groundwater quality for irrigation. It takes into account both the Na+ content and the content of irrigation water or soil, as ions can influence the soil's ability to exchange Na+ ions. This makes KR a more comprehensive indicator of sodium hazard compared to SAR, especially in areas where v concentrations are significant and it divides water quality into two categories. When the KR value is <1, water is estimated suitable for irrigation, while >1 indicates it's unsuitable. KR values ranged from 0.14 to 12.06 during the before-monsoon period. KR values ranged from 0.009 to 6.24 during the post-monsoon season. Across both seasons, 84.56% of groundwater samples were suitable for irrigation, remaining 15.44% were unsuitable (Table 6). A spatial distribution map of KR during both seasons highlights higher values in the eastern, southern, and western parts of the study region (Figure 6).
Figure 6

Spatial variations of KR, (a) pre-monsoon; (b) post-monsoon.

Figure 6

Spatial variations of KR, (a) pre-monsoon; (b) post-monsoon.

Close modal

Magnesium adsorption ratio

The Mg2+ ion is another critical factor inducing groundwater suitability for irrigation. Higher levels of Mg2+ ions in groundwater can reduce crop productivity and lead to increased soil alkalinity (Chaudhary & SatheeshKumar 2018). MAR provides valuable information for evaluating the suitability of irrigation water and managing soil quality to ensure optimal conditions for crop growth and productivity. It helps in making informed decisions regarding irrigation practices and soil amendments to prevent soil degradation and maintain irrigation sustainability (Gupta & Gupta 1987). The highest MAR values of 92.48 and 88.33 during both periods, respectively. As per Table 6, 30.89% of samples were suitable, while 69.11% were unsuitable due to magnesium ratios exceeding 50 in the pre-monsoon season. Similarly, during the post-monsoon period, 27.95% of groundwater samples were suitable, while 72.05% were unsuitable for irrigation. The spatial map illustrates lower concentrations of MAR during both seasons, primarily in small patches within the western and southern parts of the study region (Figure 7).
Figure 7

Spatial variations of MAR, (a) pre-monsoon; (b) post-monsoon.

Figure 7

Spatial variations of MAR, (a) pre-monsoon; (b) post-monsoon.

Close modal

Groundwater quality assessment using IWQI

Table 7 provides the IWQI classification along with recommendations. As per IWQI classification, 25% of groundwater samples were classified as ‘ Severe Restriction,’ 47.06% of samples as ‘High Restriction,’ and 3.68% of samples fell into the ‘Moderate Restriction’ class during the pre-monsoon period. Whereas, in the post-monsoon period, 49.26% of groundwater samples fall in ‘Severe Restriction,’ 64.71% in ‘High Restriction,’ and 10.29% of samples fall in the ‘Moderate Restriction’ class. No groundwater sample lies in the low and no restriction class of IWQI during both periods.

Table 7

Classification of IWQI, percentage of groundwater samples, and recommendations in the study area

IWQI rangeWater use restrictionsPercentage of samples in pre-monsoonPercentage of samples in post-monsoonRecommendations
85 < IWQI ≤ 100 No restriction (NR) It is suggested for most soil types without a significant risk of salinity and alkalinity issues. Including leaching into irrigation practices is advisable, and it may not be suitable for soils with exceptionally poor permeability. 
70 < IWQI ≤ 85 Low restriction (LR) It is suggested for application in irrigated soils with a low texture or medium permeability rate, where salt leaching is recommended. In heavy-textured soils, soil alkalinity issues may arise, and it is recommended to avoid using it in soils with high clay content, precisely with a 2:1 clay ratio. 
55 < IWQI ≤ 70 Moderate restriction (MR) 3.68 10.29 Appropriate for application in soils with medium to high permeability values, with a recommendation for moderate salt leaching. 
40 < IWQI ≤ 55 High restriction (HR) 49.26 64.71 Appropriate for utilization in soils with excellent permeability and the absence of compacted layers. 
0 < IWQI ≤ 40 Severe restriction (SR) 47.06 25 Under typical situations, it is advisable to avoid using it for irrigation. However, in exceptional situations, irregular usage may be considered. When dealing with water containing low salt concentrations and a high SAR, the application of gypsum is required. 
IWQI rangeWater use restrictionsPercentage of samples in pre-monsoonPercentage of samples in post-monsoonRecommendations
85 < IWQI ≤ 100 No restriction (NR) It is suggested for most soil types without a significant risk of salinity and alkalinity issues. Including leaching into irrigation practices is advisable, and it may not be suitable for soils with exceptionally poor permeability. 
70 < IWQI ≤ 85 Low restriction (LR) It is suggested for application in irrigated soils with a low texture or medium permeability rate, where salt leaching is recommended. In heavy-textured soils, soil alkalinity issues may arise, and it is recommended to avoid using it in soils with high clay content, precisely with a 2:1 clay ratio. 
55 < IWQI ≤ 70 Moderate restriction (MR) 3.68 10.29 Appropriate for application in soils with medium to high permeability values, with a recommendation for moderate salt leaching. 
40 < IWQI ≤ 55 High restriction (HR) 49.26 64.71 Appropriate for utilization in soils with excellent permeability and the absence of compacted layers. 
0 < IWQI ≤ 40 Severe restriction (SR) 47.06 25 Under typical situations, it is advisable to avoid using it for irrigation. However, in exceptional situations, irregular usage may be considered. When dealing with water containing low salt concentrations and a high SAR, the application of gypsum is required. 

The spatial variations map of IWQI covering both periods is provided in Figure 8. Results from pre-monsoon periods specify that mostly the western part falls under the Severe Restriction zone (Figure 8(a)). In post-monsoon periods, some patches of the western portion of the study area come under the Severe Restriction zone (Figure 8(b)). Groundwater quality exhibits improvement during the post-monsoon period compared to the pre-monsoon period. This improvement is verified by spatial variations map for both seasons. This result can be attributed to the high rainfall during the monsoon season, which allows for more extensive water infiltration into the soil, thereby diluting the groundwater system.
Figure 8

Spatial variations map of IWQI, (a) pre-monsoon; (b) post-monsoon.

Figure 8

Spatial variations map of IWQI, (a) pre-monsoon; (b) post-monsoon.

Close modal

Hydrogeochemistry characteristics

Gibbs diagrams, established by Gibbs in 1970, serve as valuable tools to analyze the sources of dissolved chemical constituents in groundwater and help interpret the hydrochemical processes monitoring water chemistry. They consist of two key diagrams: one depicting the relative concentrations of Na+/(Na+ + Ca2+) on the horizontal axis against TDS on the vertical axis, while the other illustrates the ratio of TDS to Cl/(Cl + ) ions. These diagrams explain three zones representing dominant aquifer chemistry mechanisms: evaporation dominance, rock dominance, and precipitation dominance (Gibbs 1970).

In the context of this study, the majority of groundwater samples fall in evaporation dominance and rock dominance (Figure 9(a) and 9(b)). This suggests that these natural processes significantly influence the groundwater chemistry during both periods. However, few samples deviate from these zones, possibly due to anthropogenetic or human causes. The application of Gibbs's diagrams across the study area indicates that chemical weathering of rock-forming minerals and evaporation play substantial roles in determining groundwater quality.
Figure 9

Gibbs diagrams showing groundwater chemistry mechanisms (a) cations: TDS concerning the ratio of Na+ to (Na+ + Ca2+), (b) anions: TDS concerning the ratio of Cl to (Cl + ).

Figure 9

Gibbs diagrams showing groundwater chemistry mechanisms (a) cations: TDS concerning the ratio of Na+ to (Na+ + Ca2+), (b) anions: TDS concerning the ratio of Cl to (Cl + ).

Close modal

Chadha diagram

The Chadha diagram was developed by Chadha (1999) and used for the geochemical classification of groundwater based on its chemical composition. It is a graphical tool that helps in determining the suitability of irrigation water, primarily focusing on the concentrations of Na⁺, Ca2⁺, and Mg2⁺ ions. It provides a quick visual assessment of the water quality and its potential impact on soil structure and crop growth, and it is divided into eight different sub-fields. Each sub-field signifies the water types, as revealed in Figure 10. The hydrochemical facies were assessed for all the samples collected from the study area, which have been represented in the Chadha diagram (Figure 10). The most of groundwater samples fall in the (Ca2+–Mg2+–Cl) quadrant, suggesting that the water shows typical reverse ion exchange activities.
Figure 10

Chadha diagram classification of the studied water samples.

Figure 10

Chadha diagram classification of the studied water samples.

Close modal

Performance of ML algorithms

For this part of the research, the dataset is divided into partitions based on the maximum r-value and minimum RMSE value, which is determined through a trial-and-error approach. Based on the results, 70% (190 samples) and 30% (82 samples) of the entire dataset are used for training (calibration) and testing (validation) phases, respectively. All three ML methods require using several hyperparameters. Different algorithms have various parameters that significantly impact their performance. In this study, the RF algorithm utilizes parameters such as Max Depth, Min Samples Leaf, Min Samples Split, and N Estimators. For SVM, the parameters include C, Degree, Epsilon, and Gamma. In XGBoost, hyperparameters like Learning Rate, Iterations, Depth, and Subsample are employed. In this study, we employed three search methods: Grid Optimization, Random Optimization, and Bayesian Optimization to determine the optimal hyperparameters, thereby improving the performance of the ML algorithm. This comprehensive approach proved to be faster and more efficient compared to conventional optimization methods. The model yielding the best predictions, with its optimal hyperparameters, was chosen to estimate the IWQI within the study area. Table 8 provides hyperparameters and functions selected for IWQI prediction.

Table 8

Hyperparameter used with grid, random, and Bayes optimization approach for IWQI prediction

AlgorithmApproachDescription of parameters and functions
RF Grid Minimum samples split = 2, Maximum depth = 15, Minimum samples leaf = 2, n estimators = 100 
Random Minimum samples split = 7, Maximum depth = 7, Minimum samples leaf = 1, n estimators = 200 
Bayes Minimum samples split = 2.32, Maximum depth = 10.0, Minimum samples leaf = 1.0, n estimators = 69.10 
SVM Grid C = 10, Degree = 2, Epsilon = 1, Gamma = 1 
Random C = 6.95, Degree = 4, Epsilon = 0.3197, Gamma = scale, 
Bayes C = 9.47, Degree = 3.15, Epsilon = 0.915, Gamma = 1.0 
XGBoost Grid Learning rate = 0.1, Iterations = 100, Depth = 3, Subsample = 0.5 
Random Learning rate = 0.1, Iterations = 410, Depth = 6, Subsample = 0.5 
Bayes Learning rate = 0.021, Iterations = 196.6, Depth = 3.59, Subsample = 0.51, Gamma = 0.128 
AlgorithmApproachDescription of parameters and functions
RF Grid Minimum samples split = 2, Maximum depth = 15, Minimum samples leaf = 2, n estimators = 100 
Random Minimum samples split = 7, Maximum depth = 7, Minimum samples leaf = 1, n estimators = 200 
Bayes Minimum samples split = 2.32, Maximum depth = 10.0, Minimum samples leaf = 1.0, n estimators = 69.10 
SVM Grid C = 10, Degree = 2, Epsilon = 1, Gamma = 1 
Random C = 6.95, Degree = 4, Epsilon = 0.3197, Gamma = scale, 
Bayes C = 9.47, Degree = 3.15, Epsilon = 0.915, Gamma = 1.0 
XGBoost Grid Learning rate = 0.1, Iterations = 100, Depth = 3, Subsample = 0.5 
Random Learning rate = 0.1, Iterations = 410, Depth = 6, Subsample = 0.5 
Bayes Learning rate = 0.021, Iterations = 196.6, Depth = 3.59, Subsample = 0.51, Gamma = 0.128 

Results from Table 9 indicate that the XGBoost model with random search method performs well in terms of r-value and RMSE value compared to other models in forecasting the IWQI (r = 0.8829, RMSE = 60.5369, MAE = 2.754, and RBIAS = 0.203%). Figure 11(a)–11(c) provides scatter plots of observed and predicted IWQI values of different models during the testing phases. The model's accuracy is considered satisfactory when the values are evenly distributed on equal sides of the line of perfect agreement, representing that errors follow a Gaussian distribution. Figure 11(a)–11(c) illustrates that the predicted values generated by the XGBoost model with all three optimizations (grid, random, and Bayesian or Bayes search) exhibit a distribution closer to the line of perfect agreement as compared to the RF and SVM models. Additionally, XGBoost with random search demonstrated better performance, with predicted values closely aligning with the actual values of IWQI.
Table 9

Model performances for prediction of IWQI during the testing process

AlgorithmApproachRRMSEMAERBIAS
RF Grid 0.8416 60.6857 3.065 −0.634% 
Random 0.8511 60.6761 2.993 −0.564% 
Bayesian 0.7074 61.1891 4.084 −1.747% 
SVM Grid 0.7907 61.3005 8.393 −2.393% 
Random 0.7927 61.3234 8.340 −2.490% 
Bayesian 0.7932 61.3127 8.365 −2.445% 
XGBoost Grid 0.8776 60.6139 2.757 −0.212% 
Random 0.8829 60.5369 2.754 0.203% 
Bayesian 0.8569 60.6958 2.976 0.606% 
AlgorithmApproachRRMSEMAERBIAS
RF Grid 0.8416 60.6857 3.065 −0.634% 
Random 0.8511 60.6761 2.993 −0.564% 
Bayesian 0.7074 61.1891 4.084 −1.747% 
SVM Grid 0.7907 61.3005 8.393 −2.393% 
Random 0.7927 61.3234 8.340 −2.490% 
Bayesian 0.7932 61.3127 8.365 −2.445% 
XGBoost Grid 0.8776 60.6139 2.757 −0.212% 
Random 0.8829 60.5369 2.754 0.203% 
Bayesian 0.8569 60.6958 2.976 0.606% 

Bold values mean that the XGBoost model with random search method performs well in terms of R-value, RMSE, MAE, and RBIAS value.

Figure 11

(a) Scatterplots of actual and predicted values of IWQI in the RF algorithm during the testing process. (b) Scatterplots of actual and predicted values of IWQI in the SVM algorithm during the testing process. (c) Scatterplots of actual and predicted values of IWQI in the XGBoost algorithm during the testing process.

Figure 11

(a) Scatterplots of actual and predicted values of IWQI in the RF algorithm during the testing process. (b) Scatterplots of actual and predicted values of IWQI in the SVM algorithm during the testing process. (c) Scatterplots of actual and predicted values of IWQI in the XGBoost algorithm during the testing process.

Close modal

The present study investigates hydrogeochemistry and groundwater suitability for irrigation uses. The study has the following conclusions are drawn below:

  • Various indices were analyzed for IWQ assessment, and it found that PI and MAR exceed the limit, mainly describing the salinity and magnesium hazards in groundwater. The SAR, Na % and KR values suggest that the groundwater is suitable for irrigation.

  • The IWQI classification suggests that 47.06% of samples in Pre-monsoon and 25% of groundwater samples in Post-monsoon fall in Severe Restriction and recommends avoiding use for irrigation under normal conditions.

  • GIS-based IWQI identifies that the groundwater quality was improved in the post-monsoon period compared to the pre-monsoon period, and the western part is unsuitable for irrigation.

  • The Gibbs and Chadha plots illustrate that many ions present in groundwater are initiated from interactions dominated by evaporation, water–rock interactions, and reversed ion exchange phenomena.

  • The XGBoost model, particularly when optimized using the random search method, has shown superior performance in forecasting the IWQI by its high correlation coefficient (r = 0.8829) and low root mean squared error (RMSE = 60.5369), mean absolute error (MAE = 2.754) indicating strong predictive accuracy and minimal deviation from actual values. Furthermore, the relative bias (RBIAS) of 0.203% underscores the model's capability to maintain consistency in its predictions.

The findings of this study offer valuable insights into the planning and management of existing water resources within this region. However, the study also has certain limitations. The data used for the analysis were confined to specific periods, which may not capture long-term trends or the impact of extreme weather events. Additionally, while the XGBoost model performed well in predicting IWQI, it is essential to explore other advanced ML models that could further enhance predictive accuracy and consider more complex interactions among variables. Future research in this region should aim to expand the temporal and spatial scale of the study to include more extensive data over multiple years and different seasons.

The authors express appreciation to the National Institute of Technology, Kurukshetra-136119, Haryana, India for providing essential research facilities.

We extend our gratitude to the Ministry of Education (MOE), Government of India (GOI), for their support in funding this study through a scholarship awarded to the first author, Hemant Raheja (Grant No. 2K19/NITK/PHD/61900011).

All relevant data are included in the paper or its Supplementary Information.

The authors have no conflict of interest.

Abba
S. I.
,
Pham
Q. B.
,
Saini
G.
,
Linh
N. T. T.
,
Ahmed
A. N.
,
Mohajane
M.
,
Khaledian
M.
,
Abdulkadir
R. A.
&
Bach
Q. V.
(
2020
)
Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index
,
Environmental Science and Pollution Research
,
27
(
33
),
41524
41539
.
https://doi.org/10.1007/s11356-020-09689-x
.
Abu El-Magd
S. A.
,
Ismael
I. S.
,
El-Sabri
M. A. S.
,
Abdo
M. S.
&
& Farhat
H. I.
(
2023
)
Integrated machine learning–based model and WQI for groundwater quality assessment: ML, geospatial, and hydro-index approaches
,
Environmental Science and Pollution Research
,
30
(
18
),
53862
53875
.
https://doi.org/10.1007/s11356-023-25938-1
.
Al-Barakah
F. N.
,
Al-jassas
A. M.
&
Aly
A. A.
(
2017
)
Water quality assessment and hydrochemical characterization of Zamzam groundwater, Saudi Arabia
,
Applied Water Science
,
7
(
7
),
3985
3996
.
https://doi.org/10.1007/s13201-017-0549-x
.
Aldrees
A.
,
Javed
M. F.
,
Bakheit Taha
A. T.
,
Mustafa Mohamed
A.
,
Jasiński
M.
&
Gono
M.
(
2023
)
Evolutionary and ensemble machine learning predictive models for evaluation of water quality
,
Journal of Hydrology: Regional Studies
,
46
(
February
),
101331
.
https://doi.org/10.1016/j.ejrh.2023.101331
.
Amrani
S.
,
Hinaje
S.
,
El Fartati
M.
,
Gharmane
Y.
&
Yaagoub
D.
(
2022
)
Assessment of groundwater quality for drinking and irrigation in the Timahdite–Almis Guigou area (Middle Atlas, Morocco)
.
Applied Water Science
,
12
(
4
),
1
12
.
https://doi.org/10.1007/s13201-022-01609-3
.
APHA
(
2012
)
Standard methods for the examination of water and waste water. 22nd edn. 14 American Water Works Association (AWWA) and Water Environment Federation 15 (WEF), New York, NY: American Public Health Association (APHA), p. 1360
.
Awasthi
A.
,
Rishi
M. S.
,
Khosla
A.
&
Panjgotra
S.
(
2023
)
Geographic information system-based groundwater quality assessment for drinking and irrigation purposes in transboundary aquifers of river Ravi, India
.
Environmental Science and Pollution Research
,
30
(
12
),
34536
34552
.
https://doi.org/10.1007/s11356-022-24642-w
.
Ayers
R. S.
&
Westcot
D. W.
(
1994
) ‘
Water Quality for Agriculture
’,
Food, Agriculture Organization of the United Nations (FAO), Irrigation and Drainage, Rome, Paper No. 29. Rev1, M-56
.
https://lymphosign.com/doi/10.14785/lymphosign-2022-0012
Breiman
L. E. O.
(
2001
)
Random forests
,
Machine Learning
,
45
,
5
32
.
https://doi.org/10.1023/A:1010933404324
.
Brereton
R. G.
&
Lloyd
G. R.
(
2010
)
Support vector machines for classification and regression
,
The Analyst
,
135
(
2
),
230
267
.
https://doi.org/10.1039/B918972F
.
CGWB
. (
2013a
)
Ground Water Information Booklet Hisar District, Haryana
. In:
Central Ground Water Board, North Western Region
.
Chandigarh
:
Ministry of Water Resources Government of India North Western Region
. http://cgwb.gov.in/District_Profile/Haryana/Hissar.pdf.
CGWB.
(
2013b
)
Ground Water Information Booklet Jind district Haryana
. In:
Central Ground Water Board, North Western Region
.
Chandigarh
:
Ministry of Water Resources Government of India North Western Region.
http://cgwb.gov.in/District_Profile/Haryana/Jind.pdf.
Chadha
D. K.
(
1999
)
A proposed new diagram for geochemical classification of natural waters and interpretation of chemical data
,
Hydrogeology Journal
,
7
(
5
),
431
439
.
https://doi.org/10.1007/s100400050216
.
Chaudhary
V.
&
Satheeshkumar
S.
(
2018
)
Assessment of groundwater quality for drinking and irrigation purposes in arid areas of Rajasthan, India
,
Applied Water Science
,
8
(
8
),
1
17
.
https://doi.org/10.1007/s13201-018-0865-9
.
Chen
T.
&
Guestrin
C.
(
2016
) ‘
XGBoost A Scalable Tree Boosting System
’,
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
,
42
,
785
794
.
https://doi.org/10.1145/2939672.2939785
.
Chen
K.
,
Chen
H.
,
Zhou
C.
,
Huang
Y.
,
Qi
X.
,
Shen
R.
,
Liu
F.
,
Zuo
M.
,
Zou
X.
,
Wang
J.
,
Zhang
Y.
,
Chen
D.
,
Chen
X.
,
Deng
Y.
&
Ren
H.
(
2020
)
Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data
,
Water Research
,
171
,
115454
.
https://doi.org/10.1016/j.watres.2019.115454
.
Cortes
C.
&
Vapnik
V.
(
1995
)
Support-Vector networks
,
Machine Learning
,
20
,
273
297
.
https://doi.org/10.1023/A:1022627411411
.
Doneen
L. D.
(
1964
). '
Notes on water quality in agriculture
',
Water Science and Engineering Paper 4001
.
Department of Water Science and Engineering, University of California, Davis
.
Egbueri
J. C.
&
Agbasi
J. C.
(
2022
)
Combining data-intelligent algorithms for the assessment and predictive modeling of groundwater resources quality in parts of southeastern Nigeria
,
Environmental Science and Pollution Research
,
29
(
38
),
57147
57171
.
https://doi.org/10.1007/s11356-022-19818-3
.
El Bilali
A.
&
Taleb
A.
(
2020
)
Prediction of irrigation water quality parameters using machine learning models in a semi-arid environment
,
Journal of the Saudi Society of Agricultural Sciences
,
19
(
7
),
439
451
.
https://doi.org/10.1016/j.jssas.2020.08.001
.
El Bilali
A.
,
Taleb
A.
&
Brouziyne
Y.
(
2021
)
Groundwater quality forecasting using machine learning algorithms for irrigation purposes
,
Agricultural Water Management
,
245
,
106625
.
https://doi.org/10.1016/j.agwat.2020.106625
.
Ewaid
S. H.
,
Abed
S. A.
&
Kadhum
S. A.
(
2018
)
Predicting the Tigris river water quality within Baghdad, Iraq by using water quality index and regression analysis
,
Environmental Technology & Innovation
,
11
,
390
398
.
https://doi.org/10.1016/j.eti.2018.06.013
.
Ewaid
S. H.
,
Kadhum
S. A.
,
Abed
S. A.
&
Salih
R. M.
(
2019
)
Development and evaluation of irrigation water quality guide using IWQG V.1 software: A case study of Al-Gharraf Canal, Southern Iraq
,
Environmental Technology & Innovation
,
13
,
224
232
.
https://doi.org/10.1016/j.eti.2018.12.001
.
Gaagai
A.
,
Aouissi
H. A.
,
Bencedira
S.
,
Hinge
G.
,
Athamena
A.
,
Haddam
S.
,
Gad
M.
,
Elsherbiny
O.
,
Elsayed
S.
,
Eid
M. H.
&
Ibrahim
H.
(
2023
)
Application of water quality indices, machine learning approaches, and GIS to identify groundwater quality for irrigation purposes: A case study of Sahara Aquifer, Doucen Plain, Algeria
,
Water
,
15
(
2
),
289
.
https://doi.org/10.3390/w15020289
.
Gao
Y.
,
Qian
H.
,
Ren
W.
,
Wang
H.
,
Liu
F.
&
Yang
F.
(
2020
)
Hydrogeochemical characterization and quality assessment of groundwater based on integrated-weight water quality index in a concentrated urban area
,
Journal of Cleaner Production
,
260
,
121006
.
https://doi.org/10.1016/j.jclepro.2020.121006
.
Gebrehiwot
A. B.
,
Tadesse
N.
&
Jigar
E.
(
2011
)
Application of water quality index to assess suitability of groundwater quality for drinking purposes in Hantebet watershed, Tigray, Northern Ethiopia
.
ISABB Journal of Food and Agriculture Science
,
1
(
1
),
22
30
.
https://doi.org/10.5897/ISABB-JFAS.9000001
.
Gibbs
R. J.
(
1970
)
Mechanisms controlling world water chemistry
.
Science, New Series
,
170
(
3962
),
1088
1090
. http://www.jstor.org/stable/1730827.
Goel
A.
&
Pal
M.
(
2009
)
Application of support vector machines in scour prediction on grade-control structures
,
Engineering Applications of Artificial Intelligence
,
22
(
2
),
216
223
.
https://doi.org/10.1016/j.engappai.2008.05.008
.
Guo
H.
,
Huang
J. J.
,
Chen
B.
,
Guo
X.
&
Singh
V. P.
(
2021
)
A machine learning-based strategy for estimating non-optically active water quality parameters using sentinel-2 imagery
,
International Journal of Remote Sensing
,
42
(
5
),
1841
1866
.
https://doi.org/10.1080/01431161.2020.1846222
.
Gupta
S. K.
&
Gupta
I. C.
(
1987
)
Management of Saline Soils and Waters
.
New Delhi, India
:
Oxford and IBH Publication Co. Oxford & IBH Publishing Co
.
Islam
A. R. M. T.
,
Shen
S.
,
Haque
M. A.
,
Bodrud-Doza
M.
,
Maw
K. W.
&
Habib
M. A.
(
2018
)
Assessing groundwater quality and its sustainability in Joypurhat district of Bangladesh using GIS and multivariate statistical approaches
,
Environment, Development and Sustainability
,
20
(
5
),
1935
1959
.
https://doi.org/10.1007/s10668-017-9971-3
.
Kelly
J. L.
(
1956
)
A New interpretation of information rate
,
Bell System Technical Journal
,
35
(
4
),
917
926
.
https://doi.org/10.1002/j.1538-7305.1956.tb03809.x
.
Khan
M. Y. A.
,
El Kashouty
M.
,
Gusti
W.
,
Kumar
A.
,
Subyani
A. M.
&
Alshehri
A.
(
2022
)
Geo-Temporal signatures of physicochemical and heavy metals pollution in groundwater of Khulais region – Makkah Province, Saudi Arabia
,
Frontiers in Environmental Science
,
9
(
January
),
800517
.
https://doi.org/10.3389/fenvs.2021.800517
.
Kouadri
S.
,
Pande
C. B.
,
Panneerselvam
B.
,
Moharir
K. N.
&
Elbeltagi
A.
(
2022
)
Prediction of irrigation groundwater quality parameters using ANN, LSTM, and MLR models
,
Environmental Science and Pollution Research
,
29
(
14
),
21067
21091
.
https://doi.org/10.1007/s11356-021-17084-3
.
Makki
Z. F.
,
Zuhaira
A. A.
,
Al-Jubouri
S. M.
,
Al-Hamd
R. K. S.
&
Cunningham
L. S.
(
2021
)
GIS-based assessment of groundwater quality for drinking and irrigation purposes in central Iraq
,
Environmental Monitoring and Assessment
,
193
(
2
),
107
.
https://doi.org/10.1007/s10661-021-08858-w
.
Masoud
M.
,
Osta
M. E.
,
Alqarawy
A.
,
Elsayed
S.
&
Gad
M.
(
2022
)
Evaluation of groundwater quality for agricultural under different conditions using water quality indices, partial least squares regression models, and GIS approaches
,
Applied Water Science
,
12
,
244
.
https://doi.org/10.1007/s13201-022-01770-9
.
Meireles
A. C. M.
,
Andrade
E. M. d.
,
Chaves
L. C. G.
,
Frischkorn
H.
&
Crisostomo
L. A.
(
2010
)
A new proposal of the classification of irrigation water
,
Revista Ciência Agronômica
,
41
(
3
),
349
357
.
https://doi.org/10.1590/S1806-66902010000300005
.
Mosavi
A.
,
Sajedi Hosseini
F.
,
Choubin
B.
,
Taromideh
F.
,
Ghodsi
M.
,
Nazari
B.
&
Dineva
A. A.
(
2021
)
Susceptibility mapping of groundwater salinity using machine learning models
,
Environmental Science and Pollution Research
,
28
(
9
),
10804
10817
.
https://doi.org/10.1007/s11356-020-11319-5
.
Omeka
M. E.
,
Igwe
O.
,
Onwuka
O. S.
,
Nwodo
O. M.
,
Ugar
S. I.
,
Undiandeye
P. A.
&
Anyanwu
I. E.
(
2023
)
Efficacy of GIS-based AHP and data-driven intelligent machine learning algorithms for irrigation water quality prediction in an agricultural-mine district within the Lower Benue Trough, Nigeria
,
Environmental Science and Pollution Research
,
31
,
0123456789
.
https://doi.org/10.1007/s11356-023-25291-3
.
Paliwal
K. V.
(
1972
)
Irrigation with Saline Water. Monogram No. 2 (Newseries)
.
New Delhi
:
IARI
.
Raheja
H.
,
Goel
A.
&
Pal
M.
(
2022a
)
An evaluation of groundwater quality and its suitability for drinking and irrigation uses under the GIS framework
,
Water Practice and Technology
,
17
(
11
),
2259
2277
.
https://doi.org/10.2166/wpt.2022.134
.
Raheja
H.
,
Goel
A.
&
Pal
M.
(
2022b
)
Prediction of groundwater quality indices using machine learning algorithms
,
Water Practice and Technology
,
17
(
1
),
336
351
.
https://doi.org/10.2166/wpt.2021.120
.
Raheja
H.
,
Goel
A.
&
Pal
M.
(
2023a
)
Groundwater quality appraisal using IWQI and PCA for irrigation uses
,
ISH Journal of Hydraulic Engineering
,
29
(
sup1
),
264
273
.
https://doi.org/10.1080/09715010.2023.2218829
.
Raheja
H.
,
Goel
A.
&
Pal
M.
(
2023b
)
Assessment and modeling of groundwater quality using GIS and machine learning techniques for drinking purpose
,
World Environmental and Water Resources Congress
,
2010
,
1092
1112
.
https://doi.org/10.1061/9780784484852.100
.
Raheja
H.
,
Goel
A.
&
Pal
M.
(
2024a
)
Evaluation of groundwater quality for drinking purposes based on machine learning algorithms and GIS
,
Sustainable Water Resources Management
,
10
(
1
),
11
.
https://doi.org/10.1007/s40899-023-00990-4
.
Raheja
H.
,
Goel
A.
&
Pal
M.
(
2024b
)
A novel approach for prediction of groundwater quality using gradient boosting-based algorithms
,
ISH Journal of Hydraulic Engineering
,
30
(
3
),
281
292
.
https://doi.org/10.1080/09715010.2024.2316615
.
Richards
L. A.
(
1954
)
Diagnosis and improvement of saline and alkaline soils
,
Soil Science Society of America Journal
,
18
(
3
),
348
.
https://doi.org/10.2136/sssaj1954.03615995001800030032x
.
Sánchez
A. V. D.
(
2003
)
Advanced support vector machines and kernel methods
,
Neurocomputing
,
55
(
1–2
),
5
20
.
https://doi.org/10.1016/S0925-2312(03)00373-4
.
Singh
K. P.
,
Basant
N.
&
Gupta
S.
(
2011
)
Support vector machines in water quality management
,
Analytica Chimica Acta
,
703
(
2
),
152
162
.
https://doi.org/10.1016/j.aca.2011.07.027
.
Singha
S.
,
Pasupuleti
S.
,
Singha
S. S.
,
Singh
R.
&
Kumar
S.
(
2021
)
Prediction of groundwater quality using efficient machine learning technique
,
Chemosphere
,
276
,
130265
.
https://doi.org/10.1016/j.chemosphere.2021.130265
.
Tleuova
Z.
,
Snow
D. D.
,
Mukhamedzhanov
M.
&
Ermenbay
A.
(
2023
)
Relation of hydrogeology and contaminant sources to drinking water quality in Southern Kazakhstan
,
Water (Switzerland)
,
15
(
24
),
1
20
.
https://doi.org/10.3390/w15244240
.
Todd
D. K.
(
1995
).
Groundwater Hydrology
, 3rd edn.
Hoboken, NJ
:
John Wiley & Sons. Inc
.
United Nations
. (
2015
)
Transforming our World: The 2030 Agenda for Sustainable Development
,
New York, NY:
Department of Economic and Social Affairs
.
Wilcox
L.
(
1955
)
Classification and Use of Irrigation Waters
.
Washington, DC
:
USDA
.
Circular, 969
.
Zhang
W.
,
Wu
C.
,
Zhong
H.
,
Li
Y.
&
Wang
L.
(
2021
)
Prediction of undrained shear strength using extreme gradient boosting and random forest based on Bayesian optimization
,
Geoscience Frontiers
,
12
(
1
),
469
477
.
https://doi.org/10.1016/j.gsf.2020.03.007
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).