ABSTRACT
In this study, a water quality analysis was conducted at 24 surface water monitoring stations for 17 water quality parameters to investigate the spatial variability of the constituents, determine the possible sources of pollution and quantify the contribution of each identified source on individual parameters using multivariate statistical analysis (MSA) and the multivariate receptor model (MRM) conjunctively. Although most of the analysed water quality constituents were in the recommended water quality guideline standard, nutrient concentration at some of the monitoring stations was found above the permissible limit, making the water resources in the river basin prone to eutrophication and leaving the aquatic life completely at risk. On the other hand, based on factor analysis in the basin (which explained 86.9% of the total variance with four principal factors), agricultural (nutrient) waste is the dominant pollution source followed by ground water intrusion and mineral dissolution. Besides, the MRM analysis using the UNMIX model-assigned source contribution to individual parameters with a minimum signal-to-noise ratio (S/N) of 3.38 > 2 and R2 of 0.88 > 0.8. A coordinated land use management and continuous water quality monitoring could be a better approach to managing the increasing pollution level in the catchment.
HIGHLIGHTS
This study describes the assessment of spatial water quality variation in the Southern Ethiopia Rift valley, Ethiopia.
Cluster analysis was used to spatially group the monitoring sites.
Pollution source quantification and identification were performed using the UNMIX receptor model.
INTRODUCTION
Surface water pollution is nowadays a critical environmental issue, resulting from various anthropogenic and natural sources. Excess nutrients from fertilizer applications are causing eutrophication, ultimately triggering algal blooms and depleting oxygen levels, endangering aquatic life (Fan et al. 2010; Madjar et al. 2024). Besides, heavy metals from mining (Obasi & Akudinobi 2020) and industrial activities (Oladimeji et al. 2024) enter into water bodies posing severe health risks. Furthermore, bacterial contamination from untreated human waste, particularly in developing regions, is increasing water-borne disease prevalence (Haldar et al. 2022).
Despite the increased complexity of environmental contaminants, integrated multivariate statistical analysis (MSA) and receptor models provided a clear framework for identifying and pinpointing pollution sources based on the available water quality datasets (Ouyang et al. 2006). The assessment of surface water quality is thus becoming more important for decision making and MSA techniques such as the application of principal component analysis (PCA) and factor analysis (FA) have gained advantages in this regard where they can manage large sets of water quality data (Shrestha & Kazama 2007; Özdemir 2016; Hajigholizadeh & Melesse 2017). These statistical methods, when integrated with environmental multivariate receptor models (MRM) such as positive matrix factorization (PMF), absolute principal component score-multiple linear regression (APCS-MLR) and chemical mass balance (CMB), enhance pollution assessment and management capability, thereby quantifying their contribution to the water resource pollution (Gholizadeh et al. 2016; Zhang et al. 2020). The MRMs have been extensively utilized in surface water studies due to their ability to decompose complex pollutant mixtures without prior knowledge of source profiles (Chen et al. 2022). While the CMB model depends on predefined source profiles, PMF assigns pollution sources based on their characteristic chemical signatures.
Many studies reveal that the impact of pollution mainly from nonpoint sources such as agricultural and urban land uses in a watershed is heavily influenced by the type of land use and the planning and management approaches (Wang et al. 2005; Lee et al. 2012; Kim et al. 2018). As a result, the type, nature and planning of land use are critical factors in river pollution management (Ding et al. 2015; Chiang et al. 2021). According to Angello et al. (2020), land use planning and management in Ethiopia is poor, and stream corridor restoration evaluation is limited and often omitted from all river restoration programs, resulting in significant river pollution from various land uses. Consequently, many surface water resources in the country are becoming heavily nutrient-polluted, intensifying eutrophication in rivers (Menberu et al. 2021; Kassaye et al. 2022). River pollution in Ethiopia hence is becoming a critical issue, primarily due to land use changes causing erosion and runoff (Moges et al. 2018).
In Ethiopia, land use change is very fast which is partly due to factors such as high population pressure, urbanization and resettlement (Negesse et al. 2024). On the other hand, agricultural practices and unsustainable practices and depletion of natural resources are also intensifying the land use change predominantly in the rural areas (Megersa & Tullu 2018). As a result, the land use modification has greatly caused various hydrologic and water quality disruptions. The Abaya-Chamo Basin (ACB) is best characterized by its diversified animal and plant species composition. However, recent studies suggest the basin is under threat mainly due to population pressure. Yohannes et al. (2018) reported human-induced factors, such as increasing demand for fuelwood, intensification of construction using wood, agricultural land expansion and irrigation activities and increased bush burning activity for pasture. This land use alteration is highly impacting the water resources of the basin. Therefore, the combined analysis of both water quality and quantity due to land use modification has nowadays become more and more important for the environment and the people depending on it (Kebede et al. 2014). This study is, therefore, aimed at investigating, assessing and characterizing the spatial river pollution, determining their potential pollution sources and quantifying the contribution and composition of each pollution source for individual water quality constituents using integrated MSA and MRM.
MATERIALS AND METHODS
The study area
The rainfall pattern in the river basin has two annual peaks, usually occurring in April/May and August/September. The ACB climate is further characterized by a high rate of evaporation. The area is also characterized by high topographic variability ranging from 1,080 to 3,500 m above sea level, the lowest at the lake shore and the highest at the Guge Mountain.
Longitudinal variations of nitrate (a), sulphate (b), Ca2+ (c), TDS (d), pH and DO (f) on selected surface water resources of the ACB.
Longitudinal variations of nitrate (a), sulphate (b), Ca2+ (c), TDS (d), pH and DO (f) on selected surface water resources of the ACB.
Geologically, the volcanic rocks are predominantly found in the area evolving from the tertiary and quaternary periods. The distribution of the geological formations in the basin of the lowland part of the upper land has a recent quaternary period Aden series with basaltic flows and related spatter cones, whereas the parts have varying formations of tertiary period trap series alkali basalt and trachyte and quaternary period alluvium deposits (Tiruneh 2005). The basin has many water resources, some of which are the Hare River, Kulfo River, Sile River, Bilate River, Gidabo River, Baso River, Sego River, Abaya and Chamo Lake.
Water quality monitoring and analytical techniques
Water samples were collected for 17 physico-chemical cations and anions (Table 1) in the selected surface water sources (lakes and rivers) of the ACB at 24 monitoring stations.
Monitored water quality parameters and analytical methods used
No. . | Parameter . | Unit . | Analytical method . |
---|---|---|---|
1 | pH | Electrometric method | |
2 | Dissolved oxygen (DO) | mg/L | Electrometric method |
3 | Total dissolved solid (TDS) | mg/L | Gravimetric |
4 | Water temperature (T) | °C | Thermal analysis |
5 | Electrical conductivity (EC) | μs/cm | Electrometric method |
6 | Total hardness as CaCO3 | mg/L | EDTA titrimetric |
7 | Calcium (Ca2+) | mg/L | EDTA titrimetric |
8 | Magnesium (Mg2+) | mg/L | EDTA titrimetric |
9 | Bicarbonate (HCO3−) | mg/L | Acid titrimetric |
10 | Chloride (Cl−) | mg/L | Argentometric |
11 | Sulphate (SO42−) | mg/L | Turbidimetric |
12 | Sodium (Na+) | mg/L | Flame photometry |
13 | Potassium (K) | mg/L | Flame photometry |
14 | Total Iron (Fe) | mg/L | FerroVer iron method |
15 | Nitrate-nitrogen (NO3-N) | mg/L | Spectrophotometric |
16 | Nitrite-nitrogen (NO2-N) | mg/L | Nitraver-3 nitrite method |
17 | Phosphate (PO4-P) | mg/L | Stannous chloride method |
No. . | Parameter . | Unit . | Analytical method . |
---|---|---|---|
1 | pH | Electrometric method | |
2 | Dissolved oxygen (DO) | mg/L | Electrometric method |
3 | Total dissolved solid (TDS) | mg/L | Gravimetric |
4 | Water temperature (T) | °C | Thermal analysis |
5 | Electrical conductivity (EC) | μs/cm | Electrometric method |
6 | Total hardness as CaCO3 | mg/L | EDTA titrimetric |
7 | Calcium (Ca2+) | mg/L | EDTA titrimetric |
8 | Magnesium (Mg2+) | mg/L | EDTA titrimetric |
9 | Bicarbonate (HCO3−) | mg/L | Acid titrimetric |
10 | Chloride (Cl−) | mg/L | Argentometric |
11 | Sulphate (SO42−) | mg/L | Turbidimetric |
12 | Sodium (Na+) | mg/L | Flame photometry |
13 | Potassium (K) | mg/L | Flame photometry |
14 | Total Iron (Fe) | mg/L | FerroVer iron method |
15 | Nitrate-nitrogen (NO3-N) | mg/L | Spectrophotometric |
16 | Nitrite-nitrogen (NO2-N) | mg/L | Nitraver-3 nitrite method |
17 | Phosphate (PO4-P) | mg/L | Stannous chloride method |
All the analytical methods for the constituents were conducted by (APHA 1999) standard procedures. The monitoring stations were selected in consideration of the spatial/geographic location, available water sources, prevalence of anthropogenic influence and economic and social importance of the water resources.
According to the land use classification of Ayele et al. (2019), the ACB is dominated by grass & shrub land (33.49%) followed by farm land (28.63%). Other types of land uses are barren land (17.12%), vegetation (9.35%), Lake Abaya (5.91%), wood land (2.79%) and Lake Chamo (1.65%).
Assessment of the water quality by MSA
Development of better and efficient water quality management systems in a water resource is the main challenge these days. Despite some limitations, MSA techniques have become successful at filling the gap (Noori et al. 2010). However, availability of data with reasonable quality plays a key role in the effective interpretation of the water quality by MSA. Several MSA techniques are used nowadays to interpret water quality variables such as PCA, FA and CA. Statistical techniques are used to characterize and interpret river water quality variables by minimizing the bulk datasets without losing much information contained in the original data (Silva et al. 2017).
In the ACB, FA was used to determine the possible pollution sources contributing to the pollution of the surface water resources qualitatively. Prior to statistical analysis and result interpretation, the suitability of the collected samples for FA was checked using the parameters suitability test: Kaiser–Meyer–Olkin (KMO), a measure of sampling adequacy and Bartlett's test of sphericity which examines whether the available data are independent or not. A KMO value close to one would generally mean the correlations are compacted and hence the sampling and the samples are highly suitable for FA, whereas smaller values would generally mean that the variables in consideration have very little in common. Although KMO greater than 0.5 is often considered adequate (Ogwueleka 2015), higher values are usually recommended. On the other hand, CA was used to group all the monitoring stations according to their spatial similarity where Hierarchical Cluster Analysis (HCA) was used to classify the monitoring stations based on the similarity between constituents through Ward's method. All the statistical analyses are conducted using SPSS (version 26).
During FA, the most common extraction method is the principal component-based correlation matrix determination where eigenvalue greater than unity is considered. For maximizing the variance and extraction of the determining factors, Kaiser normalization is used (Angello et al. 2021). Each principal component loading determines the weight (impact) of the respective constituents and generally loadings greater than 0.75 are often taken as strong (Kilonzo et al. 2014), whereas the components loadings 0.5–0.75 and 0.3–0.5 are classified as moderate and weak, respectively (Cid et al. 2011). Similarly, the CA analysis was used to group the water quality monitoring station based on their chemical similarity, represented by a dendrogram. HCA is a commonly used technique to cluster the water samples according to their chemical composition and similarity.
Quantification of pollution sources by the UNMIX model
Multivariate statistical techniques are nowadays gaining popularity in determining the types of pollution sources contributing to water quality deterioration. However, quantification of the contribution and composition of each source are difficult. In this regard, with the multivariate receptor models (MRM) such as EPA's UNMIX (Norris et al. 2007), PMF (Huang et al. 2018) and Absolute Principal Component Score/Multiple Linear Regression, APCS/MLR (Gulgundi & Shetty 2016) the contribution of the pollution sources to individual water quality constituents can be quantified and decision-making has been simplified. The UNMIX model is one of the simplified tools developed to analyze large sets of water, air and soil/sediment samples and has become effective in recent times (Sun 2017). The UNMIX assumes the given water quality data are linearly related to the unknown number of sources with an unknown composition and contribution (Norris et al. 2007).
RESULTS AND DISCUSSION
Characterization of water quality in ACB
Detailed spatial water quality variation in the Abaya-Chamo Lake Basin is given in Table 2. Accordingly, the physico-chemical characterization in the rivers and lakes revealed that the pH has shown a nearly constant trend and no significant variation was observed across the monitoring stations. A mean of 7.9 ± 0.4 was recorded on the ACB monitoring station where few irregularities were observed between monitoring stations. The observed pH generally ranged from 7.2 to 8.7, indicating the reduced impact of acidity in the water resources. Similarly, the DO variation in the surface water resources of the ACB has shown nearly a uniform trend in concentration and all the monitoring stations have met the minimum requirement oxygen concentration on surface water resources as set by the Ethiopian water quality standard >4.5 mg/L (EEPA 2003). Furthermore, a strong negative correlation was observed between DO and TDS (r = −0.75), EC (−0.76) and bicarbonate (−0.86) at a significance level of p < 0.01. The high negative correlation between DO and TDS/EC could be attributed to the high presence of dissolved salt concentration that reduced the dissolvability of oxygen.
Spatial distribution of physico-chemical characteristics of the surface water sample in ACB
Monitoring st. . | pH . | DO (mg/L) . | TDS (mg/L) . | Temp. (°C) . | EC (μS/cm) . | TH (mg/L) . | TI (mg/L) . |
---|---|---|---|---|---|---|---|
GSR | 8.3 | 6.50 | 332.7 | 28.3 | 703 | 243 | 0.02 |
LCH | 8.4 | 5.73 | 1,205.7 | 27.2 | 2,539 | 111 | 0.58 |
WOR | 8.1 | 7.31 | 304.0 | 28.1 | 643 | 241 | 0.16 |
ZSR | 7.9 | 6.54 | 290.7 | 28.4 | 614 | 189 | 0.18 |
CER | 7.9 | 6.78 | 367.0 | 26.6 | 775 | 273 | 0.12 |
CSF | 8.0 | 6.45 | 83.6 | 26.4 | 179 | 147 | 0.22 |
SRS | 7.9 | 6.99 | 177.7 | 26.8 | 377 | 150 | 0.09 |
SIB | 8.0 | 6.69 | 111.3 | 27.0 | 237 | 89 | 0.32 |
ESF | 7.7 | 6.38 | 198.0 | 29.6 | 420 | 156 | 0.15 |
ESB | 8.3 | 6.59 | 225.0 | 29.2 | 476 | 160 | 0.16 |
BSF | 7.7 | 6.68 | 81.3 | 26.8 | 176 | 70 | 2.9 |
AMB | 7.2 | 6.75 | 37.7 | 25.4 | 80.6 | 80 | 3.1 |
BNR | 7.3 | 6.63 | 63.3 | 24.4 | 132 | 60 | 2.3 |
ALL | 8.7 | 6.56 | 564 | 27.3 | 1,190 | 180 | 0.71 |
BRB | 7.5 | 6.22 | 77.5 | 25.8 | 164 | 24 | 3 |
MMA | 7.6 | 6.74 | 76.6 | 24.4 | 160 | 90 | 3.6 |
RRW | 8.4 | 6.93 | 174 | 24.3 | 359 | 220 | 0.25 |
BIR | 7.4 | 5.95 | 64.4 | 31.4 | 154 | 70 | 3.8 |
BAB | 7.5 | 6.05 | 116 | 28.8 | 259 | 120 | 2.4 |
AFB | 7.2 | 6.15 | 31.4 | 27.6 | 70.6 | 50 | 1.36 |
HNB | 7.8 | 7.46 | 28.9 | 22.1 | 60.1 | 20 | 1.77 |
HAO | 7.5 | 6.97 | 38.1 | 24.1 | 79.8 | 20 | 1.01 |
KOB | 7.8 | 6.67 | 63.2 | 25.8 | 137 | 60 | 1.24 |
KAO | 7.6 | 6.01 | 64.5 | 25.2 | 136 | 60 | 1.66 |
Mean | 7.8 | 6.6 | 199.0 | 26.7 | 421.7 | 120.1 | 1.3 |
Max | 8.7 | 7.5 | 1,205.7 | 31.4 | 2,539.1 | 273.3 | 3.8 |
Min | 7.2 | 5.7 | 28.9 | 22.1 | 60.1 | 20.0 | 0.0 |
Std. Dev. | 0.40 | 0.42 | 252.58 | 2.09 | 531.31 | 75.04 | 1.26 |
Monitoring st. . | pH . | DO (mg/L) . | TDS (mg/L) . | Temp. (°C) . | EC (μS/cm) . | TH (mg/L) . | TI (mg/L) . |
---|---|---|---|---|---|---|---|
GSR | 8.3 | 6.50 | 332.7 | 28.3 | 703 | 243 | 0.02 |
LCH | 8.4 | 5.73 | 1,205.7 | 27.2 | 2,539 | 111 | 0.58 |
WOR | 8.1 | 7.31 | 304.0 | 28.1 | 643 | 241 | 0.16 |
ZSR | 7.9 | 6.54 | 290.7 | 28.4 | 614 | 189 | 0.18 |
CER | 7.9 | 6.78 | 367.0 | 26.6 | 775 | 273 | 0.12 |
CSF | 8.0 | 6.45 | 83.6 | 26.4 | 179 | 147 | 0.22 |
SRS | 7.9 | 6.99 | 177.7 | 26.8 | 377 | 150 | 0.09 |
SIB | 8.0 | 6.69 | 111.3 | 27.0 | 237 | 89 | 0.32 |
ESF | 7.7 | 6.38 | 198.0 | 29.6 | 420 | 156 | 0.15 |
ESB | 8.3 | 6.59 | 225.0 | 29.2 | 476 | 160 | 0.16 |
BSF | 7.7 | 6.68 | 81.3 | 26.8 | 176 | 70 | 2.9 |
AMB | 7.2 | 6.75 | 37.7 | 25.4 | 80.6 | 80 | 3.1 |
BNR | 7.3 | 6.63 | 63.3 | 24.4 | 132 | 60 | 2.3 |
ALL | 8.7 | 6.56 | 564 | 27.3 | 1,190 | 180 | 0.71 |
BRB | 7.5 | 6.22 | 77.5 | 25.8 | 164 | 24 | 3 |
MMA | 7.6 | 6.74 | 76.6 | 24.4 | 160 | 90 | 3.6 |
RRW | 8.4 | 6.93 | 174 | 24.3 | 359 | 220 | 0.25 |
BIR | 7.4 | 5.95 | 64.4 | 31.4 | 154 | 70 | 3.8 |
BAB | 7.5 | 6.05 | 116 | 28.8 | 259 | 120 | 2.4 |
AFB | 7.2 | 6.15 | 31.4 | 27.6 | 70.6 | 50 | 1.36 |
HNB | 7.8 | 7.46 | 28.9 | 22.1 | 60.1 | 20 | 1.77 |
HAO | 7.5 | 6.97 | 38.1 | 24.1 | 79.8 | 20 | 1.01 |
KOB | 7.8 | 6.67 | 63.2 | 25.8 | 137 | 60 | 1.24 |
KAO | 7.6 | 6.01 | 64.5 | 25.2 | 136 | 60 | 1.66 |
Mean | 7.8 | 6.6 | 199.0 | 26.7 | 421.7 | 120.1 | 1.3 |
Max | 8.7 | 7.5 | 1,205.7 | 31.4 | 2,539.1 | 273.3 | 3.8 |
Min | 7.2 | 5.7 | 28.9 | 22.1 | 60.1 | 20.0 | 0.0 |
Std. Dev. | 0.40 | 0.42 | 252.58 | 2.09 | 531.31 | 75.04 | 1.26 |
TH, total hardness; TI, total iron; SD, standard deviation.
The TDS in the ACB has shown a significant spatial variation across the monitoring stations where a maximum concentration of 1,205.7 mg/L was recorded at LCH, whereas a minimum (28.9 mg/L) was observed at HNB, the earlier monitoring station being a dumping site for leftover fish products and public recreation areas (Figure 3). Besides, high TDS standard deviation was calculated on almost all the monitoring stations mainly due to the high spatial variation and high variation in anthropogenic activities (Angello et al. 2021). On the other hand, the EC, which measures the proxy salt concentration in the surface water resources, was also found significantly high at monitoring stations with high TDS concentration (LCH). The excess presence of EC in the water resources is a good indication of significant quantity of dissolved salt (Barakat et al. 2016). The high EC level in LCH could be attributed to the increased anthropogenic and agricultural practice impact on the water resource. Accordingly, high and low EC concentrations were recorded at monitoring stations LCH and HNB corresponding to the Lake Chamo and the Bridge on Hare River, which is the tributary for Lake Chamo and characterized by intensive agriculture (Figure 2). They earlier receive a large number of sediments mainly from agricultural lands. The conductivity in the ACB has shown a strong positive correlation with bicarbonate (r = 0.9), chloride (0.86) and sodium (0.93). Similarly, the water temperature is the major factor that causes significant environmental impacts and controls various water quality variables (Silva et al. 2017). However, in the ACB surface water samples, the water temperature ranging from 22.1 to 31.4 °C with a mean temperature of 26.7 °C is under the national (EEPA 2003) and WHO (WHO 2011) guideline standards and showed a weak to moderate correlation with all parameters.
Excess presence of nutrients can seriously impact the well-being of aquatic life (Barakat et al. 2016). In ACB, however, the NO3-N concentration was found from 3.3 to 31 mg/L with a mean concentration of 13 mg/L (Table 3). All the monitoring stations have recorded NO3-N within the permissible limit, whereas at some monitoring stations such as BSF and BRB, the NO3-N concentration has recorded slightly higher concentrations that could be attributed to the leaching of agricultural land saturated with fertilizers. More than 70% of the monitoring stations in ACB have recorded a NO3-N concentration of <10 mg/L. On the other hand, the PO4-P concentration at some of the monitoring stations has deviated from the national guideline standard for surface water requirements. More than 20% of the monitoring station water samples had a PO4-P concentration of >1 mg/L, making the stations at the critical stage of eutrophication. The correlation analysis using Pearson correlation showed that PO4-P had a strong negative correlation with pH (r = −0.77) and a strong positive correlation with total iron (r = 0.83) and NO2-N (r = 0.75).
Spatial distribution of anions and cations on the surface water samples in ACB
MS . | Ca2+ . | Mg2+ . | HCO3− . | Cl− . | SO42− . | Na+ . | K+ . | NO3− . | NO2− . | PO43− . |
---|---|---|---|---|---|---|---|---|---|---|
GSR | 21 | 48 | 310 | 48 | 27.2 | 95.5 | 10.1 | 11.7 | 0.015 | 0.22 |
LCH | 13 | 26 | 777 | 160 | 51.6 | 901.1 | 16.8 | 8.7 | 0.019 | 0.23 |
WOR | 19 | 48 | 273 | 42 | 37.5 | 72.3 | 6.1 | 8.8 | 0.016 | 0.18 |
ZSR | 14 | 41 | 193 | 35 | 36.4 | 59.4 | 6.2 | 9.7 | 0.009 | 0.24 |
CER | 15 | 57 | 402 | 31 | 36.0 | 101.7 | 2.7 | 8.1 | 0.018 | 0.21 |
CSF | 16 | 15 | 123 | 12 | 35.3 | 25.2 | 3.3 | 7.7 | 0.011 | 0.15 |
SRS | 15 | 27 | 162 | 17 | 33.1 | 35.7 | 3.5 | 12.3 | 0.012 | 0.21 |
SIB | 15 | 17 | 109 | 12 | 33.5 | 27.4 | 4.0 | 9.6 | 0.003 | 0.28 |
ESF | 11 | 33 | 213 | 24 | 36.0 | 32.7 | 4.5 | 12.8 | 0.024 | 0.26 |
ESB | 14 | 38 | 236 | 23 | 37.1 | 53.7 | 3.5 | 12.4 | 0.019 | 0.21 |
BSF | 20 | 12.1 | 130 | 32.9 | 121.8 | 45.5 | 10 | 31 | 0.115 | 1.1 |
AMB | 12 | 16.5 | 140 | 38.9 | 110.5 | 18.1 | 7.1 | 9.5 | 0.135 | 1.43 |
BNR | 13.6 | 11.3 | 190 | 29 | 104.1 | 30.2 | 9.6 | 22 | 0.05 | 1.38 |
ALL | 16 | 39.8 | 250 | 118.8 | 132.3 | 606 | 17.1 | 9.1 | 0.037 | 0.39 |
BRB | 12 | 2.9 | 130 | 29 | 71.8 | 56.9 | 8.4 | 31 | 0.21 | 0.88 |
MMA | 16 | 18 | 120 | 24 | 53.4 | 19.2 | 2.5 | 17 | 0.069 | 0.5 |
RRW | 24 | 47.6 | 130 | 11 | 118 | 43.8 | 3.2 | 6.2 | 0.043 | 0.23 |
BIR | 16 | 13.1 | 184 | 48.9 | 141.4 | 44.6 | 11.2 | 14 | 0.26 | 1.81 |
BAB | 27.3 | 22.5 | 140 | 46.9 | 41.4 | 70 | 11 | 26 | 0.315 | 1.1 |
AFB | 16 | 8.3 | 300 | 24 | 125.9 | 15.1 | 6.6 | 23 | 0.28 | 0.93 |
HNB | 12 | 1.9 | 80 | 14 | 13.5 | 9.7 | 1.1 | 6.2 | 0.04 | 0.38 |
HAO | 12 | 1.9 | 80 | 19 | 16.2 | 14.7 | 1.2 | 6.2 | 0.02 | 0.5 |
KOB | 16 | 10.7 | 100 | 16 | 25.2 | 17.8 | 1.4 | 4.8 | 0.03 | 0.55 |
KAO | 17.6 | 10.3 | 104 | 19 | 22.2 | 18.7 | 1.7 | 3.3 | 0.03 | 0.43 |
Mean | 16.0 | 23.7 | 203.2 | 36.6 | 60.9 | 100.6 | 6.4 | 13.0 | 0.1 | 0.6 |
Max | 27.3 | 57.0 | 776.7 | 160 | 141.4 | 901.1 | 17.1 | 31.0 | 0.3 | 1.8 |
Min | 11.3 | 1.9 | 80.0 | 11.0 | 13.5 | 9.7 | 1.1 | 3.3 | 0.0 | 0.2 |
Std. Dev. | 3.93 | 16.58 | 147.03 | 34.30 | 42.23 | 207.25 | 4.60 | 7.95 | 0.09 | 0.47 |
MS . | Ca2+ . | Mg2+ . | HCO3− . | Cl− . | SO42− . | Na+ . | K+ . | NO3− . | NO2− . | PO43− . |
---|---|---|---|---|---|---|---|---|---|---|
GSR | 21 | 48 | 310 | 48 | 27.2 | 95.5 | 10.1 | 11.7 | 0.015 | 0.22 |
LCH | 13 | 26 | 777 | 160 | 51.6 | 901.1 | 16.8 | 8.7 | 0.019 | 0.23 |
WOR | 19 | 48 | 273 | 42 | 37.5 | 72.3 | 6.1 | 8.8 | 0.016 | 0.18 |
ZSR | 14 | 41 | 193 | 35 | 36.4 | 59.4 | 6.2 | 9.7 | 0.009 | 0.24 |
CER | 15 | 57 | 402 | 31 | 36.0 | 101.7 | 2.7 | 8.1 | 0.018 | 0.21 |
CSF | 16 | 15 | 123 | 12 | 35.3 | 25.2 | 3.3 | 7.7 | 0.011 | 0.15 |
SRS | 15 | 27 | 162 | 17 | 33.1 | 35.7 | 3.5 | 12.3 | 0.012 | 0.21 |
SIB | 15 | 17 | 109 | 12 | 33.5 | 27.4 | 4.0 | 9.6 | 0.003 | 0.28 |
ESF | 11 | 33 | 213 | 24 | 36.0 | 32.7 | 4.5 | 12.8 | 0.024 | 0.26 |
ESB | 14 | 38 | 236 | 23 | 37.1 | 53.7 | 3.5 | 12.4 | 0.019 | 0.21 |
BSF | 20 | 12.1 | 130 | 32.9 | 121.8 | 45.5 | 10 | 31 | 0.115 | 1.1 |
AMB | 12 | 16.5 | 140 | 38.9 | 110.5 | 18.1 | 7.1 | 9.5 | 0.135 | 1.43 |
BNR | 13.6 | 11.3 | 190 | 29 | 104.1 | 30.2 | 9.6 | 22 | 0.05 | 1.38 |
ALL | 16 | 39.8 | 250 | 118.8 | 132.3 | 606 | 17.1 | 9.1 | 0.037 | 0.39 |
BRB | 12 | 2.9 | 130 | 29 | 71.8 | 56.9 | 8.4 | 31 | 0.21 | 0.88 |
MMA | 16 | 18 | 120 | 24 | 53.4 | 19.2 | 2.5 | 17 | 0.069 | 0.5 |
RRW | 24 | 47.6 | 130 | 11 | 118 | 43.8 | 3.2 | 6.2 | 0.043 | 0.23 |
BIR | 16 | 13.1 | 184 | 48.9 | 141.4 | 44.6 | 11.2 | 14 | 0.26 | 1.81 |
BAB | 27.3 | 22.5 | 140 | 46.9 | 41.4 | 70 | 11 | 26 | 0.315 | 1.1 |
AFB | 16 | 8.3 | 300 | 24 | 125.9 | 15.1 | 6.6 | 23 | 0.28 | 0.93 |
HNB | 12 | 1.9 | 80 | 14 | 13.5 | 9.7 | 1.1 | 6.2 | 0.04 | 0.38 |
HAO | 12 | 1.9 | 80 | 19 | 16.2 | 14.7 | 1.2 | 6.2 | 0.02 | 0.5 |
KOB | 16 | 10.7 | 100 | 16 | 25.2 | 17.8 | 1.4 | 4.8 | 0.03 | 0.55 |
KAO | 17.6 | 10.3 | 104 | 19 | 22.2 | 18.7 | 1.7 | 3.3 | 0.03 | 0.43 |
Mean | 16.0 | 23.7 | 203.2 | 36.6 | 60.9 | 100.6 | 6.4 | 13.0 | 0.1 | 0.6 |
Max | 27.3 | 57.0 | 776.7 | 160 | 141.4 | 901.1 | 17.1 | 31.0 | 0.3 | 1.8 |
Min | 11.3 | 1.9 | 80.0 | 11.0 | 13.5 | 9.7 | 1.1 | 3.3 | 0.0 | 0.2 |
Std. Dev. | 3.93 | 16.58 | 147.03 | 34.30 | 42.23 | 207.25 | 4.60 | 7.95 | 0.09 | 0.47 |
MS, monitoring stations; SD, standard deviation.
Like physico-chemical water quality constituents in ACB, the anion and cation concentrations have shown a slight variation across all monitoring stations. The Ca2+ concentration was found to be within the national guideline standard with a mean concentration of 16 mg/L (Table 3). Similarly, an observed mean concentration of 23.7 mg/L Mg2+ was observed in the ACB making it within the limit standard. However, at some monitoring stations, such as WOR and CER, the concentration was slightly higher than the standard limit, making the water resources unsuitable for public uses. Very strong and positive correlation was found between Mg2+ and total hardness (TH) (r = 0.99), indicating Mg2+ was the major cause of the water hardness.
The TH in the ACB was significantly high mainly attributed to the presence of these Ca2+ and Mg2+ that varied from 11.3–27.3 mg/L and 1.9–57 mg/L, respectively. On the other hand, the high bicarbonate concentration in most of the water resources of ACB was possibly attributed to large cation-exchange capabilities of clay mineral and the high ground-surface water interaction that allows easy contact with sandstone and carbonate rocks. This can easily be interpreted by the high correlation between TH and Na+ (0.78) and Cl− (0.78). Similarly, the analyzed water quality sample in the surface water resource of the ACB also indicated that the Cl− concentration was high at some of the monitoring stations, although they all are within the guideline standard limit having a maximum concentration of 160.3 mg/L at LCH (Lake Chamo) monitoring station. The presence of Cl− in excess of 200–300 mg/L in drinking water may cause a salty taste to the water and beverage consumers (WHO 2011). On the other hand, a very strong positive correlation between Cl− and Na+ (r = 0.96) and K+ (r = 0.81) was observed, which could have possibly accounted for the high concentration of TDS and EC in the ACB, especially at LCH monitoring stations (Gebresilasie et al. 2021).
Pollution source apportionment using MSA
The pollution source apportionment in ACB was based on the FA where the suitability of the collected water samples was checked by two factors: KMO and Bartlett's test of sphericity. Accordingly, the KMO in ACB was 0.56 > 0.5, indicating the FA was more suitable for the analysis and interpretation of water quality variables in the ACB.
The FA in ACB extracted four principal components explaining a total variance of 86.9% with an eigen value >1. Accordingly, factor 1 explained 34.4% of the variance and had very strong loading for chloride (0.98), sodium (0.98), TDS (0.94), EC (0.93), bicarbonate (0.87) and strong loading for potassium (0.77). The presence of high component loading on sodium, bicarbonate and chloride is a good indication of the excess presence of dissolved salt in the water resources. It can also be interpreted by the presence of the impact of mineral constituents in the water resources (Yawar et al. 2017). Moreover, the high loading on EC, TDS and potassium could be attributed to the increased land use alteration that triggered sediment transport and the high ground water intrusion in the watershed. The second principal component that explained 26.42% of the variance has a strong component loading for phosphate, nitrite, nitrate and sulphate with a loading of 0.84, 0.84, 0.81 and 0.78, respectively. This could be interpreted by the high contribution of mixed pollution sources, mainly originating from agricultural lands and households (Rahman et al. 2021).
The third principal component had explained 15.6% of the variance with a very strong component loading on magnesium and TH having a calculated component loading of 0.89 and 0.91, respectively. The high loading on the constituents could be interpreted by the significant impact of the water resources by metal hardness. These impurities might be contributed by ground water and limestone originating from the ground surface. The fourth principal component extracted by FA in ACB had a strong positive loading for water temperature (0.88) and negative loading for DO (-0.74) that explained 10.3% of total variation. The component can be explained by the impact of seasonal variation on the water quality of the ACB water resources. From all the loadings, it can be deduced that the role of nonpoint sources originating mainly from agricultural land was evident and the impact is more pronounced on the two ACB lakes: Lake Chamo and Lake Abaya.
Spatial clustering of ACB water quality monitoring stations
Cluster 1 is represented by the Chamo Lake (LCH) monitoring station. The station is characterized by a relatively high pollution level compared to other monitoring stations in the ACB. The high number of pollution sources and increased anthropogenic impact in the lake could be a potential factor behind the high pollution level in the lake (Wang et al. 2012). Cluster 2 is composed of eight monitoring stations including BAB, BIR, MMA, BNR, AMB, BSF, AFB and BRB where urban land use impacts are prevailing (Figure 2). An internal analysis in this cluster revealed that the monitoring stations BAB and BIR have shown similar features. Monitoring stations in this cluster are characterized to have nearly similar characteristics and are affected by similar pollution sources. Similarly, cluster 3 is composed of eight monitoring stations, whereas cluster 4 consists of seven monitoring stations (Figure 4).
Quantification of pollution source contribution and composition
Before model interpretation, the UNMIX model in this study was checked for the minimum requirement of Noise to Signal ratio (N/S) and a coefficient of determination (R2) where both parameters were above the minimum requirement with 3.38 > 2 and 0.88, respectively. Accordingly, more than 88% of the total variance of each water quality constituent or their contribution can be explained using the predefined pollution profile (agricultural, ground water intrusion and mineral dissolution), leaving the remainder to be contributed by unknown (undefined) sources.
The UNMIX model predicted and assigned the pollution source contribution with a mean predicted to a measured ratio of 0.96, showing that the model has slightly underestimated the assignment of the contribution of the sources to the individual constituents. The mean absolute error was 4.18%, revealing the assignment of the pollution source contribution was quite good. The model performance for each parameter had good to very good range with an R2 ranging from 0.58 to 0.98 (TDS, Cl− and Na+). The model was relatively weak on TDS to assign the contribution of identified sources with an error of about 10.18%. Table 4 shows that source 1 (mineral dissolution) has the highest contribution for water temperature, DO, Ca+ and pH followed by ground water intrusion and agriculture, whereas it has a relatively low contribution for others. On the other hand, the contribution of source 2 for Cl- and Na+ was relatively higher, whereas the contribution of source 3 for TI, nitrate, nitrite and phosphate is high. Details of all the contributions of the pollution sources for selected water quality parameters are summarized in Table 4.
Percentage contribution of pollution sources for individual constituents
Constituent . | Source contribution (%) . | Predicted . | Measured . | Error (%) . | ||
---|---|---|---|---|---|---|
Source 1 . | Source 2 . | Source 3 . | ||||
TDS | 71.00 (37%) | 57.4 (29.9%) | 44.1 (22.7%) | 172.5 | 192.00 | 10.15 |
TH | 67.7 (59.7%) | 30.58 (26.9%) | 9.40 (8.3%) | 107.68 | 113.40 | 5.04 |
Ca2+ | 13.7 (91.9%) | 0.08 (0.60%) | 0.92 (6.15%) | 14.7 | 14.90 | 1.34 |
Mg2+ | 21.1 (71.5%) | 0.12 (0.5%) | 1.40 (5.85%) | 22.62 | 23.90 | 5.35 |
HCO3− | 154 (73.51%) | 36.1 (17.2%) | 11.60 (5.5%) | 201.7 | 209.50 | 3.72 |
pH | 7.26 (91.9%) | 0.16 (2.05%) | 0.14 (1.8%) | 7.56 | 7.90 | 4.3 |
Cl− | 10.8 (28.2%) | 21.8 (56.9%) | 5.45 (14.22%) | 38.05 | 38.30 | 0.65 |
Temp | 24.9 (93.2%) | 0.57 (2.1%) | 0.07 (0.2%) | 25.54 | 26.71 | 4.38 |
SO42− | 29.4 (49.5%) | 5.00 (8.4%) | 23.60 (39.7%) | 58.00 | 59.40 | 2.35 |
Na+ | 12.1 (11.7%) | 53.1 (51.4%) | 32.01 (31%) | 97.21 | 103.20 | 5.80 |
DO | 5.61 (96.9%) | 0.3 (4.9%) | 0.10 (1.6%) | 6.01 | 6.10 | 1.47 |
K | 1.43 (25.8%) | 1.03 (18.6%) | 2.66 (48%) | 5.12 | 5.54 | 7.58 |
TI | 0.10 (8.75%) | 0.03 (2.61%) | 1.03 (85.8%) | 1.16 | 1.20 | 3.33 |
NO3− | 3.96 (35.7%) | 0.16 (1.43%) | 6.42 (57.8%) | 10.54 | 11.10 | 5.04 |
NO2− | 0.01 (14.2%) | 0.001 (1.4%) | 0.06 (85.7%) | 0.068 | 0.07 | 2.85 |
PO4 | 0.15 (26.9%) | 0.002 (0.29%) | 0.39 (69.6%) | 0.54 | 0.56 | 3.57 |
Constituent . | Source contribution (%) . | Predicted . | Measured . | Error (%) . | ||
---|---|---|---|---|---|---|
Source 1 . | Source 2 . | Source 3 . | ||||
TDS | 71.00 (37%) | 57.4 (29.9%) | 44.1 (22.7%) | 172.5 | 192.00 | 10.15 |
TH | 67.7 (59.7%) | 30.58 (26.9%) | 9.40 (8.3%) | 107.68 | 113.40 | 5.04 |
Ca2+ | 13.7 (91.9%) | 0.08 (0.60%) | 0.92 (6.15%) | 14.7 | 14.90 | 1.34 |
Mg2+ | 21.1 (71.5%) | 0.12 (0.5%) | 1.40 (5.85%) | 22.62 | 23.90 | 5.35 |
HCO3− | 154 (73.51%) | 36.1 (17.2%) | 11.60 (5.5%) | 201.7 | 209.50 | 3.72 |
pH | 7.26 (91.9%) | 0.16 (2.05%) | 0.14 (1.8%) | 7.56 | 7.90 | 4.3 |
Cl− | 10.8 (28.2%) | 21.8 (56.9%) | 5.45 (14.22%) | 38.05 | 38.30 | 0.65 |
Temp | 24.9 (93.2%) | 0.57 (2.1%) | 0.07 (0.2%) | 25.54 | 26.71 | 4.38 |
SO42− | 29.4 (49.5%) | 5.00 (8.4%) | 23.60 (39.7%) | 58.00 | 59.40 | 2.35 |
Na+ | 12.1 (11.7%) | 53.1 (51.4%) | 32.01 (31%) | 97.21 | 103.20 | 5.80 |
DO | 5.61 (96.9%) | 0.3 (4.9%) | 0.10 (1.6%) | 6.01 | 6.10 | 1.47 |
K | 1.43 (25.8%) | 1.03 (18.6%) | 2.66 (48%) | 5.12 | 5.54 | 7.58 |
TI | 0.10 (8.75%) | 0.03 (2.61%) | 1.03 (85.8%) | 1.16 | 1.20 | 3.33 |
NO3− | 3.96 (35.7%) | 0.16 (1.43%) | 6.42 (57.8%) | 10.54 | 11.10 | 5.04 |
NO2− | 0.01 (14.2%) | 0.001 (1.4%) | 0.06 (85.7%) | 0.068 | 0.07 | 2.85 |
PO4 | 0.15 (26.9%) | 0.002 (0.29%) | 0.39 (69.6%) | 0.54 | 0.56 | 3.57 |
Source 1 indicates mineral dissolution; source 2 indicates ground water intrusion; source 3 indicates nonpoint sources (agricultural).
CONCLUSION
In this study, the water quality of the rift valley water resources of ACB obtained from 24 monitoring stations has been evaluated based on the application of MSA and MRM. The FA and hierarchical clustering in CA has been used to spatially analyze and represent the water quality variation where four meaningful clusters have been identified, which could be used as a representative location for future water quality intervention programmes. The MSA result interpretation revealed that intensive agricultural practices are the dominant pollution sources in the watershed, followed by groundwater contribution and mineral dissolution. On the other hand, MSA based on the hierarchical analysis in ACB water quality monitoring stations revealed that cluster 4, which constitutes the Chamo Lake, is on the verge of water quality deterioration due to the uncontrolled controlled anthropogenic activity including illegal fish market in the lake's peripheries. The MRM analysis showed that the contribution of agriculture to each water quality constituent, specifically nutrient pollutants, was quite high. Hence, integrated MSA and MRM could be a feasible option for water quality management and pollution control in the study area.
ACKNOWLEDGEMENTS
The authors are very thankful for the Arba Minch University, Institute of Water Technology, Water Resource Research Center for providing some water quality data.
DATA AVAILABILITY STATEMENT
All relevant data are included in the paper or its Supplementary Information.
CONFLICT OF INTEREST
The authors declare there is no conflict.