ABSTRACT
Effective wastewater management is crucial in regions experiencing water scarcity and environmental stressors, such as pollution and climate change. Optimizing treatment processes is essential for achieving environmental sustainability. This study aims to highlight the importance of effective wastewater management strategies, particularly in regions facing water scarcity. Our objective was to identify key factors influencing the treatment process. Therefore, we evaluated associations between physicochemical parameters using multivariate statistical methods, including Principal Component Analysis (PCA) and Hierarchical Ascendant Classification (HAC). Our findings categorize the monthly water samples into three distinct groups based on levels of organic pollution: the first group (July, August, and September) is characterized by high oxygenation levels and significantly low organic pollution, indicating optimal system operation. The second group (April, October, November, and December) exhibits low oxygenation and low organic pollution, promoting sludge settling and pollutant reduction. The third group (January, February, March, May, and June) shows significantly high organic pollution and low oxygenation, which corresponds to unfavorable environmental conditions. Our study demonstrates the effectiveness of multivariate statistical methods in optimizing wastewater treatment processes, providing crucial insights for environmental sustainability and water resource management.
HIGHLIGHTS
Use of multivariate statistical analysis for characterizing treated wastewater.
Identification of specific seasons or periods when organic pollution levels are notably high, facilitating targeted interventions and resource allocation.
Optimization of wastewater management, enabling improved practices and enhancing overall efficiency and sustainability of treatment processes.
INTRODUCTION
Water is fundamental to the development of nations, as a consistent supply of fresh water is necessary for establishing permanent communities (Hashem & Qi 2021). Securing sufficient access to water resources is crucial for fostering sustainable development (Bhaduri et al. 2016).
In many regions worldwide, ongoing population growth and rising water demand are leading to more frequent freshwater shortages (Hussain et al. 2019).
Algeria, marked by its arid and semi-arid climate, experiences irregular rainfall and high evapotranspiration rates (Guergueb & Ferhat 2021; Derdour et al. 2022), which contribute to significant challenges in managing its water resources. These challenges include adverse climatic conditions, overconsumption exceeding the natural renewal rate of the resource, and diffuse or occasional pollution (Masmoudi et al. 2016). With an annual water potential of approximately 23.2 billion m3 comprising 10.2 billion m3 of surface water and 13 billion m3 of groundwater (MRE 2020) and a population of approximately 43 million, Algeria confronts substantial constraints on freshwater resources (Hamiche et al. 2015). Water availability in Algeria falls below the recognized benchmark of 1,700 m3 per person per year. According to the World Health Organization (WHO 1989), availability is less than 1,000 m3 per person per year. Projections suggest a severe water shortage post 2050, with an anticipated 40% decrease in water availability per person to less than 500 m3 per year (Terink et al. 2013).
According to MRE (2020), Algeria utilized 4.06 billion m3 of water for drinking purposes and 7.6 billion m3 for irrigation. As in many other countries, irrigation is the primary use of water resources in Algeria, a trend driven by the long-term decline in water availability observed in recent decades (Drouiche et al. 2012).
To address these challenges, Algeria has implemented a new strategy to mobilize and secure various water resources, ensuring sustainability and integrated, rational water resource management at the national level. The reuse of treated wastewater is considered a primary alternative for expanding water resources, particularly in arid regions (Chojnacka et al. 2020; Hussain & Qureshi 2020). Wastewater reuse for irrigation provides an additional, reliable, and safe source of water and nutrients (Becerra-Castro et al. 2015), aiding in natural resource preservation and promoting integrated water management.
Ensuring the safety of reused wastewater for crop irrigation is a critical issue globally (Ghernaout & Elboughdiri 2020). Given the variability in wastewater characteristics based on its source, thorough examination during treatment and reuse processes is essential (Ilori et al. 2019). Adhering to established quality standards for wastewater discharge into natural environments can mitigate the harmful effects of inadequate treatment. Improperly treated wastewater poses significant risks to public health, the environment, and the economy (Libutti et al. 2018). Therefore, it is crucial to adopt effective treatment and utilization methods to protect public health (Hashem & Qi 2021).
Despite advancements in the design and operation of urban wastewater infrastructure over the past decade, challenges related to effluent quality persist (Zhang et al. 2021). This emphasizes the need for better management and interpretation of data from wastewater treatment facilities. Enhanced data analysis can provide critical insights for optimizing treatment processes and ensuring compliance with safety standards.
Continuous analysis campaigns play a pivotal role in improving the understanding and management of wastewater quality. However, they often generate vast amounts of complex and heterogeneous data that can be challenging to interpret meaningfully. To address this challenge, multivariate statistical techniques such as Principal Component Analysis (PCA) and Hierarchical Ascendant Classification (HAC) are indispensable. These methods not only establish relationships among interconnected data but also quantify wastewater quality and compress large databases of quantitative variables to extract relevant information (Aguado & Rosen 2008). Specifically, these approaches have proven particularly effective in analyzing seasonal variations in wastewater flows generated by domestic and industrial sources (Ouali et al. 2009), thereby providing crucial insights for optimizing wastewater treatment processes.
Numerous studies worldwide have used multivariate statistical analysis to establish relationships between quality parameters and pollution sources in wastewater treatment data (Aguado & Rosen 2008; Lefkir et al. 2016; Abbaa et al. 2021; Rekrak et al. 2021; Mouhtady et al. 2022; Rahmat et al. 2022; El Aatik et al. 2023; Moussaoui et al. 2023; Peng et al. 2023; Newhart et al. 2024). These methods are recognized for their ability to effectively handle large, complex, and diverse datasets, providing comprehensive insights into the relationships among variables influencing wastewater quality. Multivariate statistical analysis enables precise modeling and identification of key factors impacting the performance of wastewater treatment facilities. This approach ensures more accurate and effective management of wastewater treatment processes.
This study aims to highlight the importance of effective wastewater management strategies at an activated sludge treatment plant in Mascara, a region facing water scarcity. The main objective is to identify key factors influencing the treatment process and to highlight seasonal variations in organic pollution levels to understand the operational patterns at the plant. To achieve this objective, two types of multivariate statistical methods were applied. The first involves PCA, which reduces the dimensionality of datasets and evaluates associations between physicochemical parameters. The second method is HAC, which groups parameters into homogeneous categories. By identifying critical factors and analyzing seasonal variations, this research offers an innovative approach to optimize wastewater treatment processes, crucial for environmental sustainability. The final aim of this study is to inform policymakers and operators on necessary adaptations to optimize water resource management under diverse environmental conditions.
DATA AND METHODS
Study area
The city of Mascara is situated in an agricultural region covering an area of 75 km2, with 71% of the territory consisting of agricultural land, totaling 55 km2. Within this area, 35 km2 is located in the Ghriss plain, where groundwater resources have been significantly depleted due to increased exploitation and decreased rainfall. However, the treated wastewater from urban areas could be used in agriculture to compensate for the deficit and free up conventional resources for drinking water supply (Chadli et al. 2022).
Description of the mascara activated sludge treatment plant
The Mascara wastewater treatment plant (WWTP) is located 2 km south of the provincial capital, near the Kouair River, which receives the treated water. The plant is designed to serve a population equivalent of 100,000 inhabitants with a treatment capacity of 13,000 m3 per day. It treats wastewater from both urban and industrial sources using a low-loaded activated sludge process (or extended aeration). The treatment plant operates in three phases: pre-treatment, biological treatment using low-loaded activated sludge, and sludge treatment (drying beds). At present, the treated wastewater from the sewage treatment plant is specifically intended for irrigating an extensive agricultural area spanning 1,095 ha in the Ghriss Plain. This innovative and sustainable approach involves the efficient treatment of wastewater, facilitating its reuse to address local agricultural water requirements.
Collecting data
The available database for this study includes physicochemical parameters of raw (influent) and treated (effluent) waters from the plant, with a monthly time step. The database covers the period from January to December 2020. These data are measured and provided by the National Sanitation Office of the Mascara province. Measurements are conducted four times per month, and an average value is assigned for each month. Additionally, the Ghriss meteorological station provides data on air temperature (Tair) and wind speed (Wind).
The physicochemical parameters include potential of hydrogen (pH), volume (V), water temperature (Tw), chemical oxygen demand (COD), biochemical oxygen demand over 5 days (BOD5), suspended solids (SS), dissolved oxygen (DO), ammoniacal nitrogen (), nitrates (), and orthophosphates ().
Statistical analysis methods
The adopted methodological approach relies on the use of multivariate statistical methods: PCA and HAC. These methods were applied using STATISTICA.
Many studies have applied HAC and PCA for the classification and characterization of wastewater quality (Ouali et al. 2009; Rekrak et al. 2021; Mouhtady et al. 2022; Rahmat et al. 2022; El Aatik et al. 2023; Moussaoui et al. 2023). This technique helps establish a relationship between quality parameters and sources of pollution in wastewater treatment data (Rahmat et al. 2022). It identifies the variables that contribute the most to effluent quality, providing crucial information for operators and decision-makers to modify treatment processes accordingly (Abbaa et al. 2021).
PCA is a multidimensional statistical method used to establish, on one hand, a similarity assessment among samples (samplings), and on the other hand, an assessment of the relationships between variables (Hotelling 1933). The fundamental concept of PCA is to transform a large set of data containing associated variables into a smaller set of uncorrelated variables while maintaining the largest amount of information relating to the variation between the variables in the original dataset (Mouhtady et al. 2022). The method involves analyzing the correlation coefficients of each parameter successively with the first, second, and third principal components and representing the results graphically. Principal components generated during the analysis are arranged in such a manner that they correspond to a decreasing contribution of variance; i.e., Principal Component 1 (PC1) explains the highest amount of variance in the original data (Vieira et al. 2012).
It is important to note that PCA is sensitive to outliers (Garry 2007). To minimize the influence of these values, PCA is applied to centered and normalized data, giving equal weight to each variable. The initial classification of the parameters involves separating them based on their positive or negative correlations with the principal components. A factor loading greater than 0.75 is considered strong, while a range of 0.5–0.75 is considered moderate, and a range of 0.3–0.5 is considered weak (Schreiber 2020).
The HAC method is primarily used to complement the results obtained by PCA. It organizes a large dataset into clusters based on a given set of characteristics (Sharma et al. 2021). Cluster analysis facilitates the grouping of observations or variables according to their similarities or differences (Hussain 2004). The HAC output is represented by a dendrogram that categorizes observations or variables into groups or subgroups based on similarities. In this study, a dendrogram was obtained by performing Ward's method using squared Euclidean distance as a measure of similarity.
RESULTS AND DISCUSSION
Physicochemical parameters
The physicochemical parameters of the effluent wastewater were analyzed and compared with standards recommended by Algerian norms (JORA 2012), as well as those set by the World Health Organization (WHO) and the Food and Agriculture Organization (FAO).
Parameter . | Unit . | Mean . | Min . | Max . | Standard deviation . | Cv (%) . | SD Algerian Standards . |
---|---|---|---|---|---|---|---|
Tw | °C | 15.22 | 9.75 | 23.85 | 5.07 | 33.3 | 25 |
pH | – | 7.60 | 7.33 | 8.35 | 0.26 | 3.5 | 6.5–8.5 |
BOD5 | mg/L | 42.56 | 22.50 | 78.00 | 14.91 | 35 | 30 |
COD | mg/L | 92.36 | 57.75 | 133.50 | 19.45 | 21 | 90 |
TSS | mg/L | 37.90 | 22.00 | 61.75 | 12.95 | 34.2 | 30 |
DO | mg/L | 3.20 | 1.67 | 5.22 | 1.05 | 32.8 | – |
mg/L | 6.84 | 4.13 | 10.49 | 1.64 | 24 | – | |
mg/L | 0.95 | 0.09 | 4.24 | 1.12 | 118 | 30 | |
mg/L | 2.64 | 2.13 | 3.49 | 0.39 | 15 | 1à 2 |
Parameter . | Unit . | Mean . | Min . | Max . | Standard deviation . | Cv (%) . | SD Algerian Standards . |
---|---|---|---|---|---|---|---|
Tw | °C | 15.22 | 9.75 | 23.85 | 5.07 | 33.3 | 25 |
pH | – | 7.60 | 7.33 | 8.35 | 0.26 | 3.5 | 6.5–8.5 |
BOD5 | mg/L | 42.56 | 22.50 | 78.00 | 14.91 | 35 | 30 |
COD | mg/L | 92.36 | 57.75 | 133.50 | 19.45 | 21 | 90 |
TSS | mg/L | 37.90 | 22.00 | 61.75 | 12.95 | 34.2 | 30 |
DO | mg/L | 3.20 | 1.67 | 5.22 | 1.05 | 32.8 | – |
mg/L | 6.84 | 4.13 | 10.49 | 1.64 | 24 | – | |
mg/L | 0.95 | 0.09 | 4.24 | 1.12 | 118 | 30 | |
mg/L | 2.64 | 2.13 | 3.49 | 0.39 | 15 | 1à 2 |
The inter-monthly irregularities of these parameters are assessed by their coefficients of variation. Notably, the coefficients of variation for (14%) and (23%) are slightly low, indicating that the data have relatively low dispersion around their mean. This suggests that the inter-monthly irregularities of these parameters are less significant. Conversely, the coefficients of variation are slightly high for Tw (33%), TSS (34%), BOD5 (35%), and DO (33%), indicating slightly greater dispersion of values around their mean. This highlights the variability in the data and the potential for extreme values. Specifically, the coefficient of variation for stands at 118%, indicating a strong monthly irregularity. This suggests considerable fluctuation in values from month to month, with significant dispersion relative to the monthly mean. Furthermore, the low coefficient of variation for pH (3%) implies a higher degree of stability in this parameter. A low coefficient of variation for pH suggests that this aspect of water quality tends to remain close to its average, indicating more consistent and predictable behavior. Conversely, these irregularities indicate significant deviations from the mean, suggesting dynamic changes in these water quality indicators. Hence, parameters with a low coefficient of variation, such as pH, along with exhibiting a high coefficient of variation, were not considered in the PCA and HCA (Ketrouci et al. 2023).
Figure 3 shows a substantial variation in the physicochemical composition of wastewater between different months due to the significant differences between medians and maximum values. The results indicate substantial variability in concentration rates both across different months and among various parameters.
This underscores the importance of considering both temporal and parameter-specific variations when assessing the composition of wastewater. Understanding these variations is crucial for effective wastewater management and treatment strategies, as it provides insights into the specific challenges posed by different months and parameters in maintaining water quality standards.
Water temperature
Potential of hydrogen
The pH of the raw wastewater at the outlet exhibits slight alkalinity, ranging from 7.33 to 8.35, with an average of 7.60. This alkalinity in the effluent may be attributed to the composition of domestic waste, such as soap water, feces, and urine (Paliwal et al. 1998). This range supports the growth of bacteria necessary for the biological breakdown of organic pollutants, as noted in a previous study by Kadouche and colleagues (Kadouche et al. 2018). The pH values of the treated wastewater illustrated in Figure 4(b) conform to the recommended range of 6.5–8.5 according to Algerian standards (JORA 2012).
Total suspended solids (TSS)
TSS constitutes a mixture of organic matter, inorganic matter, and microorganisms present in wastewater. TSS can negatively impact water transparency and decrease light penetration, which can ultimately affect photosynthesis. Sedimentation and filtration processes can be used to remove solid particles from wastewater. At the outlet of the basin, the concentrations range from 22 to 66.7 mg/L, with an average of 37.89 mg/L (Figure 4(c)). Although SS in treated waters have decreased, several sampling months recorded values that exceeded the limit allowed by Algerian regulations for irrigation water. This is due to filter clogging caused by high levels of particles in urban effluent. The recorded values for the months of August, September, October, November, and December are below the permissible limit set by Algerian regulations for irrigation water (<30 mg/L) (JORA 2012). This reduction is attributed to the settling of suspended matter and physical filtration, which involves the retention of coarse and fine materials on the surface (Chachuat 1998).
Chemical oxygen demand
COD is a method used to determine the concentration of dissolved or suspended organic or mineral substances in water (Clescerl et al. 1998). It measures the amount of oxygen required for their complete chemical oxidation (Rodier 2009). The COD values vary between 57.7 and 133.5 mg/L, with an annual average of 92.36 mg/L (Figure 4(d)). Concentrations in January, February, March, May, and June exceed the standards required by Algerian legislation for irrigation water (90 mg/L) (JORA 2012). The lowest COD values are observed during July, August, September, and December, where concentrations fall below the authorized limit for discharge into the natural environment. The reduction in COD concentration is primarily caused by the physical retention of organic matter in the discharge filters and, to some extent, its oxidation by microbial flora in the presence of oxygen.
Biochemical oxygen demand
BOD is a critical water quality parameter that measures the amount of DO consumed by microorganisms in breaking down organic matter in water (Arcand et al. 1989). Therefore, BOD5 is a crucial parameter used in wastewater treatment to assess the level of organic pollution in water. The concentrations of BOD5 at the outlet range from 78 to 22. 5 mg/L, averaging 42.6 mg/L, as shown in Figure 4(e). During August, November and December, the recorded values fell below 30 mg/L, attributed to the degradation of biodegradable organic matter by microorganisms (Kadouche et al. 2018). However, throughout the rest of the year, concentrations significantly exceed the standards set by Algerian legislation for irrigation water, which is 30 mg/L (JORA 2012).
Ammonia nitrogen (NH4+)
Nitrate ()
Nitrates are the end product of nitrogen oxidation and can be present in treated water. High levels of nitrates in surface waters can lead to eutrophication, while also disrupting the disinfection process for drinking water. Algerian regulations have set a limit of 30 mg/L for nitrate concentration in irrigation water to prevent groundwater pollution (JORA 2012). The nitrate values observed in this study range from 0.08 to 4.24 mg/L (Figure 5(g)). The nitrate concentration in the treated water is below the acceptable limits for irrigation.
Dissolved oxygen
DO in water is the amount of oxygen available for aquatic organisms. It is a crucial parameter for assessing water quality and determining its ability to support aquatic life. The recorded DO values in the effluent at the outlet of the basin are high, ranging between 1.67 and 5.21 mg/L, with an average of 3.20 mg/L (Figure 5(h)), indicating the efficiency of the biological treatment process in the WWTP and the satisfactory degradation of organic matter (Haidara et al. 2022). This value falls within the standard limit of 5 mg O2/L, as stated by WHO standards for direct discharge into the environment (WHO 1989).
Orthophosphates (PO4−3)
Phosphorus is a crucial nutrient for the growth of plants and microorganisms, and it can exist in various oxidized forms. Elevated levels of phosphorus in wastewater can contribute to eutrophication. The results for orthophosphates in the treated wastewater in this study vary between 2.13 and 3.5 mg/L, with an average of 2.64 mg/L of phosphate (Figure 5(i)), which complies with the Algerian standard of 2 mg/L (JORA 2012). The reduction in phosphate levels could be attributed to the consumption of phosphate by bacteria during the treatment process (Rekrak et al. 2021).
Correlation matrix
The Pearson correlation matrix analyzes the relationships between different components, quantifying how well the variance of each component is explained by their correlations with one another (Singh et al. 2004). A correlation coefficient close to 1 indicates a strong positive relationship, meaning that as one variable increases, the other also increases. Conversely, a coefficient close to −1 signifies a strong negative relationship, where an increase in one variable corresponds to a decrease in the other. Values near 0 indicate little to no linear relationship between the variables, suggesting that changes in one variable do not predict changes in the other.
Table 2 shows the Pearson correlation matrix between the parameters. The analysis reveals that temperature has a positive correlation with DO, and a negative correlation with TSS, BOD5, and COD. Moreover, the positive correlation between temperature and parameters indicates their increase during the warm period (Bahlaoui et al. 1998). Additionally, the positive correlation between DO suggests an increase in its levels during the warm period. Conversely, during cold periods, an increase in TSS, BOD5, and COD concentrations is suggested by the negative correlation observed with temperature. Furthermore, BOD5 exhibits a high correlation with COD and a moderate correlation with TSS.
. | V . | TSS . | BOD5 . | COD . | . | . | Tw . | OD . | Tair . | Wind . |
---|---|---|---|---|---|---|---|---|---|---|
V | 1.00 | |||||||||
TSS | 0.19 | 1.00 | ||||||||
BOD5 | 0.10 | 0.66 | 1.00 | |||||||
COD | 0.02 | 0.54 | 0.73 | 1.00 | ||||||
0.33 | −0.07 | 0.18 | 0.29 | 1.00 | ||||||
0.33 | 0.07 | 0.13 | 0.38 | 0.01 | 1.00 | |||||
Tw | 0.71 | −0.08 | −0.26 | −0.23 | 0.23 | 0.16 | 1.00 | |||
OD | 0.59 | −0.08 | −0.30 | −0.46 | −0.04 | 0.20 | 0.82 | 1.00 | ||
Tair | 0.67 | −0.14 | −0.24 | −0.49 | 0.06 | 0.12 | 0.77 | 0.95 | 1.00 | |
Wind | 0.59 | −0.07 | −0.02 | −0.14 | 0.57 | 0.18 | 0.55 | 0.43 | 0.53 | 1.00 |
. | V . | TSS . | BOD5 . | COD . | . | . | Tw . | OD . | Tair . | Wind . |
---|---|---|---|---|---|---|---|---|---|---|
V | 1.00 | |||||||||
TSS | 0.19 | 1.00 | ||||||||
BOD5 | 0.10 | 0.66 | 1.00 | |||||||
COD | 0.02 | 0.54 | 0.73 | 1.00 | ||||||
0.33 | −0.07 | 0.18 | 0.29 | 1.00 | ||||||
0.33 | 0.07 | 0.13 | 0.38 | 0.01 | 1.00 | |||||
Tw | 0.71 | −0.08 | −0.26 | −0.23 | 0.23 | 0.16 | 1.00 | |||
OD | 0.59 | −0.08 | −0.30 | −0.46 | −0.04 | 0.20 | 0.82 | 1.00 | ||
Tair | 0.67 | −0.14 | −0.24 | −0.49 | 0.06 | 0.12 | 0.77 | 0.95 | 1.00 | |
Wind | 0.59 | −0.07 | −0.02 | −0.14 | 0.57 | 0.18 | 0.55 | 0.43 | 0.53 | 1.00 |
Application of PCA
In order to identify the main factors influencing the treated wastewater, a PCA was applied to the analyzed physicochemical parameters data. The analysis extracted four principal factors following Kaiser's rule based on eigenvalues greater than one (Hurley et al. 2012). However, the analysis revealed that the first three components (PC1, PC2 and PC3) effectively explain the dataset's characteristics, accounting for 78% of the total variance. The remaining components do not follow the general trend of actions between variables.
Table 3 displays the correlations between variables and principal factors, component loading factors, cumulative percentages, and percentages of variance for each principal component.
Parameters . | Principal component . | ||
---|---|---|---|
F1 . | F2 . | F3 . | |
V | 0.788 | 0.281 | 0.330 |
TSS | 0.026 | 0.821 | −0.175 |
BOD5 | −0.196 | 0.850 | 0.136 |
COD | −0.334 | 0.840 | 0.228 |
0.015 | 0.083 | 0.954 | |
0.320 | 0.455 | 0.003 | |
OD | 0.869 | −0.087 | 0.222 |
Tw | 0.950 | −0.159 | −0.091 |
Tair | 0.933 | −0.191 | 0.040 |
Wind | 0.545 | −0.040 | 0.691 |
Eingen value | 4.01 | 2.53 | 1.29 |
% of variance | 40.1 | 25.3 | 12.9 |
Cumulative % of variance | 40.1 | 65.4 | 78.2 |
Parameters . | Principal component . | ||
---|---|---|---|
F1 . | F2 . | F3 . | |
V | 0.788 | 0.281 | 0.330 |
TSS | 0.026 | 0.821 | −0.175 |
BOD5 | −0.196 | 0.850 | 0.136 |
COD | −0.334 | 0.840 | 0.228 |
0.015 | 0.083 | 0.954 | |
0.320 | 0.455 | 0.003 | |
OD | 0.869 | −0.087 | 0.222 |
Tw | 0.950 | −0.159 | −0.091 |
Tair | 0.933 | −0.191 | 0.040 |
Wind | 0.545 | −0.040 | 0.691 |
Eingen value | 4.01 | 2.53 | 1.29 |
% of variance | 40.1 | 25.3 | 12.9 |
Cumulative % of variance | 40.1 | 65.4 | 78.2 |
The first axis, which alone explains 40.10% of the total inertia, is strongly constructed by physical variables (DO, Tw and V) and climatic conditions (Tair and Wind). Their relative contribution to this axis is 87, 95, 79, 93, and 54%, respectively (Table 3 and Figure 6). The first principal component (PC1) appears to indicate effluent associated with warm months and rich in DO.
The second axis accounts for 25.30% of the initial data variance, as depicted in Table 3. This axis is strongly correlated with TSS, COD, and BOD5, parameters associated with pollution (Table 3 and Figure 6). It represents variations in parameters linked to organic load. The second principal component (PC2) appears to indicate effluent rich in organic matter, suggesting the degree of organic pollution in treated wastewater.
The third axis represents 12.9% of the total variance and is positively correlated with levels of (Table 3 and Figure 6). The presence of ammonia on this axis indicates a malfunction in the nitrification activity of a wastewater treatment system, which involves the oxidation of ammonia into nitrite. PC2 expresses an effluent rich in ammonia, indicating nitrogen pollution in treated wastewater.
At the onset of the cycle (Figure 7), a closed loop pattern is observed, which may be interpreted as follows: In January, February and March, the treated waters exhibit considerably higher levels of pollution compared to those in April. The treatment process appears to be functioning effectively in April, as the situation improves compared to the first three months of the year (effluents are less polluted in April). From April onwards, there is indeed a rapid return to a situation similar to that of January where the wastewater is more polluted.
The cyclical nature of the relationship between the first component (OD, air temperature, water temperature, wind and volume) and the second component (TSS, COD, and BOD5) is summarized into three periods (Figure 7):
The first period (overload) is distinguished by a substantial increase in concentrations of TSS, BOD5, and COD, alongside a low DO concentration. This period of the cycle potentially signifies a stage of dysfunction or overload within the treatment system. During this period, activated sludge wastewater treatment systems may encounter challenges in efficiently degrading organic matter present in the wastewater. Various factors contribute to this, including significant organic overload occurring when the amount of organic matter entering the treatment system surpasses its processing capacity. This leads to an accumulation of organic matter within the system, hindering microbial degradation despite the efforts of microorganisms within the treatment system. Unfavorable environmental conditions, such as extremely low temperatures, equipment or process malfunctions, in addition to reduced microbial activity levels due to these conditions, can exacerbate the situation. Furthermore, colder temperatures during these periods can decelerate biological processes in the wastewater treatment system, resulting in decreased efficiency in organic matter degradation. Consequently, organic load in wastewater may accumulate further, while the availability of oxygen for biological processes may diminish due to reduced oxygen solubility in water at lower temperatures. This period may also be associated with decreased activity of microorganisms responsible for decomposing organic matter, leading to an increased accumulation of organic compounds in treated wastewater.
The second period (degradation) is characterized by a decrease in the concentration of organic load and a significant increase in DO. This period of the cycle may correspond to a period where the treatment system operates optimally. It can be termed as a phase of stable performance or efficient operation of the activated sludge wastewater treatment process.
During this period, the operational conditions of the treatment system are optimal, enabling efficient degradation of organic matter by the microorganisms present in the activated sludge. The decrease in TSS, COD, and BOD5 concentrations indicates an enhanced treatment efficiency, as organic materials are decomposed more thoroughly. The increase in DO concentration in the treated wastewater may result from effective aeration in the treatment system. Adequate aeration supplies sufficient oxygen to the microorganisms, enabling them to degrade organic matter effectively, consequently leading to an increase in DO concentration in the treated water.
The third period (settling) is characterized by a simultaneous decrease in the concentration of organic load and DO. This phase may correspond to a period of sludge settling in the wastewater treatment process. During this period, activated sludge, containing degraded organic matter and other suspended particles, settles to the bottom of the clarifier basin. This results in a reduction in the concentration of TSS, COD, and BOD5 in the clarified water above the sludge. The decrease in DO concentration can also be observed during this period, as some of the DO is utilized by the microorganisms present in the activated sludge to degrade organic matter. Additionally, settling sludge may consume oxygen during the aerobic decomposition process of organic matter.
Hierarchical Ascendant Classification
The first class, encompassing the months of July, August and September, is characterized by low concentrations of TSS, DOC and BOD5, as well as high levels of DO, indicating robust oxygenation activity. These observations suggest satisfactory performance of the wastewater treatment process during these months, with operators effectively maintaining low levels of SS and dissolved organic matter while promoting adequate oxygenation of the water. This interpretation also suggests that environmental conditions during this period could be conducive to better water quality and optimal wastewater treatment operation.
The second class, comprising April, October, November and December, exhibits significantly lower concentrations of TSS, COD and DOB5, along with a reduced level of DO compared to the first class. This suggests a better performance of the treatment process during these months, with operators effectively reducing the levels of SS, COD, and BOD5. The lower levels of DO may indicate efficient oxygen utilization by microorganisms during the treatment process.
The third statistical class, consisting of the months of January, February, March, May and June, is characterized by higher concentrations of BOD5, COD, and TSS, indicating a higher level of organic pollution. The elevated concentrations of these parameters suggest that the wastewater from these months likely contains a significant amount of degradable organic matter. This may necessitate specific attention and treatment procedures to effectively reduce this organic load and comply with environmental standards for wastewater quality.
Optimizing seasonal management of activated sludge wastewater treatment
To optimize the wastewater management plan for the station, several key strategies should be implemented. First, ensure regular monitoring of system performance by tracking essential parameters such as turbidity, DO concentration and organic matter levels.
Second, employ statistical methods to identify periods of varying pollution intensity, focusing on concentrations of organic matter indicators like COD, BOD5, and TSS. This allows for operational adjustments based on seasonal fluctuations.
Finally, manage seasonal dynamics by recognizing and adapting to variations in the performance of the activated sludge treatment system. This includes accounting for changes in water temperature, microbiological biomass, and oxygen demand, thereby optimizing performance throughout the year.
Period-specific management strategies
The optimized wastewater management plan for the station should incorporate strategies tailored to three distinct operational periods.
1. The environmental stress period: during this period, closely monitor system performance and adjust operations to minimize the effects of organic overload. This may include optimizing sludge return ratios, increasing aeration, or adding coagulants to aid in the flocculation of suspended matter.
2. The optimal operation period: take advantage of this period to maximize treatment efficiency, ensuring ideal conditions for microbial growth and organic matter degradation. Optimize sludge control strategies to maintain high efficiency and consistent purified water quality.
3. The sludge settling period: during this period, focus on the effective management of excess sludge produced by the activated sludge treatment process. Optimize settling operations to maximize recovery of activated sludge and minimize losses of microbiological biomass.
CONCLUSION
The aim of this study was to develop a practical method for optimizing wastewater treatment and management processes. The data were efficiently processed using multivariate statistical approaches such as PCA and HAC. Our results reveal similarities among certain months throughout the year in terms of levels of organic pollution. Furthermore, the combined PCA and HAC analysis identified three distinct periods within the wastewater treatment process, each characterized by specific operational conditions based on the level of organic pollution.
The first period (July, August and September) is characterized by decreased organic load and increased DO which reflects optimal system performance and efficient operation. This period represents the optimal phase of system operation. The second period (April, October, November, and December) exhibits simultaneous reductions in organic load and DO, corresponding to the sludge settling phase, where treated water is clarified. The third period (January, February, March, May, and June) shows high levels of TSS, BOD5, and COD, along with low DO, indicating a potential overload or dysfunction in the system and representing an environmental stress period. Understanding these periods is crucial for implementing effective management strategies to optimize treatment performance and address environmental and operational challenges.
Overall, the study successfully identifies key cyclical periods within the wastewater treatment process, highlighting their operational characteristics and implications for treatment efficiency and management.
ACKNOWLEDGEMENTS
The authors express their gratitude to the National Sanitation Office for making the data available. The authors thank Dr Ismahene Mesbah from Aix-Marseille University, France, for her assistance in reviewing the grammar and syntax of this manuscript.
DATA AVAILABILITY STATEMENT
All relevant data are included in the paper or its Supplementary Information.
CONFLICT OF INTEREST
The authors declare there is no conflict.