Navi Mumbai Municipal Corporation of Maharashtra state, India, unified a tertiary treatment plant (TTP) of 20 × 103 m3/day capacity with ultrafiltration technology in an existing Koparkhairane sewage treatment plant (STP) for producing effluent quality usable for industrial purposes. As prior art, an artificial neural network-genetic algorithm (ANN-GA) along with uncertainty estimation using prediction interval is employed to model secondary treated effluent (STE) flow rate (QT) and other quality parameters such as biochemical oxygen demand, chemical oxygen demand, and total suspended solids (TSS) to conclude the reliability of the range in which the input available to TTP. ANN-GA model provides a coefficient of determination above 0.90 for all STE parameters modelled other than TSS. Inferring that a good quantity and quality of 20 × 103 m3/day STP treated water is currently available, where a decreasing trend of QT is also noticed and highlighted. Further, the Wilcoxon signed-rank test on the quality parameter of effluent TTP for industrial reuse standard infers TSS shows infringement during the initial period but started adhering to standards over time. The research delineates at the outset of exploring water reuse policy in India, emphasizing Maharashtra state, modelling STE using ANN-GA and performance evaluation of TTP.

  • Secondary treated effluent modelling using the artificial intelligence algorithm.

  • Artificial neural network integrated with the genetic algorithm is used in this study.

  • Uncertainty estimation using the prediction interval technique.

  • Ultrafiltration technology is checked for treatment compliance.

  • The Wilcoxon signed-rank test is used for compliance analysis.

The United Nations (UN) – General Assembly formulated 17 goals known as Sustainable Development Goals (SDGs), where 193 UN member countries pledged to implement these SDGs to achieve prosperity and peace for the people and world before 2030. The 17 goals of the SDG framework cover diverse domains, including the economy, society, and environment. Although the goals have varying purposes and different strategies for implementation, it is believed that goals mutually complement one another. For instance, Obaideen et al. (2022) conducted a comprehensive analysis of all SDG objectives, revealing that better and more efficient wastewater treatment (SDG 6: Clean Water and Sanitation) could potentially pave the way to achieve 11 out of 17 SDGs; this significant impact is due to the augmented availability and access to water.

When it comes to water availability, different countries possess varied accessibility. Let's consider India as an illustration, having a populace of 18% of the global population, comprising just 4% of the world's freshwater resources, and their availability is scattered across the region. Another cause of concern is that out of 345 river stretches in India, 45 rivers are severely polluted by domestic and industrial wastewater. As a result, India is ranked as the 13th most water-stressed country globally. Concerning water resources management, the Indian constitution declares that water and sanitation fall within states' subjects, which means intra-state water and wastewater management is the responsibility of the concerned states. With such challenges, like massive population, scarce water availability, and excessive water pollution, it is highly challenging for government utilities in developing countries like India to effectively meet the demand for various needs and safeguard water resources.

Status and schemes for sewage treatment plants in India

Effectively managing water resources involves not just augmenting water but also reducing water pollution that arises due to the emission of wastewater into freshwater resources. According to the recent report by CPCB (2021), 72,368 × 103 m3/day of domestic sewage is generated in urban centres of India, where the available treatment capacity of sewage treatment plants (STPs) for treating up to secondary treatment level is 31,841 × 103 m3/day. There seems to be a significant gap between available treatment capacity and functional capacity (sewage collected and treated), which is about 63%. The matter of concern is whether the mentioned operating capacity can adhere to produce the effluent quality to the prescribed stringent effluent discharge limit. The prescribed effluent standards in India are set and revised by the respective State, Central Pollution Control Board, and Honourable National Green Tribunal (NGT); revision of standards in India over time is well-reported by Schellenberg et al. (2020).

Despite the multitude of challenges, the Government of India introduced several schemes over time to improve water and sanitation infrastructure, such as Jawaharlal Nehru National Urban Renewal Mission (JNNURM) (launched in December 2005), Atul Mission for Rejuvenation and Urban Transformation (AMRUT) (launched in June 2015), and the recently launched AMRUT 2.0 (October 2021). Primarily, AMRUT 2.0 targets revitalizing water bodies, expanding clean water access from 500 cities to 4,800 statutory towns (earlier AMRUT's mission was to provide water supply for 500 cities). In addition, managing sewerage and septage (collection and treatment) in 500 cities and, more importantly, prioritizing the safe reuse of treated wastewater (SRTW). It is essential to remember that SDG 6.3 and AMRUT 2.0 apprise not only wastewater treatment but also a significant increase in recycling and SRTW to promote a circular economy (Zhang et al. 2016).

Water reuse policy interventions in India

In order to achieve a circular economy, STPs act as a resource recovery factory to extract resources from sewage, such as energy, nutrients, and reusable water (Ali et al. 2022). A report by the Ministry of Jal Shakti (MoJS), Government of India, states that treated wastewater as ‘Apna Jal’ means ‘our water’ to detach the perception of treated sewage as a liability instead as a resource. The Government of India formulated several acts and policies pertaining to water reuse, but it all started with the Water Prevention and Control of Pollution Act of 1974 to restore aquatic resources by not discharging sewage or pollutants. In 1986, the Ganga Action Plan and the Environment Protection Act were passed to protect river Ganga, and the latter is an act relating to protecting and improving the environment of human beings along with plants and other living creatures. Concerning water reuse, emphasis has been given in the National Environment Policy of 2006, which spotlights the need to prepare action plans by addressing water pollution and regulatory frameworks for major cities in India. The National Urban Sanitation Policy drafted in 2008 emphasizes the safe disposal and handling of human waste and endorses recycling and reuse.

The National Water Policy 2012 (revised draft – 2020) necessitates recycling and reusing wastewater. The growing water demand and limited freshwater availability obliged the formulation of a comprehensive nation-level framework for SRTW; thus, the MoJS collaborated with the India-European Union (EU) Water partnership and GIZ, a German development agency, to craft a strategic policy (MoJS 2022). At this point, it is essential to mention that the developed framework is intended for non-potable reuse, such as aquifer recharge, on-site usage within treatment plants (landscaping), municipal uses (parks, fire fighting, toilet flushing), agriculture, aquaculture, and most importantly, industrial use.

Water reuse in the context of Maharashtra state

Agriculture requires significant water consumption, where the national demand is almost 70%, but providing treated wastewater for industries with higher effluent quality is even more challenging. India's water demand for various industrial purposes varies between 8 and 10%, whereas state-level demands deduce that Maharashtra state is the second largest consumer, with an annual requirement of 6.7 billion cubic metres (Joseph et al. 2019). Along with the higher water demands for industries, CPCB (2021) reports that Maharashtra stands first in generating a significant volume of municipal sewage of 9,107 × 103 m3/day with a proposed and operational capacity of STP as 9,295 × 103 m3/day, creating an excellent opportunity for SRTW in the industrial sector. This endeavour aims to mitigate the depletion of freshwater resources and eliminate the harmful effects caused by wastewater on the natural environment. In this context, Maharashtra has several noteworthy initiatives in SRTW; for example, in 2008, Maharashtra Generation Company Limited (MahaGenCo) sought to expand its power capacity, requiring additional water supply. Instead of relying on freshwater sources, MahaGenCo approached Nagpur Municipal Corporation to provide raw wastewater, where the transportation and treatment are under the scope of MahaGenCO, where the treated wastewater is used for power generation. The project was mutually beneficial for both parties involved with significant spillover benefits; interested readers can avail more information regarding this case study from the World Bank report (World Bank 2019).

Another important step taken by the Maharashtra government to improve water reuse was a policy devised on 1 November 2017 to use only recycled wastewater for industrial purposes, with no more freshwater utilization in industries; a deadline of 3 years was given to 71 municipal bodies to devise a plan and start using treated wastewater. In this context, municipalities investigated neighbouring industrial areas’ water requirements along with specific types of industries present (known as demand profiling) and the operational efficiency of STPs to devise a plan for SRTW for the non-potable purpose.

In this context, Navi Mumbai Municipal Corporation (NMMC), one of the progressive and exemplary urban local bodies of Maharashtra, proposed a compelling plan to integrate the tertiary treatment plant (TTP) into the existing couple of STPs to meet the industrial water demand of Maharashtra Industrial Development Corporation (MIDC) areas. The effectiveness of TTP is dependent on the STP's capability to treat sewage up to the required input quality levels for TTP. To illustrate the importance, a report by NGT on compliance of treatment facilities in India states that out of 15,403 STPs (non-municipal and municipal), 608 are not following the adhered-to prescribed standards, and legal cases are filed on 15 STPs (NGT 2019). Within this frame of reference, there is an obligatory requirement to model the performance of existing STPs and to check the reliability of input parameters to TTP, such as flow rate (QT), biochemical oxygen demand (BOD), chemical oxygen demand (COD), and total suspended solids (TSS).

Modelling water quantity and quality of STPs

Over the years, research studies have reported that modelling an STP is an intricate task because the time series dynamics of quantity and quality parameters of wastewater comprise seasonality, trend, and periodic components, creating high non-linearity and non-stationarity in the data. Modelling QT of STP is complex because of the stochastic variation in time series due to some of the uncertainty involved, such as the non-linear rainfall-runoff relationship of the combined sewer system, wearing infrastructure, groundwater infiltration, and many more. Modelling QT would be beneficial to understanding the quantum of secondary treated effluent (STE) available diurnally to TTP (to fix the capacity of TTP), planning operational schedules, and optimizing pump plans. We reviewed the various academic literature on modelling QT of STP, where different researchers tried varied artificial intelligence (AI) and time series modelling techniques for forecasting daily QT data: Fernandez et al. (2009) utilized the neuro-fuzzy technique using two input features, Boyd et al. (2019) applied autoregressive integrated moving average (ARIMA), Zhu & Anderson (2019) developed and employed iterated stepwise multi-linear regression, and Heo et al. (2021) integrated multiple deep learning models to predict both long- and short-term forecast using ensembled deep learning architecture.

Some researchers worked with low temporal resolution hourly QT data using non-linear autoregressive exogenous input neural networks (Ghazouli et al. 2021) and seasonal ARIMA (Do et al. 2022). Wang et al. (2022) critically reviewed the modelling of non-linear data of STPs and acclaimed that an artificial neural network (ANN) would be a propitious option for modelling. ANN is a forerunner while modelling municipal STPs (Hamed et al. 2004; Wadkar et al. 2021); many such studies investigated and compared different modelling techniques with ANN and inferred the resilience of the approach (Asami et al. 2021; Hejabi et al. 2021). However, it is crucial to remember that ANN can mimic closer to the actual values of the interested parameter of STP if sufficient training data establishes a more robust model architecture.

Establishing a robust model architecture would be a challenging problem because of the number of hyperparameters involved in ANN modelling; detecting optimal hyperparameters would be an arduous and time-consuming task because of the vast search space. Different ways to figure out optimal hyperparameters include grid search, random search, and applying heuristic algorithms. Most of the research works attempted grid search (an exhaustive search technique through a predefined combination of parameters), which requires ample time and high computational requirements. Some studies used heuristic and evolutionary algorithms such as differential evolution, gene expression programming, and genetic algorithm (GA), to mention a few. For instance, Liu & Chen (2023) applied evolutionary and greedy search optimization algorithms, such as GA and sequential feature selection methods, to model BOD and found that support vector regression integrated with GA provided better performance than other modelling techniques. In wastewater optimization modelling, GA performed better, along with different AI algorithms reported in several studies (Bagheri et al. 2015; Mu et al. 2016; Picos & Peralta-Hernández 2018); to the best of our knowledge, there has been no research work involving ANN-GA for modelling QT. In addition, most studies try optimization to mimic the real-world system perfectly by improving the performance metrics and not including uncertainty in the modelling (Hejabi et al. 2021). This research employed the prediction interval (PI) method to capture the uncertainty associated with the outcome of ANN-GA modelling. By incorporating this modelling approach, we can assess the reliability and resilience of the TTP input parameters.

Research objectives

This research focuses on two primary objectives; as prior art, an ANN-GA modelling technique is leveraged to model the volume of the STE from existing STP to conclude the reliability of the input flow available to TTP along with its quality parameters such as BOD, COD, and TSS. This objective intends that TTP relies upon a range in which the influent parameters must be available; a higher magnitude of parameters from STE (or failure of existing STP to produce intended quality) may lead to the failure of TTP. This soft sensor model will inform a decision on whether to proceed with the treatment of STE using TTP or divert STE to creeks as a preventive measure against the potential failure of TTP. Another reason for this objective is that Maharashtra's policy for industrial reuse strictly mentions using treated wastewater, so it is necessary to check the uninterrupted availability and the intended quality of STE. The second objective is to assess the performance of TTP by examining the tertiary treated wastewater (TTW) and statistically inferring the adherence to standards of NMMC for industrial reuse of effluent parameters: BOD, COD, and TSS, from the inception of TTP operation from September 2022.

This research article is outlined as follows in the subsequent section. The Study area section briefs about the existing Koparkhairane STP and the plan of action by NMMC on the construction and commissioning of TTP. The Materials and methods section provides a systematic way of achieving the research objective of modelling and performance evaluation. The Results section delineates modelling for forecasting and performance evaluation of TTP. A section on Discussion has been included to provide more insights about results with some additional analysis. Finally, the conclusion with the future scope and practical implications of the study are provided in the Conclusion section.

NMMC, located in Maharashtra state, is one of the largest planned cities globally and has a successful streak of being one of the cleanest mega cities in India; additionally, it holds a five-star rating for its maintenance of a city free of waste (NMMC 2021). The NMMC region is situated on the eastern shores of Thane Creek, in the Thane district, comprising seven fully developed nodes spanning from Airoli in the north to CBD-Belapur in the south with an area of 108.6 km2. Out of this area, 60% of the region is dedicated to urban zones, including residential, commercial, and industrial, which includes transport infrastructure such as rails, roads, and many more. Concerning municipal sewage treatment infrastructure, seven STPs are placed strategically in seven nodes comprising different capacities with sequential batch reactor (SBR) technology. NMMC STPs, better performance and spillover effect provided an excellent opportunity for advancing towards SRTW (Jothiprakash et al. 2020).

In this context, NMMC conducted demand profiling within the corporation and determined that the significant demand for this resource would be in the MIDC industrial area. MIDC is situated on the eastern side of Thane-Belapur road; the industrial nodes of MIDC include Vashi, Turbhe, Ghansoli, Airoli, and Koparkhairane, requiring almost 40 × 103 m3/day of recycled water. In response, NMMC decided to unify ultrafiltration (UF) technology with ultraviolet treatment to existing STPs. The existing STPs, namely Koparkhairane and Airoli, have been selected to unify the TTP of 20 × 103 m3/day each due to their proximity to the MIDC area.

The Koparkhairane STP, the second largest STP within the corporation, comprising 87.5 × 103 m3/day in capacity, was chosen as the research focal point. A map showing all NMMC STP locations, along with the aerial view of Koparkhairane STP, is depicted in Figure 1. Koparkhairane STP receives an average municipal sewage inflow of 40 × 103 m3/day, equivalent to wastewater generation from 370,000 individuals. It is essential to mention that the residential sewage generated in the Koparkhairane sector enters this STP, and treatment occurs before discharging it into nearby creeks. The SBR technology utilized in this plant is designed to bring the effluent quality to NMMC discharge guidelines for discharging into creeks: BOD ≤ 5 mg/L, COD ≤ 50 mg/L, TSS ≤ 10 mg/L, and dissolved oxygen ≥ 2 mg/L. One of our previous studies investigating the statistics and compliance of NMMC's STP using its past influent and effluent data with multiple performance indicators and hypothesis testing inferred that Koparkhairane STP is capable of handling all effluent parameters within the acceptable discharge limit (Ramkumar et al. 2022). Interested readers can avail themselves of more information regarding the non-parametric investigation for performance analysis of STP from the work mentioned above. Though wastewater quality parameters are well within the acceptable limit, the statistical analysis shows an average QT of 47.7 × 103 m3/day with a highly right-skewed distribution, inferring a massive inflow during the rainy season (Ramkumar et al. 2022). Another cause of concern is that the sewerage system constructed during the 1990s has almost reached its service life period. So, it is necessary to check the QT available in STP because the uncertainty involved, as already discussed in the Introduction, motivated this research. TTPs comprising different units are discussed in detail in the following section on the performance of TTP. Koparkhairane TTP has been functioning with limited flow from mid-September 2022.
Figure 1

Location of NMMC STPs and an aerial view of Koparkhairane STP. (Source: for aerial view photograph - Google Earth).

Figure 1

Location of NMMC STPs and an aerial view of Koparkhairane STP. (Source: for aerial view photograph - Google Earth).

Close modal

Since the study involves both the modelling of STE and performance evaluation of TTP, it requires enormous data to conclude the effectiveness of both treatment modules. Data have been collected to perform ANN-GA modelling from Koparkhairane STP for STE parameters such as QT, BOD, and COD having diurnal data of 3 years in length (January 2020 to December 2022), where TSS has been modelled using available minimal data (January 2020 to November 2020). For the performance evaluation of TTP, we performed the Wilcoxon signed-rank test on a data length of almost 2 months from the inception of the functioning of TTP (September 2022 to October 2022). A step-by-step methodology to attain the desired objective is outlined in the subsequent section.

Given that the research encompasses two primary objectives, the methodology to accomplish the intended objective is elucidated in two parts. Thus, the Materials and Methods section is further divided and explained step by step as modelling STE of Koparkhairane STP and performance of Koparkhairane TTP.

Modelling STE of Koparkhairane STP

The process for modelling and forecasting STE parameters using ANN-GA is depicted in Figure 2; the steps involved in modelling are as follows: (1) data pre-processing, (2) model building using ANN-GA, (3) model evaluation using performance metrics, and finally (4) uncertainty evaluation using the PI method.
Figure 2

Methodology for ANN-GA modelling with uncertainty estimation.

Figure 2

Methodology for ANN-GA modelling with uncertainty estimation.

Close modal

Data pre-processing

In the pre-processing phase, data are examined to identify missing data and outliers from the operational records of STP. We employed spline interpolation to impute the missing values in the time series dataset. Concerning the elimination of outliers, we calculated the interquartile range (IQR) of the time series and then excluded the values below (Q1–1.5 IQR) and above (Q3 + 1.5 IQR) (Alsulaili & Refaie 2021).

Another crucial pre-processing part includes data division for model building and data normalization. Data were divided into two non-overlapping subsets, training and test set; data allocation for training is 80%, and testing is 20%, followed by data normalization. The data were normalized by scaling from 0 to 1 by the following Equation (1) to avoid computational complexity.
(1)
where is the observed ith value, and are the minimum and maximum values of the time series, and is the scaled value of the actual observed value.

Artificial neural network

ANN is a mathematical modelling tool mimicking the human brain and nervous system for prediction and forecasting. This mathematical framework comprises the ability to portray any complex non-linear phenomena by capturing the relationship between the input and output of any system. In a simple context, there is no requirement for an explicit explanation about the physical elucidation of the system required for modelling. One of the popular types of ANN framework is the feed-forward neural network, which means the data transmission happens in the forward direction.

To elucidate further about the architecture of ANN, it constitutes numerous basic units, namely neurons, which function simultaneously (parallel). The most widely applied ANN architecture is multi-layer perceptron (MLP) comprising layers such as input, hidden, and output, where layers include connections between them (connections consist of weight and biases), enabling neurons to process information. The back-propagation algorithm adjusts the weights and biases to improve the model's performance. The MLP comprising N hidden layers and one single output takes the following mathematical form as provided in Equation (2):
(2)
where Y is the single output, is the weight vector, is the input feature (i = 1, 2, … N), b is bias, and f is the transfer function.

The MLP presented in this research comprises a deep feed-forward and back-propagation network architecture encompassing five distinct layers: one input layer, three hidden layers, and one output layer. Two dropout layers have been added, (i) between the first and second hidden layers, and (ii) from the second to the third hidden layer. The compelling reason behind applying the dropout layer is to control overfitting because of the deep architecture of ANN; a dropout regularization randomly disables some nodes during training, resulting in numerous different network configurations. Yaqub et al. (2020) suggest avoiding more dropout layers, which may result in loss of trend and accuracy in wastewater modelling; that is the reason behind restricting to two dropout layers. We tried different activation functions between all the network layers, such as Sigmoid, Hyperbolic tangent (Tanh), Rectified Linear Unit (ReLU), and Softmax with the optimizer as Adam algorithm. The learning rate parameter of the Adam optimizer is 0.0001; this is a critical hyperparameter governing the overriding of old data by new data, which is crucial in the modification of weight based on loss gradient. Thus, the parameters mentioned above for formulating the network architecture, such as input vector lag (past historical data), number of hidden neurons in each layer, dropout rate, and activation functions, are tuned using GA to get minimum loss function (mean squared error).

Genetic algorithm

The primary goal of optimization is to figure out the optimal solution out of all the available solutions. Among several optimization algorithms, GA is a metaheuristic algorithm that draws inspiration from Darwin's theory. GA employs the concept of survival of the fittest by direct search method mimicking natural selection and genetics, which effectively handled optimization problems in different scientific areas, such as reservoir operating policy (Jothiprakash & Shanthi 2006) and desalination efficiency of treatment plants (Sadi et al. 2019), to name a few.

GA follows a set of protocols, such as the creation of population, selection, crossover, and mutation. GA starts with generating a set of initial solutions within a feasible range using randomization called population; a solution retrieved from the current population is used for developing a new population. GA employs genetic operators and a selection process to create a new population. By employing a genetic operator, new solutions (offspring) can be made based on the current solution (parents). Whereas the selection process involves choosing some individuals from the population to create a new generation, the selection is based on the fitness function, which relies on a higher fitness score. Some ways to carry out the selection process include remainder, shift linear, stochastic uniform, roulette wheel, and tournament (Kılıç et al. 2014). Another crucial genetic operator is the crossover, which intends to facilitate the exchange of genetic information (genes) between individuals and is essential for the convergence of GA. In the crossover phase, the next generation is formed by merging two randomly chosen current-generation individuals by different methods like single point, double point, or uniform crossover. In essence, from the crossover, two individuals, known as parents, create offspring called sons; the son inherits traits of both parents. Furthermore, the parents are individuals selected from the original population, where the number of chosen parents for crossover depends on the crossover rate. Another crucial strategy in the GA algorithm is a mutation, meant to avoid local solutions and incorporate variability in the new population with some unprecedented examples.

Performance metrics

The performance of the developed ANN-GA model is evaluated using three different performance metrics or statistical indices based on the actual observation and predicted results. The deployed performance metrics are the coefficient of determination (R2), root mean squared error (RMSE), and RMSE observation standard deviation ratio (RSR) calculated using the following Equations (3)–(5), respectively:
(3)
where
(4)
(5)
where is the ith observed value, is the ith predicted or simulated value, is the mean of the observed values, and SDobs is the standard deviation of the observed values.

Uncertainty estimation: prediction interval

PI is prevalent while forecasting with a regression model, where the PI provides the range in which the actual outcome may fall (Meeker et al. 2017). We employed Gaussian PI, which involves the following steps; the retrieved optimal architecture from the GA is used to create PI. Several models are fitted in the training dataset with the same network architecture retrieved from GA; it is to be remembered that each model should make different predictions. Secondly, imputing non-identical random seeds generates random initial weights on the network architecture, making different predictions due to the stochastic gradient optimization algorithm. In this research, we assigned 20 random seeds to generate 20 models that produced varying predictions; 95% Gaussian intervals' upper and lower limit can be computed using the following Equations (6) and (7):

(6)
(7)
where is the mean of the 20 models' prediction, s is the standard deviation, and is the number of developed models.

Performance of Koparkhairane TTP

The performance assessment or compliance concerning its effluent standards at the TTW end (NMMC fixed effluent standards of TTP based on the requirements of industries) are checked for parameters such as BOD, COD, and TSS. To evaluate the performance of TTP, a non-parametric statistical approach known as the Wilcoxon signed-rank test was employed; detailed information regarding the various statistical tests to infer the compliance of STPs is provided by Ramkumar et al. (2022). The test procedure to conduct the Wilcoxon signed-rank test is as follows:

  • (a)

    The TTP effluent parameter of a single sample of n values is denoted as . For each for i = 1, 2,…, n, the signed differences were found by , where is the test value or, in this case, the prescribed effluent standard for the parameter.

  • (b)

    The next step is to compute the absolute difference, and ranking with .

  • (c)

    The number of non-zero values is found.

  • (d)

    Each ranked item is affixed with its corresponding sign of , mathematically, it can be written as, .

  • (e)

    The summation of positive-signed ranks was computed using , which can be rewritten as .

  • (f)
    With the above set of procedures, one can compute the W and (number of non-zero s), which was further utilized to calculate the normal test statistic Z using the following Equation (8):
    (8)
  • (g)

    Based on the Z, the value of p can be computed to conduct a significance test on the prescribed discharge standard of TTP against the alternate hypothesis; this research utilized a significance level () of 0.05.

Modelling of STE parameters using ANN-GA

The reliability of achieving expected quality in the TTW relies on the intended quality of STE to receive at the influent of TTP. For instance, UF technology at Koparkhairane STP requires influent quality to be less than 10 mg/L for BOD, 50 mg/L for COD, and 20 mg/L for TSS. In this context, the retrospective data from STE, such as QT, BOD, COD, and TSS, are modelled for forecasting.

Data pre-processing is carried out by imputing the missing value, and outliers are ascertained and removed by computing the quartiles of univariate data and imputing them with the spline interpolated data. The raw STE data comprised missing datapoint entries of 37 (QT), 4 (BOD), and nil for COD and TSS; concerning the number of outliers, 5 in the case of QT, 18 for BOD, 19 for COD, and 6 for TSS. The corresponding pre-processed data statistics are provided in Table 1. Other than TSS, the rest of the parameters are trained and tested with 3 years of data. One conspicuous value in Table 1 is the skewness (Sk) value of BOD, which is highly right-skewed with a value of 0.82; it needs to be explored in detail to understand the effect in modelling. The rest of the parameters range between – 0.1 and 0.34 (inferring nearly normal distribution). Further data is normalized before carrying out modelling.

Table 1

Statistics of pre-processed data of STE parameters

STE parameterNumber of data pointsMean ± SDSkMinimumMaximum
QT (×103 m3/day) 1,096 30.58 ± 7.52 0.34 7.32 57.32 
BOD (mg/L) 1,096 3.44 ± 0.80 0.82 2.5 5.50 
COD (mg/L) 1,096 30.95 ± 6.62 −0.10 15.5 46.5 
TSS (mg/L) 309 7.15 ± 0.71 −0.01 5.0 9.20 
STE parameterNumber of data pointsMean ± SDSkMinimumMaximum
QT (×103 m3/day) 1,096 30.58 ± 7.52 0.34 7.32 57.32 
BOD (mg/L) 1,096 3.44 ± 0.80 0.82 2.5 5.50 
COD (mg/L) 1,096 30.95 ± 6.62 −0.10 15.5 46.5 
TSS (mg/L) 309 7.15 ± 0.71 −0.01 5.0 9.20 

One of the necessary steps for capturing the temporal pattern of STE using ANN-GA is to select the range in which ANN and GA hyperparameters must be varied; this involves some random search. The range of selection for regulating the hyperparameters of ANN and corresponding parameters in GA for the search process for fine-tuning implemented in this research work is provided in Supplementary Table S1. This study used previous time lags (number of past observations) as input for forecasting the next time step; different researchers tried varying lags with the lags retrieved from the autocorrelation function or by the trial-and-error method. For a case in point, Surakhi et al. (2021) tried GA for tuning the input time lag for forecasting univariate time series, which provided better results; this research persuaded us to adjust the input vector using GA. Along with time lag, the rest of the hyperparameters, such as activation function, hidden neurons, and dropout rate, are tuned using GA.

The corresponding optimal network architecture retrieved using the GA algorithm is depicted in Table 2. The input vector's lag length varies between 2 and 10 for all four parameters; the maximum is recorded for BOD, whereas the minimum is recorded for COD. For each STE parameter, the optimal architecture of the ANN exhibits varying numbers of hidden neurons in each layer, as illustrated in Table 2. The dropout rate within the three hidden layers comprises a range of values between 0.13 and 0.77. Concerning activation function, other than softmax, all three functions are found to be in the ANN model architecture. With this optimal architecture, testing has been carried out; their performance is provided in Table 3.

Table 2

Optimal hyperparameters retrieved using GA for ANN modelling

STE parameterNetwork architecture
Input vector lagHidden layer
Dropout rate
Activation function
HL-1HL-2HL-3DR-1DR-2AF-1AF-2AF-3AF-4
QT 20 28 17 0.29 0.70 Tanh Tanh Sigmoid Sigmoid 
BOD 10 16 22 14 0.21 0.77 ReLu Tanh Tanh Sigmoid 
COD 24 26 0.40 0.17 Tanh ReLu Sigmoid Tanh 
TSS 24 25 0.13 0.13 Tanh Tanh Sigmoid ReLu 
STE parameterNetwork architecture
Input vector lagHidden layer
Dropout rate
Activation function
HL-1HL-2HL-3DR-1DR-2AF-1AF-2AF-3AF-4
QT 20 28 17 0.29 0.70 Tanh Tanh Sigmoid Sigmoid 
BOD 10 16 22 14 0.21 0.77 ReLu Tanh Tanh Sigmoid 
COD 24 26 0.40 0.17 Tanh ReLu Sigmoid Tanh 
TSS 24 25 0.13 0.13 Tanh Tanh Sigmoid ReLu 

Note: HL, hidden layer; DR, dropout rate; AF, activation function.

Table 3

ANN-GA modelling performance metrics of STE parameters

STE parameterTraining set
Testing set
R2RMSERSRR2RMSERSR
QT (×103 m3/day) 0.94 2.187 0.30 0.94 2.367 0.27 
BOD (mg/l) 0.97 0.133 0.23 0.88 0.484 0.43 
COD (mg/l) 0.96 1.268 0.20 0.93 1.708 0.29 
TSS (mg/l) 0.78 0.352 0.42 0.72 0.392 0.48 
STE parameterTraining set
Testing set
R2RMSERSRR2RMSERSR
QT (×103 m3/day) 0.94 2.187 0.30 0.94 2.367 0.27 
BOD (mg/l) 0.97 0.133 0.23 0.88 0.484 0.43 
COD (mg/l) 0.96 1.268 0.20 0.93 1.708 0.29 
TSS (mg/l) 0.78 0.352 0.42 0.72 0.392 0.48 

Note: The unit provided for each parameter is for RMSE.

The batch size and epochs, two crucial parameters of ANN, are kept as 32 and 1,000, respectively. The model architecture with optimal hyperparameters provided better results in almost all STE parameters. Concerning the training stage, parameters such as QT, BOD, and COD, which are trained using nearly 2 years in data length (877 data points), have an R2 above 0.90 in all cases, as provided in Table 3. Whereas the TSS, where the training was carried out with 244 data points, provided a lower R2 of 0.78. Meanwhile, the R2 in the testing set is almost comparable to the training set, ranging from 0.94 to 0.72, deducing there is no overfit occurred in training; the reason may be due to the data pre-processing, application of dropout rate parameter, and optimal architecture retrieved using GA.

As shown in Table 3, the RMSE value of the testing set is more than the training set. Especially BOD, the RMSE with a value of 0.484 mg/L, and the corresponding R2 is about 0.88 due to the extreme values in the testing set. For instance, the statistical analysis of the training and test set is provided in Supplementary Table S2. This table infers that the training BOD's mean and SD vary significantly from the testing set, which provided slightly inferior results compared to the training set.

The RSR is an error index statistic that reports the model's robustness. The optimal value of RSR ranges from zero to any positive value; a lower RSR or near zero means an ideal model or vice versa. The RSR of the training set differs between 0.2 and 0.42, where the maximum RSR is observed for TSS; it is essential to mention that if TSS has been trained with higher data points, a better performance with a lower RSR may be possible to achieve. One of the critical aspects to be noticed in the testing set RSR is 0.43 observed for BOD, whereas the training set has an RSR of only 0.23. This is because of the higher standard deviation (SD) of BOD in the testing set. TSS is observed with a higher RSR of 0.42 in the training and 0.48 in the testing set, respectively.

Concerning the uncertainty associated with the ANN-GA model, a time series plot showing an actual, forecasted, and uncertain range of QT is illustrated in Figure 3(a) (a snippet is shown in Figure 3(b) to show a magnified view of various associated observations). The better performance of modelling QT shows a very minimal range in which the upper and lower limits of uncertainty levels vary; on average, the range of uncertainty is around 1.18 × 103 m3/day between the upper and lower limits. The limited range in which the observations vary shows the robustness of the model. The better performance of the model can be visually verified by the scatter plot depicted in Figure 3(c) and 3(d), as the figures show the observation close to the 45-degree line. The rest of the STE parameters time series plot showing forecasted and actual observations are depicted in Supplementary Figures S1 (BOD), S2 (COD), and S3 (TSS).
Figure 3

Time series forecasting for STE QT with uncertainty estimation. (a) Time series showing actual, observed, and uncertainty range. (b) Snippet showing an enlarged view of time series with actual, observed, and uncertainty range. (c) Scatter plot showing actual vs. forecasted – training period. (d) Scatter plot showing actual vs. forecasted – testing period.

Figure 3

Time series forecasting for STE QT with uncertainty estimation. (a) Time series showing actual, observed, and uncertainty range. (b) Snippet showing an enlarged view of time series with actual, observed, and uncertainty range. (c) Scatter plot showing actual vs. forecasted – training period. (d) Scatter plot showing actual vs. forecasted – testing period.

Close modal

Another noteworthy concern is that the QT is decreasing in the trend at Koparkhairane STP over time, observed from statistical analysis. The average yearly QT at the influent of STP is 47.7 × 103 m3/day (2020), 35.67 × 103 m3/day (2021), and 32.81 × 103 m3/day (2022), respectively. The decrease in average QT may be due to a reduction in stormwater flow into STPs due to STP being designed for combined sewer overflow. We further analysed the data of non-monsoon flow into STP, which provided an average QT of 39.13 × 103 m3/day (2020), 35.18 × 103 m3/day (2021), and 32.8 × 103 m3/day (2022), inferring a substantial reduction in influent QT even during dry seasons. This may be due to the uncertainties, such as infrastructure ageing and loss during sewage transmission. This crucial factor needs to be considered, and further scrutinized analysis in this area is the future scope of work. Regarding the current situation, STP can provide 20 × 103 m3/day of better-quality water of STE for further processing into TTP, and the calibrated model can be used as a soft sensor for forecasting, which can be integrated into the influent end of TTP.

Performance of Koparkhairane TTP

In two ways, the TTW is distributed in NMMC; firstly, TTW is pumped to overhead tanks (OHTs) located in STP premises to convey the water through distribution networks by gravity in nearby STP's residential areas for non-potable purposes. The MIDC area is located at a higher altitude than the STP area, so from the TTW sump, through pumping mains, water is conveyed to OHT in the MIDC area for further distribution into the industrial area. A simple schematic of the OHT and distribution lines from Koparkhairane STP to the MIDC area is depicted in Figure 4(a).
Figure 4

Tertiary treatment plant of Koparkhairane STP. (a) Distribution network arrangement in STP. (b) Tertiary treatment units of UF technology.

Figure 4

Tertiary treatment plant of Koparkhairane STP. (a) Distribution network arrangement in STP. (b) Tertiary treatment units of UF technology.

Close modal

The chosen treatment technology for TTP involves using UF technology alongside a UV disinfection unit. The adoption of this particular technology is driven by the higher effluent quality standard mandated by industries located within the MIDC area (for example, TSS ≤ 1 mg/L and turbidity ≤ 0.5 NTU). The inclusion of UV treatment into the UF system stems from its remarkable ability to eliminate microbial contamination effectively. Given that a significant volume of TTW is supplied for non-potable purposes in residential areas of NMMC, the adoption of UV treatment was made obligatory.

Moreover, gaining a comprehensive understanding of the treatment modules of TTP is imperative as it significantly brings the STE to higher quality. The TTP modules consist of several distinct units, beginning with the pre-treatment system, membrane filtration units (UF treatment), disinfection unit (UV), and maintenance cleaning units and ending with a sump to store TTW. The corresponding TTP layout of Koparkhairane with treatment modules is depicted as a schematic diagram in Figure 4(b). The pre-treatment unit comprises a feed tank and a cleaning strainer. The STE from STP must have residual chlorine of 0.3–0.5 mg/L and is stored in a reinforced concrete feed tank, where two pumps lift the raw STE (additionally, one pump acts as standby) to feed it into a cleaning strainer consisting of varying sizes of stainless steel meshes to filter out coarse material to safeguard UF trains.

The membrane unit (UF train) is specifically designed to treat 833 cumecs per hour of STE; these versatile units work in different stages, including production, backwash, cleaning, and standby or shutdown. In the production stage, water is drawn from the strainer using pumps and delivered to the UF membrane columns under pressure, where the particles get trapped. Each membrane train is equipped with a flow meter and pressure transmitter that measures the transmembrane pressure to evaluate any decrease in permeability due to membrane fouls. The backwash mode maintains the transmembrane pressure in the membrane column by reversing the flow direction. During the backwash stage, the membrane is subjected to a gentle air stream to remove accumulated particles and flushed to remove the solid particles where the backwash water is not added with any chemical solutions. Prior to commencing the cleaning stage, it is necessary to conduct a backwash in conjunction with a gravity drain of backwash water. During the cleaning stage, chemical solutions like low-concentration sodium hypochlorite or citric acid are used for regular maintenance or intensive cleaning. This process involves circulating these solutions into the UF train using clean-in-pump tanks. Based on the incoming flow, some UF trains get into standby mode if the inflow volume is less than the design flow, and shutdown mode is for major maintenance. The filtered water from UF trains, known as permeate, flows to the subsequent treatment units. After the UF treatment, the UV unit disinfects the treated water to remove microbiological contamination. Then, TTW is stored in a sump and pumped to OHT for supply to the NMMC residential and MIDC industrial area through a network of pipes.

Concerning adherence to prescribed standards, as per NMMC, the effluent quality expected for industrial reuse should be BOD ≤ 5 mg/L, COD ≤ 20 mg/L, TSS ≤ 1 mg/L, and Turbidity ≤ 0.5 NTU. We further statistically analysed the observed 38 samples collected between September 2022 and October 2022 of BOD, COD, and TSS using the Wilcoxon signed-rank test, where the formulated Null hypotheses (Ho) and Alternate hypotheses (Ha) are provided below:

  • (a)

    Ho: median of the parameter = Prescribed effluent standard of the parameter, Ha: median of the parameter <> Prescribed effluent standard of the parameter.

  • (b)

    Ho: median of the parameter ≤ Prescribed effluent standard of the parameter, Ha: median of the parameter > Prescribed effluent standard of the parameter.

  • (c)

    Ho: median of the parameter ≥ Prescribed effluent standard of the parameter, Ha: median of the parameter < Prescribed effluent standard of the parameter.

Since the inception of treatment using TTP in September 2022, at an average of 1 × 103 m3/day, the flow has been maintained for the trial run and supplied for the NMMC residential area for non-potable purposes. The median for BOD, COD, and TSS is 4.8, 19.6, and 1 mg/L, respectively. The performed hypothetical analysis infers that BOD and COD adhere to the prescribed discharge standards for industrial reuse. But in the case of TSS, there seems to be an infringement deduced by a hypothetical test concerning the standards. The reason may be due to the relatively minimal flow at the inception of TTP, leading to a mean magnitude of TSS around 1.3 mg/L. From the middle of October, the flow was further increased to 20% of its design capacity. From this period onwards, TSS was well below the limit of 1 mg/L. It is essential to remember that proper mitigation measures should be taken if the TTW quality does not reach the prescribed standards. There were provisions given to distribute TTW based on the effluent quality. The hardware sensors will meticulously monitor the effluent at the TTP sump and compare it against the prescribed standard; then, the real-time decision will be taken whether the TTW will be distributed to the NMMC residential area or MIDC industrial area or bypassed into creeks. Thus, the present study with hypothetical analysis infers that TTP produced the intended quality required for industrial reuse over time.

STP and TTP's sustainable and reliable operation relies on several key elements: obtaining timely data about the quality and process parameters, utilizing acquired data to forecast essential parameters, and making informed decisions based on this data to manage the system. In this context, the objective was formulated to develop a data-driven soft computing model that can effectively forecast influent parameters (STE) for TTP, informing decisions on whether to proceed with the treatment of STE using TTP or divert STE to creeks as a preventive measure against the potential failure of TTP (if STE encompasses prescribed standards of NMMC). Another objective of this research is to evaluate the performance of TTP in acquiring prescribed discharge standards for industrial water reuse.

Concerning the first objective of preparing a soft computing model based on hybrid ANN-GA along with uncertainty estimation provides excellent forecasting accuracy results. The R2 of QT, BOD, COD, and TSS in the testing stage is 0.94, 0.88, 0.93, and 0.72, respectively. One of the advantages of this study is that it focuses solely on utilizing past data of the parameter to model the time series rather than depending on a variety of water quality parameters. Another crucial aspect is that when finding optimal input features to model time series, several studies rely on a trial-and-error approach or autocorrelation analysis to find optimum time lags. This study utilized GA to tune the input features and model hyperparameters (hidden layer, dropout layer, and activation function) to achieve minimum mean squared error and improve the forecasting performance, considerably saving time in model development compared to grid search and random search technique. Uncertainty estimation in the developed hybrid model will provide the upper and lower threshold within which the outcome may fall, which is also a novelty of the study. It is imperative to mention here that this study utilized diurnal data for developing the model, which is one of the limitations because the model developed with a higher frequency of data may provide more real-time information, leading to better control over the system. In addition, we employed ANN to model time series; several other independent AI techniques were also available, such as long short-term memory networks, random forest, support vector regression, and many more. However, after thoroughly reviewing the existing literature and considering the success achieved with ANN modelling, we opted for this method due to its ability and simplicity in modelling with higher forecasting accuracy. Our future work will be modelling hard-to-measure parameters with high-frequency data by employing diverse state-of-the-art AI techniques hybridized with optimization algorithms to integrate with SCADA or PLC to ameliorate STPs energy and chemical consumption.

Another objective is assessing the TTP's adherence to established guidelines for industrial water reuse using the Wilcoxon signed-rank test. This assessment implies improved performance in BOD and COD levels while TSS gradually meets prescribed standards over time. The influent characteristics of TTP (STE) should follow a range of values to meet the effluent standard at the TTP stage. These criteria encompass pH within the range of 6.5–9, temperature ≥22 °C, BOD ≤ 10 mg/L, COD ≤ 50 mg/L, TSS ≤ 20 mg/L, and residual chlorine within the range of 0.3–0.5 ppm. Within this context, we have expanded our analysis by thoroughly examining the magnitude of the acquired STE data comprising pH levels, temperature readings, BOD and COD concentrations between September and October 2022. We aimed to identify any observed values that deviate from the specified range of acceptable values, which may affect TTP's performance. Analysis deduces that these characteristics fall within acceptable limits, with average pH, temperature, BOD, and COD values measuring 6.6, 28.5 °C, 4.8 mg/L, and 34.45 mg/L, respectively. The treatment implemented in TTP substantially reduced the level of BOD and COD in STE, as indicated by removal efficiency analysis. The average removal efficiency for BOD was recorded as 22%, while an even higher rate of 43.4% was achieved for COD. These findings were derived from a trial run of TTP with limited capacity utilization and analysed with the collected diurnal data. As discussed for STE modelling, an increased frequency of data attained with a more reliable sensor presents an opportunity to develop a digital twin for these treatment systems. This technological advancement allows for the continuous monitoring of current conditions and enables predictive analysis to anticipate real-world scenarios. Our forthcoming research on TTP will concentrate on improving digital twins, explicitly focusing on TTP's forecasting and optimization aspects.

The primary objective of this research is to deduce the quantum of QT along with the anticipated quality of STE water available for TTP to assess the reliability of its input parameters' dynamic range. This objective is inferred using ANN-GA modelling along with an uncertainty estimation; the developed soft sensor models provided excellent results for QT, BOD, and COD with an R2 value of 0.94, 0.88, and 0.93 in the testing stage, whereas TSS shows a value of 0.72. The lower forecasting accuracy of TSS is due to the minimal data length used for model development and testing. The model dynamics and the upper and lower limit of uncertainty estimate provide strong evidence that 20 × 103 m3/day of good-quality STE water is always available for further processing into TTP. The developed model can be used as a soft sensor and requires only past parameter data, which needs to be forecasted rather than diverse water quality or quantity parameters. Another advantage of this soft sensor is that it helps in decision-making on whether to proceed with the treatment of STE using TTP or divert STE to creeks as a preventive measure against the potential failure of TTP. The decreasing trend of QT was highlighted, which may be due to the age of infrastructure, loss during sewage transmission and emphasized the need for further study. Another objective of this study includes assessing the adherence to prescribed standards by TTP for industrial water reuse. Analysis of TTW data for September and October 2022 using the Wilcoxon signed-rank test deduces that BOD and COD are well within the prescribed standards, where TSS shows infringement during the initial period and started following standards over time; consistency of parameters on achieving the prescribed standards infers the TTW can be used for industrial reuse. The developed ANN-GA model and performance evaluation will help to control the treatment system and decision-making. One of the limitations of this research is that the diurnal data was utilized to develop the model. STP comprising high-frequency data with diverse water quality, quantity, and process parameters can be even more efficient in modelling with the advantage of real-time control. Our future scope is to develop a more robust model with state-of-the-art AI techniques tuned with various optimization algorithms to integrate with digital twin systems for anomaly prediction and ameliorate STP energy and chemical consumption. Thus, the research work covered different aspects of water reuse, such as policy initiatives in India on water reuse, modelled the STE using ANN-GA along with uncertainty estimation of Koparkhairane STP, and TTP performance evaluation using the Wilcoxon signed-rank test.

The authors thank NMMC for providing the necessary data to carry forward this research successfully. We also thank the Maharashtra Pollution Control Board (MPCB) and the Government of Maharashtra for funding this project.

This project is carried out with funds distributed for studying the compliance of sewage treatment plants of Maharashtra by the Maharashtra Pollution Control Board (MPCB). Project funding code: RD/119 – MPCB-009-001.

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Ali
M.
,
Hong
P.-Y.
,
Mishra
H.
,
Vrouwenvelder
J.
&
Saikaly
P. E.
2022
Adopting the circular model: Opportunities and challenges of transforming wastewater treatment plants into resource recovery factories in Saudi Arabia
.
J. Water Reuse Desal.
12
(
3
),
346
365
.
https://doi.org/10.2166/wrd.2022.038
.
Asami
H.
,
Golabi
M.
&
Albaji
M.
2021
Simulation of the biochemical and chemical oxygen demand and total suspended solids in wastewater treatment plants: Data-mining approach
.
J. Clean Prod.
296
,
126533
.
https://doi.org/10.1016/j.jclepro.2021.126533
.
Bagheri
M.
,
Mirbagheri
S. A.
,
Bagheri
Z.
&
Kamarkhani
A. M.
2015
Modeling and optimization of activated sludge bulking for a real wastewater treatment plant using hybrid artificial neural networks-genetic algorithm approach
.
Process Saf. Environ.
95
,
12
25
.
https://doi.org/10.1016/j.psep.2015.02.008
.
Boyd
G.
,
Na
D.
,
Li
Z.
,
Snowling
S.
,
Zhang
Q.
&
Zhou
P.
2019
Influent forecasting for wastewater treatment plants in North America
.
Sustainability
11
(
6
),
1
14
.
https://doi.org/10.3390/su11061764
.
CPCB
2021
National Inventory of Sewage Treatment Plants (2021)
.
Central Pollution Control Board, Government of India Publication
,
Delhi
,
India
.
Do
P.
,
Chow
C. W. K.
,
Rameezdeen
R.
&
Gorjian
N.
2022
Wastewater inflow time series forecasting at low temporal resolution using SARIMA model: A case study in South Australia
.
Environ. Sci. Pollut. Res.
29
(
47
),
70984
70999
.
https://doi.org/10.1007/s11356-022-20777-y
.
Fernandez
F. J.
,
Seco
A.
,
Ferrer
J.
&
Rodrigo
M. A.
2009
Use of neurofuzzy networks to improve wastewater flow-rate forecasting
.
Environ. Model. Softw.
24
(
6
),
686
693
.
https://doi.org/10.1016/j.envsoft.2008.10.010
.
Ghazouli
K. E.
,
Khattabi
J. E.
,
Shahrour
I.
&
Soulhi
A.
2021
Wastewater flow forecasting model based on the nonlinear autoregressive with exogenous inputs (NARX) neural network
.
H2Open J.
4
(
1
),
276
290
.
https://doi.org/10.2166/H2OJ.2021.107
.
Hamed
M. M.
,
Khalafallah
M. G.
&
Hassanien
E. A.
2004
Prediction of wastewater treatment plant performance using artificial neural networks
.
Environ. Model. Softw.
19
(
10
),
919
928
.
https://doi.org/10.1016/j.envsoft.2003.10.005
.
Hejabi
N.
,
Saghebian
S. M.
,
Aalami
M. T.
&
Nourani
V.
2021
Evaluation of the effluent quality parameters of wastewater treatment plant based on uncertainty analysis and post-processing approaches (case study)
.
Water Sci. Technol.
83
(
7
),
1633
1648
.
https://doi.org/10.2166/wst.2021.067
.
Heo
S.
,
Nam
K.
,
Loy-Benitez
J.
&
Yoo
C.
2021
Data-driven hybrid model for forecasting wastewater influent loads based on multimodal and ensemble deep learning
.
IEEE. Trans. Ind. Inform.
17
(
10
),
6925
6934
.
https://doi.org/10.1109/TII.2020.3039272
.
Joseph
N.
,
Ryu
D.
,
Malano
H. M.
,
George
B.
,
Sudheer
K. P.
&
Anshuman
.
2019
Estimation of industrial water demand in India using census-based statistical data
.
Resour. Conserv. Recycl.
149
,
31
44
.
https://doi.org/10.1016/j.resconrec.2019.05.036
.
Jothiprakash
V.
&
Shanthi
G.
2006
Single reservoir operating policies using genetic algorithm
.
Water Resour. Manag.
20
(
6
),
917
929
.
https://doi.org/10.1007/s11269-005-9014-y
.
Jothiprakash
V.
,
Tobias
M. J.
&
Seetharam
K. E.
2020
The socioeconomic spillovers of sanitation: Sewage treatment plants in Navi Mumbai, India
.
ADBI Development Case Studies
2020
(
5
),
1
22
.
Kılıç
U.
,
Ayan
K.
&
Arifoğlu
U.
2014
Optimizing reactive power flow of HVDC systems using genetic algorithm
.
Int. J. Electr. Power Energy Syst.
55
,
1
12
.
https://doi.org/10.1016/j.ijepes.2013.08.006
.
Liu
Y. Z.
&
Chen
Z.
2023
Prediction of biochemical oxygen demand with genetic algorithm-based support vector regression
.
Water Qual. Res. J.
58
(
2
),
87
98
.
https://doi.org/10.2166/wqrj.2023.015
.
Meeker
W. Q.
,
Hahn
G. J.
&
Escobar
L. A.
2017
Statistical Intervals: A Guide for Practitioners and Researchers
, 2nd edn.
John Wiley & Sons, Inc.
,
Hoboken, New Jersey
.
MoJS
2022
National Framework on Safe Reuse of Treated Water
.
Ministry of Jal Shakti, Government of India Publication
,
Delhi
,
India
.
Mu
B.
,
Niu
L.
&
Yuan
S.
2016
Wastewater discharge optimization modeling using neural network and genetic algorithm
. In:
Proceedings of 2015 4th International Conference on Computer Science and Network Technology, ICCSNT 2015, 01 (ICCSNT)
, pp.
713
718
.
https://doi.org/10.1109/ICCSNT.2015.7490843
.
NGT
2019
Compliance Report in the Honourable NGT Matter of OA No. 593/2017 (2019)
.
National Green Tribunal (NGT)
,
Delhi
,
India
.
NMMC
2021
Environmental Status Report of Navi Mumbai Municipal Corporation (2020–2021)
.
Navi Mumbai Municipal Corporation (NMMC)
,
Mumbai
,
India
.
Obaideen
K.
,
Shehata
N.
,
Sayed
E. T.
,
Abdelkareem
M. A.
,
Mahmoud
M. S.
&
Olabi
A. G.
2022
The role of wastewater treatment in achieving sustainable development goals (SDGs) and sustainability guideline
.
Energy Nexus
7
,
100112
.
https://doi.org/10.1016/j.nexus.2022.100112
.
Ramkumar
D.
,
Jothiprakash
V.
&
Patil
B. N.
2022
Performance assessment of sewage treatment plants using compliance index
.
J. Water Sanit. Hyg. Dev.
12
(
6
),
485
497
.
https://doi.org/10.2166/washdev.2022.055
.
Sadi
M.
,
Fakharian
H.
,
Ganji
H.
&
Kakavand
M.
2019
Evolving artificial intelligence techniques to model the hydrate-based desalination process of produced water
.
J. Water Reuse Desal.
9
(
4
),
372
384
.
https://doi.org/10.2166/wrd.2019.024
.
Schellenberg
T.
,
Subramanian
V.
,
Ganeshan
G.
,
Tompkins
D.
&
Pradeep
R.
2020
Wastewater discharge standards in the evolving context of urban sustainability – The case of India
.
Front. Environ. Sci.
8
.
https://doi.org/10.3389/fenvs.2020.00030
.
Surakhi
O.
,
Zaidan
M. A.
,
Fung
P. L.
,
Motlagh
N. H.
,
Serhan
S.
,
Alkhanafseh
M.
,
Ghoniem
R. M.
&
Hussein
T.
2021
Time-lag selection for time-series forecasting using neural network and heuristic algorithm
.
Electronics
10
(
20
),
1
22
.
https://doi.org/10.3390/electronics10202518
.
Wadkar
D. V.
,
Nangare
P.
&
Wagh
M. P.
2021
Evaluation of water treatment plant using artificial neural network (ANN) case study of Pimpri Chinchwad Municipal Corporation (PCMC)
.
Sustain. Water Resour. Manag.
7
(
4
).
https://doi.org/10.1007/s40899-021-00532-w
Wang
G.
,
Jia
Q. S.
,
Zhou
M. C.
,
Bi
J.
,
Qiao
J.
&
Abusorrah
A.
2022
Artificial neural networks for water quality soft-sensing in wastewater treatment: A review
.
Artif. Intell. Rev.
55
(
1
),
565
587
.
https://doi.org/10.1007/s10462-021-10038-8
.
World Bank
2019
Wastewater From Waste to Resource – The Case of Nagpur in India
.
World Bank
,
Washington, DC
.
Yaqub
M.
,
Asif
H.
,
Kim
S.
&
Lee
W.
2020
Modeling of a full-scale sewage treatment plant to predict the nutrient removal efficiency using a long short-term memory (LSTM) neural network
.
J. Water Process Eng.
37
,
101388
.
https://doi.org/10.1016/j.jwpe.2020.101388
.
Zhang
Q.
,
Prouty
C.
,
Zimmerman
J. B.
&
Mihelcic
J. R.
2016
More than target 6.3: A systems approach to rethinking sustainable development goals in a resource-scarce world
.
Engineering
2
(
4
),
481
489
.
https://doi.org/10.1016/J.ENG.2016.04.010
.
Zhu
J. J.
&
Anderson
P. R.
2019
Performance evaluation of the ISMLR package for predicting the next day's influent wastewater flowrate at Kirie WRP
.
Water Sci. Technol.
80
(
4
),
695
706
.
https://doi.org/10.2166/wst.2019.309
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).

Supplementary data