Water is a necessary resource that enables the existence of all life forms, including humans. Freshwater usage has become increasingly necessary in recent years. Facilities for treating seawater are less dependable and effective. Deep learning methods have the ability to improve salt particle analysis in saltwater's accuracy and efficiency, which will enhance the performance of water treatment plants. This research proposes a novel technique in optimization of water reuse with nanoparticle analysis based on machine learning architecture. Here, the optimization of water reuse is carried out based on nanoparticle solar cell for saline water treatment and the saline composition has been analyzed using a gradient discriminant random field. Experimental analysis is carried out in terms of specificity, computational cost, kappa coefficient, training accuracy, and mean average precision for various tunnelling electron microscope (TEM) image datasets. The bright-field TEM (BF-TEM) dataset attained a specificity of 75%, kappa coefficient of 44%, training accuracy of 81%, and mean average precision of 61%, whereas the annular dark-field scanning TEM (ADF-STEM) dataset produced specificity of 79%, kappa coefficient of 49%, training accuracy of 85%, and mean average precision of 66% as compared with the existing artificial neural network (ANN) approach.

  • Deep learning methods have the ability to improve salt particle analysis in saltwater's accuracy and efficiency, which will enhance the performance of water treatment plants.

  • This research proposes a novel technique in the optimization of water reuse with nanoparticle analysis based on machine learning architecture. Here, the optimization of water reuse is carried out based on nanoparticle solar cells for water treatment.

Disposal of solid wastes by landfill is a more widely utilized disposal technique compared to others due to its economic advantages as well as simplicity (Rajput et al. 2022). Leachate develops in landfill areas as a result of water's impact on waste composition, precipitation, and physical, chemical, and biological processes. Wastewater with a high level of contaminants, including monovalent and multivalent ions, bacteria, and heavy metals, is referred to as leachate. Chemical composition of leachate is influenced by the type of trash in the landfill, waste content, landfill age, and environmental conditions (Zhong et al. 2022). Chemical oxygen demand (COD), total phosphorus (TP), and organic nitrogen concentrations in normal leachate vary between 140 and 152,000 mg/L, 0.1 and 30 mg/L, and 14 and 2,500 mg/L. In landfill regions, liners with low hydraulic conductivity (k ≤ 10 − 7 cm/s) are employed to stop leachate from getting into groundwater. Numerous variables, including void ratio, medium porosity, pore size, soil type, liquid type, and contaminant concentration, all affect hydraulic conductivity (Fu et al. 2021). The structure and characteristics of the clay are altered throughout the leachate's passage through the clayey soil, altering its hydraulic conductivity. The leachate's increased ion concentration, ionic valence, and organic matter concentration all help to boost hydraulic conductivity, while its increased microbial population and concentration of suspended solids help to lower it. Some of the factors that contribute to improvement in hydraulic conductivity as a result of leachate-liner interaction include thinned diffuse double layer (DDL) thickness of the clay, agglomeration, flocculation of clay particles, formation of cracks in soil structure, and a rise in void ratio. Growing global population increases water usage and daily pollution production (Alshehri et al. 2021). Approximately 2.4 billion people worldwide do not have access to reliable sanitation services, both in urban and rural regions. Within 20 years, 2 billion more people are expected to reside in towns and cities, mostly in developing nations, increasing the demand for sanitation. Sewage is released untreated and damaging lakes, rivers, and coastal areas in more than 90% of developing nations. Traditional sanitation practises, which rely on water-wasting toilets, are not an appropriate answer for industrialized and developing nations. The sanitation systems were created and planned under the assumption that human excreta was a type of waste that could be disposed of in the environment and be assimilated by it. EcoSan is a cycle of a sustainable closed system used in sanitation systems that can bridge the gap between agriculture and sanitation. The EcoSan strategy emphasizes resource management and offers a comprehensive approach to environmentally and financially sound sanitation. The major goal is to promote sustainable development by reducing nutrient and water cycles while using less resources and energy. The EcoSan system is a different method for avoiding the drawbacks of traditional wastewater systems (Melesse et al. 2020).

Smart technology and newly developed artificial intelligence (AI) and machine learning (ML) are filling a void in water applications that were previously left unfilled by conventional methods and ways of thinking. Smart technology and newly developed AI and ML are filling a void in water applications that were previously left unfilled by conventional methods as well as ways of thinking. According to some projections, the water industry will spend up to 10% of over $90 billion in investments that are anticipated to mature by 2030 on AI. Because of its universality, resilience, and relative ease of design, AI, ML, and smart methods are projected to methods as well as solve complex and challenging problems in water applications, resulting in cost savings and process optimization. Water as well as wastewater treatment, monitoring of natural methods, and precision/water-based agriculture are a few water-related applications that have seen substantial ML usage. These industry studies rely on a variety of ML methods, with the most popular ones being artificial neural network (ANN), recurrent neural network (RNN), radio frequency (RF), support vector machine (SVM), and adaptive neuro-fuzzy inference system (ANFIS), with sporadic use of AI techniques like fuzzy inference systems (FISs). ANN-RF and SVM-RF are two examples of hybrid algorithms that combine two ML systems in applications. Studies have shown success using both AI and ML to optimize modelling processes in water-based settings (Chang et al. 2022).

Contribution of this research is as follows:

  • 1.

    To propose a novel technique in the optimization of water reuse with nanoparticle analysis based on ML architecture.

  • 2.

    Optimization of water reuse is carried out based on nanoparticle solar cells for water treatment.

  • 3.

    Saline composition has been analyzed using a gradient discriminant random field based on saline water treatment.

Intelligent scissors, distance regularized level set evolution, and graph cuts are some recently developed advanced segmentation algorithms that have enhanced performance, which reduces high-throughput advantage of computerized data analysis. The concentration of dissolved oxygen (DO) in wastewater can be significantly reduced by a high temperature, which causes aquatic species to perish (Aghilesh et al. 2021). A drop in DO will cause nitrogen to be transformed into nitrate molecules, which will also produce unpleasant odours. The wastewater's odour and health may be harmed if the hue gradually turns from grey to blackish brown (Wang et al. 2021). Wastewater must be treated before being released freely into a body of water in order to protect public health (Jiang et al. 2021). Water is necessary for human living, but wastewater needs our attention because it is the primary source of most diseases. Additionally, the main source of sickness is water (Wei et al. 2022). 80% of disease outbreaks in 2002 were attributed to water contamination by microbes, according to WHO data (Wang et al. 2020). Because losses are a result of the uncertainty introduced by risks, risk management is essential for lowering losses (Li et al. 2021). Effective management is required to reduce these risks. As part of this management, procedures must be carried out effectively in order to accomplish an objective. Pre-analysis, prediction, and control are all effective ways to lower possible risks (Nam et al. 2022). Therefore, risk management is required to lower the likelihood of failure and offer convenience for handling prospective risks. The methodical process of identifying environmental threats, reviewing probable outcomes, and controlling the amount of environmental risk is one specific illustration of environmental risk management (Astray et al. 2021). The four steps of risk management for the environment are issue formulation, risk analysis, risk characterization, and risk management (Zhao et al. 2020). Large datasets can be efficiently classified using ML techniques in a variety of domains, including biology and text mining (Alqahtani et al. 2022). Particle shape categorization is now accomplished using deep learning techniques like CNN algorithms and ANN (Bagheri et al. 2019). This method produces good results for particular particle forms, but in order to produce training datasets, it requires prior knowledge about the latter. An unsupervised ML technique, which does not need prior input or data training, is preferable to accomplish generalization in data analysis (Hoa et al. 2019). Unsupervised ML techniques may effectively evaluate the size as well as shape information of nanoparticles, according to recently published studies; however, they are only applicable to nanoparticles with distinct shapes, significant visual contrast, and good dispersions, such as gold nanorods (isolated particles). Due to a variety of nanoparticle compositions, dispersions, and imaging settings, it is actually challenging to ensure a sharp contrast as well as a homogeneous background in energy management images (Zeng et al. 2018; Chen et al. 2021).

Wastewater processing plants can benefit from adaptive ML models by minimizing downtime and increasing profits through more efficient management. The use of an ensemble technique increased prediction accuracy by 5% when compared to using only the individual base models (Zaghloul & Achari 2022). Controlling chlorination using AI approaches has been shown to be successful while predicting disinfection by-products (DBP) concentrations and key parameters for adsorption and membrane-filtration processes using ML models has shown similar promise. By establishing the connection between input and output among system variables, we may improve AI and ML application efficiency that suffers from differences in the quality of data used for testing and training (Lowe et al. 2022). Successful real-time surveillance, optimization, uncertainty prediction, and fault identification for intricate environmental systems are all possible due to ML algorithms, which proved useful in predicting the uncertain behaviours of treatment procedures. Hyperparameter tweaking is used during the machine learning algorithm (MLA) selection process to discover the optimal solution in the least amount of time while using the fewest available computing resources (Sundui et al. 2021).

From the related study findings, it was found that there is a gap in the application of nanoparticles in solar panels to improve water reuse and to determine the saline composition. Also, the application of ML techniques in water reuse is very limited. Incorporating these approaches in the present study makes it novel and distinct with the adoption of datasets from bright-field tunnelling electron microscope (BF-TEM) and annular dark-field scanning TEM (ADF-STEM).

This section discusses a novel method in the optimization of water reuse with nanoparticle analysis based on ML architecture. Here, the optimization of water reuse is carried out based on nanoparticle solar cells for water treatment. For the aforementioned reasons of easiness, acceptable accuracy, speed, and high performance without the requirement to be skilled professionals in physical issues, ML models are increasingly being utilized to handle most of the complicated practical, real-world problems faced in numerous domains of life (Sarker 2021). The nanoparticles have a porous structure that allows them to absorb large amounts of water while retaining their ability to reject dissolved salts and other contaminants. Nanoparticles, due to their low bulk density, active surface state, and distinctive photoluminescent, and biocompatible features, nanoparticles can take in infrared light and generate visible light, which solar cells can then convert into extra electricity. A considerable increase in photocurrent is achieved when light that hits on the solar panel strikes the nanoparticles, scatters, and may hit additional nanoparticles and bounce several times within the system (Olaimat et al. 2021). The saline composition has been analyzed using a gradient discriminant random field based on saline water treatment. The proposed architecture is shown in Figure 1.
Figure 1

Proposed architecture.

Figure 1

Proposed architecture.

Close modal

Pre-processing begins with separating the image's background from the particles. Setting a global brightness threshold to remove the backdrop from the image is one approach that is frequently used to do this. This method takes advantage of contrast in brightness between particles as well as background in TEM image. The inhomogeneity of the background presents a problem non-depending on this method to create accurate background-removed photos. This inhomogeneity might be caused by differences in the sample backdrop matrix's thickness, surface unevenness, or by uneven electron microscope illumination. When a global filtering threshold is used, inhomogeneous background causes regional background to be both over- as well as under-filtered in the removal method. Because of this, the technique either fails to effectively isolate particles or introduces artefacts, which distort shapes. Such insufficient background removal can compromise the precision of particle shape data (Wen et al. 2021).

Solar energy can be absorbed by PV cells. Due to the fact that they are semiconductor devices, they may transform the absorbed energy into useful electrical power. P-N junction diodes are used in the construction of the PV module structure. These diodes can transform incident light that reaches their surface into electrical energy (Pastuszak & Węgierek 2022). Figure 2 illustrates the fundamental design, wiring, and operation of a PV module:
Figure 2

Working of photovoltaic.

Figure 2

Working of photovoltaic.

Close modal

In constructing the PV, there are two layers of silicon present: a negative N layer and a positive P layer that is doped with boron. The tempered glass-coated PV module absorbs solar energy when it is exposed to the sun. After a certain amount of time, the energy absorbed exceeds the band gap energy level, which causes electrons to travel from conduction band to valence band via that band. Due to this, electrons in the conduction band can flow freely and form electron–hole pairs. Since the flow of electric current is caused by the movement of electrons, the electricity produced during the process is used to power the load. In order to generate adequate electricity, an array configuration is insufficient because these systems experience numerous losses (Al-Ezzi & Ansari 2022). The ideal method for maximizing each string's effectiveness is MPPT (maximum power point tracking). The most power possible from PV modules can be obtained by using such control approaches.

The energy management system (EMS) keeps state of charge (SOC) between SOCmax and SOCmin (20–90%) within a predetermined range. The EMS operating techniques implemented state that generating power from solar panels and wind turbines takes precedence over other sources of energy in order to meet load needs. The excess power will be used to charge the battery with a reference value if the amount of power supplied by the PV and wind is greater than the amount needed to power the load, or PPV + PWind > PLoad by Equation (1):
(1)
where PPV and PWind are the power generated by the PV and wind turbines, respectively, and PB-ref is the reference battery power that will be used to charge the battery. Power demand is PLoad. The battery keeps charging until its state of charge, or SOC, reaches its maximum point, or SOCmax, which is 90%. The electrolyzer (ELZ) will then use the remaining power to produce hydrogen for the solid oxide fuel cell (SOFC) until the pressure reaches its maximum level. The extra energy produced will then be exported to the grid. The energy storage system's overall size and weight can be decreased by using a DC/DC converter with an incorporated magnetic structure. Additionally, a magnetic structure that is integrated helps lessen the output current ripple (Kumar et al. 2020).
Referencing overall DC-link voltage and energy time-variations should be done to emphasize the workings of the proposed hybrid energy storage electric converter (HESEC). These can be accomplished using Equations (2) and (3):
(2)
(3)
where the equivalent DC-link and power currents, iDC and ip, are proportional to energy exchanged by electrical drive and DC-link. In reference to (5), cH and cL are per-unit capacitances in Equation (4), and CDC stands for the equivalent DC-link series capacitance:
(4)
The per-unit capacitor voltages are represented by vH and vL in Equation (5):
(5)
Still referring to both (4) and (5), it can be said that if suitable combinations of (cH,cL) and (vH,vL) exist, VDC and EDC can be changed separately. DC-link energy base value might be provided as Equation (6) to support the preceding claim:
(6)
The per-unit energy content of the DC-link can thus be simply obtained as Equation (7) by multiplying (4) by (7):
(7)
in which ζ denotes unbalanced capacitance factor:
(8)
Therefore, by altering the continuous operating (Uc) voltage within a specific range, one can usefully introduce DC-link voltage unbalances while maintaining the total DC-link voltage. In order to regulate the energy flows among each HESEC component optimally, proper DC-link energy and voltage rules are required. Solar panels are used to gather this electricity from the ambient sunlight. Light energy is converted directly into DC electrical energy by the solar panel. To charge the battery, the DC–DC converter adjusts the DC voltage. The wireless sensor networks (WSN) node is powered by a rechargeable battery. The sensor measurement unit is used by the WSN node to measure the desired physical quantity. This data is processed by a microcontroller in the computing unit (Akgül et al. 2018). Figure 3 represents the solar energy harvesting methodology.
Figure 3

Block diagram of solar energy harvesting.

Figure 3

Block diagram of solar energy harvesting.

Close modal

Gradient discriminant random field-based saline water composition analysis:

Variations in batch-wise training come from the gradient variance. The use of a random sample has the advantage of requiring much fewer computations per iteration while the disadvantage is noisy gradient. Note that iterations are used to calculate the convergence rate in this section. We must define the Lyapunov process by Equation (9) in order to study the training dynamics per iteration (Hou & Yi 2020):
(9)
The variable ht is a random number, and the equation evaluates the separation between present solution, wt, and ideal solution, w. Consequently, using Equations (10) and (11), it is possible to determine the stochastic gradient descent's (SGD) convergence rate (Hu et al. 2021):
(10)
(11)
It implies how much progress can be made in one iteration. The convergence rate is improved by decreasing VAR{∇ψw(dt)}. Average convergence rate at an iteration's precision is produced by the expectation of Equations (12) and (13):
(12)
(13)
Let's assume that ψw(dt) is convex so as to make the analysis of Equation (14) simpler.
(14)
The objective estimation of E{∇ψw(d)} is E{∇ψw(dt)}. Therefore, increasing an iteration's contribution is equivalent to minimizing VAR{∇ψw(dt)}. This angle has been adequately covered. With respect to dt, an iteration's contribution, ht+1ht, changes. Equations (15) and (16) show the following as variance of ht+1ht:
(15)
(16)

The equation shows that VAR{ht+1ht}, 0, indicating that gradient updates do not contribute uniformly. It's intriguing to note that ∇ψw(dt) 2 and ∇ψw(dt), the deciding variables in this equation, depend on dt, indicating a relationship between ht+1ht and dt.

This original idea drives us to research the variables in dt that influence convergence rate ht+1ht and how to solve load balance issues in training. There has been numerous research on the variance reduction on ∇ψw(dt), but few have gone in this way. Every iteration k, the update shown in Equation (17) is carried out:
(17)
where stepsizes satisfy , and , i.e., is average of n noisy gradients evaluated at z(k). As a result, using more than one gradient improves the accuracy of the gradient estimation. In fact, by Equation (18) is what we have from assumption:
(18)
We will focus on two error terms, which are different from analysis for SGD. Expected squared distance between z(k) and z* is defined by the first term, called expected optimization error, and expected consensus error, or expected error, gauges disparities of individual estimates among all agents. Equation (19) specifies the average squared distance between each iterate zi(k) and ideal z:
(19)
Therefore, examining the two terms will give us information about how distributed stochastic gradient descent (DSGD) is performing. Denote to simplify the notation., ∀k We start by looking for an inequality that limits U(k), which is comparable to in SGD and was inspired by the study for SGD. Equation turns out to be one of these relationships as in Equation (20):
(20)

The anticipated consensus error V(k) representing the extra disturbances brought on by the variations in solutions. Two extra terms are probably insignificant in the long run, though, assuming V(k) decays quickly enough relative to U(k), and we would infer that convergence rate of U(k) is similar to R(k) for SGD. for . Plugging this into Equation (21) leads to inequality . Hence, when , we have that

(21)
With the same amount of processing power, DSGD performs similarly to a centralized stochastic gradient descent algorithm after a transient. Using a multilevel logistic prior, a type of Markov random field (MRF), the geographical information is included. In the Bayesian technique, the posterior distribution is often maximized to provide classification results. Maximal a posterior (MAP) ymap = arg max y p y|X predicts class result by maximizing p y |X while taking into account the input X. MAP estimate can be expressed as an Equation (22) according to Bayes' theorem:
(22)
where p(X | y) is a likelihood function, which signifies the likelihood of input data given the labels, and p(y) denotes the prior over the labels in y in Equation (23) and in Equation (24):
(23)
p(X | y) can be written as
(24)
where is a coefficient independent of y. Finally, the MAP categorization is used as Equation (25):
(25)
The final classification outcome for the suggested method will be the MAP classification. In (25), the MRF-based multilevel logistic (MLL) prior will be utilized as the prior p(y) to encode the spatial information, while the density p yi | xi given by the pixel classifier modelled by CNN with spectral characteristics. This issue is resolved using the MRF-based local binary patterns (LBP) technique, which can also discover the association between spectral and spatial data. The Gibbs distribution, according to the Hammersley–Clifford theorem, expresses the likelihood that an MRF will be relevant. As a result, Equation (26) provides the MLL prior
(26)
where Z stands for the normalizing constant and is the smoothness parameter, C stands for a pair of adjacent pixel set and is a unit impulse function, where . The MLL prior favours piecewise smooth classification since it increases the likelihood that adjacent pixels will have the same label. Finally, the equation provides the MAP estimation when (25) and (26) are combined (27):
(27)

A challenging combinatorial optimization issue is referred to as the minimization of the equation in (19).

The machine used for the experiment has the following hardware components: an Intel Core i5 7200U processor, 8 GB of RAM, a 1 TB hard drive, and NVIDIA GTX 760MX graphics. Additionally, Python 3.5 environments were utilized to simulate how the suggested strategy might be put into practise. In order to establish results of offered technique, we carried out a statistical analysis by evaluating expected performance.

Nanoparticle description: Drop-casting nanoparticle dispersions onto Cu TEM grids with the holey carbon film coating was used to create the TEM samples of nanoparticles. ADF-STEM or BF-TEM modes of the JEOL F200 (Akishima, Tokyo, Japan) were used to photograph the nanoparticle TEM samples.

Table 1 gives analysis for TEM dataset. Here, the dataset analyzed are BF-TEM and ADF-STEM in terms of specificity, computational cost, kappa coefficient, training accuracy, and mean average precision.

Table 1

Analysis based on various TEM image dataset

DatasetTechniquesSpecificityComputational costKappa coefficientTraining accuracyMean average precision
BF-TEM ANN 71 45 41 77 55 
CNN 73 48 43 79 59 
OWR_SC_MLA 75 51 44 81 61 
ADF-STEM ANN 72 51 45 79 60 
CNN 76 53 48 83 63 
OWR_SC_MLA 79 55 49 85 66 
DatasetTechniquesSpecificityComputational costKappa coefficientTraining accuracyMean average precision
BF-TEM ANN 71 45 41 77 55 
CNN 73 48 43 79 59 
OWR_SC_MLA 75 51 44 81 61 
ADF-STEM ANN 72 51 45 79 60 
CNN 76 53 48 83 63 
OWR_SC_MLA 79 55 49 85 66 

Figure 4(a)–4(e) gives analysis for BF-TEM dataset. Proposed technique attained specificity of 75%, computational cost of 51%, kappa coefficient of 44%, training accuracy of 81%, and mean average precision of 61%; existing ANN attained specificity of 71%, computational cost of 45%, kappa coefficient of 41%, training accuracy of 77%, and mean average precision of 55%; CNN attained specificity of 73%, computational cost of 48%, kappa coefficient of 43%, training accuracy of 79%, and mean average precision of 59%. A glass cuvette filled with water was used to create the sample solution that was utilized to create the scattering pattern. Sodium chloride was added in intervals of 10 ppt. Salt solution's maximum concentration was 100 ppt, while its minimum concentration was 0 ppt. Using the ready samples, two classes were produced, which are tabulated in Table 1. When tested with our dataset, the suggested ML model outperformed these models in terms of saline water particle identification accuracy, but the other techniques' Kappas remained the same (Arabameri et al. 2021). Figures 4 and 5 depict other performance measurement metrics utilized for evaluation, which are detailed below.
Figure 4

Analysis for BF-TEM in terms of (a) specificity, (b) computational cost, (c) kappa coefficient, (d) training accuracy, and (e) mean average precision.

Figure 4

Analysis for BF-TEM in terms of (a) specificity, (b) computational cost, (c) kappa coefficient, (d) training accuracy, and (e) mean average precision.

Close modal
Figure 5

Analysis for ADF-STEM in terms of (a) specificity, (b) computational cost, (c) kappa coefficient, (d) training accuracy, and (e) mean average precision.

Figure 5

Analysis for ADF-STEM in terms of (a) specificity, (b) computational cost, (c) kappa coefficient, (d) training accuracy, and (e) mean average precision.

Close modal

The following parameters must be understood in order to perceive and comprehend the categorization model that has been put into use: True Positives (Tp) – If the actual class labels and expected class labels match, this calculates to true. True Negatives (Tn) – If actual class labels match projected class labels, this evaluates to false. False Positives (Fp) and False Negatives (Fn) specifications indicate whether actual class labels correspond to predicted class by the model or the opposite (Patel et al. 2022; Gerevini et al. 2023).

Figure 5(a)–5(e) gives analysis for ADF-STEM dataset. Proposed technique attained specificity of 79%, computational cost of 55%, kappa coefficient of 49%, training accuracy of 85%, and mean average precision of 66%; existing ANN attained specificity of 72%, computational cost of 51%, kappa coefficient of 45%, training accuracy of 79%, and mean average precision of 60%; CNN attained specificity of 76%, computational cost of 53%, kappa coefficient of 48%, training accuracy of 83%, and mean average precision of 63%. These metrics scores show that by categorizing salt particles, our deep learning classification model makes precise classifications and predictions. The implemented method was compared with comparable salinity-based prediction methods in order to demonstrate that our classification approach is reliable and produces cutting-edge results (Khan et al. 2022). We sought to compare it with best ML algorithms used to evaluate salinity because there has not been much research done on identification of seawater salinity concentration, making it impossible to compare it with other methods used in this field of research.

This research proposes a novel technique in the optimization of water reuse with nanoparticle analysis based on ML architecture. The saline composition has been analyzed using gradient discriminant random field based on saline water treatment. The produced solution functions as seawater and utilized to identify concentration of salt particles in seawater because concentration of salt particles in sea or seawater ranges to about 35 ppt. This could be helpful for desalinated water treatment plants. It was noted that our adopted method had shown to be effective and precise in terms of precision. Additionally, a real-world dataset was used to target salinity of seawater utilizing a combination of ML methods, allowing performance as well as resilience of proposed method to be evaluated. Proposed technique attained specificity of 79%, computational cost of 55%, kappa coefficient of 49%, training accuracy of 85%, and mean average precision of 66%.

The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University Abha 61421, Asir, Kingdom of Saudi Arabia for funding this work through the Large Groups Project under grant number RGP.2/142/44.

Deanship of Scientific Research at King Khalid University Abha 61421, Asir, Kingdom of Saudi Arabia through the Large Groups Project under grant number RGP.2/142/44.

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Aghilesh
K.
,
Mungray
A.
,
Agarwal
S.
,
Ali
J.
&
Garg
M. C.
2021
Performance optimisation of forward-osmosis membrane system using machine learning for the treatment of textile industry wastewater
.
Journal of Cleaner Production
289
,
125690
.
Al-Ezzi
A. S.
&
Ansari
M. N. M.
2022
Photovoltaic solar cells: a review
.
Applied System Innovation
5
(
4
),
67
.
Alshehri
M.
,
Kumar
M.
,
Bhardwaj
A.
,
Mishra
S.
&
Gyani
J.
2021
Deep learning based approach to classify saline particles in sea water
.
Water
13
(
9
),
1251
.
Arabameri
A.
,
Pal
S. C.
,
Rezaie
F.
,
Nalivan
O. S.
,
Chowdhuri
I.
,
Saha
A.
,
Lee
S.
&
Moayedi
H.
2021
Modeling groundwater potential using novel GIS-based machine-learning ensemble techniques
.
Journal of Hydrology: Regional Studies
36
,
100848
.
Chang
H. M.
,
Xu
Y.
,
Chen
S. S.
&
He
Z.
2022
Enhanced understanding of osmotic membrane bioreactors through machine learning modeling of water flux and salinity
.
Science of The Total Environment
838 (1)
,
156009
.
Chen
F.
,
Du
Y.
,
Qiu
T.
,
Xu
Z.
,
Zhou
L.
,
Xu
J.
, Sun, M., Li, Y. &
Sun
J.
2021
Design of an intelligent variable-flow recirculating aquaculture system based on machine learning methods
.
Applied Sciences
11
(
14
),
6546
.
Gerevini
L.
,
Cerro
G.
,
Bria
A.
,
Marrocco
C.
,
Ferrigno
L.
,
Vitelli
M.
,
Ria
A.
&
Molinara
M.
2023
An end-to-end real-time pollutants spilling recognition in wastewater based on the IoT-ready SENSIPLUS platform
.
Journal of King Saud University – Computer and Information Sciences
35
(
1
),
499
513
.
Hou
S. C.
&
Yi
X. X.
2020
Quantum Lyapunov control with machine learning
.
Quantum Information Processing
19
,
8
.
Hu
B.
,
Seiler
P.
&
Lessard
L.
2021
Analysis of biased stochastic gradient descent using sequential semidefinite programs
.
Mathematical Programming
187
,
383
408
.
Jiang
W.
,
Pokharel
B.
,
Lin
L.
,
Cao
H.
,
Carroll
K. C.
,
Zhang
Y.
, Galdeano, C., Musale, D. A., Ghurye, G. L. &
Xu
P.
2021
Analysis and prediction of produced water quantity and quality in the Permian Basin using machine learning techniques
.
Science of The Total Environment
801
,
149693
.
Khan
M. S. I.
,
Islam
N.
,
Uddin
J.
,
Islam
S.
&
Nasir
M. K.
2022
Water quality prediction and classification based on principal component regression and gradient boosting classifier approach
.
Journal of King Saud University – Computer and Information Sciences
34
(
8A
),
4773
4781
.
Kumar
D.
,
Nema
R. K.
&
Gupta
S.
2020
A comparative review on power conversion topologies and energy storage system for electric vehicles
.
International Journal of Energy Research
44
,
7863
7885
.
Melesse
A. M.
,
Khosravi
K.
,
Tiefenbacher
J. P.
,
Heddam
S.
,
Kim
S.
,
Mosavi
A.
&
Pham
B. T.
2020
River water salinity prediction using hybrid machine learning models
.
Water
12
(
10
),
2951
.
Olaimat
M. M.
,
Yousefi
L.
&
Ramahi
O. M.
2021
Using plasmonics and nanoparticles to enhance the efficiency of solar cells: review of latest technologies
.
Journal of the Optical Society of America B
38
,
638
651
.
Patel
J.
,
Amipara
C.
,
Ahanger
T. A.
,
Ladhva
K.
,
Gupta
R. K.
,
Alsaab
H. O.
,
Althobaiti
Y. S.
&
Ratna
A.
2022
A machine learning-based water potability prediction model by using synthetic minority oversampling technique and explainable AI
.
Computational Intelligence and Neuroscience
2022
,
9283293
.
Rajput
S. P.
,
Webber
J. L.
,
Bostani
A.
,
Mehbodniya
A.
,
Arumugam
M.
,
Nanjundan
P.
&
Wendimagegen
A.
2022
Using machine learning architecture to optimize and model the treatment process for saline water level analysis
.
Water Reuse 13 (1), 51–67
.
Sundui
B.
,
Calderon
O. A. R.
,
Abdeldayem
O. M.
,
Lázaro-Gil
J.
,
Rene
E. R.
&
Sambuu
U.
2021
Applications of machine learning algorithms for biological wastewater treatment: updates and perspectives
.
Clean Technologies and Environmental Policy
23
,
127
143
.
Wang
J.
,
Peng
J.
,
Li
H.
,
Yin
C.
,
Liu
W.
,
Wang
T.
&
Zhang
H.
2021
Soil salinity mapping using machine learning algorithms with the Sentinel-2 MSI in Arid Areas, China
.
Remote Sensing
13
(
2
),
305
.
Wen
H.
,
Luna-Romera
J. M.
,
Riquelme
J. C.
,
Dwyer
C.
&
Chang
S. L. Y.
2021
Statistically representative metrology of nanoparticles via unsupervised machine learning of TEM images
.
Nanomaterials
11
(
10
),
2706
.
Zhong
H.
,
Yuan
Y.
,
Luo
L.
,
Ye
J.
,
Chen
M.
&
Zhong
C.
2022
Water quality prediction of MBR based on machine learning: a novel dataset contribution analysis method
.
Journal of Water Process Engineering
50
,
103296
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY-NC-ND 4.0), which permits copying and redistribution for non-commercial purposes with no derivatives, provided the original work is properly cited (http://creativecommons.org/licenses/by-nc-nd/4.0/).