ABSTRACT
The majority of the environmental outputs from gas refineries are oily wastewater. This research reveals a novel combination of response surface methodology and artificial neural network to optimize and model oil content concentration in the oily wastewater. Response surface methodology based on central composite design shows a highly significant linear model with P value <0.0001 and determination coefficient R2 equal to 0.747, R adjusted was 0.706, and R predicted 0.643. In addition from analysis of variance flow highly effective parameters from other and optimization results verification revealed minimum oily content with 8.5 ± 0.7 ppm when initial oil content 991 ppm, temperature 46.4 °C, pressure 21 Mpa, and flowrate 27,000 m3/day which is nearly closed to suggested oily content 8.5 ppm. An artificial neural network (ANN) technique was employed in this study to estimate the oil content in the treatment process. An artificial neural network model was remarkably accurate at simulating the process under investigation. A low mean squared error (MSE) and relative error (RE) equal to 1.55 × 10−7 and 2.5, respectively, were obtained during the training phase, whilst the testing results demonstrated a high coefficient of determination (R2) equal to 0.99.
HIGHLIGHTS
Produced water is salty water that has been trapped in geological formations and brought to the surface during the development and production of oil and gas.
Response surface methodology based on a central composite design shows a highly significant linear model with p-value <0.0001.
An artificial neural network (ANN) technique was employed in this study to estimate the oil content in the western process.
INTRODUCTION
Produced water (PW) is salty water that has been trapped in geological formations and brought to the surface during the development and production of oil and gas. PW usually has a high concentration of pollutants, such as metals and hydrocarbons, thus it needs to be properly managed and treated before being disposed of. Thus, the majority of studies have focused on treating oily water to meet environmental regulations as well as to reuse and recycle produced water. The employment of more advanced treatment techniques is required by some major environmental rules as well as financial limitations (Wei et al. 2019). High concentrations of oil and grease (O&G), total dissolved solids (TDS), suspended solids (SS), dissolved and volatile organic compounds, dispersed oil, radionuclides, heavy metals, bacteria, and dissolved gases are among the contaminants of concern found in the produced water (Konvensional 2017). The process of treating produced water from an oilfield typically consists of multiple stages: pretreatment, which eliminates larger particles and gas along with the bulk oil; main treatment, which eliminates small droplets and particles; polishing treatment, which eliminates ultra-small particles; and optional tertiary treatment, which eliminates dissolved matters and gases (Konvensional 2017). Moreover, BTEX chemicals (benzene, toluene, ethylbenzene, and xylene) and other harmful and toxic air pollutants are mostly produced by petroleum refineries. Additionally, they are a significant source of the following criterion air pollutants: sulfur dioxide (SO2), carbon monoxide (CO), hydrogen sulfide (H2S), nitrogen oxides (NOx), and particulate matter (PM). Less hazardous hydrocarbons are also released by refineries, including light volatile fuels and oils and natural gas (methane) (Jabbar et al. 2019; Alardhi et al. 2023a).
Since oily wastewater contains hazardous substances that inhibit the growth of organisms, treating it is essential (Baig et al. 2022). Many treatments, such as chemical destabilization (coagulation, flocculation) (Zhao et al. 2021), membrane filtering techniques (Barambu et al. 2021; Samuel et al. 2022), adsorption in nanoparticles (Alardhi et al. 2023b), biological methods (Jabbar et al. 2023), and electrochemical processes (electrocoagulation) (Merma et al. 2020), can be used to treat these wastewaters. A total of three parts compose the conventional oil–water separation process: the oil content control and monitoring unit, the filtration unit, and the separating unit. The treatment units include the separator and filter units, which employ a range of designs and theories. Typically, the treatment process begins with the use of gravity and centrifugal separators. Other separating methods, referred to as polishing treatment, are then applied. The flotation, coagulation and flocculation, filtration, biological treatment, absorption, and adsorption processes are examples of the polishing unit (E.P.A.W. DC 2011). Standard procedures such as gravity and centrifugation are applied to oily wastewater that is divided into two phases; in the meantime, the emulsified oily bilge water needs to be separated using chemical or biological de-emulsification (Karakulski et al. 1995). Usually, the first step in treating oily water on board is to use gravity to separate the heavier and lighter portions according to density differences. This technique uses oleophilic polymer coalescing materials in the form of parallel plates or loose-packed media to draw oil droplets to the plate (E.P.A.W. DC 2011). Coalescing plate separators made of oleophilic polymers include nylon, fiberglass, and polyethylene (Amran & Mustapha 2021).
The oil droplets must first drift freely and disperse, separate from the coalescing material, and rise to the surface of the tank before they can no longer stick to the plate or media. After that, when the sensors identify the presence of oil, the oil–water separation process is automatically started, and the collected oil is taken out of a waste oil tank. However, this approach is only effective when the phases of water and oil are separated (Koss 1996). In other words, the gravitational technique is incorrect in some instances where the bilge water is emulsified oil produced by chemical emulsifiers (cleaning and agents solvents) as well as mechanical systems like the ship's motion and transfer system pump (E.P.A.W. DC 2011).
Wastewater treatment is currently optimized and modeled using both artificial neural network (ANN) techniques and response surface methodology (RSM). The response surface technique is used to determine the relationship between experimental factors and responses (Zin et al. 2020).
When the wastewater treatment procedure is carried out under substandard conditions, more materials may be used, more time may be spent, and the results may be less favourable. As a result, it is crucial to model and optimize operational parameters on processes. ANN and RSM techniques are commonly used in the optimization and modelling approach. RSM and ANN models were improved for these issues because of their accuracy in making predictions about complex nonlinear systems.
ANNs, a type of machine learning technique that is highly predictive, have been used in the petroleum and energy sectors (Sheikhoushaghi et al. 2022). The feed-forward back-propagation algorithm-trained multilayer perceptron (MLP) architecture is the most widely used artificial neural network (ANN) model. The MLP design consists of at least three layers of processing units coupled by weighted connections. The input vector is found in the first layer, while the output vector is found in the last layer. Neural networks are represented by the intermediate layers, which are also called hidden layers, which modify the input data by creating several weighted connections (Elmabrouk et al. 2014). One of the many reasons ANNs are preferable to mathematical models is their ability to recognize various types of interdependence in biological systems without requiring a previous understanding of the kinetic and metabolic processes of the systems (Heydari et al. 2021). The use of ANNs and other models to enhance water treatment has been the subject of numerous studies, as Table 1 illustrates.
No. . | Water treatment process . | Modeling . | Reference . |
---|---|---|---|
1 | Membrane filtration | ANN and RSM | Sahrae et al. (2023) |
2 | Membrane filtration | Linear machine learning models and RSM | Usman et al. (2023) |
3 | Membrane filtration | ANN | Schmitt et al. (2018) |
4 | Coagulation | ANN and RSM | Fard et al. (2021) |
5 | Adsorption | ANN | Thangavelu & Zou (2022) |
6 | Electrocoagulation | RSM | Jasim et al. (2023) |
7 | Gravitational and addition of chemicals | ANN | This study |
No. . | Water treatment process . | Modeling . | Reference . |
---|---|---|---|
1 | Membrane filtration | ANN and RSM | Sahrae et al. (2023) |
2 | Membrane filtration | Linear machine learning models and RSM | Usman et al. (2023) |
3 | Membrane filtration | ANN | Schmitt et al. (2018) |
4 | Coagulation | ANN and RSM | Fard et al. (2021) |
5 | Adsorption | ANN | Thangavelu & Zou (2022) |
6 | Electrocoagulation | RSM | Jasim et al. (2023) |
7 | Gravitational and addition of chemicals | ANN | This study |
In order to improve the production of oil content removal efficiency, this work attempts to employ RSM and ANNs to simulate the oily water treatment process. RSM-based central composite design with four parameters developed in (DESIG EXPERT software version 7) and an ANN based on the feed-forward neural network (FFNN) principle is created using MATLAB software. Two back-propagation algorithms were utilized to train the feed-forward network utilizing an array of training data [96 data sets] collected over a month from a real operating plant. It is suggested that plant engineers can use this ANN model as a platform to estimate how plants would behave when handling oily water, giving them important information for implementing appropriate control strategies and unsuitable operating conditions.
Previous research studied the oily water treatment system using mathematical models and relied on laboratory tests. This study, however, is based on operational conditions and realistic data from an Iraqi oil field. It presents the impact of temperature, pressure, flow rate, and oil content, all of which were modeled and optimized through the use of two different approaches, ANN and RSM on the overall performance of oil content reduction. Next, the best model is chosen by comparing the anticipated outcomes of these models with the observed values.
The aim of this study is to predict the oil content of produced water from an Iraqi oil field depending on the dataset that was obtained from an operational plant in Iraq in June 2023. An artificial neural network ANN technique was employed in this study to improve the central composite design model for oil content in wastewater. The dataset included a variety of parameters, such as the temperature, pressure, unit flow rate, and oily water concentration (ppm) of the feed water. Despite the significant differences in parameters, the ANN model was remarkably accurate at simulating the process under investigation.
There are four initial sections to this study. An outline of the main issue the study addresses and a list of the most recent solutions are provided in the first section, the introduction. The second section explains the data collection procedure and gives a thorough explanation of the RSM and ANN algorithms used. Results and explanations for both models are presented in the third section, along with a comparison of them. The main conclusions and impacts are finally summed up in the conclusion.
PLANT DESCRIPTION AND PROCESS FLOW
Chemical dosing system
There are 10 kinds of chemicals required to inject in the oily water treatment system. The details of chemical injection are given in Table 2.
Injection point . | Concentration (%) . | Dosage (mg/L) . | Type . |
---|---|---|---|
Inlet of parallel plate skimmer vessel skid A and B | 30 | 140 | Coagulant |
Parallel plate skimmer vessel skid A and B | 0.5 | 5 | Coagulant-aid |
Outlet of double medium filter skid | 30 | 100 | Biocide |
Inlet of control-storage tank A and coalescing oil separator skid | 20 | 80 | Reverse emulsion breaker |
Inlet of water inlet pumps | 30 | 40 | Corrosion inhibitor |
Inlet of water inlet pumps | 30 | 50 | Scale inhibitor |
Inlet of backflushing pumps of walnut shell filter skid | 2–4 | 50 | Filter aid |
Outlet of drainage pumps | 30 | 20 | Deoxidizer |
Inlet of gas stripping column | 40 | 60 | Citric acid |
Outlet of gas stripping column pumps | 20 | 60 | NaOH |
Injection point . | Concentration (%) . | Dosage (mg/L) . | Type . |
---|---|---|---|
Inlet of parallel plate skimmer vessel skid A and B | 30 | 140 | Coagulant |
Parallel plate skimmer vessel skid A and B | 0.5 | 5 | Coagulant-aid |
Outlet of double medium filter skid | 30 | 100 | Biocide |
Inlet of control-storage tank A and coalescing oil separator skid | 20 | 80 | Reverse emulsion breaker |
Inlet of water inlet pumps | 30 | 40 | Corrosion inhibitor |
Inlet of water inlet pumps | 30 | 50 | Scale inhibitor |
Inlet of backflushing pumps of walnut shell filter skid | 2–4 | 50 | Filter aid |
Outlet of drainage pumps | 30 | 20 | Deoxidizer |
Inlet of gas stripping column | 40 | 60 | Citric acid |
Outlet of gas stripping column pumps | 20 | 60 | NaOH |
Oily water treatment system
The oily water treatment system mainly includes the coalescing oil separators, parallel plate skimmer vessel, gas stripping column, filters, the back-flushing wastewater tank and pumps, etc. Two trains with a capacity of 10,000 m3/day for each one will include the equipment mentioned above. The oily water treatment plant will receive the produced water from de-salters and oil storage tanks (water draw-off, oily water from closed drain, and new CPF). The water quality could be complex and could be dissolved hydrogen sulphide. In order to maximize the performance of the dissolved hydrogen sulphide removal, gas stripping columns are designed. Oily water comes from inlet water pumps will go to the coalescing oil separators and parallel plate skimmer vessel to remove the oil and SS, to the gas stripping column to remove H2S, to the filters to remove oil and SS, to meet the standard specification for treated water, and then it will be sent to the water injection station.
Coalescing oil separators
2 × 350 m3/h coalescing oil separators (for one train) will receive water from the 2 × 3,000 m3 control-storage tanks by 3 × 250 m3/h inlet water pumps (for one train) and reduce the oil concentration to below 80 mg/L under normal operating conditions, rendering it suitable for further treatment in the downstream. It will also remove a proportion of suspended solids, and reduce it to below 80 mg/L under normal operating conditions. This outlet water quality should be suitable for further treatment in the downstream parallel plate skimmer vessel.
Parallel plate skimmer vessel
2 × 350 m3/h capacity parallel plate skimmer vessel (for one train) will receive water from the coalescing oil separators and reduce the oil concentration to below 30 mg/L under normal operating conditions, rendering it suitable for further treatment in the downstream. It will also remove a proportion of suspended solids, and reduce it to below 20 mg/L under normal operating conditions. This outlet water quality should be suitable for further treatment in the downstream gas stripping column.
Gas stripping column
2 × 250 m3/h capacity gas stripping column (for one train) will receive water from the parallel plate skimmer vessel and reduce the dissolved hydrogen sulphide (H2S) to 30 mg/L under normal operating conditions, rendering it suitable for further treatment in the downstream nutshell filters.
Gas stripping column pumps
The treated water from the gas stripping column will be pumped into the downstream system. In order to maximize the performance of the oily water treatment system, the oil droplets should be as large as possible. This means that the pumps must be designed to minimize the shearing of the droplets.
Filtration
Walnut shell filters
3 × 180 m3/h capacity units will be installed for the base case flow of 10,000 m3/day, allowing one vessel to be withdrawn from service for backwashing, and maintenance. AII will be online under normal operation. Treated water from the water injection station will be used to backwash the filters.
Double medium filters
6 × 90 m3/h capacity units will be installed for the base case flow of 10,000 m3/day, allowing one vessel to be withdrawn from service for backwashing, and maintenance. Before the water injection station is built, treated water will be sent to the existing evaporation water pool.
Oily water process flow
METHODOLOGY AND DATA COLLECTION
RSM is considered an effective tool to optimize effective parameters in wastewater refinery oil. RSM based on central composite design gives an accurate set of experiments for investigation and data obtained for 30 runs were generalized and analyzed by Design-Expert software version 7.
The best prediction accuracy with the appropriate correlation coefficient (R2) of the ANN model with the mean squared error (MSE) would be employed. The oil content of produced water is predicted using an ANN model that was developed. The predicted data from the ANN model will be compared with the industry data (experimental data). Two training techniques in the MATLAB R2021a program will be used to train the ANN. Several performance metrics will be taken into consideration to select the optimal training algorithm. The oil field in south Iraq will supply the training data set for the ANN model.
To develop the characterization model for the oily water treatment facility, we used the database of an Iraqi oil field. Four essential process parameters for this investigation were obtained for a month from a local field in Iraq: the oil content (ppm), temperature (°C), pressure (Mpa), and flow rate (m3/day) of the input oily water, furthermore operational parameters ranges as follow: 840.02–991.9 ppm, 46.4–68.2 °C, 18.1–20.55 Mpa, and 27,000–29,390 m3/day.
This system's raw dataset is made up of 96 data points that were sampled in June 2023. The oil content of treated water is taken into consideration as an output parameter. Table 3 illustrates the units of measurement together with the maximum and minimum values for each parameter. Oily water up to 20,000 (m3/day) can be processed by the plant.
Parameters of process . | Min. . | Max. . |
---|---|---|
Inputs | ||
Oil content (ppm) | 840.02 | 991.9 |
Temperature (°C) | 46.4 | 68.2 |
Pressure (Mpa) | 18.1 | 20.55 |
Flow rate (m3/day) | 27,000 | 29,390 |
Outputs | ||
Oil content (ppm) | 8.01 | 11.2 |
Parameters of process . | Min. . | Max. . |
---|---|---|
Inputs | ||
Oil content (ppm) | 840.02 | 991.9 |
Temperature (°C) | 46.4 | 68.2 |
Pressure (Mpa) | 18.1 | 20.55 |
Flow rate (m3/day) | 27,000 | 29,390 |
Outputs | ||
Oil content (ppm) | 8.01 | 11.2 |
MODELING USING RSM AND ANN
Response surface methodology
The upper and lower levels of each effective parameter were pre-coded according to initial data from refinery oil as follows: oil content (840–991 ppm), temperature (46.4–68.2 °C), and pressure (18–21 Mpa) and flow rate (27,000–29,390 m3/day).
The experimental design was carried out with six replications of the center point and one replication of the factorial and axial points. Each factor was changed in five levels. The data were constructed and randomized using the Design-Expert software 7, as explained in Table 3.
Samples of wastewater were taken to evaluate the oil content of each sample, and given data were then analyzed via the Design-Expert software 7. Experiment matrix and regression data analysis were designed by Design-Expert software 7. The model was statistically analyzed with the analysis of variance (ANOVA) (Zin et al. 2020).
Artificial neural network
The ANN model simulates the functions of the human brain using mathematical calculations (Alardhi et al. 2024). The architecture format of the ANN model is unique and draws inspiration from the structure of the biological nervous system. The complex and nonlinear organization of neurons in ANN models is similar to that of the human brain (Malekian & Chitsaz 2021). Three groups include the ANN applications that address real-world issues: (i) pattern categorization, (ii) prediction, and (iii) control and optimization (Malekian & Chitsaz 2021). An ANN typically consists of one input layer, one or more hidden layers, and one output layer. Several nodes, or neurons in biological terms, are present in each layer of the neural network architecture and link to the layer after it. Throughout the training process, the weights assigned to the nodes can be changed. To adjust the strength of the signal from the linked neuron, the weights are increased or reduced (Fiyadh et al. 2023).
One of the most important performance factors in determining if the ANN model can be considered usable is its MSE number, which gives information about the model's accuracy. It displays the variance between the experimental data and the expected values. The fit's accuracy rises with a decreasing MSE value. To assess and indicate how closely the data fit the regression line, the value of R2 is utilized. There is less of a variance between the fitted values and the observed data when the R-value is nearer 1. The range of the R2 value is 0 to 1. An acceptable R2 value is defined as more than 0.8.
Artificial neural network design
RESULTS AND DISCUSSION
Response surface methodology
An RSM depending on CCD was developed to evaluate the feasible combinations of four studied parameters (oil content ppm, temperature C, pressure Mpa and flow rate m3/day). The lower and upper bound of parameters are identified initially conditions from oil refinery. Optimization of selected effective parameters is considered crucial parameters for reducing oil content in wastewater. RSM is a strategy of experimentation in which designed parameters are changed together instead of a single parameter at a time. The design based on CCD is satisfactory to apply in this set of experiments because each parameter varies in five levels desired to study a wide range of values for each parameter as shown in Table 7.
The value of alpha was selected based on the hypothesis of rotatability to ensure uniform variance at points that are equal distance from the origin (center point) and hence provide equal precision of response evaluation in any design direction (Montgomery 2017). The center point was run with six replications to get a reasonable estimation of an experimental error (pure error). The center point values are utilized to reveal curvature in the response (they play a part in estimating the coefficient of quadratic terms). The axial point is also used to calculate the coefficient of quadratic terms, while factorial points are used mainly to estimate the coefficients of linear terms and two-way interaction (Montgomery 2017).
The ANOVA for CCD is shown in Table 4. The goodness of the linear model was revealed to be highly significant, with P > F equal to 0.0001. Based on the ANOVA Table 4, all terms showed an insignificant effect on the response except flowrate shows highly significant, where the p value is below 0.05 (p < 0.05). D (flow rate) revealed the most significant factor affecting oil content with the highest p value of 0.0001.
Source . | Sum of squares . | df . | Mean square . | F value . | p-value Prob > F . | . |
---|---|---|---|---|---|---|
Model | 33.27833 | 4 | 8.319583 | 18.471 | <0.0001 | significant |
A – Oil content | 0.001667 | 1 | 0.001667 | 0.0037 | 0.9520 | |
B – Temperature | 0.081667 | 1 | 0.081667 | 0.181315 | 0.6739 | |
C – pressure | 0.06 | 1 | 0.06 | 0.133211 | 0.7182 | |
D – flow rate | 33.135 | 1 | 33.135 | 73.56576 | <0.0001 | |
Residual | 11.26033 | 25 | 0.450413 | |||
Lack of fit | 7.832 | 20 | 0.3916 | 0.571123 | 0.8309 | not significant |
Pure error | 3.428333 | 5 | 0.685667 | |||
Cor total | 44.53867 | 29 | ||||
R2 | 0.747 | |||||
R adjust | 0.706 | |||||
R predict | 0.643 | |||||
St. dev. | 0.67 | |||||
Mean | 9.83 | |||||
Adequate precision | 17.1 |
Source . | Sum of squares . | df . | Mean square . | F value . | p-value Prob > F . | . |
---|---|---|---|---|---|---|
Model | 33.27833 | 4 | 8.319583 | 18.471 | <0.0001 | significant |
A – Oil content | 0.001667 | 1 | 0.001667 | 0.0037 | 0.9520 | |
B – Temperature | 0.081667 | 1 | 0.081667 | 0.181315 | 0.6739 | |
C – pressure | 0.06 | 1 | 0.06 | 0.133211 | 0.7182 | |
D – flow rate | 33.135 | 1 | 33.135 | 73.56576 | <0.0001 | |
Residual | 11.26033 | 25 | 0.450413 | |||
Lack of fit | 7.832 | 20 | 0.3916 | 0.571123 | 0.8309 | not significant |
Pure error | 3.428333 | 5 | 0.685667 | |||
Cor total | 44.53867 | 29 | ||||
R2 | 0.747 | |||||
R adjust | 0.706 | |||||
R predict | 0.643 | |||||
St. dev. | 0.67 | |||||
Mean | 9.83 | |||||
Adequate precision | 17.1 |
Model quality was determined via correlation coefficient (R2), equal to 0.747, R adjusted was 0.706, and R predicted 0.643. It is considered a good result because the difference between R adjusted and R predicted is less than 0.2, which increases the model adequacy. Furthermore, the adequate precision utilized to identify signal-to-noise is thought to be recommendable more than 4. Here the value 17.1 showed that the empirical model is of adequate signal and can be used to navigate the design space (Breig & Luti 2021).
From the determined regression coefficient in Table 5, the resulting matrix from CCD was fitted and can be described in first-order equation (correlation) (Equation (4)) where A is the oil content, B is the temperature, C is the pressure, and D is the flowrate). Correlation is explained by the ‘correlation coefficient, a dimensionless signifier, always between −1 and 1. However, such a simple evaluation carries uncertainty of the correlation coefficient being influenced by a distance from the center point, which can have an enormous impact. Furthermore, the relationship may not be linear but parabolic; therefore, values are interpreted with care (Breig & Luti 2021).
. | . | . | Standard . | 95% CI . | 95% CI . | |
---|---|---|---|---|---|---|
Factor . | Coefficient estimate . | Df . | Error . | Low . | High . | VIF . |
Intercept | 9.826667 | 1 | 0.122531 | 9.57431 | 10.07902 | |
A – Oil content | 0.008333 | 1 | 0.136994 | −0.27381 | 0.290477 | 1 |
B – Temperature | 0.058333 | 1 | 0.136994 | −0.22381 | 0.340477 | 1 |
C – Pressure | −0.05 | 1 | 0.136994 | −0.33214 | 0.232143 | 1 |
D – Flow rate | 1.175 | 1 | 0.136994 | 0.892857 | 1.457143 | 1 |
. | . | . | Standard . | 95% CI . | 95% CI . | |
---|---|---|---|---|---|---|
Factor . | Coefficient estimate . | Df . | Error . | Low . | High . | VIF . |
Intercept | 9.826667 | 1 | 0.122531 | 9.57431 | 10.07902 | |
A – Oil content | 0.008333 | 1 | 0.136994 | −0.27381 | 0.290477 | 1 |
B – Temperature | 0.058333 | 1 | 0.136994 | −0.22381 | 0.340477 | 1 |
C – Pressure | −0.05 | 1 | 0.136994 | −0.33214 | 0.232143 | 1 |
D – Flow rate | 1.175 | 1 | 0.136994 | 0.892857 | 1.457143 | 1 |
Oil content = 9.83 + 8.333E − 003 * A + 0.058 * B − 0.050 * C + 1.18 * D (4).
Validation and optimum conditions
Based on the optimization result, the optimum parameters that minimize oil content are an initial oil content of 991 ppm, a temperature of 46.4 °C, and pressure 21 Mpa, and a flow rate of 27,000 m3/day. A validation experiment was performed using optimized parameters to verify the predicted result. The experiment revealed an average oil content of 8.5 ± 0.7 ppm, near the expected (assumed) value of 8.55 ppm (Fard et al. 2021; Thangavelu & Zou 2022).
ANN model performance
The effectiveness of the ANN model
Parameters . | Model No. . | |||
---|---|---|---|---|
1 . | 2 . | 3 . | 4 . | |
No. of hidden layers | 1 | 1 | 2 | 1 |
No. of neurons | 10 | 11 | 12 | 10 |
R2 (Training) | 0.85 | 0.82 | 0.92 | 0.999 |
Transfer Functions | Tansing | Tansing | Purelin | Tansing |
MSE | 0.001 | 0.001 | 0.002 | 1.55 × 10−7 |
Parameters . | Model No. . | |||
---|---|---|---|---|
1 . | 2 . | 3 . | 4 . | |
No. of hidden layers | 1 | 1 | 2 | 1 |
No. of neurons | 10 | 11 | 12 | 10 |
R2 (Training) | 0.85 | 0.82 | 0.92 | 0.999 |
Transfer Functions | Tansing | Tansing | Purelin | Tansing |
MSE | 0.001 | 0.001 | 0.002 | 1.55 × 10−7 |
RSM versus ANN models
The experimental final oil content values obtained by the different CCD combinations during the 30 runs were described in Table 7, together with the anticipated CCD values and their residuals. Eight axial points, 16 factorial points, and 6 replications of the center points are included in the experiment matrix that was created using CCD in order to determine the pure error. All RSM inputs and outputs were used in the training, testing, and validation of the ANN system. Every experiment was conducted twice, and samples were gathered over the course of the same hour.
Std . | Run . | Point type . | Oil content . | Temperature . | Pressure . | Flowrate . | Oil content CCD . | Oil content ANN . | |||
---|---|---|---|---|---|---|---|---|---|---|---|
Actual . | Predict . | Residual . | Predict . | Residual . | |||||||
17 | 1 | Axial | 764.5 | 57.3 | 19.5 | 28,195 | 9.9 | 9.81 | −0.09 | 9.98 | −0.08 |
30 | 2 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 10.5 | 9.83 | −0.67 | 9.9 | 0.6 |
6 | 3 | Fact | 991 | 46.4 | 21 | 27,000 | 10 | 8.5 | −1.5 | 9.9 | 0.1 |
26 | 4 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 9.8 | 9.83 | 0.03 | 9.7 | 0.1 |
22 | 5 | Axial | 915.5 | 57.3 | 22.5 | 28,195 | 9.5 | 9.73 | 0.23 | 9.4 | 0.1 |
27 | 6 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 9 | 9.83 | 0.83 | 8.9 | 0.1 |
18 | 7 | Axial | 1,066.5 | 57.3 | 19.5 | 28,195 | 9.1 | 9.84 | 0.74 | 9.1 | 0 |
1 | 8 | Fact | 840 | 46.4 | 18 | 27,000 | 8 | 8.6 | 0.6 | 7.8 | 0.2 |
5 | 9 | Fact | 840 | 46.4 | 21 | 27,000 | 8.3 | 8.54 | 0.24 | 8.2 | 0.1 |
25 | 10 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 8.9 | 9.83 | 0.93 | 8.7 | 0.2 |
3 | 11 | Fact | 840 | 68.2 | 18 | 27,000 | 8.1 | 8.75 | 0.65 | 8 | 0.1 |
12 | 12 | Fact | 991 | 68.2 | 18 | 29,390 | 11 | 11.12 | 0.12 | 11.2 | −0.2 |
9 | 13 | Fact | 840 | 46.4 | 18 | 29,390 | 11.4 | 10.9 | 0.5 | 11.3 | 0.1 |
15 | 14 | Fact | 840 | 68.2 | 21 | 29,390 | 11.5 | 11 | −0.5 | 11.5 | 0 |
20 | 15 | Axial | 915.5 | 79.1 | 19.5 | 28,195 | 11 | 9.94 | −1.16 | 10.9 | 0.1 |
8 | 16 | Fact | 991 | 68.2 | 21 | 27,000 | 8.6 | 8.67 | 0.07 | 8.7 | 0.1 |
7 | 17 | Fact | 840 | 68.2 | 21 | 27,000 | 8 | 8.65 | 0.65 | 8.1 | 0.1 |
13 | 18 | Fact | 840 | 46.4 | 21 | 29,390 | 10.9 | 10.89 | −0.01 | 10.8 | 0.1 |
23 | 19 | Axial | 915.5 | 57.3 | 19.5 | 25,805 | 8 | 7.48 | −0.02 | 7.8 | 0.2 |
28 | 20 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 11 | 9.83 | −1.17 | 10.8 | 0.2 |
24 | 21 | Axial | 915.5 | 57.3 | 19.5 | 30,585 | 11.9 | 12.1 | 0.2 | 11.8 | 0.1 |
11 | 22 | Fact | 840 | 68.2 | 18 | 29,390 | 11.1 | 11.1 | 0 | 11.1 | 0 |
10 | 23 | Fact | 991 | 46.4 | 18 | 29,390 | 11 | 11 | 0 | 11.1 | 0.1 |
14 | 24 | Fact | 991 | 46.4 | 21 | 29,390 | 11 | 10.9 | −0.01 | 10.9 | 0.1 |
2 | 25 | Fact | 991 | 46.4 | 18 | 27,000 | 8.9 | 8.65 | −0.35 | 8.9 | 0 |
19 | 26 | Axial | 915.5 | 35.5 | 19.5 | 28,195 | 9 | 9.7 | 0.7 | 8.8 | 0.2 |
16 | 27 | Fact | 991 | 68.2 | 21 | 29,390 | 10.5 | 11.02 | 0.52 | 10.4 | 0.1 |
21 | 28 | Axial | 915.5 | 57.3 | 16.5 | 28,195 | 10.7 | 9.98 | −0.72 | 10.6 | 0.1 |
29 | 29 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 10.1 | 9.83 | −0.18 | 10.1 | 0 |
4 | 30 | Fact | 991 | 68.2 | 18 | 27,000 | 8.1 | 8.7 | 0.6 | 8.1 | 0 |
Std . | Run . | Point type . | Oil content . | Temperature . | Pressure . | Flowrate . | Oil content CCD . | Oil content ANN . | |||
---|---|---|---|---|---|---|---|---|---|---|---|
Actual . | Predict . | Residual . | Predict . | Residual . | |||||||
17 | 1 | Axial | 764.5 | 57.3 | 19.5 | 28,195 | 9.9 | 9.81 | −0.09 | 9.98 | −0.08 |
30 | 2 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 10.5 | 9.83 | −0.67 | 9.9 | 0.6 |
6 | 3 | Fact | 991 | 46.4 | 21 | 27,000 | 10 | 8.5 | −1.5 | 9.9 | 0.1 |
26 | 4 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 9.8 | 9.83 | 0.03 | 9.7 | 0.1 |
22 | 5 | Axial | 915.5 | 57.3 | 22.5 | 28,195 | 9.5 | 9.73 | 0.23 | 9.4 | 0.1 |
27 | 6 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 9 | 9.83 | 0.83 | 8.9 | 0.1 |
18 | 7 | Axial | 1,066.5 | 57.3 | 19.5 | 28,195 | 9.1 | 9.84 | 0.74 | 9.1 | 0 |
1 | 8 | Fact | 840 | 46.4 | 18 | 27,000 | 8 | 8.6 | 0.6 | 7.8 | 0.2 |
5 | 9 | Fact | 840 | 46.4 | 21 | 27,000 | 8.3 | 8.54 | 0.24 | 8.2 | 0.1 |
25 | 10 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 8.9 | 9.83 | 0.93 | 8.7 | 0.2 |
3 | 11 | Fact | 840 | 68.2 | 18 | 27,000 | 8.1 | 8.75 | 0.65 | 8 | 0.1 |
12 | 12 | Fact | 991 | 68.2 | 18 | 29,390 | 11 | 11.12 | 0.12 | 11.2 | −0.2 |
9 | 13 | Fact | 840 | 46.4 | 18 | 29,390 | 11.4 | 10.9 | 0.5 | 11.3 | 0.1 |
15 | 14 | Fact | 840 | 68.2 | 21 | 29,390 | 11.5 | 11 | −0.5 | 11.5 | 0 |
20 | 15 | Axial | 915.5 | 79.1 | 19.5 | 28,195 | 11 | 9.94 | −1.16 | 10.9 | 0.1 |
8 | 16 | Fact | 991 | 68.2 | 21 | 27,000 | 8.6 | 8.67 | 0.07 | 8.7 | 0.1 |
7 | 17 | Fact | 840 | 68.2 | 21 | 27,000 | 8 | 8.65 | 0.65 | 8.1 | 0.1 |
13 | 18 | Fact | 840 | 46.4 | 21 | 29,390 | 10.9 | 10.89 | −0.01 | 10.8 | 0.1 |
23 | 19 | Axial | 915.5 | 57.3 | 19.5 | 25,805 | 8 | 7.48 | −0.02 | 7.8 | 0.2 |
28 | 20 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 11 | 9.83 | −1.17 | 10.8 | 0.2 |
24 | 21 | Axial | 915.5 | 57.3 | 19.5 | 30,585 | 11.9 | 12.1 | 0.2 | 11.8 | 0.1 |
11 | 22 | Fact | 840 | 68.2 | 18 | 29,390 | 11.1 | 11.1 | 0 | 11.1 | 0 |
10 | 23 | Fact | 991 | 46.4 | 18 | 29,390 | 11 | 11 | 0 | 11.1 | 0.1 |
14 | 24 | Fact | 991 | 46.4 | 21 | 29,390 | 11 | 10.9 | −0.01 | 10.9 | 0.1 |
2 | 25 | Fact | 991 | 46.4 | 18 | 27,000 | 8.9 | 8.65 | −0.35 | 8.9 | 0 |
19 | 26 | Axial | 915.5 | 35.5 | 19.5 | 28,195 | 9 | 9.7 | 0.7 | 8.8 | 0.2 |
16 | 27 | Fact | 991 | 68.2 | 21 | 29,390 | 10.5 | 11.02 | 0.52 | 10.4 | 0.1 |
21 | 28 | Axial | 915.5 | 57.3 | 16.5 | 28,195 | 10.7 | 9.98 | −0.72 | 10.6 | 0.1 |
29 | 29 | Center | 915.5 | 57.3 | 19.5 | 28,195 | 10.1 | 9.83 | −0.18 | 10.1 | 0 |
4 | 30 | Fact | 991 | 68.2 | 18 | 27,000 | 8.1 | 8.7 | 0.6 | 8.1 | 0 |
The ability of RSM-CCD and the created ANN model to forecast a decrease in oil content was assessed. Table 8 displays the calculated statistical parameters that compare and evaluate the degrees of correctness of the two models. The model's lowest MSE and highest R2 scores indicate how accurate it is. Since R2 evaluates the connection between the response under study and the operating parameters, higher findings (up to 1) suggest a strong correlation between the two datasets. Regression analysis usually employs MSE to validate experimental results since a lower value indicates that the data are concentrated around the line of best-fit (prediction-errors). In light of the aforementioned data, both models demonstrated high predictive ability. Nonetheless, an analysis of the two models reveals that the ANN model outperforms the RSM-CCD model for R2 and another goodness-of-fit metric. Consequently, compared to the CCD model, the ANN model predicts a higher reduction in oil content through the west water process.
Measure . | RSM . | ANN . |
---|---|---|
test . | ||
R2 | 0.74 | 0.994 |
MSE | 8.32 | 1.55 × 10−7 |
Number of runs used | 30 | 30 |
Measure . | RSM . | ANN . |
---|---|---|
test . | ||
R2 | 0.74 | 0.994 |
MSE | 8.32 | 1.55 × 10−7 |
Number of runs used | 30 | 30 |
The present result is consistent with that put forth by Ezemagu et al. (2021), which found that the ANN model performed better than the RSM.
In a second overall comparison, linear regression analysis was used to evaluate the actual and predicted values of the ANN and CCD models (Figure 6). Once more, Figure 14 of ANN shows considerable fitting with a higher R2 value, suggesting that ANN produces better optimization results than CCD. Consequently, when compared to the CCD model, the ANN model's capacity for generalization is significantly higher.
Nonetheless, there are a few advantages to CCD modeling. For example, by emphasizing each factor's contribution to regression models, the CCD's structure can help remove unnecessary components from the model. Additionally, because ANN modeling involves multiple iterative calculations, it requires more computation time than CCD. However, the ANN-generated model showed better prediction accuracy than the CCD model did. This can be explained by its comprehensive ability to replicate the nonlinearity of the system; CCD's restriction to the second-order polynomial requires only one computation step for a response surface model. However, ANN consistently yields superior results than CCD in every aspect.
LITERATURE REVIEW
In order to maximize the performance of the system, environmental engineering factors are crucial for optimization. The majority of wastewater treatment systems involve several variables, making conventional approach optimization rigid, unreliable, and time-consuming. Therefore, it is preferred to use a different approach that will be more efficient and adaptable for parameter optimization of different wastewater treatment procedures. There are many studies that dealt with mathematical models with different methods of water treatment, but regarding water produced from oil fields, we did not find many. Therefore, this study includes some studies that dealt with the subject of water treatment in general with the use of different mathematical models:
A Box–Behnken design was used by Sanchez-Sanchez et al. (2024) for the analysis using a gray wolf optimizer (GWO)-coupled ANN model and response surface methodology (RSM) to analyze the effect of three operating parameters (volumetric exchange ratio [VER], aeration rate [AR], and cycle time [CT]) manipulated during an aerobic granular sludge process (AGS) sequencing batch reactor on modeling the removal of chemical oxygen demand (COD) in mixed wastewater. The most efficient architecture for COD showed the highest efficiency for modeling the AGS. The RSM model and plot results indicate that the CT and AR were the most influential on COD removal efficiency. When compared with models with statistical indices, GWO-ANN demonstrated higher performance compared to RSM.
The effectiveness of the electrocoagulation approach, which uses aluminum electrodes in a batch bi-polar system, was tested by Moneer et al. (2023) in order to ascertain how well oil was removed from oily wastewater. The impacts of four independent factors: oil volume (X1), temperature (X2), initial pH (X3), and treatment duration (X4) were investigated to look into turbidity recovery and conductivity changes. The Box–Behnken design was used to improve conditions. The findings showed that reducing oil volume and lengthening treatment times greatly improve turbidity removal and conductivity changes. When the treatment method is used under ideal operating conditions, it can remove turbidity and conductivity with removal efficiencies of 97.3 and 73.4%, respectively.
In 2020, Najafzadeh & Oliveto (2022) looked into a novel approach in which machine learning (ML) models are used to generate equations for the estimation of the rate at which currents will cause scouring to spread along pipes. The primary dimensionless parameters from the credible literature were identified as the following: the current angle of attack on the pipeline, the Shields parameter, the approaching flow Froude number, and the ratio of embedment depth to pipeline diameter. Various optimization techniques and setting parameters derived from evolutionary and classification components were used to construct machine learning models. Additionally, the explicit equations produced by machine learning models were utilized to show how the suggested methods agree with experimental findings.
Saberi-Movahed et al. (2020) reported the development of a novel GMDH method called the GMDH network by using an extreme learning machine (GMDH-ELM). This method eliminates the need to update the weighting coefficients of the quadratic polynomials used in conventional GMDH during the training stage through the use of other evolutionary algorithms or backpropagation techniques. In actuality, a relationship between the input and output in each neuron of the GMDH model was established using an intermediate parameter. From a different perspective, this study demonstrated for the first time that it is possible to enhance the procedure and obtain comparably higher levels of accuracy by successfully integrating the idea of ELM into the basic structure of GMDH.
In a piping modeling study by Najafzadeh (2019), the conjugate depths of the hydraulic jump in the circular pipes were assessed using three numerical models based on evolutionary computing: gene-expression programming (GEP), model tree (MT), and evolutionary polynomial regression (EPR). To find a functional relationship between the input and output variables, ideas of particular force produced two non-dimensional factors. The results achieved with conventional methods were compared with the performances of the proposed ways. When MT was compared to those other artificial intelligence (AI) models and empirical equations, its performance showed an accurate prediction of conjugate depths (R2 = 0.995 and RMSE = 0.023). Instead of using polynomial quadratic neurons in each GMDH neuron, GEP was carried out in each, as demonstrated by another work by the same last author (Najafzadeh & Saberi-Movahed 2019) The size of the sediment, the shape of the pipeline, and the features of the waves upstream of the pipeline are effective parameters on the three-dimensional scour rates. The dimensional analysis technique considers four-dimensionless parameters as input variables. In addition, scour rates in the pipeline's left and right segments as well as vertical scour rates are established as output parameters. Several statistical indices are used to assess the training and testing stage results of the proposed GMDH-GEP models. The GMDH-GEP models' performances are contrasted with those of the GEP, GMDH, ANN, and conventional equation-based regression models.
Rastegar et al. (2011) used RSM to forecast influent COD, up-flow velocity (Vup), and HRT behaviors in the bioreactor. Reduced quadratic and cubic models were shown to be the most effective models for COD removal and biogas generation rate, respectively, according to RSM. Based on two crucial reactions, the ideal region was determined to have an influent COD of 630 mg/L, a Vup of 0.27 m/h, and an HRT of 21.4 h. As a consequence, the COD removal efficiency was 76.3%, and the biogas production rate was 0.25 L/L feed.
Tir & Mostefa (2008) conducted an initial experimental investigation (Tir & Moulai-Mostefa 2008) in order to determine the most accurate operating parameters, which are subsequently utilized to calculate the efficiency of oil removal. Turbidity and COD measurements were used to estimate oil separation in an experimental design that employed the response surface approach. It was possible to identify an ideal area with low COD and turbidity levels. The primary influences of the operational factors were also looked into as part of the improved procedure. The experimental findings demonstrated the high efficiency of electrocoagulation, which achieved 99% turbidity, 90% COD in less than 22 min, and a current density of 25 mA cm(−2). The variance coefficient (R2) value was high in the ANOVA results.
CONCLUSIONS
It is still challenging for the petroleum industry to reduce the environmental impact of produced water from oil fields and wastewater from petrochemical processes. RSM and ANN were used to simulate the oily water treatment procedure in an Iraqi oil field case study. Actual industrial parameters were entered into RSM/CCD to minimize oil content and used to validate the model, and the findings demonstrated the model's high accuracy. The ANN model was created by implementing the optimal design of the neural network with one hidden layer for the provided architecture, which is the 4–10–1 architecture. The testing results showed an R2 of more than 0.999, and the training set as a whole had a low MSE of less than 1.55 × 10−7, demonstrating the effectiveness of ANN models in predicting the oil content of treated water. The results are highly accurate based on the several indices, as well as the anticipated and actual outcomes, used in this study to test the reliability of the ANN models. R2, RE, and MSE are these markers which are equal to 0.994, 2.5, and 1.55, respectively. The created ANN model could be integrated with various optimization techniques to accelerate learning and prediction.
DATA AVAILABILITY STATEMENT
All relevant data are included in the paper or its Supplementary Information.
CONFLICT OF INTEREST
The authors declare there is no conflict.