Oil content (OC) is one of the important evaluation indicators in oilfield wastewater (OW) treatment. The purpose of this study is to realize online real-time detection of OC in OW by combining ultraviolet spectrophotometry with the convolutional neural network (CNN). In this paper, 80 groups of OW transmission data were measured for model establishment. Three CNN models with different structures are established to generalize the super parametric optimization process of the model. Furthermore, as a common method used in spectroscopy, the synergy interval partial least squares (siPLS) model is built in order to compare its accuracy with the CNN model. The results indicated the CNN model has a better performance than siPLS, in which the CNN model numbered Model 3 has the lowest root mean square error (MSE) of prediction (RMSEP) of 1.606 mg/L. As a consequence, the CNN model can be used in the monitoring of OW. This article will guide a rapid analysis of the OC of OW.

  • Transmission spectra of the oil wastewater with different oil contents were determined.

  • Accuracy of the siPLS and CNN models for the prediction of the oil content in the oil wastewater was compared.

  • The CNN model is more suitable to analyze the oil content of oilfield wastewater than the siPLS model.

  • The CNN models with different depth structures were compared, and the deepest model exhibited the highest ability of prediction.

Graphical Abstract

Graphical Abstract
Graphical Abstract

Based on the application of tertiary oil extraction in China oilfields in recent years (Seyyedsar et al. 2016; Hamza et al. 2017; Wei et al. 2018), the annual oilfield wastewater (OW) has continued to increase (Cui et al. 2019; Zhang et al. 2019a, 2019b), in which emulsion oil, suspended solids, and dissolved salts are major pollutants. OW must be treated and reinjected into the oil well after reaching the standard (He et al. 2020). Oil content (OC) is one of the important parameters in OW treatment, so this work is to research an effective method to rapidly analyze the OC of OW.

A traditional method to detect the OC of OW is using the standard curve (Qi et al. 2018). This method is based on the comparison of the absorbance of the extract collected from OW samples with the standard curve to obtain the OC. However, this method is not suitable for rapid measurement of OC in OW because it requires manual operation and involves complicated steps. The spectral detection method is used for the analysis of a variety of substances and has the characteristics of non-contact, fast, and high precision (Leng et al. 2021; Cardoso et al. 2022; Liu et al. 2022; Ma et al. 2022), so it is more suitable to be used in the online real-time monitor. Many studies are improving this point in recent years. Yan et al. (2020a, 2020b) proposed a semi-supervised soft sensor modeling method based on a deep neural network that included an embedment of a manifold into a deep neural regression network, which is used to predict the total Kjeldahl nitrogen concentration of the sewage treatment process. Yang et al. (2022) applied near-infrared (NIR) spectroscopy to reactive group analysis of the photothermal curing silicone, and they established the calibration model for the epoxy and the C = C double bond values. The result shows that the determination coefficient (R2) of the calibration model for the epoxy and the C = C double bond values were 0.9631 and 0.9689, respectively, and the corresponding root mean square error (RMSE) was 0.0179 and 0.0143, respectively. Zhang et al. (2021) compared the ability of UV and infrared (IR) spectroscopy based on partial least squares (PLS) to quantitatively analyze diesel content in the oil mixture. The results show that IR spectroscopy combined with PLS has a higher accuracy, in which RMSEP and RMSE of cross validation (RMSECV) reaching 0.0181 and 0.111, respectively. Qi et al. (2018) determined the UV transmittance spectra of the configured OW by measurements dealing with the UV–Vis spectrophotometer. The researchers focused on the relationship between the absorption coefficient and its OC, they discovered that the fitting effect of the third-order polynomial at 234 nm is the best, and the fitting accuracy is 0.9940. Shimamoto & Tubino (2016) established the PLS model to quantitatively identify the differences between biodiesel and vegetable oil based on UV–Vis spectroscopy and they found that full-spectrum data can identify differences that single-wavelength data cannot. The above papers suggest that spectrophotometry combined with an appropriate quantitative analysis model has high accuracy with less response time in substance detection. However, there are few studies on the direct analysis of actual water samples at the oilfield site due to the complex nature of OW, which makes the analysis of OC difficult. Therefore, this work selected and established the quantitative analysis model based on data from actual water samples at the oilfield site.

As a popular deep learning method, the convolutional neural network (CNN) has been widely employed in image recognition (Bora et al. 2020), video analysis (Ma et al. 2017; Hu et al. 2019), and language identification (Pande et al. 2022; Zhang et al. 2022a, 2022b). Lately, the CNN has been gradually applied to spectral data analysis (Liu et al. 2017; Malek et al. 2018; Zhang et al. 2019a, 2019b). For the prediction of nine different soil properties, four different regression methods based on NIR spectroscopy and Fourier transform infrared spectroscopy (FTIR) are analyzed by Haghi et al. (2021) in terms of predictive performance. The outcomes of this work pointed out that the one-dimensional CNN model gives better results compared to those obtained by PLS, support vector regression, and the two-dimensional CNN model for all the considered soil components regarding the RMSEP. Yan et al. (2020a, 2020b) developed quantitative calibration models by online Raman spectroscopy combined with the CNN to compare with the PLS quantitative calibration model, and the experimental results show that the CNN model using raw spectra was found better than the PLS model developed with preprocessed spectra for most analytes. Zhang et al. (2022a, 2022b) reported that the combination of hyperspectral imaging technology and deep convolutional generative adversarial network could generate single maize kernel spectral data and OC data, and the data was used to improve the performance of the OC prediction model. Wu et al. (2022) evaluated the ability of the one-dimensional CNN, SVM and improved PLS discriminant analysis in combination with Raman spectroscopy to specify three types of vegetable blend oil, and the results proved that the overall performance of the one-dimensional CNN was significantly higher than the other two methods. The above studies prove that the CNN performed well in spectrum analysis, which can be attributed to its ability of non-linear fitting. Meanwhile, the CNN has better processing performance of the high-dimensional independent variable matrix and discrete points than linear analysis. Nevertheless, the model-building process of the CNN is invisible, unlike linear fitting, which can find the polynomial between independent variables and dependent variables. The number of samples is also one of the factors that influence the performance of the CNN. When the sample size is small, the accuracy of the model is low, and the adaptability is poor. On the other hand, in different fields of wastewater monitoring, the best qualitative analysis model may be different.

Above all, it is obvious that the UV transmission spectrum combined with the quantitative analysis model, such as PLS or CNN, is suitable for online, real-time monitoring of OC in OW. The CNN model and PLS model have their own advantages and disadvantages, so it is necessary to compare the two models and select the one with the highest accuracy. In this paper, the UV spectra of 80 groups of OW samples were measured. The optimization process of the CNN model is complicated due to its many super parameters, so three CNN models with different structures were established and compared to summarize the model optimization process. Then these three CNN models were compared with the siPLS model to select the quantitative analysis model with the highest accuracy. It is hoped that the outcomes of this work can contribute to the machine learning implementations in the fields related to OW evaluation.

Material and transmittance spectra measurement

OW samples were collected from the sewage treatment stations of five oil extraction plants in Daqing. There are a total of 80 groups of OW samples collected from different injection wells and each injection well was sampled twice at a 2 h interval. The UV–Vis spectrophotometer (TU1900) was used to measure the transmission spectrum of OW with an optical path of 10 mm. The wavelength range was 190–900 nm and the resolution was 2 nm.

The standard curve method (spectrophotometry) was used to measure the OC of each OW sample and the distribution of OC was shown in Table 1. The spectra of 80 groups of OW samples were randomly separated into a training set (1–70 samples) and a test set (71–80 samples). The UV spectra of partial OW samples with different OC are shown in Figure 1.
Table 1

The OC in re-injection samples

NumberOC (mg/L)NumberOC (mg/L)NumberOC (mg/L)
0.99 28 11.91 55 9.68 
16.76 29 6.59 56 2.40 
11.64 30 13.41 57 6.31 
5.96 31 9.76 58 15.07 
2.68 32 7.11 59 12.48 
3.76 33 25.54 60 13.32 
1.45 34 31.60 61 0.91 
9.22 35 3.91 62 27.43 
20.26 36 0.76 63 19.92 
10 8.53 37 3.22 64 3.85 
11 0.93 38 1.24 65 14.24 
12 1.17 39 2.17 66 2.39 
13 22.66 40 11.94 67 1.70 
14 5.92 41 2.09 68 11.16 
15 4.38 42 0.98 69 8.71 
16 10.37 43 19.04 70 2.21 
17 13.28 44 2.03 71 12.08 
18 0.92 45 1.57 72 8.04 
19 3.47 46 1.72 73 6.70 
20 14.37 47 15.56 74 3.39 
21 1.07 48 6.48 75 15.47 
22 2.71 49 17.92 76 1.39 
23 4.68 50 3.75 77 10.76 
24 2.50 51 13.86 78 11.84 
25 2.63 52 3.94 79 21.08 
26 11.13 53 1.53 80 3.43 
27 0.91 54 21.76 – – 
NumberOC (mg/L)NumberOC (mg/L)NumberOC (mg/L)
0.99 28 11.91 55 9.68 
16.76 29 6.59 56 2.40 
11.64 30 13.41 57 6.31 
5.96 31 9.76 58 15.07 
2.68 32 7.11 59 12.48 
3.76 33 25.54 60 13.32 
1.45 34 31.60 61 0.91 
9.22 35 3.91 62 27.43 
20.26 36 0.76 63 19.92 
10 8.53 37 3.22 64 3.85 
11 0.93 38 1.24 65 14.24 
12 1.17 39 2.17 66 2.39 
13 22.66 40 11.94 67 1.70 
14 5.92 41 2.09 68 11.16 
15 4.38 42 0.98 69 8.71 
16 10.37 43 19.04 70 2.21 
17 13.28 44 2.03 71 12.08 
18 0.92 45 1.57 72 8.04 
19 3.47 46 1.72 73 6.70 
20 14.37 47 15.56 74 3.39 
21 1.07 48 6.48 75 15.47 
22 2.71 49 17.92 76 1.39 
23 4.68 50 3.75 77 10.76 
24 2.50 51 13.86 78 11.84 
25 2.63 52 3.94 79 21.08 
26 11.13 53 1.53 80 3.43 
27 0.91 54 21.76 – – 
Figure 1

UV spectra of the test set.

Figure 1

UV spectra of the test set.

Close modal

Considering Figure 1, the transmission spectra of the test set showed a similar trend at wavelengths between 190 and 900 nm. The transmittance increases gradually with respect to the increase in wavelength in the range of 190–500 nm, changes a little between 500 and 850 nm, and decreases with the increment in wavelength between 850 and 900 nm. OW with a higher OC has lower transmittance, which follows the Lambert–Beer law.

Establishment of the CNN model

The characteristics of local connections and weight sharing (LeCun et al. 2015) that the CNN has can greatly reduce the number of parameters, increase the ability to extract data features, and prevent model overfitting. The convolutional layer, which is the core structure of the CNN, can locally perceive the spectral data, which is conducive to extracting the spectral information of peaks and valleys. Therefore, the convolutional layer is more suitable for the analysis and processing of one-dimensional spectral data.

The structure of the two-layer CNN is shown in Figure 2. The input layer is the blue part of the first layer. The first convolutional layer is the yellow part of the second layer. The convolution kernel size of this layer is 3, whereas the step size is 1. Therefore, the data of three blue nodes are received by one yellow node, and each convolution operation moves one step backward. The weights of the convolution kernel are shared by each convolution operation. The second convolution layer is the green part of the third layer. The convolution kernel size and the step size are, respectively, 3 and 2 at this layer. Therefore, the convolution operation of the second layer's convolution layer is performed as shown in Figure 2. The training parameters are greatly reduced by this connection method.
Figure 2

The structure of a two-layer CNN. Please refer to the online version of this paper to see this figure in colour: http://dx.doi.org/10.2166/wst.2023.097.

Figure 2

The structure of a two-layer CNN. Please refer to the online version of this paper to see this figure in colour: http://dx.doi.org/10.2166/wst.2023.097.

Close modal

In addition to the convolutional layer, the CNN also has pooling, activation, flatten, and fully connected layers. The activation layer can introduce non-linear factors to solve the problems that cannot be solved by the linear model. The function of the pooling layer is to extract the main features to simplify the complexity of network calculations. In order to flatten the multi-dimensional input into one dimension, the flatten layer function is considered. The function of the fully connected layer is to summarize the features learned from the models and map them with sample labels. The output layer is responsible for outputting the results. In addition, the CNN can also add batch normalization (BN) layer (Ioffe & Szegedy 2015), dropout function (Srivastava et al. 2014), L1 regularization, and L2 regularization to prevent overfitting of the model.

The hyperparameters, which include the kernel size, the kernel number, the hidden number of neurons in the fully connected layer, the dropout rate, and the batch size, are shown in Table 2. All the above hyperparameters can be resized to optimize the CNN model, so the optimized process is complicated in this study. Three CNN models with different structures are shown in Figure 3 to show the summary optimization process. The three models were named Models 1, 2, and 3 regarding different depths of the model. As the number of samples in the training set is small, 10-fold validation was employed to prevent overfitting. All samples except the test set were randomly divided into the training set and validation set ten times in the process of model-building.
Table 2

Hyperparameter optimization results of each CNN model

Model 1Model 2Model 3
Kernel size 
Kernel number 16 16 
Hidden number 128 64 32 
Dropout rate 0.5 0.2 0.3 
Batch size 16 
Model 1Model 2Model 3
Kernel size 
Kernel number 16 16 
Hidden number 128 64 32 
Dropout rate 0.5 0.2 0.3 
Batch size 16 
Figure 3

The structure of the three CNN models.

Figure 3

The structure of the three CNN models.

Close modal
The structure of Model 1 is shown in Figure 3, which includes one convolutional layer (Conv1), one max-pooling layer (Pooling1), one fully connected layer (F1), and one output layer (Output). The dropout function was added to the F1 layer, which can randomly kill some neurons to prevent the overfitting of the model. For the objective function, mean square error (MSE) was used to minimize the sum of squares loss and L2 regularization was used to prevent overfitting. The equation of the objective function is shown as Equation (1):
formula
(1)
where and are, respectively, the predicted and the measured values. The number of samples in the training set is indicated by n, while the weight matrix and the regularization coefficient are denoted by w, λ, respectively.

As the activation function of the convolutional layer and F1 layer, the rectified linear unit (ReLU) function was used in the model. For the training of the model and finding the local optimal solution of the objective function, the Adam optimizer is used.

Model 2 includes two convolutional layers (Conv1 and Conv2), two max-pooling layers (Pooling1 and Pooling2), one fully connected layer (F1), and one output layer (Output). The dropout function was added to the F1 layer like Model 1. The difference between Model 3 and the other two CNN models is that there are three convolutional layers and three max-pooling layers (Conv1, Conv2, Conv3, Pooling1, Pooling2, and Pooling3). The dropout function was also added to the fully connected layer.

The three models have the same objective function and activation function.

Establishment of the siPLS model

As a common model in spectral quantitative analysis, the PLS model has been widely used in many works and it has many improved versions. Interval partial least squares (iPLS) is a PLS algorithm proposed by Nørgaard et al. (2000). The principle of iPLS is to divide the entire modeled band into multiple sub-intervals of the same width, then build multiple PLS models on each sub-interval to select the modeling interval with the highest accuracy. The siPLS (Chen et al. 2008) is a model combining two or more sub-intervals based on the iPLS, which has better predictive power than the siPLS.

Based on the spectral data of the OW, the siPLS model and three CNN models were established. The training set data was utilized to establish the model, while the test set data was considered to validate it.

The evaluation parameters of the regression model

The selected evaluation parameters of the regression model mainly include the correlation coefficient of the calibration set (rc), the correlation coefficient of the prediction set (rp), RMSECV, and RMSEP. The prediction results of the regression model have the best accuracy when the rc and rp achieve the maximum, whereas the RMSECV and RMSEP achieve the minimum. The calculations of these quantities are given in the following Equations (2)–(4):
formula
(2)
formula
(3)
formula
(4)

In these equations, represents the real value; indicates the predictive value; expresses the average of ; stands for the average of ;and n and m represent the number of the correction sets and the prediction set, respectively.

The decrease in RMSECV means that the loss of model training is reduced. The lower value of the RMSEP means the lower the prediction loss of the model to the test set. The model is overfitting when RMSECV is too lower than RMSEP as well as RMSEP does not reach the requirement. On the contrary, if RMSEP, as well as RMSECV are higher than expected, the model is underfitting. The hyperparameters should be optimized under both of the above cases until RMSEP reaches the requirement.

The comparison of the three CNN models with the siPLS model

Models 1, 2, 3, and the siPLS model were used to predict the test set data one by one. The relevant correlation plots between the predicted values and the measured values on the training set and test set are shown in Figure 4 for each considered model.
Figure 4

The correlation plots between the predicted and the measured values of the CNN on the training set and test set, obtained from (a) Model 1; (b) Model 2; (c) Model 3; and (d) siPLS.

Figure 4

The correlation plots between the predicted and the measured values of the CNN on the training set and test set, obtained from (a) Model 1; (b) Model 2; (c) Model 3; and (d) siPLS.

Close modal

The prediction results of the three CNN models are shown in Figures 4(a)–(c). The three models show great prediction ability where the RMSEP is lower than 3 and the rp is higher than 0.88. Model 3 has the best prediction results among all models. The rp, RMSEP, rc, and RMSECV of Model 3 are 0.9732, 1.606, 0.9949, and 0.845, respectively. As shown in Figure 4(d), the rp, RMSEP, rc, and RMSECV of the siPLS model are 0.7375, 3.372, 0.8140, and 4.325, respectively.

The outcomes obtained from all models are given in Table 3 for a more comprehensive comparison. It can be concluded from Table 3 that Model 3 has the best prediction ability, followed by Model 1 and then Model 2. The siPLS model has the worst prediction ability. The prediction accuracy of Model 1 and Model 2 is close. Meanwhile, the prediction ability of Model 3 has been greatly improved from Model 1 and the RMSEP has been reduced by 44.20%. Compared with the siPLS model, the RMSEP of Models 1, 2, and 3 decreased by 14.65, 12.77, and 52.35%, respectively. The prediction results of the CNN are generally better than the results of the siPLS model.

Table 3

The comparison of evaluation parameters calculated by each model

RMSECVRMSEPrvrp
Model 1 1.298 2.878 0.9850 0.8996 
Model 2 2.231 2.941 0.9536 0.8801 
Model 3 0.845 1.606 0.9949 0.9732 
siPLS 4.325 3.372 0.8140 0.7375 
RMSECVRMSEPrvrp
Model 1 1.298 2.878 0.9850 0.8996 
Model 2 2.231 2.941 0.9536 0.8801 
Model 3 0.845 1.606 0.9949 0.9732 
siPLS 4.325 3.372 0.8140 0.7375 

The errors of three CNN models

The absolute and relative errors between the predicted and measured values of the three CNN models on the test set were calculated and given in Figure 5. As shown in Figure 5(a), the absolute errors of the predicted values in the range of 1.39–15.47 mg/L are lower than 2 mg/L. The CNN model has good prediction ability for samples in the range of 1.39–15.47 mg/L. The absolute errors of the predicted values of 21.08 mg/L are higher than 4 mg/L and the absolute errors of Model 1 and Model 2 are higher than 8 mg/L. This shows that the CNN model has poor prediction ability for 21.08 mg/L samples. Considering the calculation results given in Figure 5(b), the relative errors of samples with the concentration of 1.39 and 3.39 mg/L are larger, while the relative errors of samples with the concentration of 3.43–15.47 mg/L are smaller. However, the relative error of the 21.08 mg/L sample with high OC is also larger. As can be seen from Figure 5, the relative errors of Model 3 are below 30%, while those of Model 1 and Model 2 for some samples are above 40%, which indicates that Model 3 has a better fitting effect and better prediction ability compared to the other examined models.
Figure 5

(a) The absolute and (b) relative errors between the predicted and the measured values of the CNN on the test set.

Figure 5

(a) The absolute and (b) relative errors between the predicted and the measured values of the CNN on the test set.

Close modal

It should be noted that the depth of Model 3 is deeper than that of Model 2 and Model 2 is deeper than Model 1. Theoretically, on the premise that the model does not overfit, the more complex the structure, the higher the accuracy of the model. For example (Lopez-Fornieles et al. 2022), in the PLS model optimization process, the accuracy raises with the increase of the latent variable until it reaches the highest point, then the model becomes overfitting and the accuracy decreases. But Model 2 is the model with the lowest accuracy instead of Models 1 or 3 in this work, which can be attributed to the fact that the CNN model optimization process is a process of shrinking from a big circle to a small circle rather than change of the point in a curve. Furthermore, PLS is the linear model, while the CNN can deal with the non-linear correlation of data, so the CNN model can more accurately analyze the spectrum of OW with complex properties. Nevertheless, the optimization process of the CNN model is more complex and time-consuming compared with the PLS model, which should be considered in the practical application.

On the other hand, the influence of sample distribution on model accuracy cannot be ignored (Buda et al. 2018). As shown in Figure 5, the predicted error of samples with an OC concentration of more than 20 mg/L is relatively higher than samples with other concentrations for all three models, because there are few samples with a concentration of more than 20 g/L. Meanwhile, the water quality of samples with excessive OC is more complex and it is more difficult to predict with the model. As a result, in practical application, it is necessary to take samples with a uniform distribution. Also, water quality should be added into influence factors in a further study.

In this paper, the UV spectra of 80 groups of OW samples with different OCs were measured and analyzed. The CNN model and siPLS model were established for comparison. Three CNN models with different structures were selected to show the general optimization process. The following conclusions can be drawn:

  • OC in OW can be quickly detected based on UV spectrophotometry and a quantitative analysis model, in which the CNN model has higher accuracy than the siPLS model.

  • The optimization process of the CNN model is more complicated than that of the PLS model, which means more time for model establishment and better accuracy.

The authors would like to thank Daqing Oil Field Co. for the support of sample collection.

This work was supported by the Youth Innovative Talents Training Plan of General Undergraduate University in Heilongjiang Province (UNPYSCT-2020148), the Open project of MOE Key Laboratory for Enhanced Oil and Gas Recovery (No. NEPU-EOR-2022-06), and the Postdoctoral support project of Heilongjiang Province (LBH-Q21084).

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Bora
M. B.
,
Daimary
D.
,
Amitab
K.
&
Kandar
D.
2020
Handwritten character recognition from images using CNN-ECOC
.
Procedia Computer Science
167
,
2403
2409
.
Cardoso
W. J.
,
Gomes
J. G. R.
,
Roque
J. V.
,
Barbosa
M. H. P.
&
Teófilo
R. F.
2022
Dehydration as a tool to improve predictability of sugarcane juice carbohydrates using near-infrared spectroscopy based PLS models
.
Chemometrics and Intelligent Laboratory Systems
220
,
104459
.
Chen
Q.
,
Zhao
J.
,
Liu
M.
,
Cai
J.
&
Liu
J.
2008
Determination of total polyphenols content in green tea using FT-NIR spectroscopy and different PLS algorithms
.
Journal of Pharmaceutical and Biomedical Analysis
46
(
3
),
568
573
.
Cui
J.
,
Zhou
Z.
,
Xie
A.
,
Meng
M.
,
Cui
Y.
,
Liu
S.
,
Lu
J.
,
Zhou
S.
,
Yan
Y.
&
Dong
H.
2019
Bio-inspired fabrication of superhydrophilic nanocomposite membrane based on surface modification of SiO2 anchored by polydopamine towards effective oil-water emulsions separation
.
Separation and Purification Technology
209
,
434
442
.
Hamza
M. F.
,
Sinnathambi
C. M.
&
Merican
Z. M. A.
2017
Recent advancement of. hybrid materials used in chemical enhanced oil recovery (CEOR): a review
.
IOP Conference Series: Materials Science and Engineering
206
,
012007
.
He
L.
,
Yang
L.
,
Zhang
L.
,
Wang
Z.
,
Cheng
H.
,
Wang
X.
,
Lv
J.
,
Zhang
J.
,
Mo
H.
&
Shen
J.
2020
Removal of Ca2+ and Mg2+ from oilfield wastewater using reusable PEG/Fe3O4/GO-NH2 nanoadsorbents and its efficiency for oil recovery
.
Journal of Environmental Chemical Engineering
9
(
1
),
104653
.
Hu
M.
,
Wang
H.
,
Wang
X.
,
Yang
J.
&
Wang
R.
2019
Video facial emotion recognition based on local enhanced motion history image and CNN-CTSLSTM networks
.
Journal of Visual Communication and Image Representation
59
,
176
185
.
Ioffe
S.
&
Szegedy
C.
2015
Batch normalization: accelerating deep network training by reducing internal covariate shift
. In
32nd International Conference on Machine Learning
.
LeCun
Y.
,
Bengio
Y.
&
Hinton
G.
2015
Deep learning
.
Nature
521
(
7553
),
436
444
.
Liu
J.
,
Osadchy
M.
,
Ashton
L.
,
Foster
M.
,
Solomon
C. J.
&
Gibson
S. J.
2017
Deep convolutional neural networks for Raman spectrum recognition: a unified solution
.
Analyst
142
(
21
),
4067
4074
.
Lopez-Fornieles
E.
,
Brunel
G.
,
Rancon
F.
,
Gaci
B.
,
Metz
M.
,
Devaux
N.
,
Taylor
J.
,
Tisseyre
B.
&
Roger
J.-M.
2022
Potential of multiway PLS (N-PLS) regression method to analyse time-series of multispectral images: a case study in agriculture
.
RemoteSens
14
,
216
.
Ma
S.
,
Bargal
S. A.
,
Zhang
J.
,
Sigal
L.
&
Sclaroff
S.
2017
Do less and achieve more: training CNNs for action recognition utilizing action images from the Web
.
Pattern Recognition
68
,
334
345
.
Ma
M.
,
Ai
Q.
&
Xie
M.
2022
Optical properties of four types paraffin
.
Optik
249
,
168277
.
Malek
S.
,
Melgani
F.
&
Bazi
Y.
2018
One-dimensional convolutional neural networks for spectroscopic signal regression
.
Journal of Chemometrics
32
(
5
),
e2977
.
Nørgaard
L.
,
Saudland
A.
,
Wagner
J.
,
Nielsen
J. P.
,
Munck
L.
&
Engelsen
S. B.
2000
Interval partial least-squares regression (iPLS): a comparative chemometric study with an example from near-infrared spectroscopy
.
Applied Spectroscopy
54
(
3
),
413
419
.
Pande
S. D.
,
Jadhav
P. P.
,
Joshi
R.
,
Sawant
A. D.
,
Muddebihalkar
V.
,
Rathod
S.
,
Gurav
M. N.
&
Das
S.
2022
Digitization of handwritten Devanagari text using CNN transfer learning-a better customer service support
.
Neuroscience Informatics
2
(
3
),
100016
.
Qi
H.
,
Yang
L.
,
Hu
X.
,
Li
D.
,
Wang
Q.
&
Wu
G.
2018
Optical properties of oilfield wastewater and its application in measuring oil content
.
Applied Spectroscopy.
72
(
8
),
1252
1257
.
Seyyedsar
S. M.
,
Farzaneh
S. A.
&
Sohrabi
M.
2016
Experimental investigation of. tertiary CO2 injection for enhanced heavy oil recovery
.
Journal of Natural Gas Science and Engineering
34
,
1205
1214
.
Srivastava
N.
,
Hinton
G.
,
Krizhevsky
A.
,
Sutskever
I.
&
Salakhutdinov
R.
2014
Dropout: a simple way to prevent neural networks from overfitting
.
Journal of Machine Learning Research
15
,
1929
1958
.
Wei
P.
,
Pu
W.
,
Sun
L.
,
Pu
Y.
,
Wang
S.
&
Fang
Z.
2018
Oil recovery enhancement in low permeable and severe heterogeneous oil reservoirs via gas and foam flooding
.
Journal of Petroleum Science and Engineering
163
,
340
348
.
Wu
X.
,
Gao
S.
,
Niu
Y.
,
Zhao
Z.
,
Xu
B.
,
Ma
R.
,
Liu
H.
&
Zhang
Y.
2022
Identification of olive oil in vegetable blend oil by one-dimensional convolutional neural network combined with Raman spectroscopy
.
Journal of Food Composition and Analysis
108
,
104396
.
Yan
W.
,
Xu
R.
,
Wang
K.
,
Di
T.
&
Jiang
Z.
2020a
A soft sensor modeling method based on semi-supervised deep learning and its application to wastewater treatment plant
.
Industrial & Engineering Chemistry Research
59
(
10
),
4589
4601
.
Yan
X.
,
Zhang
S.
,
Fu
H.
&
Qu
H.
2020b
Combining convolutional neural networks and on-line Raman spectroscopy for monitoring the Cornu Caprae Hircus hydrolysis process
.
Spectrochimica Acta Part A-Molecular and Biomolecular Spectroscopy
226
,
117589
.
Yang
J.
,
Zhang
T.
,
Yin
Y.
,
Jiang
W.
,
Du
Y.
,
Zhu
X.
,
Jiang
B.
&
Huang
Y.
2022
Nondestructive rapid and quantitative analysis for the curing process of silicone resin by near-infrared spectra
.
Journal of Applied Polymer Science
137
(
34
),
48982
.
Zhang
X.
,
Lin
T.
,
Xu
J.
,
Luo
X.
&
Ying
Y.
2019b
Deepspectra: an end-to-end deep learning approach for quantitative spectral analysis
.
Analytica Chimica Acta
1058
,
48
57
.
Zhang
X.
,
Li
H.
,
Zhang
Y.
,
Qi
H.
,
Yang
X.
,
Wang
Q.
&
Li
D.
2021
Quantitative analysis of the oil mixture using PLS combined with spectroscopy detection
.
Optik
244
,
167611
.
Zhang
W.
,
Li
L.
,
Zhu
Y.
,
Yu
P.
&
Wen
J.
2022a
CNN-LSTM neural network model for fine-grained negative emotion computing in emergencies
.
Alexandria Engineering Journal
61
(
9
),
6755
6767
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).