Evapotranspiration (ET) is an important component of the hydrological cycle and its accurate estimation is very necessary for water resource management and agricultural precision irrigation. The direct measurement of ET is difficult and the observed data are very limited. Therefore, this study explores the possibility of a back-propagation (BP) neural network for simulating ET based on eight flux sites in China. The input variables consist of land surface temperature, the water vapor pressure of the land surface, net radiation, and photosynthetically active radiation. Four input combination categories are determined, including one input variable, two input variables, three input variables, and four input variables. The results demonstrated the following: (1) Adding more input variables generally improves model performance, with a significant gain from one to two variables, but only slight improvements from two to three. (2) While some stations show better performance with four variables, others perform best with three. (3) Daily and monthly ET estimates are achieved across all stations, with summer estimates consistently outperforming winter ones. (4) The variability in the best input combination for the stations indicates that factors such as climate zones and land cover influence ET accuracy.

  • In this study, all possible combinations of input variables are put into the back-propagation (BP) model, which is beneficial for truly achieving the best input combination.

  • The results demonstrated that the best performance of the BP model at all stations is reliable, with higher KGE and R values and low RMSE and BIAS values.

  • More input variables do not definitely produce better performances based on results at the XSBN.

Land evapotranspiration (ET), including soil evaporation and vegetation transpiration, plays a critical role in the water cycle, carbon cycle, and energy cycle, influencing water availability, agriculture, and ecosystem health (Chao et al. 2021; Ahmadi et al. 2022; Pan et al. 2022; Zhang et al. 2023). Accurate measurement or estimation of ET is very essential for water resource management, drought forecasting, and agricultural precision irrigation, especially in arid and semi-arid regions (Jaafar et al. 2022; Theng Hue et al. 2022; Lee et al. 2023).

ET is usually measured through eddy covariance flux tower and energy balance Bowen ratio (Maes et al. 2019; Walls et al. 2020; Markos & Radoglou 2021). With these observation measurements, ET data can be acquired directly, but these measurements require expensive equipment, much time, and professional technicians (Saboori et al. 2022; Ezenne et al. 2023). Moreover, ET data based on these observation measurements are relatively limited, and cannot meet the growing demand of modern hydrology and agriculture. Traditionally, ET estimation has relied on empirical formulas and physical models, such as the Penman–Monteith equation (FAO-PM) (Allen et al. 1994). The FAO-PM model became a standard of reference evapotranspiration estimation due to its accuracy under various climate conditions around the world (Pereira et al. 2021; Chen et al. 2022). Combining the FAO-PM and ET coefficient, ET can be subsequently achieved. A number of other ET models are also proposed, which only need air temperatures or solar radiation, such as the Makkink equation, Hargreaves and Samani equation, and Priestley and Taylor equation (Makkink 1957; Priestley & Taylor 1972; Hargreaves & Samani 1985). While these approaches are widely accepted, they often require extensive input data, such as maximum air temperature, minimum air temperature, wind speed, solar radiation, and relative humidity and can be limited by assumptions about the underlying processes (Allen et al. 1994, 1998; Pereira et al. 2021). Studies have shown that traditional models may struggle to accurately capture the spatial and temporal variability of ET (Sun et al. 2024) and cannot obtain acceptable results in diverse climate zones (Lee et al. 2023). Thus, new approaches are urgently needed to address this problem.

Machine learning offers a promising alternative for ET estimation by utilizing algorithms that can learn from data without explicit programming. Zhang et al. (2018) simulated reference evapotranspiration in the Shule River Basin from northwest China via support vector machines (SVMs), adaptive network-based fuzzy inference systems (ANFIS), and back-propagation (BP) neural network and found that these models were able to accurately simulate reference evapotranspiration only based on land surface temperature. Fan et al. (2021) used SVMs, extreme gradient boosting (XGBoost), single-layer artificial neural networks (ANN), and deep neural network (DNN) models to simulate the transpiration of summer maize in northwest China and the results reveal that the DNN model is more effective for maize transpiration estimation. Zhang et al. (2022) proposed six machine learning models for daily reference evapotranspiration estimation with incomplete meteorological data in eastern Inner Mongolia, north China, the results demonstrated that all six proposed machine learning models could estimate reference evapotranspiration with high accuracy. Wu et al. (2023) developed random forest (RF), extreme learning machine (ELM), SVM, and ANN models for maize ET estimation in northwest China and found that machine learning models can provide satisfactory simulation of maize ET. The hybrid-ML models are also used to simulate ET. The wavelet-artificial neural network showed good accuracy in estimating daily ET (Araghi et al. 2018).

Sensitivity analysis (SA) is crucial for understanding the influence of different input variables on model outputs. In the context of ET simulation, variable sensitivity analysis helps identify key climatic and environmental factors affecting ET rates. For instance, DeJonge et al. (2015) utilized variance-based global sensitivity analysis to assess the impact of temperature, humidity, wind speed, and solar radiation on ET, providing insights that can enhance model accuracy. The integration of ML and sensitivity analysis has also been explored. Zhao et al. (2022) used the XGBoost algorithm to analyze the contribution rate of meteorological factors to ET and obtain the input combination of a few factors with a large impact on ET. Their findings highlighted that sunshine duration, temperature, and solar radiation are the most influential factors, demonstrating the potential of this combined approach to improve ET modeling.

Hence, machine learning models will be applied to estimate evapotranspiration and recognize the sensitivity of input variables at eight flux stations in this study. These eight flux stations are evenly distributed and cover the most prevalent climate in China. The aim of this study is, therefore, threefold: (1) to identify the best input variable combinations for modeling ET through BP for four input combinations categories, respectively; (2) to analyse the differences in BP model performances among the best input combinations under each category and achieve the variation features of BP model performances under best input combinations that vary with the numbers of input variables; (3) to select the best input combinations under all input variable combinations at eight flux stations and compare their performances at daily, monthly and seasonal scale. In this study, the input variables include land surface temperature, the water vapor pressure of the land surface, net radiation, and photosynthetically active radiation.

ChinaFLUX observations and station description

The Chinese Terrestrial Ecosystem Flux Research Network (ChinaFLUX) is established to measure the exchange of carbon dioxide, water vapor, and energy between the atmosphere and biosphere with an eddy covariance flux tower (Yu et al. 2006). Yang et al. (2017) used the ET observation to validate the GLEAM ET products and found these two datasets have a high consistency. In this study, we collect latent heat flux and meteorological data from eight flux sites (as shown in Figure 1 and Table 1). Land ET from the eddy covariance flux tower (EC-based ET, ETEC) is the latent heat flux (λET) divided by the latent heat of vaporization (λ). The eight flux sites encompass multiple vegetation types, including mixed forest (Changbaishan, CBS), evergreen broadleaf forest (Dinghushan, DHS; Xishuangbanna, XSBN), grasslands (Dangxiong, DX; Haibei, HB; Inner Mongolia, NMG), evergreen needle leaf forest (Qianyanzhou, QYZ), and croplands (Yucheng, YC). In addition, these eight flux stations cover the most prevalent climate in China but the western part. Moreover, these eight flux sites have a wide range of temperature and precipitation, as presented in Table 1. Observation data from these flux sites are available at the ChinaFLUX website (http://www.chinaflux.org/). Besides DX and NMG sites, the data period of meteorological variables and ETEC is 2003–2010 at the other six sites. At DX and NMG sites, the data period is 2004–2010. The time step used in this study is daily. For appropriate data division, the data of 2009–2010 are used for the testing period, and the data before 2009 are used for the training period.
Table 1

Attributes of the eight EC flux stations used in this study

Station (Abbreviation)Latitude (°N)Longitude (°E)Köppen-Geiger ClimateIGBP landcoverElevation (m)TL (°C)WVPL (kpa)RN (W/m2)FAR (μmol/m2 s)Mean annual precipitation (mm)Mean annual ET (mm)
Changbaishan (CBS) 42.60 128.08 Temperate Continental Monsoon Climate MF 738 3.6 0.8 76.6 310.5 695 500 
Dinghushan (DHS) 23.17 112.57 Monsoon humid climate EBF 300 20.1 2.0 86.6 275.6 1,956 661 
Dangxiong (DX) 30.85 91.42 The plateau monsoon climate GRA 4,250 2.7 0.4 63.0 466.0 477 550 
Haibei (HB) 37.62 101.29 Continental climate GRA 3,216 −1.3 0.4 90.3 366.0 560 576 
Inner Mongolia (NMG) 44.50 117.17 Continental climate GRA 1,189 1.4 0.5 67.9 369.0 336 386 
Qianyanzhou (QYZ) 26.73 115.05 Subtropical monsoon climate MF 102 17.8 1.8 88.2 273.8 1,542 704 
Xishuangbanna (XSBN) 21.95 101.2 Tropical monsoon climate EBF 756 19.1 2.2 103.6 328.6 1,493 622 
Yucheng (YC) 36.95 116.6 Warm and semi-humid continental monsoon climate CRO 28 13.2 1.3 63.3 265.6 582 683 
Station (Abbreviation)Latitude (°N)Longitude (°E)Köppen-Geiger ClimateIGBP landcoverElevation (m)TL (°C)WVPL (kpa)RN (W/m2)FAR (μmol/m2 s)Mean annual precipitation (mm)Mean annual ET (mm)
Changbaishan (CBS) 42.60 128.08 Temperate Continental Monsoon Climate MF 738 3.6 0.8 76.6 310.5 695 500 
Dinghushan (DHS) 23.17 112.57 Monsoon humid climate EBF 300 20.1 2.0 86.6 275.6 1,956 661 
Dangxiong (DX) 30.85 91.42 The plateau monsoon climate GRA 4,250 2.7 0.4 63.0 466.0 477 550 
Haibei (HB) 37.62 101.29 Continental climate GRA 3,216 −1.3 0.4 90.3 366.0 560 576 
Inner Mongolia (NMG) 44.50 117.17 Continental climate GRA 1,189 1.4 0.5 67.9 369.0 336 386 
Qianyanzhou (QYZ) 26.73 115.05 Subtropical monsoon climate MF 102 17.8 1.8 88.2 273.8 1,542 704 
Xishuangbanna (XSBN) 21.95 101.2 Tropical monsoon climate EBF 756 19.1 2.2 103.6 328.6 1,493 622 
Yucheng (YC) 36.95 116.6 Warm and semi-humid continental monsoon climate CRO 28 13.2 1.3 63.3 265.6 582 683 

Note: The location, climate, and vegetation information of stations can be found on the ChinaFLUX website. The mean annual flux data are calculated from the ChinaFLUX observations from 2002/2003 to 2010.

MF, mixed forests; EBF, evergreen broadleaf forests; GRA, grasslands; CRO, croplands.

Figure 1

Location and name of eight EC-based flux sites used in this study.

Figure 1

Location and name of eight EC-based flux sites used in this study.

Close modal

BP neural network

In this study, we adopt a very common machine learning algorithm, i.e., BP neural network, to train the ET model. The BP model is a typical multilayer forward neural network and is widely used in approximating a complicated nonlinear function (Zhao et al. 2023a). As shown in Figure 2, the common structure of BP consists of an input layer, a hidden layer, and an output layer (Zhang et al. 2018). The information in the BP model is transmitted in a forward direction and the BP of the error gradient (Gong et al. 2016).

During the application of the BP model, users often need to calibrate the parameters of BP to acquire the best model. The three-layer BP model is employed in this study according to the empirical formula proposed by Li & Liu (2009), as shown in Equation (1).
(1)
where m is the number of nodes in the hidden layer, n is the number of neural cells in the network input layer, l is the number of neural cells in the output layer, and a is a number between 0 ∼ 10. The maximum number of iterations and the training rate are set as 100 and 0.01, respectively here. The hyperbolic tangent sigmoid transfer function is selected to compute the output of each neuron. The default values suggested by Beale et al. (2010) are employed in the training of the BP model for other parameters.

Identification of variable contribution

The weights method (Garson 1991; Gevrey et al. 2003) is utilized to identify the contribution of variables in the BP model. The method essentially involves partitioning the hidden-output connection weights of each hidden neuron into components associated with each input neuron. The contribution of each variable can be calculated using Equations (2) and (3).
(2)
(3)
where |Wih| is the absolute value of the input-hidden layer weight connecting the input variable i with the hidden neuron h; ni and nh are the numbers of the input variables and the hidden neurons, respectively.

Model evaluation

The capability of the BP model in predicting ET is mainly evaluated by Kling-Gupta Efficiency (KGE) (Gupta et al. 2009; Kling et al. 2012). These indices are defined as follows:
(4)
where r is the linear correlation coefficient between EC-based ET observations and ET simulated by the BP model (BP-simulated ET, ETBP); a is the ratio of the standard deviation of BP-simulated ET, to the standard deviation of EC-based ET observations; b is the ratio of the mean value of BP-simulated ET, to the mean value of EC-based ET observations. Other statistical metrics, including coefficient of determination (r2), BIAS, and root mean square error (RMSE), are also used to evaluate the performance of the model.
(5)
(6)
(7)
where ETECi and ETBPi in the above equations are EC-based ET observations and BP-simulated ET, respectively; and are the mean values of EC-based ET observations and BP-simulated ET, respectively.

Input combination and model implementation

In this study, we select several meteorological variables as input variables to BP, based on the correlation coefficient (r > 0.5) between meteorological variables and ETEC. These meteorological variables consist of land surface temperature (TL), temperature over canopy (Tc), the water vapor pressure of land surface (WVPL), water vapor pressure over canopy (WVPc), solar radiation (Rs), net radiation (Rn), and photosynthetically active radiation (FAR). The values of correlation coefficient (r) for these variables are displayed in Table 2. In Table 2, the r between TL and Tc, WVPL, and WVPc, and Rs and FAR are very close to 1, which means that the corresponding two variables have similar meanings. Here, the uncertainty of r between FAR and ETEC is slightly smaller than that between Rs and ETEC, hence, FAR is finally chosen. For temperature and water vapor pressure data, the near-surface data are selected in this study. Therefore, four variables are finally adopted as input variables (TL, WVPL, Rn, and FAR).

Table 2

Mean value of correlation coefficient (r) between different variables at eight stations

 
 

Note: The black numbers are the mean value of r between two variables of the eight stations and the red numbers are the uncertainty of r, which is the difference between the maximum and minimum values of r at the eight stations.

Table 3 shows 15 input combinations for the BP model. All the input combinations are targeted at one response, which is the ETEC. These input combinations include four categories, i.e., one input variable (Category I), two input variables (Category II), three input variables (Category III), and four input variables (Category IV). In this study, we aim to explore the best input combinations for each category. Comparing these best input combinations, the best input combinations for different sites are then obtained.

Table 3

The input combinations based on selected meteorological variables

Input combinationsInput variablesAbbreviations
Category I TL 
WVPL 
Rn 
FAR 
Category II TL, WVPL TW 
TL, Rn TR 
TL, FAR TF 
WVPL,Rn WR 
WVPL, FAR WF 
Rn, FAR RF 
Category III TL, WVPL, Rn TWR 
TL, WVPL, FAR TWF 
TL, Rn, FAR TRF 
WVPL, Rn, FAR WRF 
Category IV TL, WVPL, Rn, FAR TWRF 
Input combinationsInput variablesAbbreviations
Category I TL 
WVPL 
Rn 
FAR 
Category II TL, WVPL TW 
TL, Rn TR 
TL, FAR TF 
WVPL,Rn WR 
WVPL, FAR WF 
Rn, FAR RF 
Category III TL, WVPL, Rn TWR 
TL, WVPL, FAR TWF 
TL, Rn, FAR TRF 
WVPL, Rn, FAR WRF 
Category IV TL, WVPL, Rn, FAR TWRF 

Influences of model input combination on evapotranspiration estimation

In this section, the four categories of input combinations (Categories I to IV) are used separately for the BP model to simulate ET. The number of nodes in the input layer ranges from 1 to 4. There is one node in the output layer. According to Equation (1), the number of nodes in the hidden layer ranges from 2 to 13, approximately. Here we set the number of nodes in the hidden layer to an intermediate value of 7. The structure of the BP model is determined as n-7-1, where n is the node number of the input layer. To investigate the uncertainty of the BP model, the model is repeatedly run 100 times for each input combination. Then KGE for each experiment is calculated with the BP-simulated ET and EC-based ET in the test period. Figure 3 shows the distribution of KGE between ETBP and ETEC. In Figure 3, different input combinations in Category I show great differences in the model performance. As the number of variables increases, the performance differences between different input combinations decrease. According to the distribution of KGE in Figure 3, the best input combination of each category is selected. The combination at which the KGE reaches its maximum is selected as the best input combination.
Figure 2

Schematic diagram of the three-layer back-propagation neural network.

Figure 2

Schematic diagram of the three-layer back-propagation neural network.

Close modal

Table 4 shows the best input combinations in Categories I to IV and the maximum KGE values for these best input combinations. From Table 4, it can be observed that the best input combination in Category I is R at almost all stations except CBS. Besides XSBN stations, other stations show acceptable performances, with KGE values larger than 0.69. In Category II, the best input combinations are WF at CBS, HB, QYZ, and XSBN stations. At DX, NMG, and YC stations, the best input combinations are WR. At these seven stations, WVPL is a relatively important variable to simulate ET. However, at the DHS station, the best input combinations are TF, excluding WVPL. As presented in Table 4, the performances of the best input combinations in Category II are greatly improved compared with those in Category I. Except for XSBN stations, KGE at the other six stations range from 0.74 to 0.92. Furthermore, the performance of the XSBN station is also greatly improved. The KGE value at the XSBN station increases from 0.55 in Category I to 0.69 in Category II. It can be observed that the best input combinations in Category III exhibit substantial differences at eight stations. The differences in these stations on climate and vegetation may lead to the difference in the best input combinations. The improvement of the model performance from Category II to Category III is not as significant as that from Category I to Category II. Compared to the performances in Category III and Category IV, it can be illustrated that more input variables cannot lead to better performance from the perspective of KGE. This will be elaborated on in the discussion section.

Table 4

The best input combinations for the BP model in categories I to IV in the eight stations

StationsCategory I
Category II
Category III
Category IV
CombinationKGECombinationKGECombinationKGECombinationKGE
CBS 0.83 WF 0.92 TWR 0.92 TWRF 0.91 
DHS 0.71 TF 0.79 TRF 0.80 TWRF 0.79 
DX 0.84 WR 0.89 TWR 0.88 TWRF 0.90 
HB 0.80 WF 0.88 WRF 0.89 TWRF 0.86 
NMG 0.68 WR 0.73 TWR 0.74 TWRF 0.74 
QYZ 0.88 WF 0.92 TWF 0.92 TWRF 0.92 
XSBN 0.55 WF 0.70 WRF 0.70 TWRF 0.68 
YC 0.73 WR 0.78 TWF 0.79 TWRF 0.79 
StationsCategory I
Category II
Category III
Category IV
CombinationKGECombinationKGECombinationKGECombinationKGE
CBS 0.83 WF 0.92 TWR 0.92 TWRF 0.91 
DHS 0.71 TF 0.79 TRF 0.80 TWRF 0.79 
DX 0.84 WR 0.89 TWR 0.88 TWRF 0.90 
HB 0.80 WF 0.88 WRF 0.89 TWRF 0.86 
NMG 0.68 WR 0.73 TWR 0.74 TWRF 0.74 
QYZ 0.88 WF 0.92 TWF 0.92 TWRF 0.92 
XSBN 0.55 WF 0.70 WRF 0.70 TWRF 0.68 
YC 0.73 WR 0.78 TWF 0.79 TWRF 0.79 

Note: The bold text represents the best input combination and its corresponding KGE value.

Performance of the BP model for evapotranspiration estimation at different stations

The best input combinations at the eight stations are displayed in bold in Table 4, where KGE reaches its peak. As shown in Figures 4 and 5, ETBP is rather consistent with ETEC, with high r2 values at most stations. However, at NMG and XSBN stations, the BP model shows a relatively poor performance with an r2 value less than 0.6 in the training and testing periods. A similar conclusion can be drawn based on Figure 6, which illustrates the daily time series of ETEC and ETBP for the testing period at eight stations based on the best input combinations. Over all eight stations, it can be found that BP modeling is more accurate at CBS and QYZ stations, followed by DX and HB stations, DHS and YC stations, and NMG and XSBN stations. As for temporal variation, ETBP shows good performances in capturing the dynamics of ETEC at eight stations, where heavy evapotranspiration mainly occurs in June, July, and August. However, it is also worth noting that our model has an underestimation of ET during this period, especially at NMG and XSBN stations in Figure 6(e) and 6(g). Meanwhile, at the YC station, there is a noticeable temporal trend in ET, but our model only captured the periodic fluctuations and failed to adequately detect the trend. This implies that the four variables used in the study do not contain all the information needed to simulate ET. Other variables, such as wind speed and minimum air temperature, are also sensitive in June, July, and August (Xu et al. 2014; Yong et al. 2023). However, the above-mentioned variables are not very sensitive to evapotranspiration throughout the year, hence these variables are not included in this study.
Figure 3

Performace of the BP model with model input combinations in the eight sites, where (a) to (d) correspond to Categories I–IV.

Figure 3

Performace of the BP model with model input combinations in the eight sites, where (a) to (d) correspond to Categories I–IV.

Close modal
Figure 4

Scatter plots of EC-based ET (ETEC) and BP-simulated ET (ETBP) for the training period of eight stations, including (a) CBS, (b) DHS, (c) DX, (d) HB, (e) NMG, (f) QYZ, (g) XSBN, and (h) YC.

Figure 4

Scatter plots of EC-based ET (ETEC) and BP-simulated ET (ETBP) for the training period of eight stations, including (a) CBS, (b) DHS, (c) DX, (d) HB, (e) NMG, (f) QYZ, (g) XSBN, and (h) YC.

Close modal
Figure 5

Scatter plots of EC-based ET (ETEC) and BP-simulated ET (ETBP) for the testing period of eight stations, including (a) CBS, (b) DHS, (c) DX, (d) HB, (e) NMG, (f) QYZ, (g) XSBN, and (h) YC.

Figure 5

Scatter plots of EC-based ET (ETEC) and BP-simulated ET (ETBP) for the testing period of eight stations, including (a) CBS, (b) DHS, (c) DX, (d) HB, (e) NMG, (f) QYZ, (g) XSBN, and (h) YC.

Close modal
Figure 7 presents the monthly performance of the best input combinations at eight stations. It can be observed that monthly ETBP outperforms daily ETBP, with increasing KGE values and decreasing RMSE values. The KGE are bigger than 0.75 at all stations, moreover, the KGE at CBS, DX, HB and QYZ stations are larger than 0.9, and the largest KGE at CBS station even reach 0.94 and 0.97 in the training and testing period, respectively. For RMSE, besides DX, NMG, and YC stations, the values of RMSE at other stations are lower than 10 mm in the testing period. Nevertheless, in June, July, and August, monthly ETBP is also slightly underestimated, which can be derived from daily ETBP.
Figure 6

Daily time series of EC-based ET (ETEC) and BP-simulated ET (ETBP) for the testing period of eight stations, including (a) CBS, (b) DHS, (c) DX, (d) HB, (e) NMG, (f) QYZ, and (g) XSBN, (h) YC.

Figure 6

Daily time series of EC-based ET (ETEC) and BP-simulated ET (ETBP) for the testing period of eight stations, including (a) CBS, (b) DHS, (c) DX, (d) HB, (e) NMG, (f) QYZ, and (g) XSBN, (h) YC.

Close modal
At the seasonal scale, absolute values of BIAS between ETBP and ETEC at four seasons, i.e., spring (MAM), summer (JJA), autumn (SON), and winter (DJF), at eight stations are presented in Figure 8. Overall, the absolute value of BIAS in summer is the lowest, followed by spring, autumn, and winter, which is inconsistent with the conclusion drawn from the daily and monthly results. This is because the larger ET in summer results in a smaller value of relative error. Throughout all stations, the average value of absolute BIAS in summer is 3.33%. In spring and autumn, the absolute BIAS is 4.14 and 4.47%, respectively, which are slightly larger than that in summer. In addition, the absolute value of BIAS in winter is 7.01%. From these results, it can be drawn that the performance of BP simulating ET in four seasons is very different, which is mainly due to the large differences between sensitive variables for ET at four seasons.
Figure 7

Monthly time series of EC-based ET (ETEC) and BP-simulated ET (ETBP) for the testing period of eight stations, including (a) CBS, (b) DHS, (c) DX, (d) HB, (e) NMG, (f) QYZ, and (g) XSBN, Seasonal performance.

Figure 7

Monthly time series of EC-based ET (ETEC) and BP-simulated ET (ETBP) for the testing period of eight stations, including (a) CBS, (b) DHS, (c) DX, (d) HB, (e) NMG, (f) QYZ, and (g) XSBN, Seasonal performance.

Close modal

Contribution of variables in the best input combination

Table 5 shows the contribution of each variable in the best input combination. As shown in Supplementary material, Tables S1–S8, the contribution rates are related to the correlation between the input variables and ETEC. These correlation matrices can be used to represent hydrological and climatic characteristics at these stations. Generally speaking, variables with smaller correlation coefficients with ETEC have relatively smaller contribution rates in the model. For example, among the variables, FAR at CBS station has the smallest correlation with ETEC. Correspondingly, FAR is not included in the best input combination at CBS station. However, the correlation and contribution rate do not always correspond exactly. For example, at HB station, we found that the correlation coefficient between TL and ETEC is second only to that between Rn and ET, but TL does not appear in the best input combination. Meanwhile, we found that TL has high correlation coefficients with both Rn and WVPL. Therefore, it is supposed that the correlation between ETEC and TL might be attributed to Rn and WVPL. Similar adjustments in contribution rates also occur at the other stations. Furthermore, the correlation matrices can explain the poor performance of the BP model at the NMG and XSBN stations. This is because the correlation between the input variables and ETEC is relatively low, indicating that these stations need to introduce more variables to improve the simulation performance.

Table 5

Contribution of each variable in the best input combination

StationTLWVPLRnFAR
CBS 47.79% 38.45% 13.77% – 
DHS 43.76% – 31.74% 24.51% 
DX 21.48% 25.49% 30.44% 22.59% 
HB – 38.92% 27.18% 33.90% 
NMG 24.40% 26.45% 28.52% 20.63% 
QYZ 19.85% 29.24% 21.41% 29.50% 
XSBN – 30.78% 31.50% 37.72% 
YC 27.64% 13.57% 38.76% 20.03% 
StationTLWVPLRnFAR
CBS 47.79% 38.45% 13.77% – 
DHS 43.76% – 31.74% 24.51% 
DX 21.48% 25.49% 30.44% 22.59% 
HB – 38.92% 27.18% 33.90% 
NMG 24.40% 26.45% 28.52% 20.63% 
QYZ 19.85% 29.24% 21.41% 29.50% 
XSBN – 30.78% 31.50% 37.72% 
YC 27.64% 13.57% 38.76% 20.03% 

Comparison of different models

To show the performance of different black models, BP, deep belief network (DBN) (Hinton et al. 2006), and linear regression model (LRM) are compared. The greedy layer-wise training algorithm (Bengio et al. 2007) is used to train DBN. In our study, BP with 7 nodes in the hidden layer is used as a regression layer connecting the hidden layer with the output layer in DBN. The inputs and outputs of DBN and LRM are the same as BP. Figure 9 shows the performance of BP, DBN, and LRM at the eight stations. As shown in Figure 9, BP and DBN show a better performance than LRM at the same station. Similar performance is seen for BP and DBN. From Figure 9, it can be observed that the simulation performance of the three models at different stations is consistent. When the LRM model performs poorly at a site, the BP and DBN models also perform poorly. This implies that machine learning methods (BP and DBN) have essentially extracted the information from the input data. To further improve model efficiency, new data should be introduced. Figure 6 shows that our model has an underestimation of ET in June, July, and August. Introduction of other variables, such as wind speed and minimum air temperature, may improve the model performance in June, July, and August (Xu et al. 2014; Yong et al. 2023). As stated in Su et al. (2023), the sensitivity of reference evapotranspiration to meteorological variables in cotton regions (Yangtze Valley, Northwest inland, and Huanghe Valley cotton regions) had certain differences due to the geographical distribution and seasonal change. Furthermore, in Zhao et al. (2023b), its conclusions reveal that reference evapotranspiration is more sensitive to relative humidity (similar to water vapor pressure) and air temperature than sunshine duration (similar to solar radiation), and reference evapotranspiration is the most sensitive to air temperature in summer and autumn and relative humidity in spring and winter in Guangdong province. On the basis of previous studies and the results in this study, simulating ET for different seasons may also enhance the accuracy of modeling ET.
Figure 8

Seasonal BIAS of EC-based ET (ETEC) and BP-simulated ET (ETBP) at four seasons for the testing period of eight stations.

Figure 8

Seasonal BIAS of EC-based ET (ETEC) and BP-simulated ET (ETBP) at four seasons for the testing period of eight stations.

Close modal
Figure 9

Performance of different ET models for the training and testing period of eight stations, including (a) CBS, (b) DHS, (c) DX, (d) HB, (e) NMG, (f) QYZ, (g) XSBN, and (h) YC.

Figure 9

Performance of different ET models for the training and testing period of eight stations, including (a) CBS, (b) DHS, (c) DX, (d) HB, (e) NMG, (f) QYZ, (g) XSBN, and (h) YC.

Close modal

As widely recognized, ET consists of vegetation transpiration and soil evaporation. Thus, the characteristics of soil and vegetation must have an impact on ET. The input variables of this study we used are only meteorological variables, as in most previous studies. In Fan et al. (2021), four meteorological variables, soil water content (soil-related variable) and leaf area index (vegetation-related variable) are used as input variables, the results illustrated that the incorporation of soil water content or/and leaf area index in the machine learning models contributed to improving the accuracy of modeling maize transpiration. Wu et al. (2023) used four machine learning models to simulate daily maize evapotranspiration at different growth stages in semi-humid regions of China and, the results demonstrated that the best-performed models and input variable combinations are very different. This conclusion partly reveals that vegetation-related variables are helpful to increase the accuracy of modeling ET. FAR considered in this study is certainly related to vegetation. In future studies, soil- and vegetation-related variables can be recommended for more accurate ET estimation. Additionally, multiple machine learning models are used in the above-mentioned studies, and their results showed that the superior models at different stations and at different growth stages differed very much. Thus, it is worth exploring the most suitable model and the best input variable combinations for different growth stages, different seasons, and different regions.

Physical causes of the best input combination at different stations

Table 4 shows that the best input combination in Category I is RL at almost all stations except CBS. It is also evident from Figure 3 that the performances of models with RL and FAR as input are very close at CBS station. The reason for the best input combination in Category I at eight stations is that temperature and radiation are the main driving factors for the ET amount (Priestley & Taylor 1972; Hargreaves & Samani 1985). The slightly poor performances at NMG and XSBN stations are mainly because the correlation coefficient between the input variables and ET is relatively small (Supplementary material, Tables S5 and S7). In other words, one input variable cannot simulate ET very well with the BP model at NMG and XSBN stations.

It is observed that, except for the DHS station, the best combinations of Category all contain WVPL. This is mainly due to the relevance of meteorological variables with ET and the interaction between meteorological variables (Xu et al. 2014). As stated by Xu et al. (2014), the interactions between meteorological variables are considerable and cannot be ignored. As for the DHS station, there is a low correlation coefficient between ET and WVPL, only 0.31. As a result, ET in DHS station evaporation is not sensitive to WVPL. In the DHS station, it is also observed that Rn does not appear as the best input combination in Category II, despite its strong performance in Category I. This can be explained by the high correlation between Rn and FAR (0.84 in Supplementary material, Table S2), which means they provide similar information to the model, leading to comparable results for combinations like Rn and FAR, as well as TF and RF. In Category II, the TF combination outperformed TR, because TF carried more diverse and complementary information compared to the TR combination, with a lower correlation coefficient between TL and FAR (0.44 in Supplementary material, Table S2) than that between TL and Rn (0.60 in Supplementary material, Table S2). The same phenomenon also occurred at the CBS station (Supplementary material, Table S1).

According to Table 1, these eight stations are evenly distributed and cover the most prevalent climate in China. Hence, it is significant that there is a difference among their various meteorological variables, soil and vegetation types, evapotranspiration ratio, etc. These results demonstrate obviously that the performance of BP for simulating ET is station-specific and depends on the climate zone, which confirms the conclusions found in previous studies (Diop et al. 2015; Bodian et al. 2016; Tao et al. 2018).

It can be illustrated that more input variables cannot lead to better performance based on Table 4, which is not consistent with the result of Tao et al. (2018). Tao et al. (2018) used a new hybrid ANFIS–FA model to simulate reference evapotranspiration based on various input combinations; the result revealed that the best accuracy was obtained for the sixth input combination, which contained all the available meteorological information. However, Tao et al. (2018) only used six input combinations, i.e., adding a meteorological variable to the previous input combination to obtain the next input combination. This composition of input variables cannot get all possible input combinations, which may lead to omitting the best input combination. Over the performances of all input combinations in this study, it can be drawn that more input meteorological variables are able to achieve better performance generally, but not absolutely. This result indicates that more input variable information can enhance the model performance. Hence, there is no doubt about the importance of maintaining and expanding the data collection.

This study simulates ET at eight flux stations through the BP model under four categories of input variable combinations (one input variable, two input variables, three input variables, and four input variables), including 15 different input combinations (all possible combinations of four meteorological variables, i.e., TL, WVPL, Rn, and FAR). The key findings of this study are summarized below.

  • (1) Overall, more input variables can improve the performance of the BP model at eight stations. From one input variable to two input variables, BP performs much better. This phenomenon also occurs from two to three input variables, but the improvement is not so significant.

  • (2) However, at some stations, the performances of four input variables are slightly better than those of three input variables. However, more input variables do not definitely produce better performances. At the other stations, the BP model with three input variables outperformed those with four input variables.

  • (3) Daily and monthly ETBP with the best input variable combinations show good performances at eight stations. At the seasonal scale, ETBP in summer outperforms that in winter.

  • (4) The different best input combinations of the BP model at eight stations reveal that the accuracy of modeling ET are subject to many factors, including climate zone soil and vegetation types, etc. The introduction of other variables, such as wind speed, soil moisture content, leaf area index, etc., or the unitization of more complex models, can enhance the performance of ET simulation.

We acknowledge the financial support from the National Nature Science Foundation of China (52209036; 51909233) and the Zhejiang Natural Science Foundation (LY22E090010). We thank ChinaFlux (http://www.chinaflux.org) for providing EC-based observations and meteorological observations from eight flux stations used in this study.

All relevant data are available from an online repository or repositories. The observation data of the eight flux stations in this study are available at the Chinese Terrestrial Ecosystem Flux Research Network (http://www.chinaflux.org/) and Chinese Ecosystem Research Network Data Center (http://www.nesdc.org.cn/theme/index?projectId=612458897e28172cbed3d77a).

The authors declare there is no conflict.

Ahmadi, A., Daccache, A., Snyder, R. L. & Suvocarev, K.
(
2022
)
Meteorological driving forces of reference evapotranspiration and their trends in California
,
Science of the Total Environment
,
849
,
157823
.
Allen, R. G., Smith, M., Perrier, A. & Pereira, L. S.
(
1994
)
An update for the calculation of potential evapotranspiration
,
ICID Bull
,
43
,
35
92
.
Allen, R. G., Pereira, L. S., Raes, D. & Smith, M.
(
1998
)
Crop evapotranspiration-Guidelines for computing crop water requirements-FAO irrigation and drainage paper 56
,
Fao, Rome
,
300
(
9
),
D05109
.
Araghi
A.
,
Adamowski
J.
&
Martinez
C. J.
(
2018
)
Comparison of wavelet-based hybrid models for the estimation of daily reference evapotranspiration in different climates
,
Journal of Water and Climate ChangeJournal of Water and Climate Change
,
11
(
1
),
39
53
.
Beale
M. H.
,
Hagan
M. T.
&
Demuth
H. B.
(
2010
)
Neural Network Toolbox™ User's Guide
.
Natick, MA: The MathWorks
.
Bengio, Y., Lamblin, P., Popovici, D. & Larochelle, H. (
2007
)
Greedy layer-wise training of deep networks
. In:
Schölkopf, B., Platt, J. & Hofmann, T. (eds.)
Advances in Neural Information Processing Systems
, Vancouver, Canada: MIT Press, pp.
153
160
.
Bodian, A., Dezetter, A., Deme, A. & Diop, L. (
2016
)
Hydrological evaluation of TRMM rainfall over the upper Senegal river basin
,
Hydrology
,
3
(
2
),
15
.
Chen, S., He, C., Huang, Z., Xu, X., Jiang, T., He, Z., Liu, J., Su, B., Feng, H., Yu, Q. & He, J.
(
2022
)
Using support vector machine to deal with the missing of solar radiation data in daily reference evapotranspiration estimation in China
,
Agricultural and Forest Meteorology
,
316
,
108864
.
DeJonge, K. C., Ahmadi, M., Ascough II, J. C. & Kinzli, K.
(
2015
)
Sensitivity analysis of reference evapotranspiration to sensor accuracy
,
Computers and Electronics in Agriculture
,
110
,
176
186
.
Diop
L.
,
Bodian
A.
&
Diallo
D.
(
2015
)
Use of atmometers to estimate reference evapotranspiration in Arkansas
,
African Journal of Agricultural Research
,
10
(
48
),
4376
4383
.
Ezenne, G. I., Eyibio, N. U., Tanner, J. L., Asoiro, F. U. & Obalum, S. E.
(
2023
)
An overview of uncertainties in evapotranspiration estimation techniques
,
Journal of Agrometeorology
,
25
(
1
),
173
182
.
Garson
G. D.
(
1991
)
Interpreting neural-network connection weights
,
AI Expert
,
6
(
4
),
46
51
.
Gupta, H. V., Kling, H., Yilmaz, K. K. & Martinez, G. F.
(
2009
)
Decomposition of the mean squared error and NSE performance criteria: implications for improving hydrological modelling
,
Journal of Hydrology
,
377
(
1–2
),
80
91
.
Hargreaves
G. H.
&
Samani
Z. A.
(
1985
)
Reference crop evapotranspiration from temperature
,
Applied Engineering in Agriculture
,
1
(
2
),
96
99
.
Hinton
G. E.
,
Osindero
S.
&
Teh
Y.
(
2006
)
A fast learning algorithm for deep belief nets
,
Neural Computation
,
18
(
7
),
1527
1554
.
Jaafar, H. H., Mourad, R. M., Kustas, W. P. & Anderson, M. C.
(
2022
)
A global implementation of single- and dual-source surface energy balance models for estimating actual evapotranspiration at 30-m resolution using google earth engine
,
Water Resources Research
,
58
(
11
),
e2022WR032800
.
Kling
H.
,
Fuchs
M.
&
Paulin
M.
(
2012
)
Runoff conditions in the upper Danube basin under an ensemble of climate change scenarios
,
Journal of Hydrology
,
424–425
,
264
277
.
Li
F.
&
Liu
C.
(
2009
) ‘
Application Study of BP Neural Network on Stock Market Prediction, 2009 Ninth International Conference on Hybrid Intelligent Systems, IEEE
’,
2009 Ninth International Conference on Hybrid Intelligent Systems
.
Shenyang, China
, pp.
174
178
.
Maes, W. H., Gentine, P., Verhoest, N. E. & Miralles, D. G
. (
2019
)
Potential evaporation at eddy-covariance sites across the globe
,
Hydrology and Earth System Sciences
,
23
(
2
),
925
948
.
Makkink
G. F.
(
1957
)
Testing the penman formula by means of lysimeters
,
Journal of the Institution of Water Engineerrs
,
11
,
277
288
.
Pereira, L. S., Paredes, P., Hunsaker, D. J., López-Urrea, R., Jovanovic, N., Pereira, L. S., Paredes, P., Hunsaker, D. J., López-Urrea, R. & Jovanovic, N. (
2021
)
Updates and advances to the FAO56 crop water requirements method
,
Agricultural Water Management
,
248
,
106697
.
Priestley
C. H. B.
&
Taylor
R. J.
(
1972
)
On the assessment of surface heat flux and evaporation using large-scale parameters
,
Monthly Weather Review
,
100
(
2
),
81
92
.
Saboori, M., Mousivand, Y., Cristóbal, J., Shah-Hosseini, R. & Mokhtari, A.
(
2022
)
An automated and improved methodology to retrieve long-time series of evapotranspiration based on remote sensing and reanalysis data
,
Remote Sensing
,
14
(
24
),
6253
.
Su, Y., Wang, J., Li, J., Wang, L., Wang, K., Li, A., Gao, L. & Wang, Z. (
2023
)
Spatiotemporal changes and driving factors of reference evapotranspiration and crop evapotranspiration for cotton production in China from 1960 to 2019
,
Frontiers in Environmental Science
,
11
,
1251789
.
Sun
Y.
,
Dong
Y.
&
Chen
Y.
(
2024
)
Global evapotranspiration simulation research using a coupled deep learning algorithm with physical mechanisms
,
Irrigation and Drainage
,
73
(
4
),
1373
1390
.
Tao, H., Diop, L., Bodian, A., Djaman, K., Ndiaye, P. M. & Yaseen, Z. M.
(
2018
)
Reference evapotranspiration prediction using hybridized fuzzy model with firefly algorithm: regional case study in Burkina Faso
,
Agricultural Water Management
,
208
,
140
151
.
Theng Hue, H., Ng, J. L., Huang, Y. F. & Tan, Y. X.
(
2022
)
Evaluation of temporal variability and stationarity of potential evapotranspiration in Peninsular Malaysia
,
Water Supply
,
22
(
2
),
1360
1374
.
Walls, S., Binns, A. D., Levison, J. & MacRitchie, S.
(
2020
)
Prediction of actual evapotranspiration by artificial neural network models using data from a Bowen ratio energy balance station
,
Neural Computing & Applications
,
32
(
17
),
14001
14018
.
Wu, Z., Cui, N., Gong, D., Zhu, F., Xing, L., Zhu, B., Chen, X., Wen, S. & Liu, Q.
(
2023
)
Simulation of daily maize evapotranspiration at different growth stages using four machine learning models in semi-humid regions of northwest China
,
Journal of Hydrology
,
617
,
128947
.
Xu, Y. P., Pan, S., Fu, G., Tian, Y. & Zhang, X.
(
2014
)
Future potential evapotranspiration changes and contribution analysis in Zhejiang Province, East China
,
Journal of Geophysical Research. Atmospheres
,
119
(
5
),
2174
2192
.
Yang, X., Yong, B., Ren, L., Zhang, Y. & Long, D.
(
2017
)
Multi-scale validation of GLEAM evapotranspiration products over China via ChinaFLUX ET measurements
,
International Journal of Remote Sensing
,
38
(
20
),
5688
5709
.
Yong, S. L. S., Ng, J. L., Huang, Y. F., Ang, C. K., Mirzaei, M. & Ahmed, A. N.
(
2023
)
Local and global sensitivity analysis and its contributing factors in reference crop evapotranspiration
,
Water Supply
,
23
(
4
),
1672
1683
.
Yu, G., Wen, X., Sun, X., Tanner, B. D., Lee, X. & Chen, J.
(
2006
)
Overview of ChinaFLUX and evaluation of its eddy covariance measurement
,
Agricultural and Forest Meteorology
,
137
(
3–4
),
125
137
.
Zhang
Z.
,
Gong
Y.
&
Wang
Z.
(
2018
)
Accessible remote sensing data based reference evapotranspiration estimation modelling
,
Agricultural Water Management
,
210
,
59
69
.
Zhang, L., Marshall, M., Vrieling, A. & Nelson, A.
(
2023
)
The divergence of energy- and water-balance evapotranspiration estimates in humid regions
,
Journal of Hydrology
,
624
,
129971
.
Zhao, L., Xing, L., Wang, Y., Cui, N., Zhou, H., Shi, Y., Chen, S., Zhao, X. & Li, Z.
(
2023a
)
Prediction model for reference crop evapotranspiration based on the back-propagation algorithm with limited factors
,
Water Resources Management
,
37
(
3
),
1207
1222
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).

Supplementary data