In this study, the vote algorithm used to improve the performances of three machine-learning models including M5Prime (M5P), random forest (RF), and random tree (RT) is developed (i.e. V-M5P, V-RF, and V-RT). Developed models were tested for forecasting soil temperature (TS) at 1, 2, and 3 days ahead at depths of 5 and 50 cm. All models were developed using different climatic variables, including mean, minimum, and maximum air temperatures; sunshine hours; evaporation; and solar radiation, which were evaluated. Correlation coefficients of 0.95 for the V-M5P model, 0.95 for the V-RF model, and 0.91 for the V-RT model were recorded for both 1- and 2-day ahead forecasting at a depth of 5 cm. For 3-day ahead forecasting, V-RF was the superior model with Nash–Sutcliff efficiency (NSE) values of 0.85, compared to V-M5P's value of 0.81 and V-RT's value of 0.81. The results at a depth of 5 cm indicate that V-RT was the least effective model. At a depth of 50 cm, forecasted TsS was in good agreement with measurements, and the V-RF was slightly superior. Among the limitations of the current work is that the models were unable to improve their performances by increasing the forecasting horizon.

  • Modelling soil temperature at different depths based on meteorological variables using vote algorithm.

  • Forecasting soil temperature at 1, 2, and 3 days ahead at depths of 5 and 50 cm using machine learning models.

  • M5P, RF, and RT are applied.

  • An ensemble approach with the vote algorithm (i.e. V-M5P, V-RF, and V-RT) is also proposed.

Soil temperature (TS) is a very important factor in the earth environment; it is mainly included in various meteorology studies, water soil dynamics, water evaporation, deep and slow water infiltration, and the quantification of water requirements for irrigation purposes. However, despite those important roles of TS, important challenges remain in its estimation process, including an exact quantification over time and space, which is mainly related to the soil plant atmosphere behaviour (Taheri et al. 2023). Soil temperature is affected by various external factors, especially climatic variables, plants, and soil structure (Ma et al. 2023). Due to the complexity of soil structure and composition, several approaches have been adopted for soil temperature (TS) estimation and three different approaches are mainly adopted including (i) direct in situ measurement, (ii) remote sensing estimation, and (iii) model application (Taheri et al. 2023).

Machine-learning methods, especially artificial neural networks (ANNs), have been found to provide reliable forecasts for some hydrological and meteorological variables in evapotranspiration modelling (Adnan et al. 2021), water quality indices (Abba et al. 2021), and irrigation system modelling (Kisi et al. 2021). ANNs were the most commonly used in hydrological and meteorological modelling, and numerous studies have employed ANNs to estimate TS. ANN models have been used to estimate daily and annual TS and they have demonstrated strong performance in forecasting the spatial variations of TS (Mihalakakou 2002). Tabari et al. (2011) found that ANN methods perform better than multivariate linear regression when predicting daily TS at six soil depths and they found that the most important variables for these predictions were air temperature and relative humidity. Bilgili (2010) compared linear regression, nonlinear regression, and ANN models to estimate monthly TS at five soil depths and found that the ANN model was the most effective. Kisi et al. (2015) estimated monthly TS at several soil depths by modelling with multi-layer perceptron neural networks (MLPNNs), radial basis neural networks (RBNNs), generalized regression neural networks (GRNNs), and multiple linear regression (MLR). The results indicated that the RBNN model was best for predicting TS at shallow depths (i.e., 5 and 10 cm), but the MLR and GRNN models were better at deeper soil depths (50 and 100 cm, respectively). Tabari et al. (2015) used an ANN model to forecast TS 1 day in advance at six depths and realized good accuracy in that short-term prediction.

Kim & Singh (2014) evaluated the adaptive neuro-fuzzy inference systems (ANFIS) and multi-layer perceptron (MLP) models for calculating daily TS. An extreme learning machine (ELM) was optimized by self-adaptive evolutionary modelling to predict daily TS at six depths (Nahvi et al. 2016). Both models produced accurate estimates, but the optimized ELM model performed marginally better than the original ELM model. Talaee (2014) used a co-active neuro-fuzzy inference system (CANFIS) to model TS in arid and semi-arid regions and found that CANFIS produced accurate results. Abyaneh et al. (2016) utilized both ANN and CANFIS modelling to estimate TS in moderate air temperature conditions in humid and dry climates and found that they were more effective models in arid settings. TS was predicted monthly by ANN and ANFIS models using data collected at 31 stations in Iran and ANFIS was found to be the more effective model (Mehdizadeh et al. 2017). Several other models have been tested as well. These include genetic programming (GP) (Kisi et al. 2017), gene expression programming (Samadianfard et al. 2018), and genetic-based neural networks (Kazemi et al. 2018). Khosravi et al. (2022a) developed several ML models including KStar, instance-based K-nearest learner, and locally weighted learner coupled with bagging (BA) and dagging (DA) for TS prediction in Iran. They finally stated that for soil depth of 5 cm, BA-KStar is superior while for soil depth of 50 cm, DA-KStar outperforms other algorithms. Malik et al. (2022) applied support vector machine (SVM), MLP, and ANFIS models optimized with slime mould algorithm, particle swarm optimization, and spotted hyena optimizer algorithms for TS prediction in a semi-arid region of Punjab, India. Their finding showed the higher performance of SVM models at multiple depths.

Ozbek (2023) compared long short-term memory (LSTM) deep learning, the ANFIS with fuzzy c-means clustering algorithm, the autoregressive integrated moving average, and the autoregressive moving average models for forecasting 1-h-ahead soil temperature. They reported that the LSTM was more accurate compared to the other models at different sites. Farhangmehr et al. (2023) used the convolutional neural network (CNN) and the MLPNN models for predicting hourly soil temperature using large features, namely precipitation, surface pressure, evaporation, wind gust, dewpoint temperature, surface solar radiation, surface thermal radiation, and air temperature. According to the obtained results, the authors highlighted the importance of the air temperature as the first relevant feature, while the surface thermal radiation was the poorest one. Furthermore, the CNN was the most accurate, significantly higher than the MLPNN model. Ebtehaj et al. (2023) introduced a new machine-learning model called the emotional neural network (ENN) for predicting TS at 10 and 20 cm depths. They developed the modelling framework according to two scenarios, namely (i) modelling TS using climatic variables, i.e., air temperature, wind speed, and solar radiation and (ii) time series modelling. By comparison with the least square support vector machine, the GP and the multivariate adaptive regression splines machine-learning models demonstrated the superiority of the ENN. Alizamir et al. (2020) compared ELM, the MLPNN, the classification and regression trees, and the group method of data handling for modelling monthly TS at different depths using four climatic variables, namely air temperature, solar radiation, wind speed, and relative air humidity. The authors reported that at 5, 10 and 50 cm, the ELM model was more accurate, and the best performances can be obtained using only air temperature; however, at 10 cm depth, it is necessary to include solar radiation and wind speed. Sattari et al. (2020) compared three ensemble machine-learning models, namely decision tree (DT), gradient boosted trees (GBT), and hybrid DT-GBT models for predicting daily soil temperature at 5, 10, and 20 cm depths. The proposed models were developed using mean, maximal and minimal air temperature, sunshine duration, and precipitation. From the obtained results, hybrid DT-GBT was found to be most accurate at 5 cm depth, while the DT exhibited high accuracy at 10 and 20 cm depths.

Though researchers have attempted predictions of TS on daily or monthly intervals, predicting values for shorter periods (half an hour, for instance) is uncommon. There are questions regarding the appropriateness of machine-learning modelling of TS over short intervals, but such models may yield a valuable high-resolution insight for modelling water resources and agronomy to improve crop yields (Singh et al. 2018; Xing et al. 2018). There has been a supportive discussion of these opportunities (Sanikhani et al. 2018). On the one hand, TS measuring is time-consuming, and on the other hand, finding a reliable and practical model as a cost-effective model is required. This study investigates the potential of (1) three standalone machine-learning models: M5Prime (M5P), random forest (RF), and random tree (RT), (2) their ensembles with the vote (V) algorithm: V-M5P, V-RF, and V-RT, (3) effect of different input scenarios, and (4) involving different easily available input variables for forecasting TS at two depths (5 and 50 cm) 1–3 days in advance. To the best of the author's knowledge, the vote algorithm is rarely used in geoscience and their ensembles with tree-based algorithms are new and has a high performance for predicting geoscience phenomena.

Study area and data collection

In the present study, data used for modelling soil temperature (TS) were collected from the Isfahan Regional Water Authority (IRWA) in Iran (Figure 1), and it was previously used by Sattari et al. (2017) and recently by Khosravi et al. (2022a). Daily soil temperature (TS) measured at 5 and 50 cm depths was modelled using various meteorological variables, namely mean air temperature (TM), minimum air temperature (TN) and maximum air temperature (TX), evaporation (Epan), sunshine hours (SH), and solar radiation (SR). All datasets cover a period ranging from June 1992 to December 2005 (Table 1). The first 10 years were used for training (70%), while the remaining four years were used for validation (30%). More details about the data can be found in Sattari et al. (2017) and Khosravi et al. (2022a).
Table 1

Descriptive statistics of soil temperatures and meteorological variables (Khosravi et al. 2022a)

VariablesMaximumMinimumMeanStandard deviationSkewnessKurtosis
Inputs TM (°C) 34.80 −3.30 20.78 7.50 −0.48 −0.73 
TN (°C) 28.80 −7.80 13.04 7.14 −0.39 −0.73 
TX (°C) 43 1.20 28.53 8.20 −0.6 −0.51 
Epan (mm) 30 0.1 8.11 4.12 −0.02 −0.48 
SH (h) 13.8 0.1 9.98 2.75 −1.65 2.74 
SR (Cal/cm29,695 25 1,845 1,122 1.38 8.49 
Output TS at 5 cm (°C) 45.53 0.70 26.24 9.69 −0.48 −0.89 
TS at 50 cm (°C) 35 7.33 24.37 6.62 −0.53 −0.92 
VariablesMaximumMinimumMeanStandard deviationSkewnessKurtosis
Inputs TM (°C) 34.80 −3.30 20.78 7.50 −0.48 −0.73 
TN (°C) 28.80 −7.80 13.04 7.14 −0.39 −0.73 
TX (°C) 43 1.20 28.53 8.20 −0.6 −0.51 
Epan (mm) 30 0.1 8.11 4.12 −0.02 −0.48 
SH (h) 13.8 0.1 9.98 2.75 −1.65 2.74 
SR (Cal/cm29,695 25 1,845 1,122 1.38 8.49 
Output TS at 5 cm (°C) 45.53 0.70 26.24 9.69 −0.48 −0.89 
TS at 50 cm (°C) 35 7.33 24.37 6.62 −0.53 −0.92 
Figure 1

Study area location.

Figure 1

Study area location.

Close modal

Most effective input combination

The simple Pearson's correlation coefficients (r-values) between the model's input variables of TM, TX, TN, Epan, SR, and SH and output variable (TS at two depths) were used to develop the input variable list. The simple r-value approach is commonly used to build model's potential input successfully (Khosravi et al. 2020, 2021a, 2021b, 2021c, 2021d, 2021e; Kargar et al. 2021; Meshram et al. 2021; Panahi et al. 2021). The r-values of the variable-output pairs suggested six inputs for modelling based on the strength of the relationships (Tables 2 and 3). The most effective input combination was determined and used for model training.

Table 2

The input combinations of different models

InputsOutput
TM TS 
TM, TX TS 
TM, TX, TN TS 
TM, TX, TN, Epan TS 
TM, TX, TN, Epan, SH TS 
TM, TX, TN, Epan, SH, SR TS 
InputsOutput
TM TS 
TM, TX TS 
TM, TX, TN TS 
TM, TX, TN, Epan TS 
TM, TX, TN, Epan, SH TS 
TM, TX, TN, Epan, SH, SR TS 
Table 3

Pearson's correlation between soil temperatures and meteorological variables

Soil depthAhead daysTNTXSRSHEpanTM
5 cm +1 0.887 0.879 0.301 0.564 0.788 0.903 
+2 0.891 0.885 0.300 0.586 0.790 0.915 
+3 0.916 0.938 0.299 0.615 0.796 0.948 
50 cm +1 0.918 0.938 0.305 0.613 0.791 0.953 
+2 0.910 0.939 0.296 0.597 0.705 0.951 
+3 0.907 0.941 0.275 0.588 0.749 0.946 
Soil depthAhead daysTNTXSRSHEpanTM
5 cm +1 0.887 0.879 0.301 0.564 0.788 0.903 
+2 0.891 0.885 0.300 0.586 0.790 0.915 
+3 0.916 0.938 0.299 0.615 0.796 0.948 
50 cm +1 0.918 0.938 0.305 0.613 0.791 0.953 
+2 0.910 0.939 0.296 0.597 0.705 0.951 
+3 0.907 0.941 0.275 0.588 0.749 0.946 

In the present study, all developed models are implemented in the Waikato Environment for Knowledge Analysis (WEKA 3.9). WEKA software was developed at the University of Waikato, New Zealand, and is a free software licensed under the GNU General Public License. Three machine learnings were proposed in the present study, namely M5P, RF, RT, and the V. These models have been broadly reported in the literature as having high forecasting capabilities (Hanoon et al. 2021; Irwan et al. 2023).

M5Prime

The M5P algorithm is a widely used DT algorithm first proposed by Quinlan (1992). Its prediction power for hydrological studies has been proven by numerous studies. The M5P algorithm was based on the regression tree idea and applied piece-wise linear functions for classifications (Wang & Witten 1996). The creation and pruning of a DT are the two main steps for the use of the M5P algorithm. The modelling space is divided into a set of regions/subsets (called a leaf or leaves) through divide-and-conquer methods (Quinlan 1992) and each leaf is labelled with a number (Figure 2). A linear model is created to fit the estimated values of each space. Similar to trees, each regression tree has roots, nodes, leaves, and branches. Splitting is used along branches at each node, which is a predictive variable in the regression tree. Each branch is divided by parent nodes that branch to child nodes. The phases of the M5P algorithm for regression tree construction and the resulting predictions can be summarized as follows. Separation of the dataset separation at each node is based on the standard deviation reduction (SDR) of the dataset for each child node factor (as an error criterion). SDR is used to compute the expected reduction of the standard deviation (Quinlan 1992):
(1)
where K and Ki denote training and dataset examples that receive the node and outcomes from node allocation, and SD is the standard deviation. The best branch among all branches that may result from potential separation that maximizes SDR is selected. The large number of branches can make a model very complicated and may weaken its generalization. To resolve this, branches are pruned, modifying them with the replacement of linear regressions at branches. Smoothing is applied to the predicted values through M5P at the leaf. More information about M5P can be found in Quinlan (1992) and Wang & Witten (1996).
Figure 2

M5Prime architecture.

Figure 2

M5Prime architecture.

Close modal

Vote (V) algorithm

The vote (V) combines several machine-learning models using a decision strategy based on majority voting (Figure 3). From a computational point of view, voting is to extract reliable models from multiple unreliable models (Parhami 1994). A major advantage of V is its capacity to fuse data from several sources to create an ultrareliable algorithm (Parhami 1996). V can guarantee the following advantages: (i) minimization of the cost function beyond that achieved by a single standalone model, (ii) more flexibility for unequal models, (iii) optimal use of the information from multiple sources, and (iv) achievement of higher accuracy than the single standalone model because it combines the strengths and weaknesses of several single models (Parhami 1994, 1996).
Figure 3

Vote (V) algorithm architecture.

Figure 3

Vote (V) algorithm architecture.

Close modal

Random forest

RF is a classified tree- and regression tree-based ML method that was developed by Breiman (1996). RF is an enhanced version of the bagging ensemble-based model, which was based on the construction of a large number of trees through the computation of the average of their predictions (Figure 4). When RF is applied to regression-based problems, the results of several fitted regression trees that use bootstrap sampling are averaged. This reduces variation by minimizing the correlations between the trees. But regression-based RF models use out-of-bag (OOB) sampling, a process that is similar to cross-validation, a technique that most ML models employ (Breiman 1996). OOB reflects the difference between observed and predicted values.
Figure 4

Random forest regression architecture.

Figure 4

Random forest regression architecture.

Close modal

Random tree

RT is a DT-based method. It creates a DT using either randomly adopted samples or potential input variables (Aldous 1990). RT can be applied to both classification and regression trees using the bootstrap approach. The growth of the DT using RT is similar to the steps used with M5P, but RT uses an independent bootstrap resample approach. If all measured or observed values at the leaf nodes are below a pre-specified threshold, tree growth stops (Figure 5). RT is derived from the theory behind the RF model (Breiman 2001) and integrates a single tree with the RF model. Each decision-tree model can optimize the linear method used at nodes and at leaves in the local subspace (or subsample). Each DT in the forest yields a prediction and the average of the individual trees in the forest is used to calculate a final prediction.
Figure 5

Random tree architecture.

Figure 5

Random tree architecture.

Close modal

Model parameter identification

In the current study, the identification of all model parameters was performed in the WEKA 3.9 software. First, the default value is considered and each model is trained based on the existence condition. Second, lower and higher values that are default values are generated and fed to them, and again each model is implemented. This approach is continued until the optimum value for each parameter is determined. In this case, root mean square error (RMSE) is applied for comparing and determining the optimum value, as the lower the RMSE, the better the model parameter would be.

Model evaluation

The prediction power of developed models was evaluated with several statistical measures such as RMSE, mean absolute error (MAE) the Nash–Sutcliffe efficiency (NSE) (which depends on both correlation and bias), and Pearson's correlation coefficient (R) (Moriasi et al. 2007; Khosravi et al. 2022b):
(2)
(3)
(4)
(5)
where and represent the observed and estimated daily soil temperature for ith observation, N is the number of data points, and and are the mean measured and mean estimated TS. NSE is a dimensionless metric that varies between − and 1; the optimum model's NSE = 1. The lower the RMSE and the MAE and the higher the R, the better is the model's prediction power (Osman et al. 2022).

This study compared three ensemble machine-learning models to forecast soil temperature (TS) at depths of 5 and 50 cm. The models, V-M5P, V-RF, and V-RT, were trained and validated using four performance metrics: RMSE, MAE, NSE, and R. The results were scrutinized for both depths.

Input variable effectiveness

Based on the achieved results (Table 3), Tm has the highest impact on TS for all three ahead days forecasting at two different depths, while SR is the variable with the lowest impact. In addition, input scenario No. 2, in which TM and TX are involved, is determined as the most effective input scenario with the lowest RMSE value. Only the results from the best input scenario are considered for further modelling and analysis.

1-, 2-, and 3-day forecasts of TS at 5 cm

At 5 cm soil depth, the TS predicted for 1-day forecast, RMSE ranged from 3.31 to 4.38 °C (mean 3.697 °C) and MAE ranged from 2.53 to 3.35 °C (mean 2.83 °C). The lowest values were produced by V-RF. V-M5P generated the second-best results and V-RT produced the highest errors and therefore was the least accurate. The differences between V-RF and V-M5P were slight and negligible, though their overall performances were mitigated slightly by equal values of R and NSE. V-RT, however, increased RMSE and MAE by approximately 22.37 and 22.09%, and V-M5P increased them by 24.43 and 24.48% from the V-RF values (Table 4). V-RT produced the lowest R and NSE values of 0.91 and 0.80. The scatterplot of measured versus 1-day forecasted TS at 5 cm (Figure 6) and the corresponding time variation plots of the values (Figure 7) exhibit the superiority of V-RF. The results for 2-day forecasts with these models are essentially the same (Table 4, Figures 6 and 7). The most accurate results were produced with V-RF and V-M5P: R = 0.95 and NSE = 0.88 l, though the lowest errors were produced by the V-RF model: RMSE = 3.34°C and MAE = 2.56°C. The V-RT model remains the least effective of the models in terms of fit, R = 0.91 and NSE ≍ 0.81, and errors, RMSE = 4.37°C and MAE = 3.31°C.
Table 4

Performances of different forecasting models at the depth of 5 cm

Algorithms1-day forecast of TS
2-day forecast of TS
3-day forecast of TS
R (/)NSE (/)RMSE (°C)MAE (°C)R (/)NSE (/)RMSE (°C)MAE (°C)R (/)NSE (/)RMSE (°C)MAE (°C)
V-M5P 0.95 0.88 3.40 2.61 0.95 0.88 3.46 2.64 0.91 0.81 4.26 3.19 
V-RF 0.95 0.89 3.31 2.53 0.95 0.88 0.34 2.56 0.93 0.85 3.84 2.79 
V-RT 0.91 0.80 4.38 3.35 0.91 0.81 4.37 3.31 0.91 0.80 4.44 3.36 
Algorithms1-day forecast of TS
2-day forecast of TS
3-day forecast of TS
R (/)NSE (/)RMSE (°C)MAE (°C)R (/)NSE (/)RMSE (°C)MAE (°C)R (/)NSE (/)RMSE (°C)MAE (°C)
V-M5P 0.95 0.88 3.40 2.61 0.95 0.88 3.46 2.64 0.91 0.81 4.26 3.19 
V-RF 0.95 0.89 3.31 2.53 0.95 0.88 0.34 2.56 0.93 0.85 3.84 2.79 
V-RT 0.91 0.80 4.38 3.35 0.91 0.81 4.37 3.31 0.91 0.80 4.44 3.36 
Figure 6

Scatterplots of measured against calculated soil temperature (TS) at 5 cm depth for 1- (blue), 2- (yellow), and 3-day (red) ahead forecasting (validation stage).

Figure 6

Scatterplots of measured against calculated soil temperature (TS) at 5 cm depth for 1- (blue), 2- (yellow), and 3-day (red) ahead forecasting (validation stage).

Close modal
Figure 7

Comparison between measured against calculated soil temperature (TS) at 5 cm depth for 1-, 2-, and 3-day ahead forecasting (validation stage).

Figure 7

Comparison between measured against calculated soil temperature (TS) at 5 cm depth for 1-, 2-, and 3-day ahead forecasting (validation stage).

Close modal

Similarly, 3-day forecasting revealed decreased performance for V-RF and V-M5P and no change in performance for V-RT (Table 4, Figures 6 and 7). Error scores produced by V-RF were 9.86% higher RMSE and 12.54% higher MAE than those values for V-M5P. In addition, these values for V-MVP were 13.51 and 16.96% better than the V-RT model. Comparing the models' 1- and 3-day forecasts, V-M5P's values for all metrics decreased: R by 4.21%, NSE by 7.95%, RMSE by 20.18%, and MAE by 18.18%. V-RF's scores declined at lower rates: R by 2.10%, NSE by 4.49%, RMSE by 13.80%, and MAE by 9.32%.

1-, 2-, and 3-day forecasts of TS at 50 cm

At the depth of 50 cm as Table 5 shows, an important remark which has to be made immediately is that, in contrast to the forecasts of TS at 5-cm depths, the forecasts of TS at 50-cm depths improved from 1- to 3-day forecasts in terms of both error and accuracy (Table 5, Figures 8 and 9). Mean RMSE and MAE improved by 4.30 and 3.20%, and mean R and NSE slightly improved. The 1-day forecast exhibits the deficiency of V-RT compared to V-M5P and V-RF. The RMSE and MAE values of the V-RT were 16.35 and 21.05% lower than those of V-M5P, and 20.44 and 25.91% lower than V-RF. The best accuracies were achieved using the V-RF having the highest R and NSE values (i.e., ≍0.94 and ≍0.86) forecasts (Table 5, Figures 8 and 9). The 2-day forecasts made by all three models were less accurate than both the 1-day and 3-day forecasts (Table 5, Figures 8 and 9). The V-RF produced the most accurate 2-day forecast model with R = 0.93 and NSE = 0.85, and it had the lowest RMSE = 2.62°C and MAE = 1.91°C. Statistics for the 3-day forecasts show that V-RF had the least error with 9.72% better RMSE and 9.42% better MAE than V-M5P. The error statistics for the V-RT forecast exceeded both: RMSE by 25.64% and MAE by 28.21% (Table 5, Figures 8 and 9). V-RF and V-M5P performed equally as well as R scores were equal and NSE scores had less than a 1% difference. Both were better than the V-RT scores.
Table 5

Performances of different forecasting models at the depth of 50 cm

Algorithms1-day TS forecast
2-day TS forecast
3-day TS forecast
R (/)NSE (/)RMSE (°C)MAE (°C)R (/)NSE (/)RMSE (°C)MAE (°C)R (/)NSE (/)RMSE (°C)MAE (°C)
V-M5P 0.93 0.84 2.66 1.95 0.93 0.84 2.74 2.02 0.94 0.85 2.57 1.91 
V-RF 0.94 0.86 2.53 1.83 0.93 0.85 2.62 1.91 0.94 0.87 2.32 1.73 
V-RT 0.90 0.78 3.18 2.47 0.89 0.78 3.21 2.50 0.90 0.79 3.12 2.41 
Algorithms1-day TS forecast
2-day TS forecast
3-day TS forecast
R (/)NSE (/)RMSE (°C)MAE (°C)R (/)NSE (/)RMSE (°C)MAE (°C)R (/)NSE (/)RMSE (°C)MAE (°C)
V-M5P 0.93 0.84 2.66 1.95 0.93 0.84 2.74 2.02 0.94 0.85 2.57 1.91 
V-RF 0.94 0.86 2.53 1.83 0.93 0.85 2.62 1.91 0.94 0.87 2.32 1.73 
V-RT 0.90 0.78 3.18 2.47 0.89 0.78 3.21 2.50 0.90 0.79 3.12 2.41 
Figure 8

Scatterplots of measured against calculated soil temperature (TS) at 50 cm depth for 1-, 2-, and 3-day ahead forecasting (validation stage).

Figure 8

Scatterplots of measured against calculated soil temperature (TS) at 50 cm depth for 1-, 2-, and 3-day ahead forecasting (validation stage).

Close modal
Figure 9

Comparison between measured against calculated soil temperature (TS) at 50 cm depth for 1-, 2-, and 3-day ahead forecasting (validation stage).

Figure 9

Comparison between measured against calculated soil temperature (TS) at 50 cm depth for 1-, 2-, and 3-day ahead forecasting (validation stage).

Close modal
In Figure 10, we reported the violin plot of measured and predicted soil temperatures at different depths for 1-, 2-, and 3-day ahead of forecasting. The figure clearly highlighted the superiority of one model in comparison to another, and more precisely, it is clear that the V-RT model was the poorest model exhibiting the lowest performances.
Figure 10

Violin plot showing distributions of the measured and forecasted soil temperature (TS, °C) at the three forecasting horizons (validation stage).

Figure 10

Violin plot showing distributions of the measured and forecasted soil temperature (TS, °C) at the three forecasting horizons (validation stage).

Close modal

In this paper, we present a new method to forecast TS measured at two different depths and we apply it to TS measured at the Isfahan region in Iran. Our method is based on the comparison between three single and hybrid models using the voting algorithm, i.e., V-M5P, V-RF, and V-RT. However, our method was inspired by several previous studies for which the TS was modelled using meteorological variables, especially, the mean, maximal, and minimal air temperatures. As commented in the previous results section, the TS was mostly influenced by air temperature, whereas combining these six meteorological variables produces different results at different depths and different forecasting horizons. Compared to the results reported in the literature, our TS estimations are consistent and encouraging. Note that, although the importance of air temperature is quite important, it seems to be sufficient for an accurate prediction of soil temperature as previously confirmed in the same site using the same data by Sattari et al. (2017) and Khosravi et al. (2022a). Comparing our numerical results with those reported in the literature, the inclusion of two temperatures, i.e., the mean and the maximal, shows a lower error at all depths and all forecasting horizons. However, given the high correlation coefficient reported in Table 3, we can consider that both variables have a positive effect on soil temperature. In addition, in this new modelling study, we demonstrate the importance of the vote algorithm in the improvement of model performances. Focusing on the proposed methods, the V-RF at 1-, 2- and 3-day forecast provides statistically high performances compared to V-M5P and V-RT. Additionally, the results of the proposed vote method provide a good forecast at 50 cm depth more than at 5 cm depth for all three forecasting horizons. Overall, the results show that with the hybrid vote method proposed in the present study, we can forecast the soil temperature with high NSE values approximately equal to 0.89.

This study investigated the use of three ensemble machine-learning models for 1-, 2-, and 3-day forecasts of TS at depths of 5 and 50 cm. The three models compared were V-M5P, V-RF, and V-RT. The novelty of this study is the use of the vote ensemble algorithm. The TS forecasting models were validated using in situ measurements from one station. The model inputs included daily air temperature measurements (i.e., minimum, maximum, and mean), solar radiation, evaporation, and sunshine hours. Though each model accurately forecasted TS, thus demonstrating that each could be a powerful tool, the accuracies of the forecasts ranged from one model to another and for each forecasting horizon. There was an apparent increase in error statistics for TS forecasts at the depth of 5 cm among all three models as forecast horizons increased from 1 to 3 days; mean RMSE and MAE for 1-day forecasts were 3.697 and 2.830°C but increased by 11.56 and 9.10% in the 3-day forecast. This increased error was less apparent for TS forecasts at 50 cm depths. The TS forecasts of the three models improved with increasing forecast horizon: RMSE and MAE decreased in the 3-day forecast by 4.30 and 3.20%, respectively. Though the performance metrics of V-M5P and V-RF were similar (V-M5P showed slight superiority), both exceeded the performance of V-RT. The strong performances of all three models, however, depend largely on the number of input variables included in the models, and the variables may be relatively high in number. It would be beneficial to test models that could achieve similar or better results using fewer input variables. It is highly recommended that other potential input variables, such as atmospheric pressure, precipitation, relative humidity, and wind speed, which may have a high impact on TS prediction and their effectiveness, need to be investigated.

The publication has been prepared with the support of the RUDN University Strategic Academic Leadership Program.

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Abba
S. I.
,
Abdulkadir
R. A.
,
Sammen
S. S.
,
Pham
Q. B.
,
Lawan
A. A.
,
Esmaili
P.
&
Al-Ansari
N.
2021
Integrating feature extraction approaches with hybrid emotional neural networks for water quality index modeling
.
Applied Soft Computing
108036
.
doi:10.1016/j.asoc.2021.108036
.
Abyaneh
H. Z.
,
Varkeshi
M. B.
,
Golmohammadi
G.
&
Mohammadi
K.
2016
Soil temperature estimation using an artificial neural network and co-active neuro-fuzzy inference system in two different climates
.
Arabian Journal of Geosciences
9
(
5
),
377
.
doi:10.1007/s12517-016-2388-8
.
Adnan
R. M.
,
Mostafa
R.
,
Islam
A. R. M. T.
,
Kisi
O.
,
Kuriqi
A.
&
Heddam
S.
2021
Estimating reference evapotranspiration using hybrid adaptive fuzzy inferencing coupled with heuristic algorithms
.
Computers and Electronics in Agriculture
191
,
106541
.
doi:10.1016/j.compag.2021.106541
.
Aldous
D.
1990
A random tree model associated with random graphs
.
Random Structures Algorithms
.
doi:10.1002/rsa.3240010402
.
Alizamir
M.
,
Kisi
O.
,
Ahmed
A. N.
,
Mert
C.
,
Fai
C. M.
,
Kim
S.
,
Kim
N. W.
&
El-Shafie
A.
2020
Advanced machine learning model for better prediction accuracy of soil temperature at different depths
.
PLoS One
15
(
4
),
e0231055
.
doi:10.1371/journal.pone.0231055
.
Barezaei
A.
&
Jalali
J.
2023
A comparison of simulated runoff based on ground rain gauges and Persiann-CDR satellite precipitation records using swat model
.
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
X-4/W1-2022
,
87
94
.
Bilgili
M.
2010
Prediction of soil temperature using regression and artificial neural network models
.
Meteorology and Atmospheric Physics
110
(
1
),
59
70
.
doi:10.1007/s00703-010-0104-x
.
Breiman
L.
1996
Bagging predictors
.
Machine Learning
24
,
123
140
.
doi:10.1007/BF00058655
.
Breiman
L.
2001
Random forests
.
Machine Learning
45
,
5
32
.
doi:10.1023/A:1010933404324
.
Ebtehaj
I.
,
Bonakdari
H.
,
Samui
P.
&
Gharabaghi
B.
2023
Multi-depth daily soil temperature modeling: Meteorological variables or time series?
Theoretical and Applied Climatology
151
(
3-4
),
989
1012
.
doi:10.1007/s00704-022-04314-y
.
Farhangmehr
V.
,
Cobo
J. H.
,
Mohammadian
A.
,
Payeur
P.
,
Shirkhani
H.
&
Imanian
H.
2023
A convolutional neural network model for soil temperature prediction under ordinary and hot weather conditions: Comparison with a multilayer perceptron model
.
Sustainability
15
(
10
),
7897
.
doi:10.3390/su15107897
.
Hanoon
M. S.
,
Ahmed
A. N.
,
Zaini
N. A.
,
Razzaq
A.
,
Kumar
P.
,
Sherif
M.
,
Sefelnasr
A.
&
El-Shafie
A.
2021
Developing machine learning algorithms for meteorological temperature and humidity forecasting at Terengganu state in Malaysia
.
Scientific Reports
11
(
1
),
18935
.
doi:10.1038/s41598-021-96872-w
.
Irwan
D.
,
Ali
M.
,
Ahmed
A. N.
,
Jacky
G.
,
Nurhakim
A.
,
Ping Han
M. C.
,
Nouar
A.
&
El-Shafie
A.
2023
Predicting water quality with artificial intelligence: A review of methods and applications
.
Archives of Computational Methods in Engineering
1
20
.
doi:10.1007/s11831-023-09947-4
.
Kargar
K.
,
Safari
M. J. S.
&
Khosravi
K.
2021
Weighted instances handler wrapper and rotation forest-based hybrid algorithms for sediment transport modeling
.
Journal of Hydrology
598
,
126452
.
doi:10.1016/j.jhydrol.2021.126452
.
Kazemi
S. M. R.
,
Minaei Bidgoli
B.
,
Shamshirband
S.
,
Karimi
S. M.
,
Ghorbani
M. A.
,
Chau
K. W.
&
Kazem Pour
R.
2018
Novel genetic-based negative correlation learning for estimating soil temperature
.
Engineering Applications of Computational Fluid Mechanics
12
(
1
),
506
516
.
doi:10.1080/19942060.2018.1463871
.
Khosravi
K.
,
Cooper
J. R.
,
Daggupati
P.
,
Pham
B. T.
&
Bui
D. T.
2020
Bedload transport rate prediction: Application of novel hybrid data mining techniques
.
Journal of Hydrology
585
,
124774
.
doi:10.1016/j.jhydrol.2020.124774
.
Khosravi
K.
,
Golkarian
A.
,
Booij
M. J.
,
Barzegar
R.
,
Sun
W.
,
Yaseen
Z. M.
&
Mosavi
A.
2021a
Improving daily stochastic streamflow prediction: Comparison of novel hybrid data mining algorithms
.
Hydrological Sciences Journal
66
(
9
),
1457
1474
.
doi:10.1080/02626667.2021.1928673
.
Khosravi
K.
,
Khozani
Z. S.
&
Mao
L.
2021b
A comparison between advanced hybrid machine learning algorithms and empirical equations applied to abutment scour depth prediction
.
Journal of Hydrology
596
,
126100
.
doi:10.1016/j.jhydrol.2021.126100
.
Khosravi
K.
,
Khozani
Z. S.
&
Cooper
J. R.
2021c
Predicting stable gravel-bed river hydraulic geometry: A test of novel, advanced, hybrid data mining algorithms
.
Environmental Modelling & Software
144
,
105165
.
doi:10.1016/j.envsoft.2021.105165
.
Khosravi
K.
,
Miraki
S.
,
Saco
P. M.
&
Farmani
R.
2021d
Short-term river streamflow modeling using ensemble-based additive learner approach
.
Journal of Hydro-Environment Research
39
,
81
91
.
doi:10.1016/j.jher.2021.07.003
.
Khosravi
K.
,
Safari
M. J. S.
&
Cooper
J. R.
2021e
Clear-water scour depth prediction in long channel contractions: Application of new hybrid machine learning algorithms
.
Ocean Engineering
238
,
109721
.
doi:10.1016/j.oceaneng.2021.109721
.
Khosravi
K.
,
Golkarian
A.
,
Barzegar
R.
,
Aalami
M. T.
,
Heddam
S.
,
Omidvar
E.
&
López-Vicente
M.
2022a
Multi-step-ahead soil temperature forecasting at multiple-depth based on meteorological data: Integrating resampling algorithms and machine learning models
.
Pedosphere
.
doi:10.1016/j.pedsph.2022.06.056
.
Khosravi
K.
,
Golkarian
A.
,
Melesse
A. M.
&
Deo
R. C.
2022b
Suspended sediment load modeling using advanced hybrid rotation forest based elastic network approach
.
Journal of Hydrology
127963
.
doi:10.1016/j.jhydrol.2022.127963
.
Kim
S.
&
Singh
V. P.
2014
Modeling daily soil temperature using data-driven models and spatial distribution
.
Theoretical and Applied Climatology
118
(
3
),
465
479
.
doi:10.1007/s00704-013-1065-z
.
Kisi
O.
,
Tombul
M.
&
Kermani
M. Z.
2015
Modeling soil temperatures at different depths by using three different neural computing techniques
.
Theoretical and Applied Climatology
121
(
1
),
377
387
.
doi:10.1007/s00704-014-1232-x
.
Kisi
O.
,
Sanikhani
H.
&
Cobaner
M.
2017
Soil temperature modeling at different depths using neuro-fuzzy, neural network, and genetic programming techniques
.
Theoretical and Applied Climatology
129
,
833
848
.
Ma
T.
,
Nan
X.
,
Wu
R.
,
Yan
H.
,
Wu
N.
,
She
J.
&
Bao
Z.
2023
Quantifying the impact of canopy structural characteristics on soil temperature variations in different bamboo communities
.
Atmosphere
14
(
3
),
445
.
doi:10.3390/atmos14030445
.
Malik
A.
,
Tikhamarine
Y.
,
Sihag
P.
,
Shahid
S.
,
Jamei
M.
&
Karbasi
M.
2022
Predicting daily soil temperature at multiple depths using hybrid machine learning models for a semi-arid region in Punjab, India
.
Environmental Science and Pollution Research
1
20
.
doi:10.1007/s11356-022-20837-3
.
Mehdizadeh
S.
,
Behmanesh
J.
&
Khalili
K.
2017
Evaluating the performance of artificial intelligence methods for estimation of monthly mean soil temperature without using meteorological data
.
Environmental Earth Sciences
76
(
8
),
325
.
doi:10.1007/s12665-017-6607-8
.
Meshram
S. G.
,
Safari
M. J. S.
,
Khosravi
K.
&
Meshram
C.
2021
Iterative classifier optimizer-based pace regression and random forest hybrid models for suspended sediment load prediction
.
Environmental Science and Pollution Research
28
(
9
),
11637
11649
.
doi:10.1007/s11356-020-11335-5
.
Mihalakakou
G.
2002
On estimating soil surface temperature profiles
.
Energy and Buildings
34
(
3
),
251
259
.
doi:10.1016/S0378-7788(01)00089-5
.
Moriasi
D. N.
,
Arnold
J. G.
,
Van Liew
M. W.
,
Bingner
R. L.
,
Harmel
R. D.
&
Veith
T. L.
2007
Model evaluation guidelines for systematic quantification of accuracy in watershed simulations
.
Transactions of the ASABE
50
(
3
),
885
900
.
Nahvi
B.
,
Habibi
J.
,
Mohammadi
K.
,
Shamshirband
S.
&
Al Razgan
O. S.
2016
Using self-adaptive evolutionary algorithm to improve the performance of an extreme learning machine for estimating soil temperature
.
Computers and Electronics in Agriculture
124
,
150
160
.
doi:10.1016/j.compag.2016.03.025
.
Osman
A. I. A.
,
Ahmed
A. N.
,
Huang
Y. F.
,
Kumar
P.
,
Birima
A. H.
,
Sherif
M.
,
Sefelnasr
A.
,
Ebraheemand
A. A.
&
El-Shafie
A.
2022
Past, present and perspective methodology for groundwater modeling-based machine learning approaches
.
Archives of Computational Methods in Engineering
29
(
6
),
3843
3859
.
doi:10.1007/s11831-022-09715-w
.
Ozbek
A.
2023
Deep learning approach for one-hour ahead forecasting of weather data
.
Energy Sources, Part A: Recovery, Utilization, and Environmental Effects
45
(
3
),
7606
7628
.
doi:10.1080/15567036.2023.2222690
.
Panahi
M.
,
Khosravi
K.
,
Ahmad
S.
,
Panahi
S.
,
Heddam
S.
,
Melesse
A. M.
,
Omidvar
E.
&
Lee
C. W.
2021
Cumulative infiltration and infiltration rate prediction using optimized deep learning algorithms: A study in Western Iran
.
Journal of Hydrology: Regional Studies
35
,
100825
.
doi:10.1016/j.ejrh.2021.100825
.
Parhami
B.
1994
Voting algorithms
.
IEEE Transactions on Reliability
43
(
4
),
617
629
.
doi:10.1109/24.370218
.
Parhami
B.
1996
A taxonomy of voting schemes for data fusion and dependable computation
.
Reliability Engineering & System Safety
52
(
2
),
139
151
.
doi:10.1016/0951-8320(96)00012-9
.
Quinlan
J. R.
1992
Learning with continuous classes
.
Machine Learning
92
,
343
348
.
doi:10.1.1.34.885
.
Samadianfard
S.
,
Asadi
E.
,
Jarhan
S.
,
Kazemi
H.
,
Kheshtgar
S.
,
Kisi
O.
,
Sajjadi
S.
&
Manaf
A. A.
2018
Wavelet neural networks and gene expression programming models to predict short-term soil temperature at different depths
.
Soil and Tillage Research
175
,
37
50
.
doi:10.1016/j.still.2017.08.012
.
Sanikhani
H.
,
Deo
R. C.
,
Yaseen
Z. M.
,
Eray
O.
&
Kisi
O.
2018
Non-tuned data intelligent model for soil temperature estimation: A new approach
.
Geoderma
330
,
52
64
.
doi:10.1016/j.geoderma.2018.05.030
.
Sattari
M. T.
,
Dodangeh
E.
&
Abraham
J.
2017
Estimation of daily soil temperature via data mining techniques in semi-arid climate conditions
.
Earth Sciences Research Journal
21
(
2
),
85
93
.
doi:10.15446/esrj.v21n2. 49829
.
Sattari
M. T.
,
Avram
A.
,
Apaydin
H.
&
Matei
O.
2020
Soil temperature estimation with meteorological parameters by using tree-based hybrid data mining models
.
Mathematics
8
(
9
),
1407
.
doi:10.3390/math8091407
.
Singh
V. K.
,
Singh
B. P.
,
Kisi
O.
&
Kushwaha
D. P.
2018
Spatial and multi-depth temporal soil temperature assessment by assimilating satellite imagery, artificial intelligence and regression based models in arid area
.
Computers and Electronics in Agriculture
150
,
205
219
.
doi:10.1016/j.compag.2018.04.019
.
Tabari
H.
,
Sabziparvar
A. A.
&
Ahmadi
M.
2011
Comparison of artificial neural network and multivariate linear regression methods for estimation of daily soil temperature in an arid region
.
Meteorology and Atmospheric Physics
110
(
3
),
135
142
.
doi:10.1007/s00703-010-0110-z
.
Tabari
H.
,
Hosseinzadeh Talaee
P.
&
Willems
P.
2015
Short-term forecasting of soil temperature using artificial neural network
.
Meteorological Applications
22
(
3
),
576
585
.
doi:10.1002/met.1489
.
Taheri
M.
,
Schreiner
H. K.
,
Mohammadian
A.
,
Shirkhani
H.
,
Payeur
P.
,
Imanian
H.
&
Cobo
J. H.
2023
A review of machine learning approaches to soil temperature estimation
.
Sustainability
15
(
9
),
7677
.
doi:10.3390/su15097677
.
Talaee
P. H.
2014
Daily soil temperature modeling using neuro-fuzzy approach
.
Theoretical and Applied Climatology
118
(
3
),
481
489
.
doi:10.1007/s00704-013-1084-9
.
Wang
Y.
&
Witten
I. H.
1996
Induction of Model Trees for Predicting Continuous Classes. (Working Paper 96/23)
.
University of Waikato, Department of Computer Science
,
Hamilton
,
New Zealand
. https://hdl.handle.net/10289/1183.
Xing
L.
,
Li
L.
,
Gong
J.
,
Ren
C.
,
Liu
J.
&
Chen
H.
2018
Daily soil temperatures predictions for various climates in United States using data-driven model
.
Energy
160
,
430
440
.
doi:10.1016/j.energy.2018.07.004
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).