The organic soils of Holland Marsh, Ontario are used for intensive vegetable production, which demands high-phosphorus (P) fertilizer applications. Such high-fertilizer applications on these tile-drained lands lead to eutrophication in surrounding water bodies. This study investigated the application of neural network (NN) models for deriving P management strategies. Seven NN models were assessed using the following two approaches: a time series with 1-year training and 1-year testing of the models and a randomization analysis where a random 80% of data was used for model training and the remainder for model testing. The feed-forward model using the randomization and the long-short-term memory model using time-series outperformed all other models. Two strategies for P management were evaluated: a direct approach that predicts P loads using new fertilizer rates or controlled drainage discharge rates, and a particle swarm optimization (PSO) that used a percent reduction of actual P loads to predict an optimal water table management strategy. Overall, the direct approach identified a water table level of 30 cm from the soil surface during the spring and 80 cm during the summer period as optimal to reduce P loads. The PSO analysis showed that a reduction of P loads by 20% in the spring and up to 40% in the summer through water table control would not compromise crop production.

  • The FNN model had the best performance with the randomization analysis.

  • The LSTM model outperformed all models using the time-series analysis to predict total phosphorus (TP) loads.

  • NN models can accurately predict TP loading in P management scenarios.

  • A water table of 30 cm (spring) and 80 cm (summer) from soil surface is the ideal beneficial management practice.

  • Inverse approach found optimal TP load reduction of 20% (spring) and 40% (summer).

Graphical Abstract

Graphical Abstract
Graphical Abstract

Organic soils, or Histosols, are used for intensive vegetable production. These soils require high-fertilizer applications due to their inherent low nutrient status (Zheng et al. 2014). However, excess fertilizer leaches into the environment, becoming a major contributor to eutrophication in water bodies (Krasa et al. 2019). Studies have found that a reduction of phosphorus (P) in freshwater systems, more than nitrogen (N), is critical to control eutrophication (Schindler et al. 2008). Instances of excessive P loads from cultivated organic soils have been observed in New York (Longabucco & Rafferty 1989), Florida (Noe et al. 2001) and Ontario (Winter et al. 2007).

The Holland Marsh of Ontario, Canada is an area of intensively cultivated organic soil that was partially converted from a natural wetland for agricultural use in 1925. The Holland Marsh represents 1% of the Lake Simcoe watershed but contributes up to 5% of the total P (TP) load entering the lake and is therefore considered a source of anthropogenic pollution (Winter et al. 2007). Due to the high water holding capacity and higher water tables encountered on organic soils, subsurface tile drains are installed to support crop production (Hallema et al. 2015). Though there have been studies on soil nutrients in Holland Marsh (Zheng et al. 2015), research on drainage water quality is limited (Grenon et al. 2021).

Drainage water management in the form of controlled drainage (CD) has the potential to mitigate the release of nutrients by controlling the water table within a field, and therefore reduce the amount of water released. The CD functions by placing a control structure on the collector tile line within an agricultural field, close to the outlet, which allows for the control of the water table level within the field. CD has been mostly implemented on mineral soils (Williams et al. 2015a), where multiple studies have concluded that CD is a functional management system that reduces drainage and limits nutrient loads (Sanchez Valero et al. 2007; Skaggs et al. 2012; Carstensen et al. 2016). However, there have been no studies on the use of CD on temperate agricultural organic soils.

Field studies of water quality management practices involve time-intensive and costly processes of instrumenting fields and measurement of water quality. Deterministic and statistical models can reduce the costs of data collection and laboratory analysis of water samples and can be used to fill gaps in data and make hydrologic and water quality predictions under various environmental scenarios (Tiyasha et al. 2021). Neural network (NN) models can be trained to predict water quality using various hydrologic and environmental input parameters (Chang et al. 2016; Liang et al. 2020), however, there are limited studies that use P as an input/output (Tiyasha et al. 2020) and even fewer that look at the management of P (Welikhe et al. 2021). NN has an advantage over process-based models (Qi et al. 2018), which require multiple specific ancillary parametric data points to create a robust simulation. NN is a method of data analysis that uses machine learning (ML) and analytical model building to understand the relationship between multiple inputs with the output variable, without detailed knowledge of the internal process-based functions (Ha & Stenstrom 2003). This type of model has been used extensively for basin rainfall–runoff processes (Dahamsheh & Aksoy 2009), nutrient functions in lakes (Sinshaw et al. 2019), and has been used to predict P loads (Chang et al. 2016) in groundwater, streams and rivers. However, the use of NN in predicting nutrient loading in agricultural drainage runoff is limited. Furthermore, optimization algorithms are effective tools for developing cost-effective watershed designs that meet specified water quality objectives (Seppelt 1999). Environmental simulation modeling with an optimization application can include established objectives, such as pollution load reduction, into specific management plans (Wu et al. 2018; Qi et al. 2020).

The study aims to investigate NN model approaches for predicting P loads from the organic soils of the Holland Marsh. Beneficial management scenarios were assessed using the highest performing NN models, combined with an optimization algorithm, for reducing P loads under CD at a field site in the Holland Marsh.

The study was conducted across 2 years (2015–2016) on a 4.2 ha field (44.064900 Latitude, −79.587878 Longitude) with subsurface tile drains within the Holland Marsh, Ontario. Environmental, agronomic, management and climate data were collected over the course of the study and used to calibrate and validate NN models. The NN models were further used to predict water management scenarios to identify beneficial management practices (BMPs) farmers can use to reduce P pollution. This section includes the following: (1) description of the study area, instrumentation, data collection and water quality analysis; (2) NN modeling methodology, parameters, description of NN models, optimization of the models and performance metrics and (3) the direct and inverse approach and creation of the management scenarios.

Study area

The study area has a temperate climate with approximately 5 months (November–April) of the year under snow cover. During the growing season (May–October) average temperatures range from 10 to 22 °C (Government of Canada 2018). The monthly precipitation for 2015, 2016 and a 30-year average is presented in Supplementary Table S1. The annual precipitation was 590 and 552 mm for 2015 and 2016, respectively, compared to the 30-year average of 780 mm, suggesting both years were drier than average. The general soil properties for the study site were analyzed by AgriDirect (Longueuil, Quebec) using a composite soil sample taken post-harvest 2015. The soil properties consisted of an organic matter content of 68%, a bulk density of 0.31 g cm−3, a pH of 6.6, and a soil Mehlich P of 655 mg kg−1. Carrots (Daucus carota Bergen) were cultivated for the duration of the study. The mineral fertilizer application in May 2015 consisted of 20 kg-P ha−1 and 210 kg potassium (K) ha−1, while in May 2016 it consisted of 10 kg-N ha−1, 20 kg-P ha−1 and 205 kg-K ha−1. Seeding occurred soon after the fertilization, and the crop harvest started in September for both years.

Instrumentation, CD system

A CD structure (Supplementary Figure S1) was installed at the outlet of the collector tile line. This structure allowed for manual control of the water table height in the field with the placement of removable gates that stack on top of each. The water level strategy adopted was to have the water table between 30 and 40 cm from the soil surface during the non-growing season (November–April) to reduce tile flow during the spring, and between 70 and 80 cm from the soil surface during the growing season for agronomic management. Similar strategies were implemented in other studies where drainage reduction during the spring-thaw period is critical for nutrient load reduction (Gunn et al. 2015). The CD gates were removed in May to allow water to drain from the field for fertilizer application and seeding. In 2015, the height of the gates varied from 70 cm to the drain depth of 1.48 m from the soil surface between mid-June and early July to reduce surface water ponding due to excess rain. While in 2016, they were lowered mid-April to dry the field for planting and then raised in July to conserve water within the field due to lack of precipitation.

Discharge measurements were taken using a compound weir in the CD structure, which was placed as the uppermost gate. The compound weir consisted of an 11° V-notch to estimate flows less than 1 L s−1 and a rectangular weir to calculate flows above 1 L s−1. The discharge was continuously recorded at 15-min intervals using a pressure transducer within the control structure.

Water quality analysis

Water samples were collected during periods of drain discharge, using an ISCO 6712 portable auto-sampler at 4-h intervals (Teledyne ISCO, Lincoln, Nebraska). Weekly discrete grab samples were also taken. No samples were taken during periods of winter freeze or periods of no drain flow due to dry weather conditions. The water samples were stored in a 4 °C refrigerator at the Guelph University Muck Crops Research Station (MCRS) before being transported in coolers with ice to the McGill University laboratories for analysis.

Water analyses consisted of TP using the potassium persulfate digestion method (Dayton et al. 2017). For each analysis, both blank distilled water samples and standards of P were included for the calibration of the results. Calculation of P loads was done using TP concentrations (mg L−1) and continuous discharge (L s−1). The load was calculated using linear interpolation, which is one of the most common methods of load calculation with the least amount of uncertainty (Williams et al. 2015b).

NN model methodology

The non-linear or stationary feature of water in addition to its interdependent relationship with the environment and agronomic interactions creates a complex problem (Tiyasha et al. 2020). Qi et al. (2018) reviewed biophysical phosphorus models and found that the performance was dependent on a complex assessment of environmental parameters. Furthermore, these models have been calibrated for mineral soils and there is a lack of information on their performance with organic soil (Grenon et al. 2021). However, NN models allow for the incorporation of relevant and available environmental data without having to account for the entire physical environment and its processes. NN models have been extensively studied and concluded as a reliable tool to deal with non-linear water quality data, to increase decision-making capacities and to study various water quality management problems (Tiyasha et al. 2020).

An ad hoc model-free approach (Maier et al. 2010) based on domain knowledge was used to select input parameters. The daily input parameters included maximum temperature, minimum temperature, precipitation, evapotranspiration, soil moisture, the water level within the field, water height within the CD structure, CD discharge and fertilizer application. A comparison analysis was conducted to assess the interaction of the parameters with the model output, TP loads.

Neural networks

Seven NN models were chosen for this study, using two modeling approaches. The seven models consisted of feed-forward NN (FNN), deep feed-forward NN (DFNN), long-short-term memory (LSTM), bi-directional LSTM (Bi-LSTM), gated recurrent unit (GRU), general regression NN (GRNN) and radial basis function NN (RBFNN). All models were run using two different modeling approaches: a time-series and a randomization analysis. The time-series analysis was done using 2015 data to train the model and 2016 to test the accuracy of the model. This allowed for the inclusion of temporal trends in the analysis. The randomization analysis was done by randomly assigning 80% of the 2 years of values to train the model, and the other 20% of the values to test model accuracy. In total, the 14 (7 time series and 7 randomizations) models were run on a daily time scale.

FNN models

A feed-forward neural network (FNN) model includes one hidden layer, while multiple hidden layers are called a DFNN. The first and the last layers are for inputs and outputs, respectively. The operating principle of a typical FNN is based on the behavior of neurons and synaptic connections, where the input data are processed through three different operations: (i) multiplication by the synaptic connection weights; (ii) summation by adding the threshold function and (iii) transformation of results by applying an activation function. In the training phase, FNN automatically adjusts their synaptic weights and bias through back-propagation (Schmidt et al. 2018). The Bayesian optimization procedure was to select the optimal training function. Supplementary Figure S2 shows the schematic representation of typical FNN and DFNN structures while the equation in Supplementary Table S2 characterizes the relationship of one neuron.

LSTM, Bi-LSTM and GRU

Recurrent neural networks (RNNs) are based on the concept of time layering, which gives them the ability to take temporal series data as inputs and outputs. In more general terms, RNN is particularly suitable for sequence-to-sequence learning purposes (Sainath et al. 2015). The standard structure of an RNN model consists of an input layer, multiple hidden layers, and an output layer. In contrast to other RNN, memory cells, which are the main features of LSTM and Bi-LSTM networks, are incorporated into the hidden layers. The equations governing memory cell operations can be found in Supplementary Table S2. Memory cells are composed of an input (it) gate; to determine which information will be added to the cell, an output (ot) gate; to identify which information can be used as output in the cell and a forget (ft) gate; to decide which information can be removed from the cell. At each time step, the input and output are introduced into the three gates and candidate values are computed (Supplementary Figure S3).

In a similar manner to the training phase in FNN, the weights and bias terms are computed using the back-propagation algorithm with gradient descent so that an objective function is minimized across temporal target values. The idea behind the Bi-LSTM is to aggregate input signals in the past and future of a specific time step in LSTM models. The GRU is an RNN similar to LSTM. However, in contrast to LSTM, GRU has only two gates: the update and reset gates which decides what information is relevant and should be passed to the output.

RBFNN models

These ML models are neural networks with radial basis functions as activation functions. Global approximation, compact scheme, and ability to fit continuous and noisy data are features that distinguish RBFNNs and GRNNs from other NNs (Yu et al. 2011). The most used RBFNN activation function is the Gaussian function, which possesses only a few hyper-parameters. RBFNN's principle of operation does not differ from FNN. However, the training process is tuned by incorporating some clustering techniques such as decision trees (Kubat 1998), or k-means (Sing et al. 2003). The typical structure of RBFNN includes a two-layer network: the first layer comprises RBF neuron cells that compute weighted inputs and process them to the second layer with a linear transfer function. The only difference between RBFNN and GRNN is the use of biases in both layers of RBFNN and no biases are used in the case of GRNN (Ozyildirim & Avci 2013).

Optimization of models

Optimization is conducted within the model to acquire the best possible results between the training of the model and the testing of the model accuracy. The range of parameters and hyper-parameters used to build the NN models are presented in Supplementary Table S3. Hyper-parameters are vital to control parameters for ML where their appropriate choice leads to high-performance training and thus highly effective predictions. Many optimization techniques have been developed to tune ML hyper-parameters. A Bayesian optimization approach was the most suitable, as a probabilistic model with prior distribution is built through past evaluation. This meta-model surrogates the objective function for a much easier optimization (Snoek et al. 2015). The model optimization occurs through iterations. Each iteration allows for different configurations between the learning rate, and the number of epochs within the FNN, DFNN and RNN models. The GRNN and RBFNN model iterations allowed for the spread of the values to be varied and optimized. All models were run at least three times using 100 iterations to gain the best optimization possible.

Performance evaluation of models

The performance of the models was evaluated using the following statistical measures: Nash–Sutcliffe efficiency (NSE), correlation coefficient (R) and root mean squared error (RMSE). Furthermore, the comparison of the measured and the predicted TP loads were evaluated with the percentage of uncertainty to identify the accuracy of the models (Williams et al. 2015b).

Water management scenarios

Through the optimization of the various models, the best model was found for each approach: one for time-series and one for randomization. Although all models were optimized, predicting TP management strategies was performed using the best model for each approach. Two TP management strategies were used: a direct approach and an inverse approach. All management scenarios are presented in Table 1. The management scenarios were run by separating the spring (January–April) and summer (May–October) periods. This was done because the water management strategies differed between these periods due to agronomic activities. The objective was to find the BMP for water management, aimed at reducing TP.

Table 1

Scenarios used with the direct approach and the PSO analysis method to develop BMPs

SpringSummerFertilizerOther scenarios
Direct approach 
Water table height from the soil surface (cm) 20 50 No fertilizer BMP application for a year with 590 mm precipitation 
25 60 Actual fertilizer rate 
30 70 Double fertilizer rate 
35 80 BMP application for a year with 900 mm precipitation 
40 90 5× fertilizer rate 
45  
Inverse approach 
% Reduction of TP loads 10, 20, 30, 40, 50 and 60% reduction in TP loads used as input in the optimization model to estimate required gate height within the CD structure. Percent reduction of TP loads was applied in both the spring and summer periods 
SpringSummerFertilizerOther scenarios
Direct approach 
Water table height from the soil surface (cm) 20 50 No fertilizer BMP application for a year with 590 mm precipitation 
25 60 Actual fertilizer rate 
30 70 Double fertilizer rate 
35 80 BMP application for a year with 900 mm precipitation 
40 90 5× fertilizer rate 
45  
Inverse approach 
% Reduction of TP loads 10, 20, 30, 40, 50 and 60% reduction in TP loads used as input in the optimization model to estimate required gate height within the CD structure. Percent reduction of TP loads was applied in both the spring and summer periods 

Direct approach for predicting nutrient loads

The direct approach (Supplementary Figure S4) was done using the calibrated models and running new input parameters through the models to predict the TP load. The new input parameters used the nine, 2015 input datasets and varied the values within the file depending on the management scenario (Table 1). The spring and summer management scenarios were done by re-calculating what the discharge would be at different set gate levels within the CD structure. As the discharge is directly related to the water table height, this allows for a comprehensive understanding of how the management practices affect the total nutrient load. The six spring scenarios were run by varying the set gate-level between 20 and 45 cm from the soil surface, while five summer scenarios were run, maintaining a constant water height while accounting for the 9 days of free drainage needed to reduce ponding due to rainfall. Other management scenarios included four fertilizer rates, from no fertilizer to five times the actual rate, to simulate the effect of fertilizer application on TP loads. This consisted of retaining all 2015 input parameters except the fertilizer rate which was manually changed to reflect the new application rate. Finally, using the optimal management strategy found from the spring and summer scenarios, these new set gate heights were applied to a year with 900 mm of rain (wet year: 15% greater rainfall than the 30-year average), and the 2015 data which was a dry year (Supplementary Table S1). Climate data from 2010 were used as the input parameters for the wet year.

The inverse approach for optimal water table height in the CD structure

The inverse approach with particle swarm optimization (PSO) was the second management strategy used. The PSO's mechanism was inspired by the swarming behavior of biological populations (Eberhart & Shi 2001). The computational procedure of the PSO algorithm is described in the following steps: (i) initialization of the set of PSO particles with random positions and velocities; (ii) evaluation of the objective function for each particle; (iii) the value of TP load is calculated at each new particle position until a better position is achieved by the PSO particle and the value of Pbest (best position) is replaced by the current value and (iv) testing the model that if the new value of the objective function (called fitness) is better than the old, the value is updated and stored in a list. In brief, this inverse method allows for the PSO algorithm to be coupled to the NN model and predict an input parameter while maintaining a specifically wanted output.

The inverse analysis uses the PSO search algorithm in conjunction with the NN model (Supplementary Figure S5), to predict the expected water discharge input parameter for a desired TP load output. The TP load was calculated as a percent reduction rate (i.e. 10, 20%). The PSO algorithm takes this TP output value and associates it with fixed input data consisting of all parameters except the discharge. The PSO algorithm optimized the discharge value in relation to the TP load and runs the NN model until convergence of the predicted TP load to the target TP load. This convergence is a function of the following equation:
(1)
where the TP_load(real) is the percentage of the measured TP load and the TP_load(desired) is found through the PSO algorithm. A matching test is conducted within the model to compare TP_load(real) and TP_load(desired). If the model finds the two loads are equal, then this discharge data is taken as the desired data for that TP load. However, if the two loads are not equal, the PSO algorithm will generate new input data until convergence. This discharge is then used to calculate the average water table height within the CD structure. To summarize, the PSO algorithm uses a percent reduction of the actual TP loads to predict a discharge that correlates to this calculated TP load, which can then be used to calculate the water table height within the CD structure.

The proposed discharge by the PSO is applied as an input variable with the other eight 2015 input variables to create a new input dataset. This input dataset was then entered into the optimized NN model to produce the corresponding TP load. The resulting TP load is compared to the calculated TP load. If they are equal, then this computed TP load is stored as a potential solution to the inverse analysis problem (Supplementary Figure S5). In contrast, if the values of the desired and computed TP load are different, the PSO iterations are run until convergence.

The PSO approach was used with various management scenarios (Table 1) which consisted of reducing the TP loads found in 2015 by a percentage varying between 10 and 60%. These new values were then applied to predict the discharge and set the gate-level within the CD structure needed to achieve this reduction.

TP load

The TP load (Supplementary Figure S6) totaled 0.45 kg ha−1 in 2015 and 0.50 kg ha−1 in 2016. Tile drain discharge occurred during spring-thaw each year and high rainfall events. Studies have shown that some of the most significant discharge events occur during spring-thaw (Lam et al. 2016). This period of high discharge was corroborated by the 2016 results (spring load: 0.33 kg ha−1; summer load: 0.17 kg ha−1), where 67% of the tile discharge occurred from February to April resulting in 66% of the TP load, due to the freeze-thaw, as well as the increased precipitation in March 2016 (80 mm), compared to the 30-year average (46 mm). Furthermore, the 2015 TP load (spring load: 0.18 kg ha−1; summer load: 0.27 kg ha−1) was influenced by fertilizer inputs followed by high rainfall in June 2015 (160 mm), which was 112% higher than the 30-year average (75 mm), resulting in 60% of the TP load occurring during the summer. The annual TP load in the drainage water accounted for approximately 3% of the 20 kg-P ha−1 applied fertilizer for both 2015 and 2016. Therefore, only a small portion of the applied P is released through tile drain discharge (Sanchez Valero et al. 2007). Large amounts of P were sequestered within the soil or taken up by crops, as found in other studies (Easton & Petrovic 2004).

Correlation between model parameters

There was a high correlation (0.97) between the discharge and TP load (Table 2). As the TP load was calculated using both the continuous tile discharge and the TP concentration (Williams et al. 2015b), this high correlation was expected. A correlation was further found between the climate parameters (temperature, evapotranspiration) and the soil hydraulic parameters (soil moisture, field water table). Other studies have found that temperature and precipitation can impact P movement (Liu et al. 2021). The relationship between temperature, evapotranspiration and soil properties is reflected in the model. Soil hydraulic parameters and water levels were not significantly correlated with precipitation. Although climate parameters are important to understanding nutrient movement, trends are not often discernible (Coffey et al. 2019). Sinha & Michalak (2016) concluded that spatial variability of nutrient loading is driven by nutrient inputs, but intense precipitation events, greater than the average rainfall, are a driver to increased loading. The accuracy of NN models can be improved when there is an understanding of the functional relationship between the input parameters (Tiyasha et al. 2020). Therefore, the input parameters were chosen with an understanding of their inherent relationship to the output TP load.

Table 2

Correlation analysis for all model parameters

Tmax (°C)Tmin (°C)EvapotranspirationPrecipitationSoil MoistureField Water TableDischargeDaily CD water LevelP FertilizerDaily TP Load
Tmax (°C) 1.00          
Tmin (°C) 0.92 1.00         
Evapotranspiration 0.69 0.83 1.00        
Precipitation 0.12 0.06 −0.04 1.00       
Soil moisture −0.60 −0.62 −0.57 0.09 1.00      
Field water table 0.66 0.67 0.62 −0.02 −0.97 1.00     
Discharge 0.05 0.06 0.05 0.11 0.11 −0.10 1.00    
Daily CD water Level 0.38 0.34 0.28 0.02 −0.63 0.60 −0.22 1.00   
P Fertilizer −0.01 0.01 0.02 −0.02 0.00 0.00 0.05 −0.02 1.00  
Daily TP load 0.04 0.05 0.03 0.10 0.14 −0.13 0.97 −0.26 0.04 1.00 
Tmax (°C)Tmin (°C)EvapotranspirationPrecipitationSoil MoistureField Water TableDischargeDaily CD water LevelP FertilizerDaily TP Load
Tmax (°C) 1.00          
Tmin (°C) 0.92 1.00         
Evapotranspiration 0.69 0.83 1.00        
Precipitation 0.12 0.06 −0.04 1.00       
Soil moisture −0.60 −0.62 −0.57 0.09 1.00      
Field water table 0.66 0.67 0.62 −0.02 −0.97 1.00     
Discharge 0.05 0.06 0.05 0.11 0.11 −0.10 1.00    
Daily CD water Level 0.38 0.34 0.28 0.02 −0.63 0.60 −0.22 1.00   
P Fertilizer −0.01 0.01 0.02 −0.02 0.00 0.00 0.05 −0.02 1.00  
Daily TP load 0.04 0.05 0.03 0.10 0.14 −0.13 0.97 −0.26 0.04 1.00 

FNN model analyses

The time-series and randomization analysis for FNN and DFNN (Table 3) resulted in higher accuracy under the latter analysis. Others have found that FNN models have better performance compared to other NN models (Ahmed 2017). The simple structure of one hidden layer optimizes the linear parameter interactions and creates the optimal solution (Tiyasha et al. 2020). Studies have found that DFNN is an improvement over FNN models (Wilamowski & Yu 2010). As shown in Table 3 for randomization, the performance of the model increased between FNN (R=0.92) and DFNN (R=0.98). However, the uncertainty also increased under DFNN (8.54%). Training of DFNN on a small dataset caused increased uncertainty (overfitting problem) within the model prediction. Studies have found that the overfitting in training DFNN models can cause an accumulation of errors within the neurons, and decrease the model accuracy (Mhaskar & Poggio 2016).

Table 3

Feed-forward model (FNN and DFNN) analyses for both time-series and randomization as well as the RNN model (LSTM, Bi-LSTM, and GRU) analyses for time-series

Time-series
Randomization
Performance/OptimizationFNNDFNNLSTMBi-LSTMGRUFNNDFFNN
Hidden layer 19 (CGF) 19, 13 (BR) 20, 7 16, 1 14, 18 15 (CGF) 8, 18 (LM) 
Learning rate 0.271 0.0011 0.096 0.059 0.03 0.12 0.0014 
Epochs 620 196 1,980 1,974 1,356 1,000 677 
R 0.58 0.48 0.79 0.73 0.73 0.92 0.98 
NSE 0.5 0.38 0.76 0.69 0.68 0.85 0.98 
RMSE 10.5 11.67 7.3 8.3 8.34 5.47 2.53 
Y prediction 3,431 g 3,347 g 2,135 g 1,998 g 1962 g 508 g 366 g 
% Uncertainty 63% 59% 1.64% −4.90% −6.62% 0.7% 8.54% 
Time-series
Randomization
Performance/OptimizationFNNDFNNLSTMBi-LSTMGRUFNNDFFNN
Hidden layer 19 (CGF) 19, 13 (BR) 20, 7 16, 1 14, 18 15 (CGF) 8, 18 (LM) 
Learning rate 0.271 0.0011 0.096 0.059 0.03 0.12 0.0014 
Epochs 620 196 1,980 1,974 1,356 1,000 677 
R 0.58 0.48 0.79 0.73 0.73 0.92 0.98 
NSE 0.5 0.38 0.76 0.69 0.68 0.85 0.98 
RMSE 10.5 11.67 7.3 8.3 8.34 5.47 2.53 
Y prediction 3,431 g 3,347 g 2,135 g 1,998 g 1962 g 508 g 366 g 
% Uncertainty 63% 59% 1.64% −4.90% −6.62% 0.7% 8.54% 

The time-series models (Table 3) resulted in low R values (FNN: 0.58; DFNN: 0.48) and NSE (FNN: 0.5; DFNN: 0.38) and high percentages of uncertainty (FNN: 63%; DFNN: 59%). The inaccuracy of the time series can be accounted for by the type of NN model. The FNN models need more data for training (Chen et al. 2017). The time-series analysis, however, had a higher percentage of data points used for model testing, compared to the randomization analysis.

RNN model analyses

The RNN models did not perform well using a randomization analysis. The RNN models can process temporal data identifying interconnections and allowing feedback more than other NN models (Chen et al. 2020). Therefore, the temporal aspect of the time-series analysis is necessary for RNN models. The time-series analysis (Table 3) showed that the LSTM model outperformed both the Bi-LSTM and the GRU models. In addition to having better model performance in comparison to Bi-LSTM and GRU, the LSTM model had an overall lower percentage of uncertainty at 1.64% (Table 3). Other studies have found that LSTM has a good performance when analyzing nutrient parameters in water and air quality (Hamrani et al. 2020; Huan et al. 2021). Studies have found that the added ‘memory cell state’ within the LSTM model allows for a greater understanding of the input data and higher accuracy (Wang et al. 2020).

RBFNN model analyses

The results obtained with the RBFNN and GRNN models showed both underperformed with the time-series (GRNN, R=−0.45; BRFNN, R=0) and randomization (no results) analyses. Essentially RBFNN and GRNN models are very sensitive to the high variation of input and output data, as well as to the data distribution (Heydari et al. 2019). These models are, therefore, more suitable for normally distributed data. Furthermore, the low accuracy of these models can be linked to poor flexibility in terms of tunable hyper-parameters, as a single parameter can be optimized (Benoudjit & Verleysen 2003), unlike the FNN or RNN models that have multiple optimization options. Furthermore, the dataset of 2 years can be considered as the minimum data required for model predictions and could, therefore, have impacted the performance of these models as past studies have found RBFNN an accurate model to use in water quality predictions (Sharma et al. 2003).

Water management scenarios

The FNN was found to be the most accurate model under a randomization analysis, while under time-series, the LSTM outperformed the other models. The results from the NN models identified that the overall performance of the optimized FNN and LSTM models was high and allowed for the prediction of management scenarios. Two management strategies were undertaken: a direct approach and an inverse approach. The management scenarios (Table 1) allow for an understanding of BMPs and how varying these practices can affect the TP loads in the environment. The management scenarios were run to predict the most effective water table height during the spring and summer periods to reduce TP loads.

Direct approach, predicting load scenarios

The direct approach for predicting TP loads found that the LSTM predictions had an uncertainty of 13.2% and the FNN 6.2% compared to the actual TP loads of 2015 (Figure 1). Furthermore, the water table management scenarios (Table 4) were overall more accurate for spring-thaw compared to the summer period where all predictions were significantly lower than actual TP loads. Chen et al. (2020) found that at minimum 1 year of data is needed for long-term forecasting. Therefore, this study had available the minimum data required. The predicted peaks of TP loads (Figure 1) from the FNN model were similar to the measured, though not as high a daily value.

Table 4

Direct approach predictive management scenarios for the spring-thaw and summer periods

Water table depth from the soil surfaceLSTM(kg ha–1)FNN(kg ha–1)
 Spring-thaw management scenarios 
20 cm 0.33 0.30 
25 cm 0.34 0.31 
30 cm 0.35 0.34 
35 cm 0.38 0.40 
40 cm 0.44 0.53 
45 cm 0.54 0.71 
 Summer period management scenarios 
50 cm 0.25 0.25 
60 cm 0.25 0.26 
70 cm 0.26 0.27 
80 cm 0.39 0.28 
90 cm 0.40 0.42 
 Fertilizer and management scenarios 
No fertilizer 0.40–0.41 0.42–0.46 
Actual fertilizer rate 0.45 0.45 
Double fertilizer rate 0.38–0.39 0.43–0.57 
5× fertilizer rate 0.41–0.60 0.57–0.89 
Dry year (2015) 0.24–0.30 0.21–0.22 
Wet year (2010) 0.35–0.46 0.23–0.41 
Water table depth from the soil surfaceLSTM(kg ha–1)FNN(kg ha–1)
 Spring-thaw management scenarios 
20 cm 0.33 0.30 
25 cm 0.34 0.31 
30 cm 0.35 0.34 
35 cm 0.38 0.40 
40 cm 0.44 0.53 
45 cm 0.54 0.71 
 Summer period management scenarios 
50 cm 0.25 0.25 
60 cm 0.25 0.26 
70 cm 0.26 0.27 
80 cm 0.39 0.28 
90 cm 0.40 0.42 
 Fertilizer and management scenarios 
No fertilizer 0.40–0.41 0.42–0.46 
Actual fertilizer rate 0.45 0.45 
Double fertilizer rate 0.38–0.39 0.43–0.57 
5× fertilizer rate 0.41–0.60 0.57–0.89 
Dry year (2015) 0.24–0.30 0.21–0.22 
Wet year (2010) 0.35–0.46 0.23–0.41 
Figure 1

Predictive performance of the LSTM and FNN models for the 2015 data.

Figure 1

Predictive performance of the LSTM and FNN models for the 2015 data.

Close modal

Table 4 suggests that setting the water table at 35 cm or lower from the soil surface can increase TP loads compared to a water table of 30 cm. The management of water levels during the spring is crucial, as most of the nutrient discharge occurs during this period (Williams et al. 2015a). Therefore, maintaining a higher water level can reduce the annual nutrient loads. The spring FNN results were more consistent than the summer, where there was less perceived change in TP loads between 80 and 50 cm. The summer LSTM water table management scenarios showed that 70 cm from the soil surface can reduce the TP loads by 32%, compared to 80 cm or deeper. McDonald & Chaput (2010) noted that carrots require the water table to be between 75 and 90 cm for optimum growth. Therefore, given the difference in hydraulic gradient between the water level in the CD structure and the upstream tile lines, our predictions for a water table height of 80 cm from the soil surface in the CD structure are well in line with the recommendations of McDonald & Chaput (2010).

Direct approach management scenarios for fertilizer and BMPs

The results (Table 4) found no variation in TP loads between actual fertilizer rate and no application of fertilizer. As all other climate and hydrological parameters were the same, the models are unable to predict fertilizer effects when the rate is decreased. The results identify increased TP loads with the FNN model at double the fertilizer rate with a range of 0.43–0.57 kg ha−1 and at five times the fertilizer rate, with a range of 0.57–0.89 kg ha−1. Studies have shown that nutrient inputs are the driving factor to nutrient load increase (Sinha & Michalak 2016), which is seen in increased loading with increased fertilizer application.

Table 4 shows that the BMP under the direct approach is a spring water table set at 30 cm from the soil surface, and 80 cm during the summer period. The FNN model predicted a TP load of approximately 0.21–0.22 kg ha−1 during a dry year with a slight increase in the rate (0.23–0.41 kg ha−1) during a wet year. The LSTM predictions for the dry (0.24–0.30 kg ha−1) and wet years (0.35–0.46 kg ha−1) were higher compared to the FNN. There is a 15% increase in rainfall between the dry and wet years, signifying an increase in nutrient loads. However, the predictions identify similar loads between the wet and the actual 2015 practice. This identifies that free drainage and precipitation, even for a short period (as seen in 2015), increased nutrient load. Studies have found that RNN models can better handle historical time-series input data to predict water quality (Li et al. 2019). Therefore, according to past studies, the LSTM predictions may be more accurate.

Inverse approach water management scenarios

The inverse approach with the PSO algorithm quantified the TP load reduction rates and estimated the optimal corresponding discharge. The optimum values of water discharge corresponding to the TP reduction rates (10–60%) obtained from FNN-PSO and LSTM-PSO models during the summer season are presented in Figure 2. The predicted discharges were not able to identify the peaks caused by excessive rain in June/July 2015. The mean predicted discharge under TP load reduction (%) and the subsequently calculated gate levels are given in Table 5. In the spring season, the reduction in average discharge is higher with the FNN models, however, the percent reduction of TP was similar for each water table height. A reduction in TP load of 20% showed similar results under the direct approach (Table 4) reflecting that the optimal water table level from the soil surface is approximately 30 cm. Water quality management methodologies based on the PSO approach have been successfully applied for optimizing BMPs (Liu et al. 2020; Rohmat et al. 2021). However, only a few studies used the combination of ML models with optimization approaches (He et al. 2020; Taghizadeh et al. 2021). This inverse analysis resulted in the calculation of the necessary water table height for the desired TP load and can therefore facilitate the adoption of BMPs at a field scale.

Table 5

Results of the mean optimal discharge control with TP load reduction (%) by both FNN-PSO and LSTM-PSO models

Season% TP load reductionMean value of discharge (mm/day)
Average predicted water table height from the soil surface (cm)
FNN-PSOLSTM-PSOFNN-PSOLSTM-PSO
Spring Original (0%) 2.11 2.11 36 36 
10% 2.47 2.49 32.23 32.21 
20% 2.18 2.44 29.03 28.88 
30% 2.12 2.37 25.87 25.73 
40% 1.94 2.24 22.93 22.77 
50% 1.90 1.99 20.04 19.99 
60% 1.75 1.80 17.30 17.27 
Summer Original (0%) 7.60 7.60 88.65 88.65 
10% 4.95 7.51 86.58 85.13 
20% 4.60 7.45 83.42 81.81 
30% 4.39 7.33 80.35 78.69 
40% 3.66 7.15 77.72 75.75 
50% 3.22 6.89 75.05 72.98 
60% 2.70 5.96 72.53 70.68 
Season% TP load reductionMean value of discharge (mm/day)
Average predicted water table height from the soil surface (cm)
FNN-PSOLSTM-PSOFNN-PSOLSTM-PSO
Spring Original (0%) 2.11 2.11 36 36 
10% 2.47 2.49 32.23 32.21 
20% 2.18 2.44 29.03 28.88 
30% 2.12 2.37 25.87 25.73 
40% 1.94 2.24 22.93 22.77 
50% 1.90 1.99 20.04 19.99 
60% 1.75 1.80 17.30 17.27 
Summer Original (0%) 7.60 7.60 88.65 88.65 
10% 4.95 7.51 86.58 85.13 
20% 4.60 7.45 83.42 81.81 
30% 4.39 7.33 80.35 78.69 
40% 3.66 7.15 77.72 75.75 
50% 3.22 6.89 75.05 72.98 
60% 2.70 5.96 72.53 70.68 
Figure 2

Daily optimal discharge under various TP load reductions (%) by (a) FNN-PSO and (b) LSTM-PSO models during the summer season 2015.

Figure 2

Daily optimal discharge under various TP load reductions (%) by (a) FNN-PSO and (b) LSTM-PSO models during the summer season 2015.

Close modal

The summer period results (Table 5), with the FNN-PSO analysis, showed that decreasing the TP load by 10% is obtained by a reduction of 35% in the average drainage discharge. This reduction in discharge, however, was not found with LSTM-PSO. Both FNN-PSO and LSTM-PSO found that the TP load reduction of 20–60% identified a gradual decline in discharge. The PSO results predicted that reducing the TP load by 10% would set the water table level within the CD structure to approximately 85 cm from the soil surface, while with a reduction of 60% TP loads, the water table level would be about 71 cm. As carrot crops need a water table level between 75 and 90 cm from the soil surface (McDonald & Chaput 2010), the inverse analysis predicted that a reduction of 30–40% TP loads would still be obtained with a water table depth of approximately 78 cm and would be able to meet the crop aeration requirement by adequate drainage.

This study investigated the optimization of seven NN models (FNN, DFNN, LSTM, Bi-LSTM, GRU, RBFNN, GRNN) under two different modeling approaches (randomization and time-series). Our results show that the FNN and LSTM models are very reliable with only a few years of field data. Continued analysis into parameter distribution and longer-term datasets can obviously strengthen model performance. The results identified LSTM as the model with the best performance under the time-series approach with temporal time step being important to the performance of RNN models. In addition, the FNN model outperformed all other models under the randomization approach. Two management strategies, the direct and the inverse approach, were used to predict the best water management strategy needed to reduce TP loads. Both the direct and inverse approach predicted similar recommendations that can be adopted by farmers. The direct approach found that the optimal strategy was to set the water table at 30 cm from the soil surface during the spring-thaw period and at 80 cm during the summer period, to reduce the TP load to the environment, while also not negatively affecting crop production. The direct approach was also used to assess fertilizer management scenarios and found that any increase in the fertilizer rate from actual (20 kg-P ha−1), will increase the TP load. The second management strategy, the inverse approach found that a reduction of 20% in the spring TP load can be achieved by raising the gate-level to 30 cm from the soil surface. Furthermore, in the summer period, the analysis identified that a reduction of up to 40% of the TP loads was possible while maintaining a water table height of approximately 78 cm for crop production.

The broader applicability of our research lies in the ability to apply the LSTM or FNN models to multiple locations within an organic soil agricultural area, thereby allowing for the development of appropriate water management and fertilizer application strategies to achieve P reduction in lakes and rivers. NN models can optimize fertilizer rates and water table levels for different crops and fields, based on inputs of climate and soil moisture data for P management strategies.

We thank Hicham Benslim, Hélène Lalande, David Meek and Khosro Mousavi for their technical assistance and are grateful to Aidan De Sena, Paddy Enright, Naresh Gaj, Samuel Ihuoma, Marjorie Macdonald, Kerri Edwards, Kenton Ollivierre, Naeem Abbasi and Rebecca Seltzer for their field/lab assistance. We also thank the University of Guelph-Muck Crops Research Station staff and the landowners who provided access to the fields during this study.

Funding for this project was provided by the Natural Sciences and Engineering Research Council of Canada (NSERC) under the Strategic Projects Grant fund # 447528-13.

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Ahmed
A. A. M.
2017
Prediction of dissolved oxygen in Surma River by biochemical oxygen demand and chemical oxygen demand using the artificial neural networks (ANNs)
.
Journal of King Saud University – Engineering Sciences
29
(
2
),
151
158
.
https://doi.org/10.1016/j.jksues.2014.05.001
.
Benoudjit
N.
&
Verleysen
M.
2003
On the kernel widths in radial-basis function networks
.
Neural Processing Letters
.
https://doi.org/10.1023/A:1026289910256
.
Carstensen
M.
,
Poulsen
J.
,
Ovesen
N.
,
Børgesen
C.
,
Hvid
S.
&
Kronvang
B.
2016
Can controlled drainage control agricultural nutrient emissions? evidence from a BACI experiment combined with a dual isotope approach
.
Hydrology and Earth System Sciences Discussions
1
17
.
https://doi.org/10.5194/hess-2016-303
.
Chang
F.-J.
,
Chen
P.-A.
,
Chang
L.-C.
&
Tsai
Y.-H.
2016
Estimating spatio-temporal dynamics of stream total phosphate concentration by soft computing techniques
.
Science of The Total Environment
562
,
228
236
.
https://doi.org/10.1016/j.scitotenv.2016.03.219
.
Chen
L.
,
Sun
C.
,
Wang
G.
,
Xie
H.
&
Shen
Z.
2017
Modeling multi-event non-point source pollution in a data-scarce catchment using ANN and entropy analysis
.
Entropy
19
(
6
),
265
.
https://doi.org/10.3390/e19060265
.
Chen
Y.
,
Song
L.
,
Liu
Y.
,
Yang
L.
&
Li
D.
2020
A review of the artificial neural network models for water quality prediction
.
Applied Sciences
10
(
17
),
5776
.
https://doi.org/10.3390/app10175776
.
Coffey
R.
,
Paul
M. J.
,
Stamp
J.
,
Hamilton
A.
&
Johnson
T.
2019
A review of water quality responses to air temperature and precipitation changes 2: nutrients, algal blooms, sediment, pathogens
.
JAWRA Journal of the American Water Resources Association
55
(
4
),
844
868
.
https://doi.org/10.1111/1752-1688.12711
.
Dahamsheh
A.
&
Aksoy
H.
2009
Artificial neural network models for forecasting intermittent monthly precipitation in arid regions
.
Meteorological Applications
16
(
3
),
325
337
.
https://doi.org/10.1002/met.127
.
Dayton
E. A.
,
Whitacre
S.
&
Holloman
C.
2017
Comparison of three persulfate digestion methods for total phosphorus analysis and estimation of suspended sediments
.
Applied Geochemistry
78
,
357
362
.
https://doi.org/10.1016/j.apgeochem.2017.01.011
.
Easton
Z. M.
&
Petrovic
A. M.
2004
Fertilizer source effect on ground and surface water quality in drainage from turfgrass
.
Journal of Environmental Quality
33
(
2
),
645
655
.
https://doi.org/10.2134/jeq2004.6450
.
Eberhart
R. C.
&
Shi
Y.
2001
Particle swarm optimization: developments, applications and resources
. In
Proceedings of the IEEE Conference on Evolutionary Computation
.
ICEC
. pp.
81
86
.
https://doi.org/10.1109/cec.2001.934374
Government of Canada
2018
Daily Data Report for Toronto International Airport, Ontario
.
Historical Data. Available from: http://climate.weather.gc.ca/historical_data/search_historic_data_e.html (accessed 2 December 2018)
Grenon
G.
,
Singh
B.
,
De Sena
A.
,
Madramootoo
C. A.
,
von Sperber
C.
,
Goyal
M. K.
&
Zhang
T.
2021
Phosphorus fate, transport, and management on subsurface drained agricultural organic soils: a review
.
Environmental Research Letters
16
(
1
),
013004
.
https://doi.org/10.1088/1748-9326/abce81
.
Gunn
K. M.
,
Fausey
N. R.
,
Shang
Y.
,
Shedekar
V. S.
,
Ghane
E.
,
Wahl
M. D.
&
Brown
L. C.
2015
Subsurface drainage volume reduction with drainage water management: case studies in Ohio, USA
.
Agricultural Water Management
149
,
131
142
.
https://doi.org/10.1016/j.agwat.2014.10.014
.
Ha
H.
&
Stenstrom
M. K.
2003
Identification of land use with water quality data in stormwater using a neural network
.
Water Research
37
(
17
),
4222
4230
.
https://doi.org/10.1016/S0043-1354(03)00344-0
.
Hallema
D. W.
,
Périard
Y.
,
Lafond
J. A.
,
Gumiere
S. J.
&
Caron
J.
2015
Characterization of water retention curves for a series of cultivated histosols
.
Vadose Zone Journal
14
(
6
).
https://doi.org/10.2136/vzj2014.10.0148
.
Hamrani
A.
,
Akbarzadeh
A.
&
Madramootoo
C. A.
2020
Machine learning for predicting greenhouse gas emissions from agricultural soils
.
Science of The Total Environment
741
,
140338
.
http://doi.org/10.1016/j/scitotenv.2020.140338
.
He
S.
,
Chen
W.
,
Mu
X.
&
Cui
W.
2020
Constrained optimization model of the volume of initial rainwater storage tank based on ANN and PSO
.
Environmental Science and Pollution Research
27
(
17
),
21057
21070
.
http://doi.org/10.1007/s11356-020-08630-6
.
Heydari
A.
,
Garcia
D. A.
,
Keynia
F.
,
Bisegna
F.
&
Santoli
L. D.
2019
Renewable energies generation and carbon dioxide emission forecasting in microgrids and national grids using GRNN-GWO methodology
.
Energy Procedia
159
,
154
159
.
https://doi.org/10.1016/j.egypro.2018.12.044
.
Huan
J.
,
Chen
B.
,
Xu
X. G.
,
Li
H.
,
Li
M. B.
&
Zhang
H.
2021
River dissolved oxygen prediction based on random forest and LSTM
.
Applied Engineering in Agriculture
0
.
https://doi.org/10.13031/aea14496
.
Krasa
J.
,
Dostal
T.
,
Jachymova
B.
,
Bauer
M.
&
Devaty
J.
2019
Soil erosion as a source of sediment and phosphorus in rivers and reservoirs – watershed analyses using WaTEM/SEDEM
.
Environmental Research
171
,
470
483
.
https://doi.org/10.1016/j.envres.2019.01.044
.
Kubat
M.
1998
Decision trees can initialize radial-basis function networks
.
IEEE Transactions on Neural Networks
9
(
5
),
813
821
.
https://doi.org/10.1109/72.712154
.
Lam
W. V.
,
Macrae
M. L.
,
English
M. C.
,
O'Halloran
I. P.
,
Plach
J. M.
&
Wang
Y.
2016
Seasonal and event-based drivers of runoff and phosphorus export through agricultural tile drains under sandy loam soil in a cool temperate region
.
Hydrological Processes
30
(
15
),
2644
2656
.
https://doi.org/10.1002/hyp.10871
.
Li
L.
,
Jiang
P.
,
Xu
H.
,
Lin
G.
,
Guo
D.
&
Wu
H.
2019
Water quality prediction based on recurrent neural network and improved evidence theory: a case study of Qiantang River, China
.
Environmental Science and Pollution Research
26
(
19
),
19879
19896
.
https://doi.org/10.1007/s11356-019-05116-y
.
Liang
Z.
,
Soranno
P. A.
&
Wagner
T.
2020
The role of phosphorus and nitrogen on chlorophyll a: evidence from hundreds of lakes
.
Water Research
185
,
116236
.
https://doi.org/10.1016/j.watres.2020.116236
.
Liu
G.
,
Chen
L.
,
Wang
W.
,
Sun
C.
&
Shen
Z.
2020
A water quality management methodology for optimizing best management practices considering changes in long-term efficiency
.
Science of The Total Environment
725
,
138091
.
https://doi.org/10.1016/j.scitotenv.2020.138091
.
Liu
W.
,
Birgand
F.
,
Tian
S.
&
Chen
C.
2021
Event-scale hysteresis metrics to reveal processes and mechanisms controlling constituent export from watersheds: a review
.
Water Research
117254
.
https://doi.org/10.1016/j.watres.2021.117254
.
Longabucco
P.
&
Rafferty
M. R.
1989
Delivery of nonpoint-source phosphorus from cultivated mucklands to Lake Ontario
.
Journal of Environmental Quality
18
(
2
),
157
163
.
https://doi.org/10.2134/jeq1989.00472425001800020005x
.
Maier
H. R.
,
Jain
A.
,
Dandy
G. C.
&
Sudheer
K. P.
2010
Methods used for the development of neural networks for the prediction of water resource variables in river systems: current status and future directions
.
Environmental Modelling & Software
25
(
8
),
891
909
.
https://doi.org/10.1016/j.envsoft.2010.02.003
.
McDonald
M. R.
&
Chaput
J.
2010
Management of Organic Soils
.
OMAFRA
.
Available from: http://www.omafra.gov.on.ca/english/crops/facts/93–053.htm (accessed 10 November 2019)
Mhaskar
H.
&
Poggio
T.
2016
Deep vs. shallow networks: an approximation theory perspective
.
Analysis and Applications
14
(
06
),
829
848
.
Noe
G. B.
,
Childers
D. L.
&
Jones
R. D.
2001
Phosphorus biogeochemistry and the impact of phosphorus enrichment: why is the everglades so unique?
Ecosystems
4
(
7
),
603
624
.
https://doi.org/10.1007/s10021-001-0032-1
.
Ozyildirim
B. M.
&
Avci
M.
2013
Generalized classifier neural network
.
Neural Networks
39
,
18
26
.
https://doi.org/10.1016/j.neunet.2012.12.001
.
Qi
H.
,
Qi
Z.
,
Zhang
T. Q.
,
Tan
C. S.
&
Sadhukhan
D.
2018
Modeling phosphorus losses through surface runoff and subsurface drainage using ICECREAM
.
Journal of Environmental Quality
47
(
2
),
203
211
.
https://doi.org/10.2134/jeq2017.02.0063
.
Qi
Z.
,
Kang
G.
,
Wu
X.
,
Sun
Y.
&
Want
Y.
2020
Multi-objective optimization for selecting and siting the cost-effective BMPs by coupling revised GWLF model and NSGAII algorithm
.
Water
12
(
235
),
1
12
.
https://doi.org/10.3390/w.12010235
.
Rohmat
F. I.
,
Gates
T. K.
&
Labadie
J. W.
2021
Enabling improved water and environmental management in an irrigated river basin using multi-agent optimization of reservoir operations
.
Environmental Modelling & Software
135
,
104909
.
https://doi.org/10.1016/j.envsoft.2020.104909
.
Sainath
T. N.
,
Vinyals
O.
,
Senior
A.
&
Sak
H.
2015
Convolutional, long short-term memory, fully connected deep neural networks
. In
2015 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
. pp.
4580
4584
.
https://doi.org/10.1109/ICASSP.2015.7178838
Sanchez Valero
C.
,
Madramootoo
C. A.
&
Stämpfli
N.
2007
Water table management impacts on phosphorus loads in tile drainage
.
Agricultural Water Management
89
(
1–2
),
71
80
.
https://doi.org/10.1016/j.agwat.2006.12.007
.
Schindler
D. W.
,
Hecky
R. E.
,
Findlay
D. L.
,
Stainton
M. P.
,
Parker
B. R.
,
Paterson
M. J.
,
Beaty
K. G.
,
Lyng
M.
&
Kasian
S. E. M.
2008
Eutrophication of lakes cannot be controlled by reducing nitrogen input: results of a 37-year whole-ecosystem experiment
.
Proceedings of the National Academy of Sciences
105
(
32
),
11254
11258
.
https://doi.org/10.1073/pnas.0805108105
.
Seppelt
R.
1999
Applications of optimum control theory to agroecosystem modelling
.
Ecological Modelling
121
(
2-3
),
161
183
.
https://doi.org/10.1016/S0304-3800(99)00082-4
.
Sharma
V.
,
Negi
S. C.
,
Rudra
R. P.
&
Yang
S.
2003
Neural networks for predicting nitrate-nitrogen in drainage water
.
Agricultural Water Management
63
(
3
),
169
183
.
https://doi.org/10.1016/S0378-3774(03)00159-8
.
Sing
J. K.
,
Basu
D. K.
,
Nasipuri
M.
&
Kundu
M.
2003
Improved k-means algorithm in the design of RBF neural networks. TENCON 2003
. In
Conference on Convergent Technologies for Asia-Pacific Region
. pp.
841
845
.
https://doi.org/10.1109/TENCON.2003.1273297
Sinha
E.
&
Michalak
A. M.
2016
Precipitation dominates interannual variability of riverine nitrogen loading across the continental United States
.
Environmental Science & Technology
50
(
23
),
12874
12884
.
https://doi.org/10.1021/acs.est.6b04455
.
Sinshaw
T. A.
,
Surbeck
C. Q.
,
Yasarer
H.
&
Najjar
Y.
2019
Artificial neural network for prediction of total nitrogen and phosphorus in US lakes
.
Journal of Environmental Engineering
145
(
6
),
04019032
.
http://doi.org/10.1061/(ASCE)EE.1943-7870.0001528
.
Skaggs
R. W.
,
Fausey
N. R.
&
Evans
R. O.
2012
Drainage water management
.
Journal of Soil and Water Conservation
67
(
6
),
167A
172A
.
https://doi.org/10.2489/jswc.67.6.167A
.
Snoek
J.
,
Ripped
O.
,
Swersky
K.
,
Kiros
R.
,
Satish
N.
,
Sundaram
N.
,
Patwary
M.
,
Prabhat
M.
&
Adams
R.
2015
Scalable Bayesian optimization using deep neural networks
. In
32nd International Conference on Machine Learning, ICML 2015
. Vol.
37
, pp.
2171
-
2180
.
Taghizadeh
S.
,
Khani
S.
&
Rajaee
T.
2021
Hybrid SWMM and particle swarm optimization model for urban runoff water quality control by using green infrastructures (LID-BMPs)
.
Urban Forestry & Urban Greening
60
,
127032
.
https://doi.org/10.1016/j.ufug.2021.127032
.
Tiyasha
T.
,
Tung
T. M.
&
Yaseen
Z. M.
2020
A survey on river water quality modelling using artificial intelligence models: 2000–2020
.
Journal of Hydrology
585
,
124670
.
https://doi.org/10.1016/j.jhydrol.2020.124670
.
Tiyasha
T.
,
Tung
T. M.
,
Bhagat
S. K.
,
Tan
M. L.
,
Jawad
A. H.
,
Mohtar
W. H. M. W.
&
Yaseen
Z. M.
2021
Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: development of hybrid tree-based artificial intelligence models
.
Marine Pollution Bulletin
170
,
112639
.
https://doi.org/10.1016/j.marpolbul.2021.112639
.
Wang
F.
,
Xuan
Z.
,
Zhen
Z.
,
Li
K.
,
Wang
T.
&
Shi
M.
2020
A day-ahead PV power forecasting method based on LSTM-RNN model and time correlation modification under partial daily pattern prediction framework
.
Energy Conversion and Management
.
https://doi.org/10.1016/j.enconman.2020.112766
.
Welikhe
P.
,
Brouder
S. M.
,
Volenec
J. J.
,
Gitau
M.
&
Turco
R. F.
2021
Using artificial neural networks to improve phosphorus indices
.
Journal of Soil and Water Conservation
76
(
6
),
513
526
.
http://doi.org/10.2489/jswc.2021.00153
.
Wilamowski
B. M.
&
Yu
H.
2010
Improved computation for Levenberg Marquardt training
.
IEEE Transactions on Neural Networks
.
https://doi.org/10.1109/TNN.2010.2045657
.
Williams
M. R.
,
King
K. W.
&
Fausey
N. R.
2015a
Drainage water management effects on tile discharge and water quality
.
Agricultural Water Management
148
,
43
51
.
https://doi.org/10.1016/j.agwat.2014.09.017
.
Williams
M. R.
,
King
K. W.
,
Macrae
M. L.
,
Ford
W.
,
Van Esbroeck
C.
,
Brunke
R. I.
,
English
M. C.
&
Schiff
S. L.
2015b
Uncertainty in nutrient loads from tile-drained landscapes: effect of sampling frequency, calculation algorithm, and compositing strategy
.
Journal of Hydrology
530
,
306
316
.
https://doi.org/10.1016/j.jhydrol.2015.09.060
.
Winter
J. G.
,
Eimers
C. M.
,
Dillon
P. J.
,
Scott
L. D.
,
Scheider
W. a.
&
Willox
C. C.
2007
Phosphorus inputs to lake simcoe from 1990 to 2003: declines in tributary loads and observations on lake water quality
.
Journal of Great Lakes Research
33
(
2
),
381
396
.
https://doi.org/10.3394/0380-1330(2007)33[381:PITLSF]2.0.CO;2
.
Wu
H.
,
Zhu
A.-X.
,
Liu
J.
,
Liu
Y.
&
Jiang
J.
2018
Best management practices optimization at watershed scale: incorporating spatial topology among fields
.
Water Resource Management
32
(
1
),
155
177
.
https://doi.org/10.1007/s11269-017-1801-8
.
Yu
H.
,
Xie
T.
,
Paszczynski
S.
&
Wilamowski
B. M.
2011
Advantages of radial basis function networks for dynamic system design
.
IEEE Transactions on Industrial Electronics
58
(
12
),
5438
5450
.
https://doi.org/10.1109/TIE.2011.2164773
.
Zheng
Z. M.
,
Zhang
T. Q.
,
Wen
G.
,
Kessel
C.
,
Tan
C. S.
,
O'Halloran
I. P.
,
Reid
D. K.
,
Nemeth
D.
&
Speranzini
D.
2014
Soil testing to predict dissolved reactive phosphorus loss in surface runoff from organic soils
.
Soil Science Society of America Journal
78
(
5
),
1786
1796
.
https://doi.org/10.2136/sssaj2014.02.0065
.
Zheng
Z. M.
,
Zhang
T. Q.
,
Kessel
C.
,
Tan
C. S.
,
O'Halloran
I. P.
,
Wang
Y. T.
,
Speranzini
D.
&
Van Eerd
L. L.
2015
Approximating phosphorus leaching from agricultural organic soils by soil testing
.
Journal of Environmental Quality
44
(
6
),
1871
1882
.
https://doi.org/10.2134/jeq2015.05.0211
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY-NC-ND 4.0), which permits copying and redistribution for non-commercial purposes with no derivatives, provided the original work is properly cited (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Supplementary data