A surrogate modeling framework is developed in this study to circumvent the computational burden of high-fidelity numerical groundwater models for arid coastal aquifers. Two different surrogate models, namely, artificial neural network (ANN) and Gaussian process model (GPM) are trained to replace the computationally expensive numerical flow and transport model OpenGeoSys. A novel time-dependent training scheme is introduced which helps the surrogates in tracking the discrete-time state-space trajectories of the high-fidelity model, thereby making them suitable for variable-time simulations. The surrogates are also tested in the extrapolation range corresponding to some extreme boundary conditions such as a very high rate of extraction. Both the surrogates show comparable accuracy in efficiently approximating the numerical model response; however, ANN is found to be much faster than GPM for the size of the data used. The trained surrogates are then used in developing a long-term planning and management framework for analyzing feasible management scenarios in the coastal aquifer of Oman.
INTRODUCTION
Increasing water demand in different consumer sectors has contributed to higher water stresses in the aquifers in various parts of the world (WBCSD 2006). The consequences of this alteration vary from place to place depending on water consumption, aquifer characteristics, and climate conditions. Aquifers in coastal arid or semi-arid regions are often vulnerable to water table drawdown and seawater intrusion as a result of increased water pumping. As water is extracted from an aquifer, the water table drops at the pumping location, triggering drawdown. A sustainable development entails extraction to be equal to the ‘capture’ which is a balance between the change in virgin rate of recharge and/or the change in virgin rate of discharge (Bredehoeft 2002). Seawater intrudes into the aquifer subsequent to the negative pressure gradients created due to high extraction, and contaminates the fresh water regions. This curtails drinking water availability and also jeopardizes agricultural production. Robust management strategies are therefore indispensable in this kind of situation for ensuring both profit from agriculture as well as aquifer sustainability.
In recent practice, management frameworks often couple a numerical groundwater model with an optimization algorithm to reach the optimal solutions of the control variables (Das & Datta 2001; Nicklow et al. 2010; Yeh 2015). This method is called simulation-optimization. The flow and transport processes can be simulated by various physically based high-fidelity (highly realistic) numerical models (e.g., Voss 1984; Lin et al. 1997; Guo & Langevin 2002; Liu et al. 2002; Hamidi et al. 2006; Delfs et al. 2009). For the optimization part, various nature-inspired metaheuristic algorithms, such as genetic algorithm (Wang & Zheng 1998; Yan & Minsker 2006; Ataie-Ashtiani et al. 2014), evolution strategies (Bayer & Finkel 2004, 2007; Kollat & Reed 2006; Papadopoulou et al. 2010), particle-swarm-optimization (Mousavi & Shourian 2010; Gaur et al. 2011; Sedki & Ouazar 2011), ant-colony-optimization (Ataie-Ashtiani & Ketabchi 2011), differential evolution (de Paly et al. 2013), and honey-bee-mating-optimization (Haddad & Marino 2007, 2011) have gained vast popularity.
There are two main issues with a simulation-optimization framework that couples a high-fidelity numerical model with an optimization algorithm. First, the numerical models are often computationally expensive and second, the optimization algorithm requires the numerical model to run several thousands of times which makes the whole process overly time-consuming. In addition, the computational complexity can increase further if the considered optimal management problem has multiple criteria and/or a high number of decision variables (see Table 1). This computational burden can be reduced significantly by using different surrogate modeling techniques (Rao et al. 2004; Bhattacharjya & Datta 2009; Dhar & Datta 2009; Papadopoulou et al. 2010; Sreekanth & Datta 2011, 2015; Ketabchi & Ataie-Ashtiani 2015). A surrogate model is an empirical or lower-fidelity alternative to either (A) the response of the high-fidelity numerical simulation model itself or (B) the objective function of the considered optimization problem or (C) the operating policy model in case of optimal control problems (Ratto et al. 2012). A surrogate model is simpler, considerably faster to run, and is generally derived through a training framework from the original simulation model (Razavi et al. 2012b).
Reference . | Modela . | Problemb . | Objective functionc . | #DVd . | SMe . | OAf . | STg . | TDh . | CEi . |
---|---|---|---|---|---|---|---|---|---|
Bhattacharjya & Datta (2009) | 3D DA-GW | GW (q) | max(Qout), min(C), min(Qinj) | 33 | MLP | NSGA-II | HC | TR3 | 1,000 × 2,500 |
Bray & Yeh (2008) | FEMWATER | GW (q,x,y) | min(Qinj) | 215 | – | GA | RC | TR5 | Parallel |
Dhar & Datta (2009) | FEMWATER | GW(q) | max(Qout), min(Qinj) | 33 | MLP | NSGA-II | HC | TR3 | 24 × 1,800 |
Haddad & Marino (2011) | 3D DA-GW | GW(q) | max(C) | 26 | – | HBMO | HC, RC | TI | 6,000 |
Kourakos & Mantoglou (2010) | SEAWAT (2D) | GW(q) | min(Cök), min(Cumw) | 4 | – | NSGA-II | HC | TI | ? × 400 |
Papadopoulou et al. (2010) | PTC | GW(q) | max(Qout) | 5 | RBF | DE | RC | TI | 20 × 200 |
Rao et al. (2004) | SEAWAT | GW(q) | max(Qout), min(Qinj) | 27 | MLP | SA | HC | TR3 | 10,000 |
Sreekanth & Datta (2011) | FEMWATER | GW(q) | max(Qout), min(Qinj) | 33 | GP | NSGA-II | HC | TR3 | 200 × 750 |
Reference . | Modela . | Problemb . | Objective functionc . | #DVd . | SMe . | OAf . | STg . | TDh . | CEi . |
---|---|---|---|---|---|---|---|---|---|
Bhattacharjya & Datta (2009) | 3D DA-GW | GW (q) | max(Qout), min(C), min(Qinj) | 33 | MLP | NSGA-II | HC | TR3 | 1,000 × 2,500 |
Bray & Yeh (2008) | FEMWATER | GW (q,x,y) | min(Qinj) | 215 | – | GA | RC | TR5 | Parallel |
Dhar & Datta (2009) | FEMWATER | GW(q) | max(Qout), min(Qinj) | 33 | MLP | NSGA-II | HC | TR3 | 24 × 1,800 |
Haddad & Marino (2011) | 3D DA-GW | GW(q) | max(C) | 26 | – | HBMO | HC, RC | TI | 6,000 |
Kourakos & Mantoglou (2010) | SEAWAT (2D) | GW(q) | min(Cök), min(Cumw) | 4 | – | NSGA-II | HC | TI | ? × 400 |
Papadopoulou et al. (2010) | PTC | GW(q) | max(Qout) | 5 | RBF | DE | RC | TI | 20 × 200 |
Rao et al. (2004) | SEAWAT | GW(q) | max(Qout), min(Qinj) | 27 | MLP | SA | HC | TR3 | 10,000 |
Sreekanth & Datta (2011) | FEMWATER | GW(q) | max(Qout), min(Qinj) | 33 | GP | NSGA-II | HC | TR3 | 200 × 750 |
a3D DA-GW = 3D density driven groundwater model, FEMWATER, SEAWAT, PTC = see reference.
bGW = groundwater management, q = abstraction (injection) rate, x = x-coordinate, y = y-coordinate of the well.
cQout = pump volume, Qinj = injection volume, C= costs.
dNumber of decision variables.
eSurrogate model: GP = genetic programming, MLP = multilayer perceptron, RBF = radial basis function network.
fOptimization algorithm: GA = genetic algorithm, HBMO = honey-bee mating algorithm, NSGA-II = nondominated sorting genetic algorithm II, DE = differential evolution, SA = simulated annealing.
gType of study: HC = hypothetical case, RC = real case.
hTime dependence of decision variables: TI = time-invariant, TR <x> transient over x periods.
iComputational effort: <population size > x < nr. of generations > .
A comprehensive review of the state-of-the-art of surrogate modeling of type (B) – the approximation of the objective function – can be found in Razavi et al. (2012b) where the authors analyzed 48 studies dealing with surrogate models directly from the water resources field and over 100 from a broader research community and summarized various concepts, methodological developments, and challenges involved in the field of surrogate modeling. They divided the surrogate model development techniques into four different categories: (1) basic ‘off-line’ sequential framework (e.g., Sreekanth & Datta 2011), (2) adaptive-recursive framework (e.g., Fen et al. 2009), (3) metamodel-embedded evolution framework (e.g., Yan & Minsker 2011), and (4) approximation uncertainty based framework (e.g., Razavi et al. 2012a). In a basic sequential framework, the surrogates are globally fitted to the evaluated objective function values of some fixed design datasets. For the three other categories, the approximation framework is ‘on-line’; i.e., the design datasets and thus the surrogate model are updated during the course of the optimization. The approximation uncertainty based framework is the probabilistic version of the adaptive-recursive framework. Instead of considering the surrogate approximations as true deterministic values, this framework further incorporates the uncertainty associated with those approximations (Razavi et al. 2012a).
Table 1 shows some selected examples of simulation-optimization frameworks for the management and monitoring of coastal aquifers using surrogate models, and that are also reviewed by Ketabchi & Ataie-Ashtiani (2015) and Sreekanth & Datta (2015). All the reviewed studies employ surrogate modeling of type (B) – the approximation of the objective function – which thus require a fixed time horizon for training of the surrogates. Therefore, these surrogate models can be considered as time-independent surrogate models and an additional training is required if the given time horizon of the problem changes.
In this study, we introduce a new surrogate training framework which focuses on surrogate modeling of type (A); i.e., an approximation of the simulator, which makes the trained surrogate model time-dependent. These surrogates are able to track the discrete-time state-space trajectories of the original model and avoid an additional training when the simulation time is altered. This training scheme is ‘off-line’ in nature and exploits the information available in the variable space through a comprehensive set of feasible scenarios. We use two different surrogate models, namely, artificial neural network (ANN) and Gaussian process model (GPM). ANN is widely used in various hydrological applications (Hsu et al. 1995; Liriano & Day 2001; Jain et al. 2004; Shourian et al. 2008; Bhattacharjya & Datta 2009; Dhar & Datta 2009; Fu et al. 2010) whereas GPM is an emerging technique of non-parametric surrogate modeling (MacKay 1998; Williams 2002; Rasmussen & Williams 2006).
METHODS
The state-space surrogate modeling framework
Groundwater model OGS
OGS is a numerical simulation package for thermo-hydro-mechanical-chemical processes in porous and fractured media. It is based on the finite element method to solve the governing differential equations for different physical processes. Variable density flow simulation in porous media is highly dependent on the changes in solute concentration as well as temperature. OGS is used in this study to consider changes in density; however, the medium is assumed to be isothermal. The three fundamental equations for variable density flow are (a) continuity equation, (b) momentum equation, and (c) contaminant transport equation, which are also linked to the bulk fluid density and hydrodynamic dispersion equations (Kolditz & Shao 2010).
Continuity equation
Momentum equation
Contaminant transport equation
The basic idea behind OGS dates back to the mid-1980s (Kolditz et al. 2012). It was formerly known as Rockflow/Geosys (Kolditz & Bauer 2004), and was first coupled with optimization by Bayer & Finkel (2007). Application of OGS can be found in some relatively new papers (e.g., Kolditz et al. 2008; Delfs et al. 2009; McDermott et al. 2009; Grundmann et al. 2012; Walther et al. 2012, 2014). The model was validated by Walther et al. (2012) against the Goswami–Clement benchmark problem (Goswami & Clement 2007) which is similar to the Henry's problem (Henry 1960) for salt water intrusion. The problem is posed as an initially freshwater environment with horizontal saltwater intrusion and withdrawal. The steady state density-dependent simulations of OGS were compared against the simulation results of the SEAWAT model (Langevin & Guo 2006) and the observations. It was found that OGS matched the observations more closely as compared to SEAWAT. The same experiment was extended to the transient state by Walther et al. (2014), and the ability of OGS to represent the transient density-dependent flow processes was also verified.
ANN
ANNs are biologically inspired mathematical models that imitate the basic structure and function of the human brain. Several interconnected artificial neurons form the neural network. A simple network consists of three different layers, namely, input layer, hidden layer, and output layer. Networks can be classified as feed-forward network and feed-back network based on the direction of the flow of information. Feed-forward network is a popular choice in the field of hydrologic modeling where the flow of information is unidirectional. The connections between the neurons in a network are assigned with some weights which get updated during the training process based on the minimization of an error function. In this study, we apply the scaled conjugate gradient (SCG) algorithm (Møller 1993) for the training of ANNs.
GPM
In simple regression methods, a function is fitted between the input and the output variables. These methods are constrained by the choice of any particular fitting function (e.g., linear, higher order polynomial, sigmoidal, etc.). In an ideal regression framework, all possible function forms must be included, but there are infinite sets of feasible function forms. The concept of Gaussian process (GP) helps overcome this problem. A GP is defined as a distribution over functions, unlike a Gaussian distribution which is a distribution over vectors, or a Dirichlet process denoting distribution over distributions. GP can therefore be fully specified by a mean function and a covariance function, and once these two functions are specified, GP can be treated with the basic rules of probability applied to multivariate Gaussian distributions.
The covariance function controls the collective variability in different function values for a given input to the functions. Learning/Training in GP implies calculating the optimal values of the covariance function parameters (also called hyperparameters) in light of the training data. Since GP is a non-parametric modeling approach, it requires the whole training dataset for prediction purpose. The main disadvantage of GP is its high computational time requirement for large datasets resulting from the inversion of the covariance matrix during training. To overcome this issue, various sparse pseudo-input GP techniques are proposed in which the actual input data are represented by a smaller subset (called active set) of the total input data selected based on some specific criteria (Smola & Bartlett 2000; Seeger et al. 2003; Quinonero-Candela & Rasmussen 2005; Snelson & Ghahramani 2006).
Model performance evaluation
For a model to be called well performing, the , , and values should be as low as possible. Values of R lie within a range of −1 to 1 where a value close to 1 denotes strong positive correlation and a value close to −1 indicates strong negative correlation. Values of vary from negative infinity to 1 where any value close to 1 is considered to be good. A negative value indicates that the performance of the model is worse than a simple persistence model. s give an idea about the overall (or any other error) distribution. For example, a value of 90% at 25% level () would imply that for 90% of the total points, is less than 25%. Hence, larger value of at lower level is always desired.
Water management system for coastal regions
The surrogates are applied in the frame of a simulation-based integrated water management system published by Grundmann et al. (2012, 2013). This system aims at generating optimal strategies for the management of water and agricultural resources considering water quantity and quality in the coastal regions that are infected by saltwater intrusion. A simulation-optimization approach is used where an evolution strategy is linked with an integrated simulation model that describes the coupled groundwater-agriculture hydrosystem. The trained surrogate models are used within the integrated simulation model to simulate the aquifer behavior including the dynamics of the seawater interface. Behavior of the highly productive agricultural farms is simulated by a database of generated stochastic crop water production functions (SCWPF) which describe the relationship between crop yield, salinity of irrigation water, and water amount for irrigation assuming an optimal water application. The SCWPF are derived by employing soil-vegetation-atmosphere-transfer models together with optimal irrigation scheduling and control algorithms (Schütze & Schmitz 2010; Grundmann et al. 2012). The objective function of the management problem combines two contradicting objectives of agricultural profit of the farmers and sustainability of the groundwater system.
DEVELOPMENT OF SURROGATE MODELS FOR COASTAL AQUIFER MANAGEMENT
Study area and problem description
An increasing trend of groundwater extraction and a large drawdown of the water table as well as saltwater intrusion have been evident in the region since the 1960s, although the amount of the total extraction was lower at the beginning (Stranger 1985). The extraction rate in the Barka region was reported to be 30 Mm3/year in 1970 which increased to about 120 Mm3/year in the recent past (Al-Shoukri 2008). The coastal aquifer system is mainly replenished by rainfall in the southern hard rock mountains. Rainfall characteristics are highly variable between the years. The mean annual precipitation is about 250 mm/year in the upper mountains and 50 mm/year along the coastline. Therefore, direct recharge of the coastal aquifer from rainfall is negligible. An average lateral recharge rate from the mountain range into the coastal aquifer was estimated to be 68 Mm3/year (Walther et al. 2012). Since the groundwater extraction exceeded the recharge rate, an inversion of the groundwater's natural flow gradient occurred together with an ongoing process of saltwater intrusion into the aquifer. As a result, the water for irrigation became saline which led to economic losses for the agricultural sector and many abandoned farms near the coastline.
Numerical groundwater model setup
Table 2 shows the parameter values of the OGS model used in this study. For variable density flow simulation, the difference between the density of saltwater and the fresh water is assumed to be 26 kg/m3 and the permeability reduction factor (Goswami & Clement 2007) is taken to be 0.001.
Parameters . | Upper layer . | Lower layer . |
---|---|---|
Porosity | 0.20 | 0.15 |
Tortuosity | 1 | 1 |
Isotropic permeability | 1 × 10−10 m2 | 1 × 10−12 m2 |
Longitudinal dispersion length | 100 m | 100 m |
Transverse dispersion length | 50 m | 50 m |
Diffusion coefficient | 5 × 10−9 m2/s | 5 × 10−9 m2/s |
Solid density | 2 × 103 kg/m3 | 2 × 103 kg/m3 |
Viscosity | 1 × 10−3 kg/m/s | 1 × 10−3 kg/m/s |
Parameters . | Upper layer . | Lower layer . |
---|---|---|
Porosity | 0.20 | 0.15 |
Tortuosity | 1 | 1 |
Isotropic permeability | 1 × 10−10 m2 | 1 × 10−12 m2 |
Longitudinal dispersion length | 100 m | 100 m |
Transverse dispersion length | 50 m | 50 m |
Diffusion coefficient | 5 × 10−9 m2/s | 5 × 10−9 m2/s |
Solid density | 2 × 103 kg/m3 | 2 × 103 kg/m3 |
Viscosity | 1 × 10−3 kg/m/s | 1 × 10−3 kg/m/s |
The transient model with the aforementioned settings was run in three steps; i.e., (1) steady state simulation, (2) initial state simulation, and (3) future state simulation. In order to achieve the steady state condition, OGS was first run for 300 years with an inflow rate of 68 Mm3/year and an extraction rate of 36.9 Mm3/year, as reported by Walther et al. (2012). The upper permeable layer reached the steady state condition within the first 50 years but the lower layer took almost 200 years. The initial state was derived by further running the model for another 40 years with a higher extraction rate and the same inflow rate as before. The extraction rate had to be kept high in order to ensure reasonable water table drawdown and progress of the seawater front that match the current state of the aquifer. The end point of the initial state simulation (i.e., water table heights and salinity after 40 years) was considered to be the starting point of the future state simulations, which were carried out to generate a scenario database for developing the surrogates. Water table heights and salinity values were extracted from the model simulation results at nine different model observation locations (MOLs) at 1.5, 2, 2.5, 2.8, 3.4, 4.3, 5.5, 12, and 16 km from the coastline (Figure 3). All inflow and extraction rates were divided by the third dimension of the model domain (30 km, see Walther et al. (2012)) to make them representative of the 2D model.
Generation of scenario database
The scenario database for developing the surrogate models was created by running future state simulations (see the section ‘Numerical groundwater model setup’) with OGS for 60-year periods with different combinations of inflow and extraction rates. In total, four different datasets were created for training, testing, validation, and extrapolation. ANNs were trained using the training set and some superior models were selected based on their performance in testing. The final ANN model was selected from the validation results. GPM was trained using the training set and was also run with the validation set. Finally, both the surrogates were run with the extrapolation set to investigate the model consistency for extreme values. Note that the term ‘extrapolation’ throughout this paper corresponds to any inflow or extraction rate that is outside its range used in training. It is not a temporal extrapolation.
For the training set preparation, five different extraction rates and five different inflow rates (in order to account for the variability in groundwater replenishment) were selected such that they include values from low to high ranges. Additionally, an adaptive pumping scheme was applied during the higher extraction rates by deactivating the sea-side pumps in order to reproduce farmers' adaptation on saline water intrusion. If five extraction rates were to be distributed over the 60-year period considering yearly time step, the total number of unique patterns would be 560. The same number of patterns could be generated for five inflow rates as well. If we consider both inflow and extraction patterns together, the number of total patterns would become 2560. Clearly, there was an obvious need to reduce the number of patterns. For that purpose, inflow and extraction rates were treated as fixed pairs, meaning, the first extraction pattern would always concur with the first inflow pattern, the second extraction pattern with the second inflow pattern, and so on. This reduced the total number of patterns from 2560 to 560. These patterns were then applied to the first 12 years of the 60-year period. In order to further reduce the size of the data, we considered a 3-year time step for the management problem, which essentially implies that the inflow and the extraction rates are constant for 3 successive years. This was a valid assumption in our case mainly for three reasons: (1) we were interested in long-term planning and management, (2) constant pumping rates can arise from water quota systems, (3) since inflow rates depend on the longer time scales, the 3-year duration follows the dynamics of the groundwater recharge more closely. These 12-year patterns with 3-year time step were then repeated four more times to extend the entire management period of 60 years. This technique reduced the number of patterns drastically to 54 = 625. Data from OGS were extracted at a fixed interval of 3 years, thereby creating 20 points for each 60-year pattern. Thus, the final size of the training data became 625 × 20 = 12,500. Similar methodology was applied for generating the other three datasets as well. For testing and validation sets, four different inflow and four different extraction rates were chosen, and the size of each one of the datasets was 44 × 20 = 5,120. Furthermore, the inflow and the extraction rates for testing and validation were chosen such that they lie within their range used in training. The extrapolation set was created with two values of inflow and two values of extraction rates such that one value is lower than the lowest and the other value is higher than the highest used in training. Thereby, the size of the extrapolation set was 24 × 20 = 320. Different inflow and extraction rates used for the data preparation in this study are summarized in Table 3.
. | Inflow rates . | Extraction rates . |
---|---|---|
Training | 50, 60, 70, 80, 90 | 50, 150, 300, 400, 600 |
Testing | 55, 65, 75, 85 | 100, 200, 350, 500 |
Validation | 52, 68, 72, 88 | 100, 250, 450, 500 |
Extrapolation | 40, 100 | 40, 700 |
. | Inflow rates . | Extraction rates . |
---|---|---|
Training | 50, 60, 70, 80, 90 | 50, 150, 300, 400, 600 |
Testing | 55, 65, 75, 85 | 100, 200, 350, 500 |
Validation | 52, 68, 72, 88 | 100, 250, 450, 500 |
Extrapolation | 40, 100 | 40, 700 |
There were still repetitions in the datasets that needed to be filtered out. For example, the first extraction pattern for the 60-year management period in the training dataset would look like:
E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 |
E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 | E1 |
where each of the 20 square boxes represents a time period of 3 years and E1 denotes the first extraction rate out of the five rates selected during training. The second pattern looks like:
E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 |
E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 |
This time it has also included the second extraction rate E2. As can be seen, the first three values of the second pattern are redundant once the first pattern is already fed to the model. Removing the first three values would create a modified second pattern as:
E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 |
E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 | E1 | E1 | E1 | E2 |
This technique was applied to all the four datasets which in addition to reducing the number of data points, also helped in avoiding overfitting (Table 4).
. | With repetitions . | Without repetitions . |
---|---|---|
Training | 12,500 | 10,780 |
Testing | 5,120 | 4,436 |
Validation | 5,120 | 4,436 |
Extrapolation | 320 | 286 |
. | With repetitions . | Without repetitions . |
---|---|---|
Training | 12,500 | 10,780 |
Testing | 5,120 | 4,436 |
Validation | 5,120 | 4,436 |
Extrapolation | 320 | 286 |
OGS had to be run once for each individual pattern. The training data had 625 patterns, each of the testing and validation data had 256 patterns, and the extrapolation data had 16 patterns, which produced a total number of 625 + 256 + 256 + 16 = 1,153 patterns. Therefore, there were 1,153 OGS runs which were distributed over 14 dual core computers, each running two OGS programs simultaneously. The entire data preparation process took around 16 hours.
Datasets for the surrogates were generated from the OGS model based on different inflow–extraction scenarios for a management period of 60 years. Water table heights were extracted from all nine MOLs whereas only the first seven MOLs were considered for the salinity because of the fact that salinity was negligible at the last two MOLs. One inflow rate, one extraction rate, nine water table heights, and seven salinity values from any given time step were considered to be the inputs for that time step whereas nine water table heights and seven salinity values from the next time step were used as outputs. Table 5 shows the first three rows of any dataset with superscripts indicating the time steps. As is seen, for each time step there are 18 inputs from that time step and 16 outputs from the next time step. Training based on the pattern of the data shown in Table 5 makes the surrogates suitable for variable-time simulations.
Time step . | Inputs . | Targets . | ||||
---|---|---|---|---|---|---|
1 | Inflow1 | Extraction1 | 9 WT1 | 7 Sal1 | 9 WT2 | 7 Sal2 |
2 | Inflow2 | Extraction2 | 9 WT2 | 7 Sal2 | 9 WT3 | 7 Sal3 |
3 | Inflow3 | Extraction3 | 9 WT3 | 7 Sal3 | 9 WT4 | 7 Sal4 |
Time step . | Inputs . | Targets . | ||||
---|---|---|---|---|---|---|
1 | Inflow1 | Extraction1 | 9 WT1 | 7 Sal1 | 9 WT2 | 7 Sal2 |
2 | Inflow2 | Extraction2 | 9 WT2 | 7 Sal2 | 9 WT3 | 7 Sal3 |
3 | Inflow3 | Extraction3 | 9 WT3 | 7 Sal3 | 9 WT4 | 7 Sal4 |
ANN development
GPM development
A total of 16 single-output GPMs were trained in this study for 16 different outputs and they were combined together to build a unified modeling framework. As for the GPM configuration, linear mean function and composite covariance function (linear and squared exponential function forms) were used. The fully independent training conditional (Naish-Guzman & Holden 2007) approximation scheme with 500 pseudo inputs was applied as a sparse GP technique. The GPML (Gaussian Processes for Machine Learning) Toolbox (Rasmussen & Williams 2006) was used for the implementation of GPM.
Setup of water management system application
For demonstrating the ability of the surrogate models to emulate the behavior of a physically based numerical groundwater model, an application of the surrogate models within the frame of a water management system for coastal regions is presented as an example (see Figure 1, Step 3). The long-term evaluation of the environmental and economic development over 60 years in a hypothetical farm located in the vicinity of the sea at MOL 3, was investigated for a more profit oriented management scenario with w1 = 0.8 and w2 = 0.2 (see Equation (17)). For the sake of simplicity, cultivation of two different crops growing in two seasons per year was considered: (1) maize, a salt-sensitive cash crop with a market price of $175/t, and (2) sorghum, a lower priced but more salt-resistant crop with a market price of $100/t. Additionally, regional specific cost models for the irrigated agriculture and the groundwater abstraction were incorporated (Malik & Al-Zubeidi 2006). The three decision variables considered in this study were the cropping pattern, groundwater extraction rate, and cultivated area, which would produce a total number of 3 × 60 = 180 variables if they were to be applied to a 60-year period with yearly time stepping. However, since a 3-year time step was established throughout the course of this study, the total number of variables reduced to 3 × 20 = 60. In order to keep the optimization problem manageable for this example application, it was assumed that the variables were constant for each set of non-overlapping 12 consecutive years (i.e., four time steps). Thus, the final number of decision variables reduced to 60/4 = 15. The CMA-ES algorithm needed 2,914 iterations and a population size of 12 to solve the optimization problem.
RESULTS
Generation of surrogate models
The average validation error statistics for all 16 outputs of the final model are given in Table 6. The performance of ANN was also investigated for the extrapolation dataset in order to check the model consistency for extreme values. As seen in Table 6, the model performance was quite satisfactory even in the extrapolation range.
. | NRMSE . | NQ95 . | NSE . | R . | NMBE . | TS25 . | TS100 . |
---|---|---|---|---|---|---|---|
Validation | 0.0159 | 0.0258 | 0.9784 | 0.9974 | −0.7772 | 65.34 | 85.23 |
Extrapolation | 0.0762 | 0.1516 | 0.6941 | 0.9993 | 0.5339 | 38.83 | 67.19 |
. | NRMSE . | NQ95 . | NSE . | R . | NMBE . | TS25 . | TS100 . |
---|---|---|---|---|---|---|---|
Validation | 0.0159 | 0.0258 | 0.9784 | 0.9974 | −0.7772 | 65.34 | 85.23 |
Extrapolation | 0.0762 | 0.1516 | 0.6941 | 0.9993 | 0.5339 | 38.83 | 67.19 |
The average validation and extrapolation error statistics of the combined GPM framework are given in Table 7. Similar to ANN, the performance of GPM in the extrapolation also turns out to be quite satisfactory. Both the surrogates show comparable performance as captured by the performance metrics. They both have very low values of NRMSE, NQ95, and NMBE, and very high values of NSE, R, and threshold statistics. Performance of GPM is relatively better, particularly in the extrapolation range.
. | NRMSE . | NQ95 . | NSE . | R . | NMBE . | TS25 . | TS100 . |
---|---|---|---|---|---|---|---|
Validation | 0.0136 | 0.0244 | 0.9865 | 0.9969 | −0.1254 | 74.18 | 90.19 |
Extrapolation | 0.0064 | 0.0110 | 0.9977 | 0.9993 | −0.3288 | 79.31 | 90.78 |
. | NRMSE . | NQ95 . | NSE . | R . | NMBE . | TS25 . | TS100 . |
---|---|---|---|---|---|---|---|
Validation | 0.0136 | 0.0244 | 0.9865 | 0.9969 | −0.1254 | 74.18 | 90.19 |
Extrapolation | 0.0064 | 0.0110 | 0.9977 | 0.9993 | −0.3288 | 79.31 | 90.78 |
Behavior of surrogate models for interpolation
Behavior of surrogate models at their boundaries
As seen in Figure 6, both ANN and GPM perform quite satisfactorily and there is a good agreement in their outputs. They follow almost a similar temporal pattern while applied to the model domain. For the low extraction case, the water table drawdown is quite small and it always stays above the MSL whereas the salinity goes up to almost 35 dS/m. Water table drawdown is predominant for high extraction rate where it goes down to 2.5 m below MSL. The salinity value in this case also reaches its maximum at the end of the simulation period.
Although GPM showed comparable accuracy as compared to ANN, the computational time was not improved much. GPM was around five to six times faster than OGS for this particular application whereas ANN was more than 10,000 times faster. While these results substantiate the successful use of ANN as a surrogate model, they also indicate that GPM has the potential to become a prominent candidate for surrogate modeling if the computational time can be reduced through future research.
Application of the water management system
Cropping pattern (fraction of maize) . | Extraction rates (Mm3/year) . | Cultivated area (hectare) . |
---|---|---|
1 | 50 | 4.85 |
1 | 274 | 26.57 |
1 | 106 | 10.25 |
1 | 139 | 13.51 |
1 | 50 | 4.85 |
Cropping pattern (fraction of maize) . | Extraction rates (Mm3/year) . | Cultivated area (hectare) . |
---|---|---|
1 | 50 | 4.85 |
1 | 274 | 26.57 |
1 | 106 | 10.25 |
1 | 139 | 13.51 |
1 | 50 | 4.85 |
DISCUSSION
The training scheme applied in this study produces time-dependent surrogate models which do not require any additional training if the simulation time is altered (see Figure 5). Optimization with varying planning horizon is rather very common in several groundwater management applications such as remediation or restoration of coastal aquifers. A variable period affects the pumping rates and the removal efficiency of the remediation systems (Singh & Chakrabarty 2011; Peralta & Kalwij 2012). Thus, time is an important decision variable, or it can be included as an additional objective when trade-offs between costs and time are of interest. In all those applications, surrogate modeling approaches assuming fixed time horizons are not that useful because for each model run, a new training of the surrogate becomes compulsory. Moreover, in cases where the objective function is not fully defined at the beginning and needs to be refined during the investigation for a satisfying solution, the usefulness of variable-time surrogate modeling and decoupling of the model and the objective function is quite obvious. The modeler can use a single trained model to carry out this interactive exploration and also several other tasks such as sensitivity analyses, uncertainty analyses at different time scales, comparing long- and short-term management decisions, etc., without carrying out a new training procedure.
Therefore, our goals in this investigation are to understand: (1) how closely the state-space surrogates follow the results calculated by the numerical groundwater model; (2) how well the surrogates behave in extrapolation which corresponds to some extreme boundary conditions not considered during training; (3) how similar the optimal management decisions are when trained surrogates are used within the optimization routine instead of the actual numerical groundwater model; and (4) how computationally efficient the surrogates are as compared to the numerical groundwater model. Both the surrogates show comparable accuracy in validation and extrapolation (Tables 6 and 7). The NRMSE, NQ95, and NMBE statistics are all quite low whereas the values of NSE, R, and threshold statistics are on the higher range for both the models, which collectively indicate superior model performance. GPM also tends to perform better in the extrapolation range. During interpolation (Figure 5), GPM follows OGS more closely for salinity at MOL 1. For water table heights at MOL 4, the performance of ANN is quite satisfactory for the first 60 years, after which it shows some underestimations. Overall, the results are quite similar and they also substantiate a successful application of the variable-time simulation framework since the surrogates in this case were trained for 60 years but were run for 84 years. Comparison of the surrogates at the boundaries of their training space (Figure 7) also shows good agreement in the outcomes. Both the surrogates approximate water table heights closely for the low as well as high extraction rates. In the case of low extraction, GPM imitates the salinity profile more closely, but when the extraction rate is kept high, it underestimates the salt concentration near the sea-side pumps.
Although GPM has shown high accuracy in approximating the OGS model, it still suffers from its relatively higher computational time requirement. It did perform faster than OGS, however, it was considerably slower as compared to ANN. Improving the GPM speed should be regarded as an avenue for future research. Attempts should be made through a more rigorous training with increased number of inflow and extraction rates to solve the issue with ANN in Figure 5. It is worth mentioning that ANN in this case runs out of the valid state-space of the training dataset, which could be one of the reasons for such an abnormal behavior. One solution approach that might work would be to train the ANNs for the longest time period and then carry out variable-time simulations within that temporal range. Another probable reason behind the behavior of ANN in Figure 5 could be the implementation of the adaptive pumping scheme, because there is a chance that it might have introduced discontinuities in the ANN. We address this particular issue more elaborately in our ongoing research.
SUMMARY AND CONCLUSIONS
In this study, we present a simulation-optimization framework for solving coupled groundwater-agriculture management problems in the arid coastal aquifer of Oman which is infected with saline water as a result of uncontrolled pumping over the last few decades. The framework couples a groundwater model with an optimization routine to estimate the optimal values of the decision variables of the management problem. This process usually involves thousands of runs of the computationally expensive groundwater model which eventually leads to a significantly high overall computational time requirement. To overcome this issue, faster surrogate models are used as alternatives to the numerical groundwater model. In this study, we replace the high-fidelity numerical groundwater model OGS with two state-space surrogate models (ANN and GPM) in a water management framework for the coastal aquifer of Oman. The surrogates are trained using the scenario database generated from the physically based OGS model considering 2D transient density-dependent groundwater flow and salt water transport processes in a two-layer aquifer system. Keeping ANN as the main candidate for the surrogate modeling, we further analyze the comparative benefits and drawbacks of GPM. We then demonstrate an example of a long-term (60-year) management problem and analyze the agricultural profit as well as aquifer sustainability through the application of our methodology.
Although both ANN and GPM showed comparable accuracy in approximating the response of OGS, the main drawback of GPM was its relatively higher computational time requirement for the particular problem presented in this study. It did perform faster than the high-fidelity numerical model; however, the computational speed needs to be improved further, which should be regarded as an avenue for future research. ANN, on the other hand, approximated the numerical model quite closely and was also significantly fast. The training scheme applied in this study made the surrogates suitable for variable-time simulations. This kind of a training scheme is highly beneficial for various applications where simulation period needs to be altered frequently; i.e., for exploring optimal water management strategies on different time horizons (Grundmann et al. 2012). The applied multi-objective water management framework facilitates the evaluation of different management scenarios by changing the weights on profit and sustainability terms in the objective function. Furthermore, it also allows the exploration of the entire trade-off between the contradicting objectives within a multi-criteria optimization (Grundmann et al. 2013) in a reasonable computational time.
ACKNOWLEDGEMENTS
The first author of this manuscript was supported by the DAAD Master's Scholarship. The manuscript was prepared within the research project IWAS funded by the German Federal Ministry of Education and Research (BMBF) under grant no. 02WM1028. We would like to thank the Ministry of Regional Municipalities and Water Resources of the Sultanate of Oman for supporting the IWAS-IWRM project. We thank Marc Walther (TU Dresden/UFZ Leipzig) for his help with the OGS model.