Abstract
A Soil and Water Assessment Tool (SWAT) model was used to model streamflow in a tropical humid basin in the Cerrado biome, southeastern Brazil. This study was undertaken in the Upper São Francisco River basin, because this basin requires effective management of water resources in drought and high-flow periods. The SWAT model was calibrated for the period of 1978–1998 and validated for 1999–2007. To assess the model calibration and uncertainty, four indices were used: (a) coefficient of determination (R2); (b) Nash–Sutcliffe efficiency (NS); (c) p-factor, the percentage of data bracketed by the 95% prediction uncertainty (95PPU); and (d) r-factor, the ratio of average thickness of the 95PPU band to the standard deviation of the corresponding measured variable. In this paper, average monthly streamflow from three gauges (Porto das Andorinhas, Pari and Ponte da Taquara) were used. The results indicated that the R2 values were 0.73, 0.80 and 0.76 and that the NS values were 0.68, 0.79 and 0.73, respectively, during the calibration. The validation also indicated an acceptable performance with R2 = 0.80, 0.76, 0.60 and NS = 0.61, 0.64 and 0.58, respectively. This study demonstrates that the SWAT model provides a satisfactory tool to assess basin streamflow and management in Brazil.
INTRODUCTION
Brazil, along with many other countries, has been severely and repeatedly affected by droughts in terms of extent, intensity and economic impact (Silva et al. 2013). Brazil is a country of continental scale and has a broad range of environments (Amazônia, Mata Atlântica, Pantanal, Cerrado (Brazilian savanna), Caatinga and Pampa) (IBGE 2016). However, in Brazil few studies for estimating streamflow have been performed on the Cerrado biome, with most attention dedicated to the other biomes, despite the Cerrado being the second most extensive biome in South America (Sano et al. 2010), and one of the world's biodiversity hotspots (Grecchi et al. 2014). The Cerrado biome covers approximately 204 million ha, which corresponds to 24% of Brazil (Medrado & Lima 2014), and 57% of the Upper São Francisco River basin. Since the 1970s, the Cerrado biome has suffered loss of vegetation due to agriculture and pasture expansion (Silva et al. 2006), although understanding of these land cover changes and their impacts on streamflow in the Cerrado biome is limited (Beuchle et al. 2015). In contrast, the Amazônia, Mata Atlântica and Pantanal biomes have been the focus of significant monitoring since the 1980s. Only recently have some studies focused on assessing hydrologic modelling in the southeastern region. These have been in small basins (Durães et al. 2011; Pinto et al. 2013; Pereira et al. 2014, Pereira 2016a), which do not represent streamflows at the biome-scale and, particularly, impact on inflows to the Três Marias reservoir. Improving the knowledge base on hydrological processes in the Cerrado biome is therefore important for the national water resources management.
Mapping of land-cover change and hydrologic modelling are important tools for water resource planning and management in river basins in a changing climate (Beskow et al. 2009), particularly in basins with large rainfall variability and different land uses, such as the Upper São Francisco River basin. This basin of 48,679 km2 drains the main watercourse of Minas Gerais State and is located in the southeastern region of Brazil, characterized by a tropical humid climate. The Upper São Francisco River basin is a strategic area in terms of water resources and includes the Três Marias reservoir, with a capacity of 20 billion m3 that plays an important role in regulating river discharge, and is used for multiple purposes: (a) hydropower generation; (b) navigation; and (c) water supply for population, industries and irrigation projects (Fan et al. 2014). Furthermore, this river basin is of the utmost importance for regional development, supporting diverse agricultural and industrial enterprises (Versiani et al. 2009). The basin has a particularity: a non-intensive, but constant, cattle farming, which occurs largely in the Cerrado biome, in addition to the intensive agriculture, which influences the surface soil characteristics and consequently influences the hydrological regime of the basin. There are approximately 300,000 ha of irrigated land in the region and fruit production for export (grapes and mangoes, among others), dry-land farming, cattle-raising and mining are among the main economic activities (CEMIG 2015). This land use, together with intensive agriculture, impacts surface soil characteristics and influences the hydrological regime of the basin.
Recently the southeastern region of Brazil has experienced significant rainfall deficits, and consequently, recurring droughts in the Upper São Francisco River basin (Coelho et al. 2016), e.g., 1998, 1999, 2001 and 2012. In these years, the region suffered from severe water shortages leading to depleted water levels in reservoirs such as Três Marias, which had, in turn, a direct impact on hydropower energy generation and water availability for public consumption and agriculture. Water scarcity refers to the relative water shortage of the supply system, which can be caused by climate or by human actions due to demand exceeding supply – something that can also be linked to population growth and higher living standards (Chen & Li 2016). Quantification of current and future water availability is an important aspect of water management in order to ensure food security, human health, environmental health and economic development.
Modelling techniques have been developed to quantify water scarcity and availability (Chen & Li 2016), and hydrological models can be used for simulating past, present and future land-use change impact at basin scales based on different scenarios (Li et al. 2015). The Soil and Water Assessment Tool (SWAT) model has been used in a wide range of studies across Brazil (Durães et al. 2011), but there has been limited testing in Brazilian savannah conditions. Lima et al. (2014) also note that further development of the SWAT soil parameter database to support a greater range of Brazilian physical characteristics was needed, and especially for the Cerrado. Therefore, the aims of this paper are to firstly develop a consistent soil parameter database and secondly to assess performance of a model in simulating streamflow in the Upper São Francisco River basin of the Cerrado biome, using the SWAT model.
MATERIALS AND METHODS
Study area
The research was conducted in the Upper São Francisco River basin (48,679 km2), located in the central region of Minas Gerais state, in southeastern Brazil, between latitudes 21°10′S and 18°10′S, and longitudes 46°40′W and 43°40′W (Figure 1). Local climate types according to the Köppen classification (Alvares et al. 2013) include Cwa (with dry winter and hot summer), Cwb (with dry winter and temperature summer) and Aw (tropical zone with dry winter), with a well defined rainy season between December and March and average annual rainfall ranges from 800 to 1,200 mm (Rao et al. 2016).
Geographical location of the São Francisco River basin and Upper São Francisco sub-basin, Brazil.
Geographical location of the São Francisco River basin and Upper São Francisco sub-basin, Brazil.
The local biome is Cerrado, which is a seasonal ecosystem characterized by distinct dry and wet seasons. The typical vegetation of this biome is tall and dense evergreen gallery forests, ranging from closed to open canopy deciduous and semi-deciduous forest with a maximum height of 15 m (Klink & Machado 2005). The main land uses in the Upper São Francisco River basin have been classified as cerrado and agriculture.
Datasets and input data
The rainfall database was made up of historical daily rainfall series from 12 rainfall stations. Three streamflow stations and three weather stations were also used (Table 1 and Figure 2). Rainfall and streamflow data were obtained from the National Water Resources Information System (SNIRH) (available at http://www.ana.gov.br/portalsnirh). Observed meteorological data (daily values of total rainfall, mean wind speed, maximum and minimum temperatures, mean relative humidity and solar radiation) measured from 2 m height from the three weather stations (Table 1 and Figure 2) were obtained from the Instituto Nacional de Meteorologia (INMET) (available at http://www.inmet.gov.br).
Geographic characteristics of the rainfall, streamflow and weather stations used in the study
Stations . | Type . | Latitude (degree) . | Longitude (degree) . | Elevation (m) . | Period of data . |
---|---|---|---|---|---|
Araxá | Rainfall | −19.60 | −46.94 | 1,023 | 1978–2007 |
Bambuí | Rainfall | −20.03 | −45.00 | 661 | 1978–2007 |
Barbacena | Rainfall | −21.25 | −43.76 | 1,126 | 1978–2007 |
Belo Horizonte | Rainfall | −19.93 | −43.93 | 915 | 1978–2007 |
Florestal | Rainfall | −19.88 | −44.41 | 760 | 1978–2007 |
Ibirité | Rainfall | −20.01 | −44.05 | 814 | 1978–2007 |
João Pinheiro | Rainfall | −17.73 | −46.17 | 760 | 1978–2007 |
Patos de Minas | Rainfall | −18.51 | −46.43 | 940 | 1978–2007 |
Pirapora | Rainfall | −17.35 | −44.91 | 505 | 1978–2007 |
Pompeu | Rainfall | −19.21 | −45.00 | 690 | 1978–2007 |
São Sebastião do Paraíso | Rainfall | −20.91 | −47.11 | 820 | 1978–2007 |
Sete Lagoas | Rainfall | −19.46 | −44.25 | 732 | 1978–2007 |
Porto das Andorinhas | Streamflow | −19.28 | −45.29 | 532 | 1978–2007 |
Pari | Streamflow | −20.18 | −44.89 | 710 | 1978–2007 |
Ponte da Taquara | Streamflow | −19.42 | −44.55 | 627 | 1978–2007 |
Bambuí | Weather | −20.03 | −45.00 | 661 | 1978–2007 |
Araxá | Weather | −19.60 | −46.94 | 1,023 | 1978–2007 |
São Sebastião do Paraíso | Weather | −20.91 | −47.11 | 820 | 1978–2007 |
Stations . | Type . | Latitude (degree) . | Longitude (degree) . | Elevation (m) . | Period of data . |
---|---|---|---|---|---|
Araxá | Rainfall | −19.60 | −46.94 | 1,023 | 1978–2007 |
Bambuí | Rainfall | −20.03 | −45.00 | 661 | 1978–2007 |
Barbacena | Rainfall | −21.25 | −43.76 | 1,126 | 1978–2007 |
Belo Horizonte | Rainfall | −19.93 | −43.93 | 915 | 1978–2007 |
Florestal | Rainfall | −19.88 | −44.41 | 760 | 1978–2007 |
Ibirité | Rainfall | −20.01 | −44.05 | 814 | 1978–2007 |
João Pinheiro | Rainfall | −17.73 | −46.17 | 760 | 1978–2007 |
Patos de Minas | Rainfall | −18.51 | −46.43 | 940 | 1978–2007 |
Pirapora | Rainfall | −17.35 | −44.91 | 505 | 1978–2007 |
Pompeu | Rainfall | −19.21 | −45.00 | 690 | 1978–2007 |
São Sebastião do Paraíso | Rainfall | −20.91 | −47.11 | 820 | 1978–2007 |
Sete Lagoas | Rainfall | −19.46 | −44.25 | 732 | 1978–2007 |
Porto das Andorinhas | Streamflow | −19.28 | −45.29 | 532 | 1978–2007 |
Pari | Streamflow | −20.18 | −44.89 | 710 | 1978–2007 |
Ponte da Taquara | Streamflow | −19.42 | −44.55 | 627 | 1978–2007 |
Bambuí | Weather | −20.03 | −45.00 | 661 | 1978–2007 |
Araxá | Weather | −19.60 | −46.94 | 1,023 | 1978–2007 |
São Sebastião do Paraíso | Weather | −20.91 | −47.11 | 820 | 1978–2007 |
Upper São Francisco River basin with its sub-basins, and rainfall and streamflow stations.
Upper São Francisco River basin with its sub-basins, and rainfall and streamflow stations.
Figure 3(a) shows the land use and cover map derived from satellite imagery, captured by the TM/Landsat 5 satellite (orbit/point 218/74), with 30 × 30 m spatial resolution, acquired on September 21, 2003, accessible at the Instituto Nacional de Pesquisas Espaciais – INPE (available at: http://www.inpe.br). The images were classified using the K-means unsupervised digital image classification method (Santana et al. 2014), with field verification to identify and confirm land use. The identified land uses were: (a) cerrado; (b) agriculture; (c) pasture; (d) bare soil; (e) urban area; and (f) water bodies. Reclassification of the land-use map was undertaken and the respective model parameters were selected from the SWAT database corresponding to the specific land-use and cover (LUC) types. The areas of each LUC classification system are shown in Table 2.
LUC and soil classes for the Upper São Francisco River basin
SWAT class . | Description . | Type . | Area (km2) . | % of Area . |
---|---|---|---|---|
RNGB | Cerrado | LUC | 26,128 | 53.67 |
AGRL | Agriculture | LUC | 11,452 | 23.53 |
PAST | Pasture | LUC | 7 848 | 16.12 |
WATR | Water | LUC | 1 519 | 3.12 |
BARR | Bare soil | LUC | 1 167 | 2.40 |
URBN | Urban area | LUC | 565 | 1.16 |
Total | − | − | 48,679 | 100.00 |
CAMB | Cambisol | Soil | 21,046 | 43.23 |
RLAT | Red Latosol | Soil | 11,327 | 23.27 |
RYAC | Red-yellow Acrisol | Soil | 7 523 | 15.45 |
RYLA | Red-yellow Latosol | Soil | 6 099 | 12.53 |
WATR | Water | 1 519 | 3.12 | |
RACR | Red Acrisol | Soil | 1 009 | 2.07 |
LITH | Litholic | Soil | 156 | 0.32 |
Total | − | − | 48,679 | 100.00 |
SWAT class . | Description . | Type . | Area (km2) . | % of Area . |
---|---|---|---|---|
RNGB | Cerrado | LUC | 26,128 | 53.67 |
AGRL | Agriculture | LUC | 11,452 | 23.53 |
PAST | Pasture | LUC | 7 848 | 16.12 |
WATR | Water | LUC | 1 519 | 3.12 |
BARR | Bare soil | LUC | 1 167 | 2.40 |
URBN | Urban area | LUC | 565 | 1.16 |
Total | − | − | 48,679 | 100.00 |
CAMB | Cambisol | Soil | 21,046 | 43.23 |
RLAT | Red Latosol | Soil | 11,327 | 23.27 |
RYAC | Red-yellow Acrisol | Soil | 7 523 | 15.45 |
RYLA | Red-yellow Latosol | Soil | 6 099 | 12.53 |
WATR | Water | 1 519 | 3.12 | |
RACR | Red Acrisol | Soil | 1 009 | 2.07 |
LITH | Litholic | Soil | 156 | 0.32 |
Total | − | − | 48,679 | 100.00 |
(a) LUC map, (b) digital elevation model, (c) soil map, and (d) hydrologic response units (HRUs), of the study area.
(a) LUC map, (b) digital elevation model, (c) soil map, and (d) hydrologic response units (HRUs), of the study area.
Figure 3(b) presents a 30 m resolution digital elevation model (DEM) of the Upper São Francisco River basin, obtained from the INPE (available at: http://www.topodata.inpe.br). The EMBRAPA soil map at a scale of 1:250,000 (Figure 3(c)) (available at: http://mapoteca.cnps.embrapa.br) was applied and the soil data properties obtained from the EMBRAPA soil properties database (EMBRAPA 2015) were used. The soil properties data (soil texture, available water content, hydraulic conductivity, bulk density and organic carbon content) for different layers of each soil type were collected and inserted in a database of SWAT model. The major soil groups in the basin are Cambisol (43%) and Red Latosol (23%).
From the natural river network, the topography of the basin and the distribution of rainfall stations, the study area was divided into 22 sub-basins and 180 HRUs (Figure 3(d)) for the purposes of routing water and sediment (Santos et al. 2011). SWAT divides a basin into sub-basins, each of which is linked through a stream channel and further discretized into hydrologic response units (HRUs), that enables the model to reflect the hydrologic conditions for different slope, land cover and soil types (Patel & Srivastava 2013). A HRU is a unique combination of soil and vegetation type in a sub-basin, and the model simulates hydrological processes at the HRU level and aggregates these results to the catchment scale by applying a weighted average to the HRU results (Devkota & Gyawali 2015). In defining the HRUs, the minor land use and cover, soil types and slope were ignored by setting a threshold of 15% to avoid an unnecessarily large number of HRUs (Neitsch et al. 2005). The drainage areas accounting for each one of the three runoff gauges used in the simulation are: (a) Porto Andorinhas = 13,005 km2; (b) Pari = 4,521 km2; and (c) Ponte da Taquara = 7,179 km2.
Application of the SWAT model and automatic calibration
The SWAT model has more than 40 calibration parameters and effective calibration begins by developing a proper mechanism for reducing the number of parameters to be calibrated (Xu et al. 2009). Parameter values that were not derived from measurements or the literature were assigned based on either automated or manual calibration. In the initial phase, parameter sensitivity was evaluated combining the methods One-factor-At-a-Time (OAT) and Latin Hypercube (LH) (van Griensven et al. 2006), identifying and classifying the parameters that have the most significant impact on model outputs (Panagopoulos et al. 2011; Wu & Chen 2015). Subsequently, an automatic calibration based on the sensitivity analysis of model parameters called SWAT-CUP (SWAT Calibration Uncertainty Programs) (Abbaspour et al. 2004), was used. In the SWAT-CUP, the SUFI-2 algorithm (Abbaspour 2007) is used to investigate the sensitivity and uncertainty in various SWAT parameters related to streamflow. SUFI-2 is a widely used algorithm that estimates the sensitivity and uncertainty of a hydrological model, and is powerful in communicating the reliability of model predictions (Narsimlu et al. 2015). Latin Hypercube sampling is carried out and the corresponding objective function was used to calculate the responsive parameter sensitivities (Yang et al. 2008). In order to estimate the level of significance of each parameter, the t-test and the p-values were used to provide a measure and the significance of the sensitivity, respectively.
For automatic calibration and uncertainty analysis, a total of 19 SWAT model parameters with high calibration sensitivity for streamflow prediction were adjusted. The calibrated parameters and the ranges used are shown in Table 3, ranked from the most to least sensitive parameters. Once the model parameters were optimized for calibration, model validation was performed based on monthly streamflow data.
SWAT parameters used for calibration and the higher ranks of the parameters
Parameters . | Parameter definition . | Ranks of the parameters . |
---|---|---|
GW_REVAP | Revap coefficient | 1 |
GW_DELAY | Groundwater delay time | 2 |
GWQMN | Threshold depth of water in the shallow aquifer required for return flow to occur | 3 |
SOL_AWC | Soil available water storage capacity | 4 |
SOL_Z | Depth from soil surface to bottom of layer | 5 |
CN2 | Curve number for moisture condition II | 6 |
ALPHA_BF | Base flow alpha factor | 7 |
ESCO | Soil evaporation compensation factor | 8 |
CH_N2 | Manning's n value for the main channel | 9 |
CH_K2 | Main channel conductivity | 10 |
SLSUBBSN | Average slope length | 11 |
CANMX | Maximum canopy storage | 12 |
SOL_ALB | Moist soil albedo | 13 |
SOL_K | Soil hydraulic conductivity | 14 |
RCHRG_DP | Deep aquifer percolation fraction | 15 |
SURLAG | Surface runoff lag coefficient | 16 |
BIOMIX | Biological mixing efficiency | 17 |
EPCO | Plant uptake compensation factor | 18 |
REVAPMN | Threshold water in shallow aquifer | 19 |
Parameters . | Parameter definition . | Ranks of the parameters . |
---|---|---|
GW_REVAP | Revap coefficient | 1 |
GW_DELAY | Groundwater delay time | 2 |
GWQMN | Threshold depth of water in the shallow aquifer required for return flow to occur | 3 |
SOL_AWC | Soil available water storage capacity | 4 |
SOL_Z | Depth from soil surface to bottom of layer | 5 |
CN2 | Curve number for moisture condition II | 6 |
ALPHA_BF | Base flow alpha factor | 7 |
ESCO | Soil evaporation compensation factor | 8 |
CH_N2 | Manning's n value for the main channel | 9 |
CH_K2 | Main channel conductivity | 10 |
SLSUBBSN | Average slope length | 11 |
CANMX | Maximum canopy storage | 12 |
SOL_ALB | Moist soil albedo | 13 |
SOL_K | Soil hydraulic conductivity | 14 |
RCHRG_DP | Deep aquifer percolation fraction | 15 |
SURLAG | Surface runoff lag coefficient | 16 |
BIOMIX | Biological mixing efficiency | 17 |
EPCO | Plant uptake compensation factor | 18 |
REVAPMN | Threshold water in shallow aquifer | 19 |
Calibration and validation
The SWAT model was applied for three flow stations (Porto das Andorinhas, Pari and Ponte da Taquara). Calibration and validation of SWAT, facilitated by the sensitivity analysis, were performed for streamflow on a monthly basis. Model calibration was performed for the period of January 1978 to December 1998 (i.e. 20 years of data for calibration), with three years of warm up (1975–1977). SWAT was validated for the Upper São Francisco by carrying out a Split Sample Test, where the model was calibrated for the period 1999 to 2006, and the optimized parameters were applied to the period of January 1999 to December 2007 (validation).


The Latin hypercube sampling method was employed for 95PPU and for obtaining the final cumulative distribution of the model outputs. These are calculated at the level of 2.5 and 97.5% prediction limit. At the start of calibration, SUFI-2 assumes a large parameter uncertainty and then decreases this uncertainty through the p-factor and the r-factor performance statistics. The range of the p-factor varies from 0 to 1, with values close to 1 indicating a very high model performance and efficiency, while the r-factor is the average width of the 95PPU band divided by the standard deviation of the measured variable and varies in the range 0–1 (Yang et al. 2008).


RESULTS AND DISCUSSION
Sensitivity analysis
Table 4 presents details of the parameters used for the sensitivity analysis and the most sensitive parameters. The results of the sensitivity analysis confirmed that all 19 sensitive parameters are applicable to surface runoff, groundwater, channel routing, and soil properties. Several SWAT parameters were set during the model calibration: SLSUBBSN, GWQMN, CN2, CH_K2, CH_N2, ALPHA_BF, ESCO, EPCO, REVAPMN, SOL_AWC, SOL_K, SOL_BD, GW_DELAY, GW_REVAP, and HRU_SLP. The larger absolute values are more sensitive than the lower ones, while a value closer to zero has more significance. In this study, the result of the global sensitivity analysis with the t-test suggests that the most sensitive parameters are GW_REVAP, GW_DELAY, GWQMN, SOL_AWC, SOL_Z, CN2, ALPHA_BF and ESCO. The results show that the most sensitive parameters are those representing the groundwater, base flow, surface runoff and soil properties, linked to basin rainfall seasonality, and diversity of lithology, landscapes, soil, and land use, all of which influence soil available water storage capacity.
Sensitive SWAT model parameters included in the final calibration, with their variations (initial and final), p-values and t-stat for the studied gauges
Parameters . | t-stat . | p-value . | Range . | Fitted values . | Method . | |||
---|---|---|---|---|---|---|---|---|
Min. value . | Max. value . | Porto das Andorinhas . | Pari . | Ponte de Taquara . | ||||
GW_REVAP | −46.39 | 0 | 0.02 | 0.2 | 0.18614 | 0.03926 | 0.15698 | Replace |
GW_DELAY | −11.59 | 0 | −30 | 60 | 45.33 | −5.25 | −1.83 | Added |
GWQMN | −10.40 | 0 | 0 | 1000 | 415 | 165 | 839 | Replace |
SOL_AWC | −8.633 | 0 | −0.25 | 0.25 | −0.0195 | −0.1625 | 0.1585 | Multiply |
SOL_Z | −8.065 | 0 | −0.25 | 0.25 | 0.1565 | 0.1865 | 0.1585 | Multiply |
CN2 | 6.283 | 0 | −0.1 | 0.1 | 0.0354 | −0.0978 | −0.0106 | Multiply |
ALPHA_BF | 7.335 | 0 | 0 | 1 | 0.023 | 0.671 | 0.005 | Replace |
ESCO | 23.34 | 0 | 0.5 | 1 | 0.5435 | 0.9985 | 0.6865 | Replace |
CH_N2 | −2.837 | 0.004 | 0 | 0.3 | 0.1743 | 0.0051 | 0.2481 | Replace |
CH_K2 | −2.724 | 0.006 | 0 | 5 | 1.195 | 4.415 | 1.965 | Replace |
SLSUBBSN | −1.299 | 0.194 | −0.25 | 0.25 | 0.2045 | 0.0755 | −0.0665 | Multiply |
CANMX | 1.261 | 0.207 | 2 | 5 | 4.241 | 2.915 | 2.453 | Replace |
SOL_ALB | −0.919 | 0.358 | −0.25 | 0.25 | 0.0705 | 0.2135 | 0.1955 | Multiply |
SOL_K | −0.897 | 0.37 | −0.25 | 0.25 | 0.2085 | −0.2465 | 0.0575 | Multiply |
RCHRG_DP | 0.734 | 0.463 | −0.04 | 0.05 | 0.03929 | 0.02327 | −0.01669 | Multiply |
SURLAG | 0.732 | 0.464 | 0 | 24 | 1.896 | 11.304 | 16.632 | Replace |
BIOMIX | −0.659 | 0.509 | 0 | 1 | 0.893 | 0.139 | 0.573 | Replace |
EPCO | 0.431 | 0.666 | 0 | 1 | 0.577 | 0.839 | 0.229 | Replace |
REVAPMN | −0.221 | 0.824 | 0 | 10 | 2.11 | 2.65 | 5.51 | Replace |
Parameters . | t-stat . | p-value . | Range . | Fitted values . | Method . | |||
---|---|---|---|---|---|---|---|---|
Min. value . | Max. value . | Porto das Andorinhas . | Pari . | Ponte de Taquara . | ||||
GW_REVAP | −46.39 | 0 | 0.02 | 0.2 | 0.18614 | 0.03926 | 0.15698 | Replace |
GW_DELAY | −11.59 | 0 | −30 | 60 | 45.33 | −5.25 | −1.83 | Added |
GWQMN | −10.40 | 0 | 0 | 1000 | 415 | 165 | 839 | Replace |
SOL_AWC | −8.633 | 0 | −0.25 | 0.25 | −0.0195 | −0.1625 | 0.1585 | Multiply |
SOL_Z | −8.065 | 0 | −0.25 | 0.25 | 0.1565 | 0.1865 | 0.1585 | Multiply |
CN2 | 6.283 | 0 | −0.1 | 0.1 | 0.0354 | −0.0978 | −0.0106 | Multiply |
ALPHA_BF | 7.335 | 0 | 0 | 1 | 0.023 | 0.671 | 0.005 | Replace |
ESCO | 23.34 | 0 | 0.5 | 1 | 0.5435 | 0.9985 | 0.6865 | Replace |
CH_N2 | −2.837 | 0.004 | 0 | 0.3 | 0.1743 | 0.0051 | 0.2481 | Replace |
CH_K2 | −2.724 | 0.006 | 0 | 5 | 1.195 | 4.415 | 1.965 | Replace |
SLSUBBSN | −1.299 | 0.194 | −0.25 | 0.25 | 0.2045 | 0.0755 | −0.0665 | Multiply |
CANMX | 1.261 | 0.207 | 2 | 5 | 4.241 | 2.915 | 2.453 | Replace |
SOL_ALB | −0.919 | 0.358 | −0.25 | 0.25 | 0.0705 | 0.2135 | 0.1955 | Multiply |
SOL_K | −0.897 | 0.37 | −0.25 | 0.25 | 0.2085 | −0.2465 | 0.0575 | Multiply |
RCHRG_DP | 0.734 | 0.463 | −0.04 | 0.05 | 0.03929 | 0.02327 | −0.01669 | Multiply |
SURLAG | 0.732 | 0.464 | 0 | 24 | 1.896 | 11.304 | 16.632 | Replace |
BIOMIX | −0.659 | 0.509 | 0 | 1 | 0.893 | 0.139 | 0.573 | Replace |
EPCO | 0.431 | 0.666 | 0 | 1 | 0.577 | 0.839 | 0.229 | Replace |
REVAPMN | −0.221 | 0.824 | 0 | 10 | 2.11 | 2.65 | 5.51 | Replace |
SWAT calibration and validation
Figure 5 compares the measured and simulated monthly runoff values at the Porto das Andorinhas station for both the calibration and validation periods. The simulated and measured monthly streamflows at the Porto das Andorinhas stations are similar during most of the calibration period. The simulated streamflow follows a similar trend for both periods. Monthly simulation results are satisfactory and the model was able to track the seasonal trends very well. The results reflect the seasonal change and match the daily streamflow to a reasonable degree. However, the results indicate intermittent over-/underestimation. Most peaks in the summer (September, October and November) were significantly underestimated, while the mean monthly streamflow in March to May was obviously overestimated in most cases. The values of the streamflow reveal the greatest variation of magnitude in the monitored and simulated events, clearly indicating an imbalance in the infiltration process and formation of streamflow.
Simulated and observed monthly runoff and observed hyetograph at the Porto das Andorinhas station.
Simulated and observed monthly runoff and observed hyetograph at the Porto das Andorinhas station.
The hydrographs (Figure 5) in the validation period also highlighted this typical phenomenon. Flood peaks in summer were greatly overestimated. The largest differences were observed for months in which high rainfall totals were registered. In these cases, the model tended to overestimate the runoff and, as a consequence, to increase the simulated discharge peaks.
Figure 5 clearly shows that the model satisfactorily simulates the monthly runoff and that the hydrographs are in reasonable agreement with the rainfall pattern. The values of the Nash–Sutcliffe efficiency (NS equal to 0.68 for calibration and 0.61 for validation) and the coefficient of determination (R2 equal to 0.73 for calibration and 0.80 for validation) and the small volume difference between the observed and simulated values of 9% (87.64 m3s−1 for observed monthly runoff and 79.74 m3s−1 for simulated monthly runoff) show a good predictive capability of the model (according to Moriasi et al. 2007) at the Porto das Andorinhas station. Other studies using the SWAT model related NS values ranged from 0.3 to 0.9, depending on the drainage area of basin, the time interval of the simulation and the available database (Fukunaga et al. 2015).
Overall, the streamflow simulated by the SWAT model fits the observed well, albeit with some difficulties in simulating some streamflow peaks both in the calibration and validation periods (Figure 5). This same problem is related to the representation of the temporal runoff and was found by many authors in basins near to Upper São Francisco (Durães et al. 2011; Pinto et al. 2013; Pereira et al. 2014, 2016a).
Figure 6 shows the comparison between the observed and simulated streamflow for the calibration and validation periods at the Pari station. This figure shows that the simulated values are again in good agreement with the observed values for calibration and validation periods. This finding is justified by the high NS of 0.79 and R2 of 0.81 and the small volume difference of 8.6%. However, for the validation, the model overestimates the runoff (NS = 0.64 and R2 = 0.76). The results with the SWAT model showed that the baseflow component in the basin was not modeled well. This low model predictability is also due to: (a) changes in the land use, and climate variability in the basin, which were not able to be incorporated into the hydrologic simulations, as also shown by Rodrigues et al. (2015); (b) changes in the soil water infiltration and retention patterns; and (c) increase in extreme events which caused a volume reduction in the streamflow time series (Pruski et al. 2004).
Simulated and observed monthly runoff and observed hyetograph at the Pari station.
Simulated and observed monthly runoff and observed hyetograph at the Pari station.
The results show that the model had a base flow underestimation bias during the calibration period. Combined with the temporal rainfall variation, which is difficult to represent through a daily time step, this made the response of the SWAT model in simulating some peak streamflow difficult. Nevertheless, this is considered a reasonable monthly model performance, especially when compared with the values cited in the studies mentioned earlier.
Figure 7 presents the hydrograph of the calibration and validation periods for the Ponte da Taquara station. A close relationship between the simulated and observed monthly streamflow indicates a good performance of the model (Moriasi et al. 2007), with R2 values of 0.76 and 0.60, and NS values of 0.73 and 0.58 for calibration and validation, respectively. Again, the monthly simulation results are satisfactory and the model was also able to track the seasonal trends very well. Although the results reveal that the model satisfactorily represented the simulated flow, underestimations are noted in the recession limb of the hydrograph, possibly related to the high spatial variability of climatic data produced with the weather generator to replace missing values and also due to the influence of the aquifer systems, which are necessary for the model application. Such marked differences in the case of our study could be due to the location and low number of hydrometeorological stations used in relation to the size of the study area, as they would not have been able to record some local high-intensity rainfall events. Alternatively this could be the result of intra-annual variability of rainfall during the calibration years. Mello et al. (2008), applying the same model for sub-basins in Alto Rio Grande basin, found NS values above 0.7 in the calibration and validation phases. Rostamian et al. (2008) reported that a SWAT model is not designed to simulate such an extreme event and the model usually under predicts the largest flow events increasing the uncertainty concerning the calibration of extreme events.
Simulated and observed monthly streamflow and observed hyetograph at the Ponte da Taquara station.
Simulated and observed monthly streamflow and observed hyetograph at the Ponte da Taquara station.
The modeled hydrograph showed goodness of fit to observations, which means there was good estimation of discharge in that phase, observing model sensitivity to peak discharges. Thus, the model succeeded in capturing oscillations brought about by rainfall. The model slightly underestimated some low flows and the agreement between observed and simulated flows was stronger in the monthly time scale. The model also performed well for the validation period, and these results showed that the hydrological processes were reasonably represented, suggesting that this model could be used for water impact assessment, as also obtained for a daily scale by Durães et al. (2011) for Paraopeba basin (sub-basin of Upper São Francisco) and Pereira et al. (2016b) for Pomba River Basin, which are located in southeastern region of Brazil.
Rainfall distribution in SWAT is undertaken by associating each sub-basin with the nearest rainfall gauging station closest to its centroid, and because the sub-basin has only four rainfall gauging stations upstream from the control section in Ponte da Taquara, actual spatial distribution of rainfall may not have been representative.
Reasonably high values of NS and R2 for monthly streamflow in the Porto das Andorinhas and Pari stations indicate better prediction for both stations during the calibration period. Although an overestimation of monthly streamflow by the model was beyond the satisfactory level of acceptance, the model can be considered good considering the overall statistics. In general, the model accurately tracked the observed streamflow for the time periods, although some monthly runoff peaks were over-predicted during calibration and validation, with smaller over-prediction the during validation.
Uncertainty in streamflow prediction
We used the default lower and upper bound parameter values, which were taken directly from the SWAT users’ manual (Neitsch et al. 2005). Uncertainty analysis results of SUFI-2 during the calibration and validation periods at the three flow stations are shown in Figure 8. In these figures, the shaded region (95PPU) contains all uncertainties from the different sources.
95% probability uncertainty plot observed and simulated streamflow for stations: (a) Porto das Andorinhas, (b) Pari and (c) Ponte da Taquara.
95% probability uncertainty plot observed and simulated streamflow for stations: (a) Porto das Andorinhas, (b) Pari and (c) Ponte da Taquara.
The results for Pari flow station (Figure 8(b)) show that 84% of the observed data were bracketed by the 95PPU. Most of the data were captured by the 95PPU. The SWAT model was capable of simulating large flows and extreme events in the Upper São Francisco River basin, except for Ponte da Taquara flow station. Some peak values were missing from the 95PPU band. This shows that SUFI-2 did not capture the observed data well for Ponte da Taquara and Porto das Andorinhas stations and had high uncertainty for simulated peak values, which shows that calibration of the SWAT model in this basin is challenging due to the uncertainties that are driven by the runoff process, which are not totally understood.
The results indicated that the p-factor values were 0.89, 0.81 and 0.79 and that the r-factor values were 1.53, 2.34 and 2.95, respectively, during the calibration. The validation also indicated an acceptable performance with p-factor = 0.87, 0.86, 0.81 and r-factor = 1.72, 2.35, 3.44, respectively. In all stations, the model mostly shows large uncertainties at extreme events during the calibration and validation periods. These results were generally better for the Pari station in terms of the percentage of data being bracketed (p-factor), but the uncertainties are larger as expressed by the r-factor in all stations. This has resulted in large uncertainty in discharge peaks for all stations. Although the simulation of monthly runoff for the Porto das Andorinhas station was satisfactory during the calibration period, the SWAT model exhibited large uncertainties for the validation period.
The parameter uncertainties were tolerable when the parameter ranges of the NS and R2 reached the desired limits. When the NS value is >0.60, the results are satisfactory, and once NS is >0.75, the simulation results are good (Nash & Sutcliffe 1970). For the results during the calibration, the values of R2 and NS were 0.77 and 0.74, respectively, while in the validation, the same performance index values obtained were 0.71 and 0.69, respectively. The results indicate that the model can be accepted for the Upper São Francisco River basin.
CONCLUSIONS
In this study, the SWAT model was used in the Alto São Francisco River basin, Brazil, to simulate streamflow from 1978 to 2007. Modelling studies should always include assessment of uncertainty, and in this study the SUFI-2 technique was used. The calibration and validation results show that the simulated monthly runoff was in reasonable agreement with the measured values, suggesting that the SWAT model could be successfully used to accurately simulate runoff in the study area. The sensitivity analysis of SWAT parameters indicated that runoff is the most sensitive to the curve number and base flow alpha factor. The model overestimates the monthly runoff in the dry periods and underestimates the runoff in the wet periods, but modeling efficiency for monthly runoff was satisfactory. Uncertainty analysis in SUFI-2 indicated that 95PPU could capture more than 80% of the observed streamflow at Pari flow station. The corresponding p-factor and r-factor suggested that there were larger uncertainties at Porto das Andorinhas and Ponte da Taquara. The model also had large uncertainty in peak-flow prediction. These results can be used in further assessments of climate change and land use and cover impact appraisal on water resources for this basin.
ACKNOWLEDGEMENTS
The authors would like to express their sincere thanks to the Brazilian National Research Council (CNPq) and to the Coordination for the Improvement of Higher Education Personnel (CAPES) for their financial support. The authors are also thankful to the Brazilian National Water Agency (ANA) for providing the runoff and rainfall data.