As human activities increasingly impact river runoff, measured runoff often fails to reflect the natural state of surface water resources. This study addresses the limitations of traditional natural runoff reconstruction methods, such as the high workload of the itemized survey method and the difficulty in obtaining data for hydrologic modeling-based methods. An innovative approach is proposed. By establishing a basin-scale Soil and Water Assessment Tool model, calibrating parameters of near-natural sub-basins, analyzing sub-basin similarity, and spreading parameter sets, natural runoff can be reconstructed. In the case of the Upper Yangtze River Basin, significant spatial differences exist in the average annual runoff of its 99 sub-basins. The basin's average annual natural runoff is 438.18 km³, while the measured runoff is 428.91 km³, demonstrating the impact of human activities. Among the sub-basins, the highest average annual runoff in sub-basin 67 is 267.93 km³, and the lowest in sub-basin 16 is only 0.28 km³, indicating a large-scale variation range. The reconstructed natural runoff provides a basis for water resource planning, ecological protection, and adaptation to climate and land-use changes.

  • An innovative natural runoff reconstruction method is proposed to overcome the shortcomings of current methods.

  • The similarity between the unnatural sub-basins and the near-natural sub-basins is analyzed, and the model parameter sets are spread to the whole basin according to the degree of similarity.

  • A case study is conducted using real-world data from the Upper Yangtze River Basin in China.

With the continuous growth of the population and the ongoing development of the socio-economy, the issue of water resource shortages has become increasingly severe (Elahi et al. 2024). At the same time, under the influence of climate change and human activities, the problem of uneven spatiotemporal distribution of water resources has intensified (Abbas et al. 2022). This raises higher demands for water conservation and efficient utilization of water resources. To alleviate the water resource crisis, it is necessary to develop reasonable water resource allocation plans based on the spatiotemporal distribution and evolution characteristics of water resources, thereby improving water resource utilization efficiency. As a major component of the hydrological cycle, runoff responds to the impacts of climate change and human activities, and its evolution can lead to changes in regional water resource quantities. Surface water resources constitute the main body of total water resources and can be represented by natural river runoff. Natural runoff refers to the dynamic amount of water that can be renewed year by year in rivers, lakes, glaciers, and other surface water bodies formed by local precipitation (Miao et al. 2022). Under the influence of human activities, due to socio-economic water use, water storage, and other factors, the measured runoff of many sections can no longer truly reflect the natural water resources of the basin (Zhang et al. 2021a; Elahi et al. 2022; Abbas et al. 2023). In order to achieve rational allocation and efficient utilization of regional water resources, it is necessary to reconstruct the natural runoff series to find out the runoff volume in its natural state (Wang et al. 2022).

The traditional method for reconstructing natural runoff is the itemized survey method (Razavi & Coulibaly 2013). According to the water balance principle, for a closed watershed, the natural runoff from a control section is equal to the measured runoff from that control section plus the amount of each reconstructed water above that control section (Ren et al. 2012). Therefore, on the basis of the measured runoff, the natural runoff is obtained by investigating items such as water consumption by each water-using sector, cross-basin diversions, and changes in reservoir storage and reconstructing them item by item. Although the itemized survey method is easy to understand, the collection of survey information brings about a huge workload, requires a lot of human and material resources, and the accuracy of the survey results is difficult to guarantee, which is defective in practical application. Take the research on the Jinsha River Basin (Feng et al. 2024), Yalong River Basin (Wang et al. 2022), Tongtian River Basin (Liu et al. 2024), and Yangtze River Basin (Zhang et al. 2021a) in China as an example. Since it involves numerous water-using departments, the data collection channels for each department are complex, and the statistical standards vary, making it difficult to ensure data accuracy. At the same time, it is extremely difficult to obtain long-time-series data. Some historical data are missing or have vague records, resulting in deviations in the restoration of past water resource conditions. When dealing with complex river basins, the workload of this method increases exponentially, with high costs and low efficiency, which seriously restricts its wide application in practice.

With the gradual deepening of the study of hydrological cycle mechanisms, natural runoff reconstruction methods based on rainfall–runoff relationships have emerged. These methods use rainfall to calculate natural runoff based on the relationship between rainfall and runoff under approximately the same surface conditions. Sajikumar & Thandaveswara (1999) modeled monthly rainfall–runoff relationships for watersheds with short data lengths using artificial neural network techniques commonly used to deal with nonlinear problems. Ozelkan & Duckstein (2001) solved the problem of parameter uncertainty in rainfall–runoff modeling by developing a fuzzy rainfall–runoff model. Whigham & Crapper (2001) introduced the genetic algorithm into a rainfall–runoff model for natural runoff reconstruction. Methods based on the rainfall–runoff relationship have been widely used in natural runoff reconstruction, but there are shortcomings such as incomplete consideration of influencing factors, and the calculation results are not easy to infer. For example, Zhang et al. (2018) merely considered the relationship between rainfall and runoff in a simplistic way, neglecting the crucial impact of topography and geomorphology on the formation of runoff and the confluence process. In mountainous river basins, where the terrain undulates significantly, the velocity and direction of water flow are remarkably controlled by the terrain. There are huge differences in runoff generation and concentration in areas with different slopes and aspects, yet such methods cannot accurately reflect these differences. Additionally, the impacts of soil types and vegetation cover on water infiltration and evapotranspiration have not been fully considered, making it difficult for the calculation results to accurately reflect the actual natural runoff conditions.

In order to fully consider the physical mechanisms in the hydrologic cycle process in natural runoff reconstruction, hydrologic modeling-based natural runoff reconstruction methods have been developed. The hydrologic modeling-based methods have the advantages of strong physical mechanisms, few constraints, and ease of use (Tran et al. 2022). Hydrologic modeling is a product of the combination of hydrology science and computer science, which is a physical or conceptual structure constructed according to the formation process of rainfall–runoff in a basin to meet the principle of water balance (Clark et al. 2015). The hydrologic models in common use today can be divided into two groups: lumped models and distributed models. Lumped models simulate the runoff formation process of a basin as a whole. They are also called conceptual models because they are formed by combining conceptual elements according to the runoff formation process. Famous lumped models include the TANK model in Japan, the Sacramento model in the United States, and the Xinanjiang model in China. Distributed models divide the basin into multiple spatial units, each with different inputs and outputs. This can reflect the heterogeneous spatial distribution of runoff formation conditions in watersheds, making the simulation results more realistic and accurate. The study of distributed hydrologic modeling originated from the original concept and framework proposed by Freeze & Harlan (1969). Since then, scholars have worked on a variety of distributed hydrologic models. Abbott et al. (1986) proposed the System Hydrologic European model, which synthesizes numerous hydrologic processes and is the first representative distributed model. In 1994, the Soil and Water Assessment Tool (SWAT) was jointly developed by the USDA Agricultural Research Service and Texas A&M AgriLife Research. The SWAT has a strong physical mechanism for simulating hydrologic processes in complex watersheds with a wide range of soil, land use, and management conditions (Sarrazin et al. 2016). Various distributed hydrologic models play an important role in revealing the relationship between runoff formation and topography, geomorphology, soil, vegetation, hydrogeology, and land use. The SWAT model, as a small watershed to river basin-scale model over long-time periods, has a wide range of applications around the world, with natural runoff reconstruction being a major area of application (Wang et al. 2023). Bouraoui et al. (2005) used SWAT to simulate daily and monthly runoff in the Medjerda Basin, and the results showed that the accuracy of the simulated runoff is high and the natural runoff reconstruction is reliable.

In this study, natural runoff reconstruction is based on the SWAT model. After collecting digital elevation model (DEM) data information, land use information, soil spatial distribution information, and meteorological information, the river basin-scale SWAT model is established. The river basin is divided into sub-basins (watersheds), and the runoff of the watershed is reconstructed to its natural state after calibrating the model parameters based on hydrologic data. The traditional approach to calibrating model parameters is to adjust the basin-wide parameters to make the simulated runoff close to the early measured runoff, reasoning that the early measured runoff is less influenced by human activities and can be approximated as natural runoff (Zhang et al. 2021b). However, this parameter calibration method has two notable drawbacks. First, there are challenges in data collection. The period when runoff is least affected by human activities often dates back to a distant past. As the time span extends, the impact of human activities diminishes. Nevertheless, this also means that acquiring measured runoff data becomes increasingly arduous due to issues like data scarcity over long time periods, limited data preservation, and potential inaccuracies in historical records. Second, the spatial variability within the basin is overlooked. Employing a uniform parameter calibration across the entire basin and assigning identical parameter values to each sub-basin fails to account for the diverse influences of different surface conditions on runoff. This approach contradicts the fundamental principle of the distributed hydrologic model, which aims to represent the spatial heterogeneity of hydrological processes (Xu et al. 2023). This study proposes an innovative approach to overcome the shortcomings. A case study is conducted using real-world data from the Upper Yangtze River Basin (UYRB) in China. The natural runoff reconstruction method proposed in this research differs greatly from traditional ones. Integrating advanced data-driven algorithms and considering multiple environmental variables allows for a more accurate analysis of the natural runoff process. Validating this method in the complex and data-rich UYRB has two main aims. First, it significantly improves the accuracy and reliability of natural runoff reconstruction, providing a more precise quantification of the natural hydrological cycle. Second, it aims to create a generalized model that can be easily applied to other basins, overcoming the limitations of a single-study area. The findings from this research are expected to be crucial for developing more scientific water resource management strategies. Given the changing water availability due to climate change and land-use change, these strategies are designed to enhance the adaptability of water resource management systems. They aim to ensure sustainable water use and protect the ecological balance of river basins, thus contributing to the long-term stability of aquatic ecosystems.

Figure 1 shows the flow chart of the methodology. First, a basin-scale SWAT model is established, and the parameters of near-natural sub-basins are calibrated to obtain multiple sets of model parameters. These near-natural sub-basins that are less affected by human activities are selected, and the measured runoff of each near-natural sub-basin can be approximated as its natural runoff. Then, the similarity between non-near-natural and near-natural sub-basins is analyzed, and the parameter sets are spread across the whole basin according to the degree of similarity. Finally, the calibrated model is used to simulate the natural runoff of each non-near-natural sub-basin.
Figure 1

Flow chart of the methodology.

Figure 1

Flow chart of the methodology.

Close modal

River basin-scale SWAT establishment

The establishment of river basin-scale SWAT requires generating rivers and sub-basins based on DEM data, then generating hydrologic response units based on the spatial distribution of land use types, soil types, and slope types, and finally inputting meteorological data to run the model (Xu et al. 2023; Lotfirad et al. 2025).

Basic data preparation

The SWAT model requires the support of a variety of data from different sources and in different forms, which need to be processed and transformed before being added to the model. A detailed description of the various types of basic data is shown in Table 1.

Table 1

SWAT basic data description

Data categoriesDescription
Topography DEM data 
Land use Spatial distribution of land use types, runoff curve number, and other attribute data 
Soil Spatial distribution of soil types, soil physical properties, chemical properties, and other attribute data 
Meteorology Daily precipitation, maximum/minimum temperature, wind speed, humidity, radiation data 
Hydrology Measured runoff data from river gauging stations 
Data categoriesDescription
Topography DEM data 
Land use Spatial distribution of land use types, runoff curve number, and other attribute data 
Soil Spatial distribution of soil types, soil physical properties, chemical properties, and other attribute data 
Meteorology Daily precipitation, maximum/minimum temperature, wind speed, humidity, radiation data 
Hydrology Measured runoff data from river gauging stations 

Sub-basin (watershed) delineation

Based on the basin DEM and human-set watershed thresholds, the SWAT generates streams and a number of watershed inlets and outlets. The total outlet of the basin is determined after adding additional inlets and outlets artificially according to the actual needs. Sub-basins (watersheds) are divided based on hydraulic connections and topographic parameters, geometric parameters, and flow paths, are analyzed and calculated for each sub-basin (watershed).

Hydrological response unit generation

After delineating the sub-basins (watersheds), each sub-basin is further delineated into hydrological response units (HRUs). HRUs are areas within a sub-basin that have the same land-use type, soil type, and slope type (within a certain threshold), and they are the smallest computational units in the model run. Loading spatially distributed land use, soil, and slope datasets into the SWAT model generates a number of HRUs based on their combination and distribution.

Model inputs and model runs

Various types of soil data and meteorological data are input into the model, and input files, such as basin input file (.bsn), sub-basin input file (.sub), HRU input file (.hru), management input file (.mgt), soil input file (.sol), and groundwater input file (.gw), are created in the model database, and then the model can be run. The results produced by the model run mainly include the summary information of each HRU, each sub-basin, and each calculated river segment in the basin, which are stored in the HRU output file (output.hru), sub-basin output file (output.sub), and main river channel output file (output.rch), respectively. The results of the model run are analyzed and calculated to produce the natural runoff for each part of the watershed.

Parameter calibration considering spatial variations

Model parameter calibration is the process of adjusting model parameter values so that model calculations match measured data. Any model that mathematically represents a simulated process cannot be based entirely on the physical substance of the process, and therefore the accuracy and reliability of the model are limited. Parameter calibration is a critical step before applying the model to reveal and compensate for deficiencies in model design and implementation and is useful when real parameter values are not available or are difficult to obtain. Typically, calibrated model parameters can only be used to simulate hydrologic processes. After calibration of the model parameters is completed, validation with data other than the calibration needs to be applied to assess the applicability of the model parameters.

Parameter sensitivity analysis

The SWAT model has a large number of parameters, 28 of which are related to runoff. In order to improve the efficiency of parameter calibration, after performing parameter sensitivity analysis, parameters with less impact on model results are excluded, and parameters with higher sensitivity are selected for calibration. The parameter sensitivity analysis is performed using the Latin-hypercube one-at-a-time method (Wang et al. 2023).

Representative sub-basin (watershed) selection

The traditional approach to calibrating model parameters is to adjust the basin-wide parameters to make the simulated runoff close to the early measured runoff, reasoning that the early measured runoff is less influenced by human activities and can be approximated as natural runoff. However, this parameter calibration method has two shortcomings. First, it is difficult to collect information: the period in which runoff is minimally affected by human activities generally goes back a very long time, and the longer the time, the smaller the impact of human activities, but the more difficult it is to obtain information on measured runoff. Second, the spatial differences of different locations within the basin are not considered: the uniform parameter calibration for the whole basin and the same parameter values for each sub-basin (watershed) do not reflect the influence of different surface conditions on runoff, which is contrary to the original intention of the distributed hydrologic model.

In order to overcome these two shortcomings, a new idea is proposed in this study. Several near-natural sub-basins (watersheds) in the watershed that are less affected by human activities are selected for parameter calibration, and the measured runoff of each near-natural sub-basin (watershed) can be approximated as its natural runoff. In this way, the parameter calibration can be completed by collecting the measured runoff data of these near-natural sub-basins in recent decades, which avoids the difficulty of collecting hydrological data of old age. At the same time, calibrating the parameters for different near-natural sub-basins can fully take into account the spatial variability of the parameters and provide corresponding parameter sets for sub-basins with different surface conditions.

Therefore, M near-natural sub-basins (watersheds) are selected as representative sub-basins before the parameter calibration. The selection is based on the extent to which runoff from each sub-basin is affected by human activities. Generally, sub-basins in the mountainous areas in the upper part of the basin can be selected as representative sub-basins, where the population density is very low and human activities are minimal.

Parameter calibration algorithm

A Differential Evolution Adaptive Metropolis (DREAM)-based parameter calibration algorithm proposed by Wang et al. (2023) is applied in this study. This algorithm takes into account the multifactorial uncertainty of input data, model parameters, measured data, and is superior in model parameter estimation. Based on Bayesian inference, a set of parameters is randomly generated by sampling from the prior distribution of the parameters, from which the SWAT model is set up and simulated, and the difference between the simulated and measured values is compared to prefer the parameter ranges. The algorithm reflects the uncertainty in the calibrated parameter ranges, which is visualized by plotting a 95% prediction uncertainty graph. The 95 Percent Prediction Uncertainty (PPU) interval after parameter calibration contains most of the measured data. Both sides of the cumulative distribution of the output variables at the 2.5 and 97.5% levels are eliminated as the 5% with highly unsatisfactory simulation results. The uncertainty range of the parameters is continuously reduced through multiple iterations so that the simulated values are constantly close to the measured values.

Parameter calibration evaluation indicators

The coefficients of determination () and Nash–Sutcliffe efficiencies () are selected in this study to evaluate the applicability of SWAT model parameters (Moriasi et al. 2007).

Parameter spreading considering the spatial similarity

After completing the parameter calibration for the M representative sub-basins, the M sets of parameters need to be spread out to the full basin scale. To account for spatial differences across the basin, parameter spreading is based on similarities between sub-basins.

We assume that the whole basin is divided into N sub-basins and analyzed the characteristics of the NM non-representative sub-basins one by one. For the Kth non-representative sub-basin, by comparing its similarity with M representative sub-basins, the representative sub-basin with the highest similarity is selected, and the parameter set corresponding to this representative sub-basin is spread to the Kth non-representative sub-basin. The above steps are implemented for each non-representative sub-basin to complete the basin-wide parameter spread.

Sub-basin (watershed) characteristics analysis

The spatial differences considered in the model parameters spread are mainly in the effect of different surface conditions on runoff. Therefore, in this study, indicators reflecting surface conditions, including land use type and soil type, are selected for the analysis of sub-basin characteristics.

The direct characterization indicator for each sub-basin is the composition of the different land use types and soil types, i.e., the percentage of area occupied by each land use type and the percentage of area occupied by each soil type. In order to consistently apply land use and soil indicators for the integrated evaluation of sub-basin characteristics, the indicators are normalized for each characteristic.

Normalization is a dimensionless processing method that converts values with dimensions into dimensionless relative value relationships. In statistics, the specific role of normalization is to summarize the statistical distribution properties of a uniform sample. The normalized function form includes linear function, logarithmic function, inverse tangent function. In this study, the more commonly used linear function is adopted for normalization (Zhang et al. 2018), and the formula is as follows:
(1)
where and are the values of the sample data before and after conversion, respectively; and are the maximum and minimum values in the sample data, respectively; n is the number of data in the sample.

The percentage of area occupied by each land use type and the percentage of area occupied by each soil type in each sub-basin are used as sample data. These are normalized by substituting them into Equation (1) to obtain uniformly comparable dimensionless values between 0 and 1 to quantitatively characterize the N sub-basins in the basin.

Sub-basin (watershed) similarity analysis

After clarifying the characteristics of each sub-basin, the non-representative sub-basins are analyzed for similarity with each representative sub-basin, and parameters are spread according to the degree of similarity. The measure of similarity between sub-basins is to calculate the ‘distance’ between them.

There are many methods of describing distances that are used in different fields; common ones are Minkowski distance, Mahalanobis distance, Hamming distance, and Jaccard distance. A well-used method for describing the distance between two points and in an n-dimensional space is the Minkowski distance (Xian et al. 2016), which is defined as follows:
(2)
where q is a variable parameter, when q takes different values, can specifically represent the different types of distances in Minkowski distance: when q = 1, it represents Manhattan distance, also known as the absolute value distance; when q = 2, it represents Euclidean distance; when , it represents Chebyshev distance. The distance calculation method chosen for this study is the easily understood Euclidean distance, which can represent the distance between two points in Euclidean space. When q = 2, The Euclidean distance between two points and in n-dimensional space is calculated as follows (Xian et al. 2016):
(3)
We assume that there are n land use type indicators and soil type indicators describing the characteristics of each sub-basin, so that the N sub-basins with different characteristics within a basin can be regarded as N points with different locations in n-dimensional space: , , … , , where the first M points , , … , represent the M representative sub-basins and the second N–M points , , … , represent the N–M non-representative sub-basins. To analyze the similarity between each non-representative sub-basin and the M representative sub-basins, the Euclidean distance between the latter N–M points and the first M points needs to be calculated as shown in Figure 2.
Figure 2

Schematic diagram of Euclidean distance calculation for sub-basin similarity analysis.

Figure 2

Schematic diagram of Euclidean distance calculation for sub-basin similarity analysis.

Close modal

For the Kth non-representative sub-basin, the Euclidean distances between it and the M representative sub-basins are compared to find the closest representative sub-basin. This means that this representative sub-basin has the highest degree of similarity with the Kth non-representative sub-basin, so the parameter set calibrated for this representative sub-basin is spread to the Kth non-representative sub-basin. The above steps are implemented for each non-representative sub-basin to complete the basin-wide parameter spread.

In this paper, a basin-scale SWAT model was developed for the UYRB as an example, and the proposed hydrological parameter spreading method was applied to reconstruct the natural runoff in the basin.

Study area

The UYRB extends 4,511 km from Geladandong Mountain in the west to Yichang City in the east, with a watershed area of 1 million km2 between 90 °E ∼ 112 °E and 23 °N ∼ 35 °N, as shown in Figure 3. The topography of the UYRB is high in the west and low in the east, with a difference of 6,900 m. The UYRB has a complex terrain, with the Tibetan Plateau and Hengduan Mountains in the west, high mountains in the center, and a hilly basin and valley in the southeast. The hydrological characteristics of the UYRB can be referred to in the studies by Zhang et al. (2018) and Wang et al. (2021).
Figure 3

Map of the UYRB (Wang et al. 2023).

SWAT model for the UYRB

Basic data preparation

The spatial data (DEM) were obtained from Shuttle Radar Topography Mission data measured jointly by the National Aeronautics and Space Administration and the National Imagery and Mapping Agency at 30 m × 30 m resolution. The UYRB part was extracted from it, as shown in Figure 3. The geographic feature data, including land use data and soil-type data, were obtained from the Data Center for Resources and Environmental Sciences, Chinese Academy of Sciences at scales of 1:100,000 and 1:1,000,000, respectively, from which the UYRB were extracted, as shown in Figure 4. A detailed description of the above geographic feature data can be found in Zhang et al. (2021b).
Figure 4

Geographic feature data used for modeling in the UYRB: (a) land use data and (b) soil-type data.

Figure 4

Geographic feature data used for modeling in the UYRB: (a) land use data and (b) soil-type data.

Close modal

Meteorological data from 80 meteorological stations in the UYRB were selected for this modeling, and the location of each station is shown in Figure 3. Meteorological data include daily precipitation, average/maximum/minimum temperature, wind speed, humidity, and radiation data for the period 1951–2013.

River basin-scale SWAT establishment

The SWAT model of the UYRB was established using the ArcSWAT module under the ArcGIS platform. Based on the spatial data of the UYRB, rivers and watersheds were identified and generated, and the watersheds were adjusted to a suitable degree of subdivision through threshold settings. Then, according to the actual situation or demand, the inlet and outlet were added or removed, and the total outlet was determined so that the UYRB could be subdivided into a number of sub-basins, as shown in Figure 5. The UYRB was divided into 99 sub-basins, and parameters such as sub-basin geometric features, topographic features, and water flow paths were calculated to generate sub-basin reports.
Figure 5

Sub-basin delineation of the SWAT model in the UYRB.

Figure 5

Sub-basin delineation of the SWAT model in the UYRB.

Close modal

The land use data and soil-type data were loaded. Land use, soil, and slope types were superimposed, appropriate thresholds were set, and different combinations of their distributions were obtained, from which HRUs were generated. A total of 702 HRUs were defined in 99 sub-basins of the UYRB for this modeling, and land use, soil and slope distribution reports and HRU distribution reports, were generated.

The prearranged meteorological data were input into the model, and various model database files were created to run the model. The Soil Conservation Service (SCS) runoff curve method was chosen to simulate surface runoff, the Hargreaves method was chosen to simulate evapotranspiration, and the Muskingum method was chosen to perform the river channel algorithm.

Parameter calibration for representative sub-basins

In the UYRB, we selected nine sub-basins with low population and construction densities and minimal human activities as near-natural sub-basins (Zhang et al. 2021a, b; Wang et al. 2023). As shown in Figure 6, the near-natural sub-basins are sub-basins #1, #6, #17, #22, #33, #44, #45, #95, and #99. The above sub-basins are used as representative sub-basins for parameter calibration.
Figure 6

Representative sub-basin locations.

Figure 6

Representative sub-basin locations.

Close modal

Based on the results of parameter sensitivity analyses, the top 10 parameters with high sensitivity were selected: ALPHA_BF, CANMX, EPCO, ESCO, GW_DELAY, GW_REVAP, GWQMN, CN2, SOL_AWC. The exact description of the parameters can be found in Zhang et al. (2021b). Wang et al. (2023) and Xu et al. (2023) showed that the runoff in the UYRB changed significantly after 1993 due to the influence of human activities. In order to minimize the error of natural runoff reconstruction caused by anthropogenic disturbance of runoff in the near-natural sub-basins, the measured monthly runoff series from 1960 to 1980 in sub-basins #1, #6, #17, #22, #33, #44, #45, #95, and #99 were selected as the parameter calibration data, and those from 1981 to 1993 were used as the parameter validation data in this study.

A set of parameters was calibrated for each of the nine representative sub-basins using the DREAM-based parameter calibration algorithm proposed by Wang et al. (2023). The comparison of the simulated runoff processes with the measured runoff processes for each representative sub-basin during the calibration and validation periods is shown in Figure 7. The coefficients of determination () and Nash–Sutcliffe efficiencies () of the simulated and measured values of runoff are also presented. As can be seen in Figure 6, the nine sets of parameters calibrated for the nine representative sub-basins are well simulated.
Figure 7

Comparison of simulated and measured runoff processes in representative sub-basins.

Figure 7

Comparison of simulated and measured runoff processes in representative sub-basins.

Close modal

Parameter spreading in similar watersheds

Based on the calibration of the parameters for the representative sub-basins, a parameter spreading was carried out to extend the nine sets of parameters to each non-representative sub-basin. The 99 sub-watersheds in the model were standardized for six land use type shares and 18 soil-type shares. The Euclidean distance between the 90 non-representative sub-basins and the nine representative sub-basins was then calculated one by one based on the sub-basin characteristic indicators. A smaller Euclidean distance indicates a higher degree of similarity. The nine sets of parameters calibrated for sub-basins #1, #6, #17, #22, #33, #44, #45, #95, and #99 are noted as parameter sets #A to #I, respectively. When a non-representative sub-basin has the highest degree of similarity to sub-basin #1, parameter set #A is used for that sub-basin; when a non-representative sub-basin has the highest degree of similarity to sub-basin #6, parameter set #B is used for that sub-basin; when a non-representative sub-basin has the highest degree of similarity to sub-basin #17, parameter set #C is used for that sub-basin; and so on. The results of the basin-wide parameter spread are shown in Figure 8.
Figure 8

Results of the basin-wide parameter spread.

Figure 8

Results of the basin-wide parameter spread.

Close modal

The use of Euclidean distance as a similarity metric allows for a quantitative assessment of the relationships between sub-basins. A smaller Euclidean distance implies a higher degree of similarity in terms of land-use type shares and soil-type shares, which are key surface-condition indicators. This means that sub-basins with similar surface characteristics are grouped together. For example, sub-basins within the same set in Figure 8 likely have comparable impacts on runoff due to their similar land use and soil compositions. This helps in identifying sub-basin clusters that may respond in a similar way to precipitation and other hydrological factors. Conversely, larger Euclidean distances highlight the differences between sub-basins. These differences can be attributed to variations in land-use patterns (such as differences between urban, agricultural, and forested areas) and soil types (which affect water infiltration and storage capacities). By quantifying these differences, we can better understand how specific surface-condition factors influence runoff generation and distribution across the basin. For instance, sub-basins with a high proportion of urban land-use (impervious surfaces) are likely to have different runoff characteristics compared to those dominated by forested areas, and the similarity metrics help in precisely identifying such disparities.

In the context of natural runoff reconstruction in the UYRB, similarity-based parameter spreading is a fundamental approach. Since the model parameters are calibrated for representative sub-basins, the ability to extend these parameters to non-representative sub-basins based on similarity is essential. By using the parameter sets of the most similar representative sub-basins for non-representative ones, we can more accurately simulate the runoff processes in different parts of the basin. This method takes into account the spatial heterogeneity of the basin, which is critical for natural runoff reconstruction. The surface-condition indicators, as reflected by the land-use and soil-type shares, are closely related to the hydrological behavior of sub-basins. For example, areas with different soil types will have different water-holding capacities and infiltration rates, which directly affect the amount and timing of runoff. By grouping sub-basins based on similarity and applying appropriate parameter sets, we can improve the accuracy of runoff simulations, thereby providing more reliable data for understanding the natural runoff regime in the UYRB. Moreover, these similarity metrics and the associated parameter – spreading method – can also assist in predicting the impact of future land-use changes or soil-related modifications on natural runoff. If there are planned changes in land use (such as urban expansion or reforestation projects) in a particular sub-basin, we can refer to the similar sub-basins and their runoff responses to estimate the potential changes in the target sub-basin's runoff. This has important implications for water resource management, flood control, and ecological protection in the UYRB.

Natural runoff reconstruction in the UYRB

The calibrated parameter sets were input into the model to simulate the runoff of the UYRB from 1960 to 2013 on a monthly scale. The average annual runoff depths for the 99 sub-basins are shown in Table 2. The average annual runoff of the UYRB was calculated to be 438.18 km3.

Table 2

Average annual runoff in sub-basins of the UYRB

Sub-basinRunoff (km3)Sub-basinRunoff (km3)Sub-basinRunoff (km3)
2.34 34 13.88 67 267.93 
0.70 35 1.16 68 15.25 
5.75 36 11.98 69 2.40 
8.18 37 0.54 70 256.94 
1.00 38 12.88 71 164.52 
1.64 39 7.83 72 5.85 
4.11 40 2.74 73 7.46 
5.02 41 7.30 74 20.58 
2.36 42 4.34 75 164.66 
10 0.78 43 438.18 76 2.95 
11 1.25 44 5.04 77 3.19 
12 8.61 45 11.22 78 3.86 
13 0.90 46 24.07 79 5.44 
14 1.18 47 26.19 80 2.32 
15 11.01 48 50.16 81 2.33 
16 0.28 49 17.39 82 11.79 
17 3.45 50 424.11 83 4.25 
18 0.20 51 19.29 84 41.99 
19 0.54 52 2.89 85 90.56 
20 4.32 53 17.55 86 20.07 
21 6.12 54 33.28 87 2.49 
22 8.26 55 20.81 88 29.43 
23 5.03 56 72.05 89 38.59 
24 1.49 57 362.47 90 67.27 
25 1.62 58 15.82 91 16.60 
26 12.50 59 274.08 92 3.88 
27 1.55 60 55.42 93 93.24 
28 1.79 61 7.04 94 36.10 
29 2.09 62 14.15 95 2.59 
30 3.63 63 0.22 96 93.43 
31 12.16 64 2.25 97 99.33 
32 4.35 65 97.47 98 0.98 
33 6.95 66 265.06 99 2.86 
Sub-basinRunoff (km3)Sub-basinRunoff (km3)Sub-basinRunoff (km3)
2.34 34 13.88 67 267.93 
0.70 35 1.16 68 15.25 
5.75 36 11.98 69 2.40 
8.18 37 0.54 70 256.94 
1.00 38 12.88 71 164.52 
1.64 39 7.83 72 5.85 
4.11 40 2.74 73 7.46 
5.02 41 7.30 74 20.58 
2.36 42 4.34 75 164.66 
10 0.78 43 438.18 76 2.95 
11 1.25 44 5.04 77 3.19 
12 8.61 45 11.22 78 3.86 
13 0.90 46 24.07 79 5.44 
14 1.18 47 26.19 80 2.32 
15 11.01 48 50.16 81 2.33 
16 0.28 49 17.39 82 11.79 
17 3.45 50 424.11 83 4.25 
18 0.20 51 19.29 84 41.99 
19 0.54 52 2.89 85 90.56 
20 4.32 53 17.55 86 20.07 
21 6.12 54 33.28 87 2.49 
22 8.26 55 20.81 88 29.43 
23 5.03 56 72.05 89 38.59 
24 1.49 57 362.47 90 67.27 
25 1.62 58 15.82 91 16.60 
26 12.50 59 274.08 92 3.88 
27 1.55 60 55.42 93 93.24 
28 1.79 61 7.04 94 36.10 
29 2.09 62 14.15 95 2.59 
30 3.63 63 0.22 96 93.43 
31 12.16 64 2.25 97 99.33 
32 4.35 65 97.47 98 0.98 
33 6.95 66 265.06 99 2.86 

Upon further analysis, it was found that there are significant spatial variations in the average annual runoff among different sub-basins. For instance, sub-basin 67 has the highest runoff of 267.93 km³, while sub-basin 16 has the lowest runoff of only 0.28 km³. Such large differences can be attributed to multiple factors. Climatic factors play a crucial role, with sub-basins in areas with higher precipitation generally having higher runoff. For example, sub-basins located in the southeastern part of the UYRB, which are influenced by the monsoon climate and receive abundant rainfall, tend to have larger runoff volumes.

In addition, underlying surface conditions also contribute significantly to the runoff differences. Sub-basins with more forest cover and better-preserved natural landscapes usually have a more stable runoff, as vegetation can enhance water infiltration and storage, reducing surface runoff and maintaining a relatively constant flow. On the contrary, sub-basins with intensive human activities, such as large-scale urbanization or industrial development, often show abnormal runoff patterns. The construction of impervious surfaces in urban areas reduces water infiltration, leading to increased surface runoff during rainfall events.

Surface water resources constitute the main body of total water resources and can be characterized by natural river runoff. Their evolution is influenced by climatic conditions and underlying surface conditions, while the differences between measured runoff and natural runoff can reflect the impact of human water extraction. Based on the temporal and spatial evolution patterns of the obtained natural runoff and measured runoff, as well as their causes, it is possible to further identify changes in regional water resources against the backdrop of climate change and human activities. The study of the temporal and spatial evolution analysis methods of natural runoff in watersheds holds significant scientific importance and practical value for the rational planning and utilization of water resources.

In the context of climate change and land use change, accurately reconstructing the natural runoff in the Upper Yangtze River is of great significance. The average annual natural runoff reconstructed in this study for the Upper Yangtze River is 438.18 km³, while the measured runoff is 428.91 km³ (Wang et al. 2023). The difference between the two clearly demonstrates the impact of human activities on runoff.

Climate change has led to alterations in precipitation patterns, with an increasing frequency of extreme precipitation events. In the UYRB, changes in the precipitation pattern directly affect the total runoff volume and its distribution within a year. Accurately understanding natural runoff can provide a scientific basis for coping with water resource fluctuations caused by climate change. For example, advance planning of water resource storage and allocation strategies can help store water rationally during the wet season and ensure water security during the dry season, effectively alleviating the problem of uneven spatial and temporal distribution of water resources.

Land use change also significantly impacts runoff. The rapid urbanization process in the UYRB has led to the expansion of urban land, resulting in hardened underlying surfaces. This causes surface runoff to converge rapidly, reducing infiltration and evapotranspiration and altering the runoff formation mechanism. Large-scale agricultural development has changed the soil structure and vegetation cover, affecting the natural water cycle. By reconstructing natural runoff, it is possible to quantify the impact of these land use changes on runoff and thus develop targeted ecological protection and restoration strategies. For instance, increasing green infrastructure in urban areas can restore the hydrological regulation function of cities, and optimizing irrigation methods in agricultural areas can reduce interference with natural runoff.

The reconstruction of natural runoff provides crucial basic data for water resource planning. Based on the reconstructed natural runoff data, more scientific water resource allocation plans can be developed to ensure the rational allocation of domestic, industrial, and ecological water use, and to achieve the sustainable utilization of water resources. At the same time, it is also an important indicator for evaluating the health of the ecosystem. It helps to maintain the stability and balance of the ecosystem in the UYRB, protect biodiversity, and promote the sustainable development of the basin.

Limitations and future research directions

In the case study, nine sub-basins with low population and construction densities and minimal human activities were selected as near-natural sub-basins Besides the degree of human activity influence, factors like topographic and geomorphic features, vegetation and ecosystems, hydrological characteristics, and soil properties can all potentially serve as criteria for selecting natural sub-basins. In the future, it is possible to further study the evaluation index system for assessing whether a sub-basin is in a natural or near-natural state to make the selection criteria more scientific and reasonable.

This study investigates the spatial spreading of SWAT model parameter sets. It uses the similarity of sub-basins based on land use and soil types as a key reference. Land use types, including forest, agricultural, and urban areas, significantly impact runoff generation. For example, forested areas have more infiltration and less surface runoff compared to urban regions. Similarly, different soil types vary in water-holding capacity and permeability, which also affect hydrological processes. By analyzing these factors, we can group sub-basins with similar characteristics and spread the parameter sets accordingly. However, this approach might not cover all aspects influencing parameter distribution. Future research could further explore whether there are other more scientifically grounded bases for the distribution of model parameters. This could involve studying the role of topography at a more detailed level, like micro-relief features. These factors could provide a more comprehensive understanding and potentially lead to more accurate parameter distribution methods in the SWAT model.

The reconstruction of natural runoff in a river basin serves as the cornerstone of scientific water resources management, playing a crucial role in maintaining ecological balance and ensuring the sustainable development of the social economy. Focusing on the shortcomings of existing methods, this study proposes a novel natural runoff reconstruction method based on the spreading of hydrological parameters in similar watersheds. In-depth case analyses are carried out using the actual data of the UYRB, yielding a series of valuable results:

Innovative parameter calibration approach: Breaking through the limitations of traditional parameter calibration methods, this study innovatively selects near-natural sub-basins within the river basin, which are less affected by human activities and can better reflect the natural runoff conditions, for parameter calibration. In the research of the UYRB, nine such sub-basins were identified, and multiple sets of accurate model parameter sets were successfully obtained. This strategy effectively addresses the data acquisition difficulties in traditional methods and fully considers the spatial variability of parameters, laying a solid foundation for accurate runoff process simulation.

Establishment of parameter spreading system: Land use types and soil types are used as key indicators to quantify the characteristics of sub-basins. By scientifically calculating the Euclidean distance, the similarity between non-representative sub-basins and near-natural sub-basins is accurately analyzed. Based on the similarity, the parameter sets of near-natural sub-basins are reasonably spread to the entire basin. This method fully takes into account the impact of underlying surface conditions on runoff, enabling the model parameters to better adapt to the actual situations of different sub-basins and significantly improving the accuracy and reliability of model simulation.

High-precision runoff reconstruction: With the help of the established SWAT model at the basin scale of the UYRB and the innovative parameter calibration and spreading methods, the high-precision reconstruction of the basin's natural runoff is successfully achieved. The study reveals significant spatial differences in the average annual runoff of the 99 sub-basins in the basin, which are mainly caused by different climate conditions and underlying surface characteristics. By comparing the reconstructed average annual natural runoff (438.18 km³) with the measured runoff (428.91 km³), the impact of human activities on runoff is clearly demonstrated. In addition, the method proposed in this study provides new ideas and methods for the construction of runoff series in ungauged areas, playing an important role in the scientific planning and efficient management of water resources, as well as the protection and sustainable development of the river basin ecosystem.

Conceptualized by L.L., Y.Z., Y.M.; developed the methodology by L.L., X.C., R.C.; rendered support in formal analysis and investigated by L.L., X.C., R.C.; wrote the original draft preparation by L.L., Y.Z.; wrote and reviewed and edited by L.L., X.C., R.C., X.Z., Y.Z., Y.M.; rendered support in funding acquisition by X.Z., Y.Z.; contributed in resources R.C., X.Z.; supervised by Y.M.

This work is supported by the National Natural Science Foundation of China (Grant No. 52209032) and the China Three Gorges Corporation Project (Grant No. 0704226).

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Abbas
A.
,
Ullah
S.
,
Ullah
W.
,
Waseem
M.
,
Dou
X.
,
Zhao
C. Y.
,
Karim
A.
,
Zhu
J. T.
,
Hagan
D. F. T.
,
Bhatti
A. S.
&
Ali
G.
(
2022
)
Evaluation and projection of precipitation in Pakistan using the Coupled Model Intercomparison Project Phase 6 model simulations
,
Int. J. Climatol.
,
42
(
13
),
6665
6684
.
https://doi.org/10.1002/joc.7602
.
Abbas
A.
,
Bhatti
A. S.
,
Ullah
S.
,
Ullah
W.
,
Waseem
M.
,
Zhao
C. Y.
,
Dou
X.
&
Ali
G.
(
2023
)
Projection of precipitation extremes over south Asia from CMIP6 GCMs
,
J. Arid. Land
,
15
(
3
),
274
296
.
https://doi.org/10.1007/s40333-023-0050-3
.
Abbott
M. B.
,
Bathurst
J. C.
,
Cunge
J. A.
,
O'Connell
P. E.
&
Rasmussen
J.
(
1986
)
An introduction to the European hydrological system – systeme hydrologique Europeen, ‘SHE’, 2: structure of a physically-based, distributed modelling system
,
J. Hydrol.
,
87
(
1–2
),
61
77
.
https://doi.org/10.1016/0022-1694(86)90115-0
.
Bouraoui
F.
,
Benabdallah
S.
,
Jrad
A.
&
Bidoglio
G.
(
2005
)
Application of the SWAT model on the Medjerda river basin (Tunisia)
,
Phys. Chem. Earth.
,
30
(
8–10
),
497
507
.
https://doi.org/10.1016/j.pce.2005.07.004
.
Clark, M. P., Fan, Y., Lawrence, D. M., Adam, J. C., Bolster, D., Gochis, D. J., Hooper, R. P., Kumar, M., Leung, L. R., Mackay, D. S., Maxwell, R. M., Shen, C. P., Swenson, S. C. & Zeng, X. B.
(
2015
)
Improving the representation of hydrologic processes in Earth System Models
,
Water Resour. Res.
,
51
(
8
), 5929–5956.
https://doi.org/5929-5956. 10.1002/2015WR017096
.
Elahi
E.
,
Khalid
Z.
&
Zhang
Z. X.
(
2022
)
Understanding farmers’ intention and willingness to install renewable energy technology: a solution to reduce the environmental emissions of agriculture
,
Appl. Energy
,
309
,
118459
.
https://doi.org/10.1016/j.apenergy.2021.118459
.
Elahi
E.
,
Li
G. J.
,
Han
X. R.
,
Zhu
W. B.
,
Liu
Y.
,
Cheng
A.
&
Yang
Y. D.
(
2024
)
Decoupling livestock and poultry pollution emissions from industrial development: a step towards reducing environmental emissions
,
J. Environ. Manage.
,
350
,
119654
.
https://doi.org/10.1016/j.jenvman.2023.119654
.
Feng
B. F.
,
Zhang
X.
,
Yan
F. J.
&
Zhang
Y.
(
2024
)
Determination of ecological flow based on probability distributions of annual and monthly flows
,
J. Water Clim. Change
,
15
(
10
),
5242
5257
.
https://doi.org/10.2166/wcc.2024.315
.
Freeze
R. A.
&
Harlan
R. L.
(
1969
)
Blueprint for a physically-based, digitally-simulated hydrologic response model
,
J. Hydrol.
,
9
(
3
),
237
258
.
https://doi.org/10.1016/0022-1694(69)90020-1
.
Liu
X.
,
Yang
W. X.
,
Zhang
Y.
&
Fang
G. H.
(
2024
)
A copula-based approach to instream ecological flow determination considering inter- and intra-annual runoff variability
,
J. Water Clim. Change
,
15
(
5
),
2415
2428
.
https://doi.org/10.2166/wcc.2024.025
.
Lotfirad
M.
,
Mahmoudi
M.
,
Esmaeili-Gisavandani
H.
&
Adib
A.
(
2025
)
Allocation of water resources with management approaches and under climate change scenarios in an arid and semi-arid watershed (study area: hablehroud watershed in Iran)
,
Appl. Water Sci.
,
15
(
4
),
74
.
https://doi.org/10.1007/s13201-025-02416-2
.
Miao, C. Y., Gou, J. J., Fu, B. J., Tang, Q. H., Duan, Q. Y., Chen, Z. S., Lei, H. M., Chen, J., Guo, J. L., Borthwick, A. G. L., Ding, W. F., Duan, X. W., Li, Y. G., Kong, D. X., Guo, X. Y. & Wu, J. W.
(
2022
)
High-quality reconstruction of China's natural streamflow
,
Sci. Bull.
,
67
(
5
),
547
556
.
https://doi.org/10.1016/j.scib.2021.09.022
.
Moriasi
D. N.
,
Arnold
J. G.
,
Van Liew
M. W.
,
Bingner
R. L.
,
Harmel
R. D.
&
Veith
T. L.
(
2007
)
Model evaluation guidelines for systematic quantification of accuracy in watershed simulations
,
Trans. ASABE
,
50
(
3
),
885
900
.
https://doi.org/10.13031/2013.23153
.
Ozelkan
E. C.
&
Duckstein
L.
(
2001
)
Fuzzy conceptual rainfall–runoff models
,
J. Hydrol.
,
253
(
1–4
),
41
68
.
https://doi.org/10.1016/S0022-1694(01)00430-9
.
Razavi
T.
&
Coulibaly
P.
(
2013
)
Streamflow prediction in ungauged basins: review of regionalization methods
,
J. Hydrol. Eng.
,
18
(
8
),
958
975
.
https://doi.org/10.1061/(ASCE)HE.1943-5584.0000690
.
Ren
L. L.
,
Liu
X. F.
,
Yuan
F.
,
Xu
J.
&
Liu
W.
(
2012
)
Quantitative analysis of runoff reduction in the Laohahe basin
,
Hydrol. Res.
,
43
(
1–2
),
38
47
.
https://doi.org/10.2166/nh.2011.135
.
Sajikumar
N.
&
Thandaveswara
B. S.
(
1999
)
A non-linear rainfall–runoff model using an artificial neural network
,
J. Hydrol.
,
216
(
1–2
),
32
55
.
https://doi.org/10.1016/S0022-1694(98)00273-X
.
Sarrazin
F.
,
Pianosi
F.
&
Wagener
T.
(
2016
)
Global sensitivity analysis of environmental models: convergence and validation
,
Environ. Modell. Software
,
79
,
135
152
.
https://doi.org/10.1016/j.envsoft.2016.02.005
.
Tran, H., Zhang, J., O'Neill, M. M., Ryken, A., Condon, L. E. & Maxwell, R. M.
(
2022
)
A hydrological simulation dataset of the Upper Colorado River Basin from 1983 to 2019
,
Sci. Data
,
9
(
1
),
16
.
https://doi.org/10.1038/s41597-022-01123-w
.
Wang, M. L., Zhang, Y., Lu, Y., Gong, X. L. & Gao, L.
(
2021
)
Detection and attribution of reference evapotranspiration change (1951–2020) in the Upper Yangtze River Basin of China
,
J. Water Clim. Change
,
12
(
6
),
2624
2638
.
https://doi.org/10.2166/wcc.2021.011
.
Wang, M. L., Zhang, Y., Lu, Y.,Wan, X., Xu, B. & Yu, L.
(
2022
)
Comparison of multi-objective genetic algorithms for optimization of cascade reservoir systems
,
J. Water Clim. Change
,
13
(
11
),
4069
4086
.
https://doi.org/10.2166/wcc.2022.290
.
Wang
M. L.
,
Zhang
Y.
,
Lu
Y.
,
Gao
L.
&
Wang
L. Z.
(
2023
)
Attribution analysis of streamflow changes based on large-scale hydrological modeling with uncertainties
,
Water Resour. Manage.
,
37
(
2
),
713
730
.
https://doi.org/10.1007/s11269-022-03396-7
.
Whigham
P. A.
&
Crapper
P. F.
(
2001
)
Modelling rainfall–runoff using genetic programming
,
Math. Comput. Modell.
,
33
(
6–7
),
707
721
.
https://doi.org/10.1016/S0895-7177(00)00274-0
.
Xian
S. D.
,
Sun
W. J.
,
Xu
S. H.
&
Gao
Y. Y.
(
2016
)
Fuzzy linguistic induced OWA Minkowski distance operator and its application in group decision making
,
Pattern Anal. Appl.
,
19
(
2
),
325
335
.
https://doi.org/10.1007/s10044-014-0397-3
.
Xu
Y. S.
,
Chen
Y. B.
,
Ren
Y. F.
,
Tang
Z. Y.
,
Yang
X.
&
Zhang
Y.
(
2023
)
Attribution of streamflow changes considering spatial contributions and driver interactions based on hydrological modeling
,
Water Resour. Manage.
,
37
,
1859
1877
.
https://doi.org/10.1007/s11269-023-03459-3
.
Zhang
Y.
,
Zhong
P. A.
,
Wang
M. L.
,
Xu
B.
&
Chen
J.
(
2018
)
Changes identification of the Three Gorges reservoir inflow and the driving factors quantification
,
Quat. Int.
,
475
,
28
41
.
https://doi.org/10.1016/j.quaint.2016.02.064
.
Zhang
Y.
,
Wang
M. L.
,
Chen
J.
,
Zhong
P. A.
,
Wu
X. F.
&
Wu
S. Q.
(
2021b
)
Multiscale attribution analysis for assessing effects of changing environment on runoff: case study of the Upstream Yangtze River in China
,
J. Water Clim. Change
,
12
(
2
),
627
646
.
https://doi.org/10.2166/wcc.2020.155
.
Zhang, Y., Yu, L., Wu, S. Q., Wu, X. F., Dai, J. Y., Xue, W. Y. & Yang, Q. Q.
(
2021a
)
A framework for adaptive control of multi-reservoir systems under changing environment
,
J. Cleaner Prod.
,
316
,
128304
.
https://doi.org/10.1016/j.jclepro.2021.128304
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY-NC 4.0), which permits copying, adaptation and redistribution for non-commercial purposes, provided the original work is properly cited (http://creativecommons.org/licenses/by-nc/4.0/).