The need to develop new and computationally efficient artificial intelligence models that accurately simulate river flows in data-scarce regions, considering not only current but also projected future climate change conditions is vital. In this study, a hybrid artificial neural network (ANN) model that combines HEC-HMS and the feed-forward neural network (FFNN) was developed in the Python programming language and applied to simulate future stream flows in the River Mayanja catchment in Central Uganda. The study results suggest that the performance of the validated hybrid HEC-HMS-ANN model during calibration and validation (NSE and R2 > 0.99) was more superior to the corresponding performance obtained using individual HEC-HMS (NSE and R2 > 0.50), MIKE HYDRO (NSE and R2 > 0.42), and ANN models (NSE and R2 > 0.56). Using the developed hybrid ANN model, future average daily stream flows are projected to increase by up to 17.3% [2.2–39.5%] and 18.5% [0.8–42.7%] considering the SSP2-4.5 and SSP5-8.5 future climate change scenarios. The study demonstrates that well-trained hybrid ANN models could provide more computationally efficient models for the simulation of future stream flow and for undertaking water resource assessments in catchments with limited in situ observed data.

  • Development and application of a hybrid HEC-HMS-ANN model to simulate future stream flows in a data-scarce River Mayanja catchment.

  • The performance of the developed hybrid model is superior when compared with individual HEC-HMS, MIKE HYDRO, and ANN models.

  • Future average daily stream flows are projected to increase by 17.3% [2.2–39.5%] and 18.5% [0.8–42.7%] for the 2040 SSP2-4.5 and SSP5-8.5 climate change scenarios.

AI

artificial intelligence

ANN

artificial neural network

CMIP6

Coupled Model Intercomparison Project Phase 6

DEM

digital elevation model

EO

evolutionary optimisation

FFNN

feed-forward neural network

FL

fuzzy logic

GRNN

generalised regression neural network

HEC-HMS

Hydrologic Engineering Centre Hydrological Modelling System

LS

linear scaling

LSTM

long short-term memory

MAE

mean absolute error

ML

machine learning

NAM

Nedbor-Afstromnings model

NSE

Nash–Sutcliffe efficiency index

OAT

one-factor-at-a-time

RBFNN

radial basis function neural network

RMSE

root mean square error

SHAP

SHapley additive explanations

SMA

soil moisture accounting

SOM

self-organising map

SSP

shared socioeconomic pathways

SVM

support vector machine

SWAT

soil and water assessment tool

WEAP

water evaluation and planning system

Globally, emerging threats such as climate change, rapid population growth, and urbanisation have been projected to increase the frequency and magnitude of extreme events that will expose larger populations to more severe flooding, drought, and reduction of available water resources (Eccles et al. 2019; UNDRR 2020; Papa et al. 2023). In East Africa, global climate and hydrological model predictions suggest increases in annual rainfall, river flows, fluctuations in lake levels, and increased frequency and intensity of floods and droughts (Olaka et al. 2019; Trisos et al. 2022; Gebrechorkos et al. 2023; Mugume et al. 2024; Sempewo et al. 2024). In addition, most conventional physically based or conceptual hydrologic models often lead to inaccurate estimation of stream flows in catchments in sub-humid tropical regions that are characterised by highly varying topography, daily and seasonal rainfall, and runoff patterns (Jimeno-Sáez et al. 2018; Tibangayuka et al. 2022). This has been attributed to insufficient in situ observed hydrometeorological data specifically in many data-scarce regions in Sub-Saharan Africa, inherent hydrological model uncertainties, insufficient understanding of underlying hydrological processes, and uncertainties in the projection of future climate change impacts (Dile et al. 2020; Sempewo et al. 2022).

In recent research, many studies have assessed the performance of existing physically based or conceptual hydrological models such as the Hydrologic Engineering Centre Hydrological Modelling System (HEC-HMS), MIKE HYDRO, soil and water assessment tool (SWAT), Hydrologiska Byråns Cattenbalansavdelning (HBV), and Water Evaluation and Planning System (WEAP) with a view of evaluating location-specific model performance (Vo et al. 2018; Chathuranika et al. 2022; Ghosh et al. 2022; Sempewo et al. 2022). However, in most studies, the performance of these models in simulating the peak flows has been constrained by insufficient catchment (topography, soil, vegetation, and geological) in situ observed rainfall and stream flow data for model calibration and validation, inherent uncertainties in the model structures and a strong requirement for the modellers' a priori understanding of inherent physical processes in the hydrological system being studied (Mount et al. 2016; Dile et al. 2020; Reddy & Pramada 2022; Pareta 2023). Therefore, it is vital to develop new, computationally efficient and less data-intensive models to facilitate more accurate prediction of stream flows in natural rivers including their low and peak flows. Accurate estimation of stream flows is vital for reliable and robust design, retrofit and operational control of engineered systems such as water treatment plants, irrigation schemes, dams, hydropower plants, spillways, weirs, and other flood control structures (Jimeno-Sáez et al. 2018; Kurian et al. 2020; Ghimire et al. 2021).

In recent years, the need to develop computationally efficient data-driven artificial intelligence (AI) models for simulating stream flows in remote data-scarce regions has been underscored (Kim et al. 2021; Rahimzad et al. 2021; Chebii et al. 2022; Rahman et al. 2022; Tibangayuka et al. 2022). Well-trained AI models provide a more computationally efficient and accurate approach for simulating stream flows in complex non-linear hydrological systems without the need for detailed knowledge of the underlying hydrological processes and where the available data may be insufficient to set up more conventional physically based or conceptual models (Javan et al. 2015; Mount et al. 2016). In addition, AI models can adequately represent noise complexity in observed data and have an inherent ability to simultaneously discover the ‘model structure’ and to optimise its parameters (Yaseen et al. 2015; Mount et al. 2016; Young et al. 2017). Furthermore, after calibration, validation, and testing, such models can be used to simulate stream flows in adjacent ungauged hydrologically similar catchments, given the increased access to open-source reanalysis climate datasets (Nakkazi et al. 2022).

The most commonly applied AI methods can be broadly categorised as artificial neural network (ANN), support vector machine (SVM), fuzzy logic, evolutionally optimisation, and wavelet conjunction models (Tiwari & Chatterjee 2011; Huang et al. 2014; Yaseen et al. 2015; Sahoo et al. 2018, 2023). ANN models are further classified into supervised (back propagation multi-layer perceptron feed-forward neural network (FFNN), radial basis function, and generalised regression neural network (GRNN)) models and unsupervised models such as the self-organising map (Yaseen et al. 2015). ANN models, which are inspired by the functionality of the human nervous system, with a network of interconnected neurons created to process information (Chen et al. 2013; Nourani et al. 2019), have been widely applied for the prediction of stream flows due to their computational efficiency and cost-effectiveness (Ali & Shahbaz 2020). A review summary of recent studies that have developed and applied ANN models is presented in Supplementary Table S1.

In more recent studies, deep learning techniques such as long short-term memory (LSTM) models have been developed for the prediction of stream flows and sediments in natural rivers (Hu et al. 2018; Ghimire et al. 2021; Sahoo et al. 2023). Ghorbani et al. (2016) compared the performance of the multi-layer perceptron FFNN, radial basis function neural network (RBFNN), and SVM models in simulating stream flows for the Zarrinehrud River located in North-Western Iran. The study results suggested that both the FFNN and RBFNN models performed better than the SVM model in simulating monthly stream flows. However, the study results suggested that the uncertainty in the SVM model was less than the corresponding values in the RBFNN and FFNN models. Koycegiz & Buyukyildiz (2022) also investigated the performance of FFNN and RBNN models in estimating monthly stream flows in the Seyhan Basin in the south of Turkey. The study results suggested that the FFNN model resulted in more superior performance when compared with the RBNN model. In addition, Awchi (2014) compared the performance of an FFNN, GRNN, and the RBFNN model in simulating stream flows in the upper and lower Zab River, which is a tributary of the Tigris River in northern Iraq. The study results indicated that their performance varied with different cases involved in the study; however, the FFNN model resulted in better performance when compared with the other neural networks utilised in the study. However, most of these ANN-based modelling studies have been undertaken considering current climate conditions. In addition, there are very few studies that have investigated the performance of ANN-based models in simulating stream flows considering the projected impacts of future climate change (Aryal et al. 2019), despite the well-understood view that hydrological model uncertainty is comparatively higher than climate model uncertainty (Karlsson et al. 2016).

Furthermore, the use of hybrid modelling approaches that combine ANN models and physically based models have been a subject of current research (Young et al. 2017; Reddy & Pramada 2022). Hybrid models have been preferred because they are less data-intensive, facilitate the representation of different aspects of the input data, which leads to more superior estimation of stream flows when compared with individual HEC-HMS, MIKE HYDRO, or ANN models (Young & Liu 2015; Young et al. 2017). The developed hybrid model overcomes the challenges of inaccurate estimation of peak flows that have been observed in studies utilising individual physically based or conceptual models. Furthermore, within the hybrid HEC-HMS-ANN modelling approach, the stream flows generated using HEC-HMS serve as a predictor for the hybrid HEC-HMS-ANN model, which facilitates the elimination of unwanted accumulated errors in the hybrid model outputs (Young et al. 2017).

Humphrey et al. (2016) compared the performance of a hybrid GR4J conceptual rainfall-runoff model and a Bayesian ANN in simulating stream flows at three key locations in the southeast of South Australia. The study results suggest that the hybrid model outperformed the individual models in simulating high flows. Young & Liu (2015) compared the performance of a hybrid HEC-HMS and ANN model for simulating peak flows in the Sandimen basin in southern Taiwan. The study results suggested that the hybrid model performance was superior when compared with the individual HEC-HMS, SWAT, and ANN models. In a recent study, Kurian et al. (2020) showed that two hybrid models, that is, HEC-HMS-ANN and SWAT-ANN, were more superior in flood forecasting when compared with individual models. Furthermore, Reddy & Pramada (2022) compared the performance of a hybrid HEC-HMS-ANN model for simulating daily stream flow for the Kallada River catchment in India. The study results suggested that the hybrid model performed better than the individual models and resulted in reduced simulation errors of 66 and 26.5% when compared with the individual HEC-HMS and ANN models.

In this study, a hybrid model that combines the physically based HEC-HMS and the multi-layer perceptron feed-forward ANN model was developed and applied to simulate current and future stream flows and water availability for the River Mayanja catchment. The performance of the developed hybrid model was compared with corresponding results obtained using individual physically based HEC-HMS, conceptual MIKE HYDRO, and ANN models considering both current and future climate change conditions. In addition, a water availability assessment was undertaken considering current and projected future water demand in the case study catchment. The study findings will inform water resource modellers, managers, and decision-makers in the choice of suitable hydrological models and approaches for undertaking climate change impact assessments and evaluation of potential mitigation and adaptation strategies when faced with limited in situ observed hydrometeorological datasets.

In this research, a new hybrid model that combines the physically based HEC-HMS and the multi-layer perceptron feed-forward ANN model developed in the Python programming language was applied to the study. The hybrid HEC-HMS-ANN model utilises two input datasets that include the daily rainfall and stream flow data.

In this study, the feed-forward ANN model was chosen for utilisation in the hybrid model as opposed to the more recent deep learning approaches such as the LSTM approaches, which require more time and resource-consuming hyperparameter tuning methods for model training (Ghimire et al. 2021) and input datasets with shorter lead times e.g. 1 h which would necessitate coupling of LSTM models with other data-driven models such as wavelet transform to decompose time series into simpler sub-timeseries components (Hu et al. 2018; Ni et al. 2020). Furthermore, the ANN-based models have a simple and straightforward architecture that requires less computational power, have fewer hyperparameters that can be optimised, and are less prone to overfitting problems when compared with LSTM models (Ghimire et al. 2021).

To validate the developed hybrid HEC-HMS-ANN model, three different models, that is HEC-HMS, MIKE HYDRO, and the multi-layer perceptron FFNN (referred to as the ANN model in this research), were also developed and used to simulate both current and projected future stream flows. The general form of the proposed hybrid model is given by Equation (1) (Kurian et al. 2020). A detailed description of the HEC-HMS, MIKE HYDRO, ANN, and the hybrid HEC-HMS-ANN models that were applied in this research are described in Sections 2.1–2.4.
(1)
where Qm(t+i) denotes the predicted stream flow using the hybrid model at time step t + i; QSim represents the simulated flow using HEC-HMS at any given time; i represents the lead time; t is the current time step; f is the non-linear function fitted by ANN; P denotes rainfall values, and j and k are the lags considered for stream flow and rainfall.
The adopted study approach is presented in Figure 1.
Figure 1

Adopted study approach.

Figure 1

Adopted study approach.

Close modal

Physically based HEC-HMS model

The HEC-HMS model is one of the most widely applied physically based, semi-distributed rainfall-runoff modelling software developed by the US Army Corps of Engineers, Hydrologic Engineering Centre (Chathuranika et al. 2022; Reddy & Pramada 2022; Sempewo et al. 2022; Tibangayuka et al. 2022). In the HEC-HMS model, the catchment (basin) is subdivided into sub-catchments connected by channel links (Reddy & Pramada 2022). In addition, effective rainfall on previous land surfaces is computed using an empirical equation (Young et al. 2017) described using Equation (2), by subtracting the volume of water that is transpired or evaporated, intercepted, stored, and infiltrated, from the total rainfall (Bartles et al. 2022).
(2)
where Pe and P are the precipitation excess and accumulated rainfall depth at time t, respectively; Ia is the initial abstraction (e.g. infiltration loss), and S is the potential maximum retention.
The losses in Equation (2) were estimated using the soil moisture accounting (SMA) loss model, which simulates the movement and storage of water through the soil profile and the multiple groundwater layers (Bartles et al. 2022). Furthermore, the transformation of effective rainfall into runoff was undertaken using the Clark Unit Hydrograph method (Sabol 1988), whose parameters include the time of concentration (Tc) and the storage coefficient, R. The Tc was estimated using Kirpich's formula described in Equation (3) in which L is the reach length and S is the slope (−).
(3)
Based on the time of concentration, Tc, and maximum travelling distance, dmax, and the travel time (or translation lag) to each watershed cell, the tcell can be calculated using Equation (4) (Young et al. 2017):
(4)
where dcell is the distance from the cell to the catchment outlet.
The cell outflow hydrograph is then routed using a linear reservoir concept described using Equation (5) (Young et al. 2017):
(5)
where Q(t) and Q(t − 1) are the outflows at current and previous time levels t and t − 1, respectively; I(t) is the average inflow at time t; R is the storage coefficient used to represent discharge attenuation; and t is the time increment.
Furthermore, the baseflow is computed using an exponential decrease function specified in Equation (6) (Young et al. 2017):
(6)
where Q0 is the averaged initial baseflow before a storm and k is an exponential decay constant.
In addition, the stream flow, formed by combining the near-surface flow and overland flows (Young et al. 2017), was routed using the Muskingum equation (Birkhead & James 2002). In this method (Equation (7)), two parameters are the travel time of the flood wave through the routing reach, K, and the dimensionless coefficient X [0, 0.5], representing the wedge storage or the attenuation of the flood wave on its way through the reach must be evaluated (Birkhead & James 2002; Reddy & Pramada 2022). The X and K values were derived through model calibration using observed river flow values (Bartles et al. 2022; Birkhead & James 2002):
(7)
where S is the net storage along the river reach; I is the inflow to the reach; and O is the outflow from the reach.

Conceptual MIKE HYDRO model

The conceptual and lumped MIKE HYDRO model, developed by the Danish Hydraulic Institute is an integrated rainfall-runoff and hydrodynamic model that utilises the Nedbor-Afstromnings model (NAM) to simulate rainfall-runoff (surface), subsurface, and baseflows and to undertake hydrodynamic modelling in river reaches (Ghosh et al. 2022; Pareta 2023). The model comprises a series of mathematical statements that describe the characteristics of the land area of the hydrological cycle in a simplified quantitative form (Pareta 2023).

The NAM model is a deterministic and lumped conceptual model, utilising the continuous moisture accounting method that requires relatively low input data to predict the long-term water balance and runoff as well as predict short-term inflows (Pareta 2023). The input data required to set up the NAM model includes rainfall, temperature, evapotranspiration, the definition of catchment and initial conditions, and stream flows or water levels for model calibration and validation (Pareta 2023). Some of the parameters can therefore be estimated from physical catchment data, but the final parameter estimation must be performed by calibration using concurrent input and output time series (DHI 2017). Evapotranspiration was determined using the Penman–Monteith method using daily sunshine hours, wind speed, and daily maximum and minimum temperature as input data (Zhang et al. 2008). Furthermore, hydrodynamic modelling in the MIKE HYDRO model is based on the finite-difference numerical solution of the St. Venant (dynamic wave) equation, which combines the conservation of mass (Equation (8)) and the conservation of momentum (Equation (9)) to enhance the accuracy of the 1D river flow simulation results (DHI 2017; Ghosh et al. 2022):
(8)
(9)
where Q is the discharge (m3/s); x is the longitudinal distance along the channel (m); t is the time (s or h); A is the cross-sectional flow area (m2); y is the water depth (m); g is the acceleration due to gravity (m/s2); q is the lateral inflow per unit length (m2/s); S0 is the bottom slope, and Sf is the friction slope.

Multi-layer perceptron FFNN model

In this study, an ANN model utilising the back propagation multi-layer perceptron feed-forward (FFNN) algorithm was developed in the Python programming language version 3.11 and applied to model the rainfall–stream flow relationship for current and projected future stream flows. This type of neural network is one of the most widely used ANN in hydrological modelling and has shown promising results when compared with other types of ANN models (Ali & Shahbaz 2020). The FFNN algorithm has a non-recurrent architecture and provides a superior model for non-linear hydrological processes including stream flow (Rahman et al. 2022).

The architecture of the FFNN model used comprises an input layer, a hidden layer, and an output layer (Tibangayuka et al. 2022). The process of determining the ANN model architecture includes specifying the number of network layers, neurons in the layers and iterations, the learning rate, the activation function, the momentum of the coefficient, and determination of the normalisation method (Koycegiz & Buyukyildiz 2022). A model with one hidden layer was utilised because various studies have determined that having more than one hidden layer yields no significant improvement in performance (Turhan 2021) rather these increase the complexity of the model and can lead to overfitting (Tanteliniaina et al. 2021).

In undertaking data normalisation, large variations that exist within the input data can slow down or even prevent the training of the network (Pramanik et al. 2011).

The mathematical representation of the ANN model based on FFNN is presented in Equation (10):
(10)
where y is the output of the jth neuron, wij represents the connection weight from the ith neuron in the preceding layer to the jth neuron of the current layer, xi represents the sequence of inputs, bj is the bias of the jth neuron, and f represents the activation function.
Furthermore, in the developed ANN model, the number of input and output parameters is equal to the number of neurons contained in the input and output layers, respectively (Javan et al. 2015). The net input to each neuron in the hidden layer is the total summation of all the weighted connections between the input layer and that neuron (wijxi) plus the bias (bj) associated with that neuron (Priddy & Keller 2005). The inputs received by the neurons are then passed through an activation function, f(x) (Equation (11)), to produce output values (Elsafi 2014; Turhan 2021). The logistic sigmoid activation function (Equation (11)) was utilised in the ANN model (Shiau & Hsu 2016; Jimeno-Sáez et al. 2018).
(11)
where x is the weighted sum of inputs to the neuron and f(x) is the neuron's output.
To train the network, a backpropagation algorithm, where the output data from the ANN is compared with the actual observed data was applied. The model training outputs were evaluated for the mean square error (MSE) occurring within the network (Jimeno-Sáez et al. 2018). Prior to the commencement of model training, weights are randomly initialised to each neuron and adjusted using backpropagation so as to minimise the MSE values computed using Equation (12) (Rahman et al. 2022).
(12)

Hybrid HEC-HMS-ANN model

In order to improve the accuracy and reliability of future stream flow projections, previous studies have recommended the development of hybrid models (multi-model combinations) so as to improve overall model predictive performance (Mount et al. 2016).

Hybrid models are combined hydrologic models in which physically based or conceptual modelling is fused with data-driven modelling. The physically based or conceptual models are based on a simplified watershed system and mathematical equations that describe the physical processes involved in the movement and storage of water (Young et al. 2017). Conversely, the data-driven models establish a direct relationship between hydrologic variables that are extracted from observed (measured) data using algorithms developed in the fields of statistics, computational intelligence, machine learning, and data mining (Young et al. 2017). Data-driven models are used to model the elements of the system where there is insufficient a priori knowledge to support physically based or conceptual modelling (Mount et al. 2016).

Hybrid models can result in improved predictions of system responses because they circumvent the challenge of having to derive predictions of a system's response from models that are inadequate or incomplete conceptualizations of it (Mount et al. 2016). Furthermore, hybrid modelling approaches can exploit the strengths and minimise the weaknesses of either modelling approach, as opposed to the isolated use of individual modelling techniques (Humphrey et al. 2016; Mount et al. 2016).

In this study, a hybrid ANN model based on the multi-layer perceptron FFNN, which utilises two input variables, i.e. observed daily rainfall and stream flows was developed in the Python programming language. It is hypothesised that the hybrid model takes advantage of the complementary strengths of both physically based semi-distributed models and ANN models to provide more accurate and reliable stream flow forecasts for case study catchment (e.g. Humphrey et al. 2016). The adopted network structure for the hybrid ANN model structure is presented in Table 1.

Table 1

Model parameters of the hybrid ANN

Basic parametersValues
Number of neurons 
Number of iterations 30 
Learning rate 0.001 
Training dataset 2,103 (Rainfall), 2,103 (Stream flow) 
Learning dataset 526 (Rainfall), 526 (Stream flow) 
Testing dataset 658 (Rainfall), 658 (Stream flow) 
Basic parametersValues
Number of neurons 
Number of iterations 30 
Learning rate 0.001 
Training dataset 2,103 (Rainfall), 2,103 (Stream flow) 
Learning dataset 526 (Rainfall), 526 (Stream flow) 
Testing dataset 658 (Rainfall), 658 (Stream flow) 

Study area

The River Mayanja is located in central Uganda in the Albert Water Management Zone between longitudes 31°40′E and 32°40′E and latitudes 0°10′N and 1°30′N, with a length of 150 km draining a catchment area of approximately 6,160 km2. The catchment experiences two rainy seasons, with an average annual rainfall of 1,287 [1,081–1,516] mm and potential evapotranspiration that ranges from 1,237 to 1,672 mm (MWE 2024). The River Mayanja catchment drains the highly urbanised Lubigi and Nalukolongo catchments within Kampala City, parts of Wakiso, Nakaseke, Kiboga, Kyankwanzi, Mpigi, Luwero, Butambala, Mpigi and Mityana districts and drains into the River Kafu, which is a tributary of the Kyoga Nile. A large part of the upper sub-catchment is covered by the Mayanja wetland system. The river is gauged at Kapeeka located in Nakaseke district (Figure 2). Figure 3(a) presents the land use and land cover variation in the catchment, while Figure 3(b) presents the average slopes for the study area.
Figure 2

Location of the study area.

Figure 2

Location of the study area.

Close modal
Figure 3

(a) Land use and land cover map and (b) average slopes for the River Mayanja catchment.

Figure 3

(a) Land use and land cover map and (b) average slopes for the River Mayanja catchment.

Close modal

Datasets and data preprocessing

Datasets

Table 2 presents the various datasets that were used in the study and includes the observed daily rainfall recorded at Makerere University meteorological station, observed daily stream flow recorded at the Kapeeka gauging station along River Mayanja, remote sensed topographical, land use and land cover, soil, historical rainfall, and projected future climate change anomalies.

Table 2

Description of the datasets used in the study

Data typeDescription and useData sourcePurpose
Stream flow Observed daily river flow data for the Kapeeka hydrological monitoring station for the period 2011–2021 Ministry of Water and Environment Uganda Model calibration and validation 
Uganda DEMs 30 × 30 m resolution digital elevation model United States Geological Survey (USGS), https://earthexplorer.usgs.gov/ Catchment delineation 
Land use and land cover map 10 × 10 m map showing various land use and land cover types Esri Land Cover Atlas (2021), https://livingatlas.arcgis.com/landcover/ Used to obtain parameters for SMA in HEC-HMS 
World digital soil map 10 × 10 m map showing variations in soil types Food and Agricultural Organization (FAO) Database (2007), https://www.fao.org/ Used to obtain parameters for the SMA method in HEC-HMS 
Rainfall data Observed daily rainfall data for Makerere University rain gauge station for the period 1991–2019 Uganda National Meteorological Authority (UNMA) Hydrological modelling 
Remote sensed satellite climate data Daily rainfall, minimum and maximum temperature, wind speed and sunshine hours for the period 1991–2019 NASA Power, https://power.larc.nasa.gov/ Hydrological modelling 
Future climate change anomalies Monthly average values are used to change base year data to future projected daily data World Bank Climate Change Knowledge Portal, https://climateknowledgeportal.worldbank.org Hydrological modelling of future stream flows 
Data typeDescription and useData sourcePurpose
Stream flow Observed daily river flow data for the Kapeeka hydrological monitoring station for the period 2011–2021 Ministry of Water and Environment Uganda Model calibration and validation 
Uganda DEMs 30 × 30 m resolution digital elevation model United States Geological Survey (USGS), https://earthexplorer.usgs.gov/ Catchment delineation 
Land use and land cover map 10 × 10 m map showing various land use and land cover types Esri Land Cover Atlas (2021), https://livingatlas.arcgis.com/landcover/ Used to obtain parameters for SMA in HEC-HMS 
World digital soil map 10 × 10 m map showing variations in soil types Food and Agricultural Organization (FAO) Database (2007), https://www.fao.org/ Used to obtain parameters for the SMA method in HEC-HMS 
Rainfall data Observed daily rainfall data for Makerere University rain gauge station for the period 1991–2019 Uganda National Meteorological Authority (UNMA) Hydrological modelling 
Remote sensed satellite climate data Daily rainfall, minimum and maximum temperature, wind speed and sunshine hours for the period 1991–2019 NASA Power, https://power.larc.nasa.gov/ Hydrological modelling 
Future climate change anomalies Monthly average values are used to change base year data to future projected daily data World Bank Climate Change Knowledge Portal, https://climateknowledgeportal.worldbank.org Hydrological modelling of future stream flows 

Data preprocessing

Data cleaning and filling of missing observed rainfall, temperature, and stream flow data were undertaken prior to its use in hydrological modelling. Specifically, the observed rainfall data was checked for inconsistencies, duplicates, outliers, repeated dates, and missing data. The missing data in this dataset was filled using bias-corrected NASA Power remote sensed rainfall data (Tan et al. 2023). The NASA Power rainfall data was corrected for magnitude bias using the linear scaling (LS) bias correction method to match the monthly mean of the corrected dataset as close as possible to that of the observed dataset. The bias correction factor was computed as the quotient of the observed monthly mean and the remote sensed rainfall data (Girma 2021). Furthermore, the linear interpolation method was used to fill in the missing stream flow values.

In addition, the cleaned data used in the ANN model underwent another preprocessing step to ensure accurate predictions. This data was normalised using the statistical normalisation, which rescales the features to have a mean of zero and a variance of one (Priddy & Keller 2005). Data normalisation ensures that large variations that exist within the input data do not slow down or prevent the training process from going ahead (Pramanik et al. 2011).

In addition, the raw digital elevation model (DEM) obtained was pre-processed through the removal of errors and artificial sinks so as to ensure accurate mapping of streams (Lidberg et al. 2017). In this study, raw DEM preprocessing entailed a series of steps that included: (a) filling of artificial sinks and generation of flow direction and accumulation grids (Troolin & Clancy 2016), (b) extraction of channel networks from the grid, stream segmentation, and catchment grid delineation. The generated catchment grid was then converted to vector format using the catchment polygon processing tool. Finally, the obtained stream network was also converted to vector format using a drainage line processing tool (based on a segmented grid and flow direction grid as inputs) (Akram et al. 2012; Venkatesh 2012).

Physically based and conceptual model sensitivity analysis, calibration, and validation

Model sensitivity analysis is applied to assess how variables impact the model inputs on the model outputs (Tang et al. 2007). Identifying and estimating the most sensitive parameters (those that can cause significant changes in the simulated stream flow) is vital for enhancing model prediction accuracy. In this study, one-factor-at-a-time local sensitivity analysis was applied to identify key parameters and the parameter precision required for calibration. This was implemented by changing parameter values by up to ±50% using intervals of 10% (Tang et al. 2007; Sweetapple et al. 2014). Sensitivity analysis is preceded by running the model using initial estimates of model parameters. Thereafter, one parameter is varied at a time and analysed from −50 to 50% with increments of 10% while keeping the rest of the parameters constant. This process is repeated for all other parameters and the output values are compared with the initial estimates to determine the variation. The greater the percentage change in simulated stream flow indicates the greater sensitivity of that specific model parameter.

Based on the results of the sensitivity analysis, manual calibration was carried out for the HEC-HMS model. No autocalibration was done in this study as it was considered ineffective during the calibration process (Dariane et al. 2016). On the other hand, autocalibration was undertaken for the MIKE HYDRO model. The models were calibrated for the period between 2011 and 2014 and the obtained parameters and initial conditions were validated for the period between 2015 and 2017.

ANN model training, validation, and testing

The ANN model was trained, validated, and tested using observed daily rainfall for Makerere University rain gauge station and observed daily stream flow for Kapeeka stream flow gauging stations (Table 2). In the study, seven training, validation, and testing data split percentage ratios, that is 50:25:25; 60:20:20; 60:10:30; 60:30:10; 70:10:20; 70:20:10; and 80:10:10, were investigated (Table 3). The performance of each data split ratio was evaluated using Nash–Sutcliffe efficiency (NSE), R2, and RMSE measures of goodness of fit. The 70% (training):10% (validation):20% (testing) data split ratio, which resulted in more superior performance when compared with all the tested data split was applied in this study (Table 3). The data were split into three (3) sets that is: training (2,300); validation (328); and testing (657), respectively.

Table 3

Effect of data split percentage ratio on ANN model performance

Data split percentage ratios (Training:Validation:Testing)NSER2RMSE
50:25:25 0.31 0.57 7.25 
60:20:20 0.59 0.67 6.10 
60:10:30 0.33 0.58 6.62 
60:30:10 0.56 0.58 6.72 
70:10:20 0.65 0.67 5.64 
70:20:10 0.60 0.60 6.44 
80:10:10 0.61 0.62 6.33 
Data split percentage ratios (Training:Validation:Testing)NSER2RMSE
50:25:25 0.31 0.57 7.25 
60:20:20 0.59 0.67 6.10 
60:10:30 0.33 0.58 6.62 
60:30:10 0.56 0.58 6.72 
70:10:20 0.65 0.67 5.64 
70:20:10 0.60 0.60 6.44 
80:10:10 0.61 0.62 6.33 

The bold values show the data split ratio that resulted into more superior performance when compared to all other considered data split ratios.

Rainfall data were the only input for the model and different input variables of antecedent rainfall were tested in the model. The input variables that gave the optimum model were selected and a total of 100 days of previous rainfall were determined to predict the stream flow at one lead day.

The input data were then pre-processed through normalisation to remove large variations and to enable smooth training of the model. Data normalisation was carried out using the Min–Max and statistical method and the model efficiency was compared using the NSE and RMSE values, as shown in Table 4. The statistical normalisation method was more superior when compared with the Min–Max method (Priddy & Keller 2005). Based on this, the statistical normalisation method was applied to standardise the input and output variables.

Table 4

Comparison of the Min–Max and statistical normalisation methods for data pre-processing

Normalisation methodNSE indexRMSE
Min–Max method 0.63 5.80 
Statistical method 0.65 5.63 
Normalisation methodNSE indexRMSE
Min–Max method 0.63 5.80 
Statistical method 0.65 5.63 

To avoid model overfitting, the trial-and-error method (Javan et al. 2015) was applied to determine the optimal number of neurons within the hidden layer. If fewer neurons are used, the network may not be able to capture the very detailed relationship between the indicator and computed output parameters whereas if too many neurons are used, this may result in overtraining of the network (Pramanik et al. 2011; Javan et al. 2015). The model was run with both the training and validation data and their errors monitored during the training stage. The optimal parameters for the ANN model were obtained in the region where validation error was observed to start increasing as the training error reduced. The number of iterations that resulted in the best performance when the model was run with an unseen dataset (testing) was determined as the most optimal.

Furthermore, in this study, a range of neurons (4–30) was tested to determine the suitable number of neurons to use in the model. Results showed that there was no significant change in the accuracy of the model when the number of neurons exceeded 4, which was in agreement with the results of Demirel et al. (2009). Based on this, the ANN model was programmed to use four neurons.

Lastly, the optimal number of epochs (iterations) that adequately generalised the data was determined as 50 (Table 5). The trained ANN was later run for the calibration (2011–2014) and validation (2014–2017) periods which used for the HEC-HMS and MIKE HYDRO models to compare all the models on the same dataset.

Table 5

ANN model development parameters

Basic parametersValues
Number of neurons 
Number of iterations 50 
Learning rate 0.001 
Training dataset 2011–2017 (2,300 data points) 
Validation dataset 2017–2018 (328 data points) 
Testing dataset 2018–2019 (657 data points) 
Basic parametersValues
Number of neurons 
Number of iterations 50 
Learning rate 0.001 
Training dataset 2011–2017 (2,300 data points) 
Validation dataset 2017–2018 (328 data points) 
Testing dataset 2018–2019 (657 data points) 

Interpretation of ANN model predictions

Due to the inherent uncertainties associated with ANNs, the interpretability of the model results was investigated using the SHapley additive explanations (SHAP) technique. SHAP originates from ‘game theory’ and constructs an additive interpretation model where all features are regarded as contributors (Wang et al. 2022; Mushtaq et al. 2024). SHAP can quantify the contribution of an input feature to the prediction for an individual sample. The SHAP value obtained for a given prediction describes the contribution to that prediction compared with the average prediction of the given dataset (Molnar 2022). SHAP has been successfully applied to explain the importance of input features to the predictions attained both on the local (individual) and global frame (whole dataset) (Lundberg & Lee 2017). The technique facilitates the correct interpretation of a prediction model's output.

Model performance evaluation and comparison

The prediction accuracy of hydrological models must be evaluated in both the calibration and validation phases (Gelete et al. 2023). The use of at least one goodness of fit and one error measure can sufficiently evaluate model performance (Legates & Mccabe 1999). Moriasi et al. (2007) undertook a comprehensive review of hydrological model performance evaluation metrics and recommended the use of NSE, percent bias, and the ratio of the root mean square error (RMSE) to the standard deviation of measured data (RSR). Legates & Mccabe (1999) recommended a combination of relative error that is NSE and the index of agreement and absolute error measures (mean absolute error (MAE) and RMSE) to overcome the weaknesses of correlation-based measures such as the coefficient of determination (R2).

In this study, the performance of the hydrological models was evaluated using NSE (Equation (13)), R2 (Equation (14)), and the RMSE measures (Equation (15)) (Seenath et al. 2016; Gelete et al. 2023; Mugume et al. 2024). The NSE, which ranges from minus infinity to 1.0, is a normalised statistic that determines the relative magnitude of the residual variance (‘noise’) compared with the observed data variance (‘information’). NSE indicates how well the plot of observed versus simulated data fits the 1:1 line. Higher values of NSE indicate better agreement, as to the variance in the observed data, subtracted from unity (Nash & Sutcliffe 1970; Legates & Mccabe 1999; Moriasi et al. 2007). NSE values between 0 and 1 are generally acceptable, with 1 as the best value (Gupta & Kling 2011). The NSE helps to capture errors based on the long-term calibration of a model.

On the other hand, R2 is the square of Pearson's product-moment correlation coefficient. It describes the proportion of the total variance in the observed data that can be explained by the model. It ranges from 0.0 to 1.0 with higher values indicating better performance of the hydrological model (Legates & Mccabe 1999; Hanington et al. 2017). Last but not least, RMSE provides a measure of the difference between the predicted values and the actual, observed stream flow (Kovačević et al. 2018), with values close to zero representing the highest model efficiency.
(13)
(14)
(15)
where Qi is the observed discharge, Qa is the average value of observed discharge, Qm is the simulated discharge, and N is the total number of simulations.

Climate change impact assessment

The anomalies for rainfall, minimum, and maximum temperature expected in the near-term future (2020–2039) were downloaded for an open-source multi-model ensemble of global climate model results for the case study area that is available on the Climate Change Knowledge Portal (CCKP) (World Bank 2021). The CCKP is an online platform created by the World Bank to provide climate-related information at global and regional scales based on the outputs of the Coupled Model Intercomparison Project Phase 6 (CMIP6). In this study, two future climate change scenarios that are SSP2-4.5 and SSP5-8.5 were chosen and the corresponding anomalies were downloaded for the period 2020–2039 using the period 1995–2014 as the baseline period. The impact of climate change on monthly rainfall, minimum temperature, and maximum temperature was computed for the future 2020–2039 period, consideration of the 10th, 50th, and 90th percentiles (e.g. Patterson et al. 2022). The use of the 10th and 90th percentiles facilitated considering the uncertainty band inherent in future climate change projections. The trained/validated ANN model and the calibrated HEC-HMS models were used for the simulation of stream flow under projected climate change scenarios.

Furthermore, considering that the Kapeeka gauging station used in the study is located upstream of the River Mayanja catchment outlet, after the model calibration and validation steps, the observed stream flow data required scaling to compute the corresponding stream flow for the entire catchment. This was achieved by applying a scaling factor that relates the area and percentage imperviousness of the catchment upstream of the gauging station to the total area and percentage imperviousness of the catchment (Equation (16)):
(16)
where Ac is the total area of the catchment, PIMPc is the percentage imperviousness of the whole catchment, Ac,u is the area of the catchment upstream of the gauging station, and PIMPc,u is the percentage imperviousness of the catchment upstream of the gauging station.

This chapter describes the performance of the individual HEC-HMS, MIKE HYDRO, ANN, and the hybrid HEC-HMS-ANN models during calibration and validation. Furthermore, the ANN model performance during testing and interpretation of ANN model predictions is described. In addition, the results of the climate change impact assessment and the projected stream flows for the River Mayanja catchment are described.

HEC-HMS model performance

The HEC-HMS was calibrated for the period 2011–2014 and validated for the period 2015–2017. The optimised parameters of the HEC-HMS model are presented in Supplementary Table S3. The performance of the HEC-HMS model during the calibration and validation periods was evaluated using the NSE, coefficient of determination (R2), and RMSE coefficients (Table 6). The HEC-HMS model exhibited satisfactory results during both the calibration and validation (NSE and R2 > 0.5). Furthermore, the HEC-HMS model performed well in the simulation of low flows when compared with the MIKE HYDRO model. In addition, the model tended to underestimate the peak flows during some events (refer to Figure 4 and Supplementary Table S5). Furthermore, the calibration time for the HEC-HMS model was considerably higher than the corresponding period for the MIKE HYDRO model.
Figure 4

Observed rainfall and stream flow and simulated stream flow hydrograph for HEC-HMS, MIKE HYDRO, and the ANN model for (a) the calibration period (2011–2014) and (b) the validation period (2015–2017).

Figure 4

Observed rainfall and stream flow and simulated stream flow hydrograph for HEC-HMS, MIKE HYDRO, and the ANN model for (a) the calibration period (2011–2014) and (b) the validation period (2015–2017).

Close modal

MIKE HYDRO model performance

Similar to the HEC-HMS model, the MIKE HYDRO model was calibrated for the period 2011–2014 and validated for the period 2015–2017. The optimised parameters of the MIKE HYDRO model are presented in Supplementary Table S4. The MIKE HYDRO model performed poorly during the calibration period (NSE and R2 < 0.5) but produced satisfactory results during the validation period (NSE and R2 > 0.5), suggesting that the performance of MIKE HYDRO was less superior when compared with the HEC-HMS model. Furthermore, the MIKE HYDRO model performance in the simulation of low flows was less satisfactory when compared with the corresponding performance obtained using the HEC-HMS model. In addition, the MIKE HYDRO model tended to underestimate the peak flows during some events (refer to Figure 4 and Supplementary Table S5).

ANN model performance

The ANN model performed more satisfactorily during the calibration period (NSE and R2 > 0.64) and the validation period (NSE and R2 > 0.56) when compared with both the HEC-HMS and MIKE HYDRO models (Table 3). However, the ANN model performance, based on the R2 coefficient (0.57), was lower than the corresponding value obtained using the MIKE HYDRO model (0.59) during the validation period (refer to Supplementary Figure S2). The study results suggest that the ANN model performed generally well in simulating the general pattern of flow. The model was generally able to simulate the low flows with high levels of accuracy. However, the model did not perform well in simulating peak flows occurring during the simulation period, as the maximum flow is limited to 30 m3/s in the ANN model, compared with the maximum flow of ∼40 m3/s in the observed stream flow data.

Hybrid HEC-HMS-ANN model performance

The study results suggest that the hybrid HEC-HMS-ANN model performed better than the individual HEC-HMS, MIKE HYDRO, and ANN models during both calibration (NSE and R2 > 0.991) and validation (NSE and R2 > 0.982) (Table 6). The study results suggest that by combining the strengths of the physically based HEC-HMS model and the ANN models, the resulting performance of the hybrid model is more superior when compared with the individual HEC-HMS and ANN models.

Table 6

Performance evaluation of the HEC-HMS, MIKE HYDRO, ANN, and hybrid ANN models

ModelPeriodNSER2RMSE
HEC-HMS Calibration 0.56 0.59 7.49 
Validation 0.50 0.60 6.51 
MIKE HYDRO Calibration 0.42 0.45 8.67 
Validation 0.54 0.59 6.26 
ANN Calibration 0.64 0.66 6.81 
Validation 0.56 0.57 6.14 
Hybrid ANN Calibration 0.991 0.992 1.07 
Validation 0.982 0.988 1.25 
ModelPeriodNSER2RMSE
HEC-HMS Calibration 0.56 0.59 7.49 
Validation 0.50 0.60 6.51 
MIKE HYDRO Calibration 0.42 0.45 8.67 
Validation 0.54 0.59 6.26 
ANN Calibration 0.64 0.66 6.81 
Validation 0.56 0.57 6.14 
Hybrid ANN Calibration 0.991 0.992 1.07 
Validation 0.982 0.988 1.25 

Interpretation of ANN model predictions

The results of the SHAP interpretability analysis for the ANN model for a random individual output indicate that rainfall from the previous 14, 19, 69, 67, and 80 days had the highest impact on the model outputs at a local scale for moderate rainfall (Figure 5). Furthermore, the results indicate that rainfall from the previous 17, 14, 5, 1, 7, 3, 4, and 91 days had the highest impact on model outputs at a local scale for extreme rainfall (Figure 6). On the other hand, rainfall from the previous 14, 13, 15, 18, 17, 16, 12, 11, and 19 days had a significant impact on the model results at a global scale (Figure 7).
Figure 5

Effect (SHAP values) of ANN input variables for a single random event of moderate stream flow.

Figure 5

Effect (SHAP values) of ANN input variables for a single random event of moderate stream flow.

Close modal
Figure 6

Effect (SHAP values) of ANN input variables for a single random event of relatively high stream flow.

Figure 6

Effect (SHAP values) of ANN input variables for a single random event of relatively high stream flow.

Close modal
Figure 7

Global effect (SHAP values) of the input variables on the output received from the ANN model.

Figure 7

Global effect (SHAP values) of the input variables on the output received from the ANN model.

Close modal

From the results, it is noted that SHAP values for the days when rainfall was received have a positive impact on the output generated while for days where no significant rainfall was received the SHAP values were negative. The results suggest that days that received rainfall led to increased stream flow generation while days that did not receive rainfall led to reduced stream flows. The results further suggest that runoff generated in the River Mayanja catchment system takes approximately 11–19 days to make a significant impact at the point of interest (Kapeeka gauging station).

Furthermore, the results in Figure 6 suggest that although rainfall from the previous 17 and 1 days is relatively the same (41.4 and 40.3 mm, respectively), the impact made on the output from the previous 17 days is significantly greater than that registered by the rainfall received in the previous 1 day (6.07 and 1.71, respectively). In addition, the results of the SHAP analysis also suggest that rainfall from the previous 2–3 months also had a significant influence on the model output (Figures 5 and 6).

Furthermore, for the hybrid HEC-HMS-ANN model, the results suggest that stream flow has a stronger influence on the model outputs when compared with rainfall for both local (Figure 8) and global scales (Figure 9). This could be attributed to the presence of papyrus vegetation in the river, which enables the river to retain water for longer periods and slowly release it into the environment. These results provide an explanation for the difference in ANN and hybrid HEC-HMS-ANN model performance and are also in agreement with findings from previous studies that suggested that ANN models that utilise both stream flow and rainfall data as inputs performed better than those that utilised rainfall data alone (Javan et al. 2015).
Figure 8

A graph indicating the effect (SHAP values) of the input variables for a random single output of the hybrid HEC-HMS-ANN model.

Figure 8

A graph indicating the effect (SHAP values) of the input variables for a random single output of the hybrid HEC-HMS-ANN model.

Close modal
Figure 9

A graph indicating the global effect (SHAP values) for the input variables on the output received from the hybrid HEC-HMS-ANN model.

Figure 9

A graph indicating the global effect (SHAP values) for the input variables on the output received from the hybrid HEC-HMS-ANN model.

Close modal

Climate change projections

In this study, the impacts of climate change on future temperature, rainfall, and stream flow for the near future period (2020–2039) were assessed with reference to the baseline period (1995–2014) and considering the SSP2-4.5 and SSP5-8.5 emission scenarios. Figure 10 shows the relative changes in monthly rainfall, and the monthly minimum and maximum temperature for the 10th, 50th, and 90th percentiles of the multi-model ensemble CIMP6 global climate model data. The presented relative changes obtained from the CCKP database are based on monthly data, which was attributed to the non-availability of data on corresponding relative changes presented on a daily time step. This could lead to an underestimation of the projected future daily rainfall peaks. However, the monthly values are more accurate in estimating future average daily rainfall and were therefore applied in this study.
Figure 10

Impact of climate change on monthly rainfall, minimum temperature, and maximum temperature. The black and red solid lines represent the 50th percentiles for SSP2-4.5 and SSP5-8.5 future climate change scenarios. The red line represents the 50th percentile for the uncertainty band (10th percentile and 90th percentile values) for the SSP2-4.5 scenario (black line and darker area) and SSP5-8.5 (red line and brighter area) future climate change scenarios.

Figure 10

Impact of climate change on monthly rainfall, minimum temperature, and maximum temperature. The black and red solid lines represent the 50th percentiles for SSP2-4.5 and SSP5-8.5 future climate change scenarios. The red line represents the 50th percentile for the uncertainty band (10th percentile and 90th percentile values) for the SSP2-4.5 scenario (black line and darker area) and SSP5-8.5 (red line and brighter area) future climate change scenarios.

Close modal

Furthermore, the results in Figure 10 suggest that monthly rainfall is expected to increase in January, February, March, August, October, November, and December for both SSP2-4.5 and SSP5-8.5 scenarios. The results also suggest the prediction uncertainty is highest in the dry months of July (−1.6% [−22.4–26.1%]) and December (1.2% [−20.4–72.8%]). For SSP5-8.5, monthly rainfall is anticipated to decrease in May, June, and July with the rest of the months registering increasing rainfall. The results also suggest the prediction uncertainty for the SSP5-8.5 scenario is highest in July (−0.4% [−19.7–31.1%]) and December (0.9% [−21.3–80.0%]). The computed change factors for monthly rainfall, minimum monthly temperature, and maximum monthly temperature are presented in Supplementary Figure S4. The monthly rainfall change factors for the SSP2-4.5 scenario range from a minimum of 0.97 [0.77–1.07] to a maximum of 1.06 [0.96–1.72], while corresponding values for the SSP5-8.5 scenario range from a minimum of 0.95 [0.77–1.05] to a maximum of 1.09 [0.98–1.80]. Furthermore, the study results suggest that the highest climate change factors greater than 1.2 were obtained during the months of July, September, and December for the SSP5-8.5 scenario. On the other hand, the computed change factors for the minimum and maximum monthly temperature were relatively low [0.998–1.002] suggesting that future changes in temperature extremes will be very low.

Overall, a comparison of the projected average monthly rainfall to that of the baseline period shows that the rainfall is projected to increase by 2.1% [−11.4–20.4%] and 3.1% [−14.1–22.9%] for the SSP2-4.5 and SSP5-8.5 future climate change scenarios, respectively. These results are relatively consistent with the results obtained by Onyutha et al. (2021) who found that rainfall was projected to increase from January to February and decrease in March to May and June to September and increase in October to December for the SSP2-4.5. A slight decrease in rainfall was observed for the period June to September with an increase in other months for the SSP5-8.5. The projected monthly minimum temperature and maximum temperature slightly increase throughout the year for both scenarios. However, the relative change in minimum and maximum temperature values are low [–0.–5–0.30%] for both SSP2-4.5 and SSP5-8.5 scenarios (Figure 10).

Projected stream flows

The results of the impact of climate change on future stream flow are shown in Figure 11 (HEC-HMS model), Figure 12 (scaled ANN model), and Figure 13 (hybrid ANN model) and summarised in Supplementary Table S6.
Figure 11

The HEC-HMS model projected monthly stream flow showing the median and the 10th–90th percentile uncertainty band for (a) SSP2-4.5 and (b) SSP5-8.5 scenarios.

Figure 11

The HEC-HMS model projected monthly stream flow showing the median and the 10th–90th percentile uncertainty band for (a) SSP2-4.5 and (b) SSP5-8.5 scenarios.

Close modal
Figure 12

Scaled ANN model projected monthly stream flow showing the median and the 10th–90th percentile uncertainty band for (a) SSP2-4.5 and (b) SSP5-8.5 scenarios.

Figure 12

Scaled ANN model projected monthly stream flow showing the median and the 10th–90th percentile uncertainty band for (a) SSP2-4.5 and (b) SSP5-8.5 scenarios.

Close modal
Figure 13

The hybrid ANN model projected monthly stream flow showing the median and the 10th–90th percentile uncertainty band for (a) SSP2-4.5 and (b) SSP5-8.5 scenarios.

Figure 13

The hybrid ANN model projected monthly stream flow showing the median and the 10th–90th percentile uncertainty band for (a) SSP2-4.5 and (b) SSP5-8.5 scenarios.

Close modal

The HEC-HMS model results indicate an increase in stream flows of 18.0% [2.7–41.4%] and 19.3% [1.4–45.2%] considering the SSP2-4.5 and SSP5-8.5 future climate change scenarios. The scaled ANN model results indicate an increase/decrease in stream flows of 15.3% [−17.8–27.2%] and 3.1% [−19.3–30.4%] considering the SSP2-4.5 and SSP5-8.5 future climate change scenarios. On the other hand, the hybrid ANN model results suggest an increase in stream flows of 17.3% [2.2–39.5%] and 18.5% [0.8–42.7%] considering the SSP2-4.5 and SSP5-8.5 future climate change scenarios.

The study results further suggest that both HEC-HMS and the hybrid ANN models led to a slight increase in the low flows (10th percentiles) of up to 2.7 and 1.4% considering the SSP2-4.5 and SSP5-8.5 future climate change scenarios, respectively. Furthermore, both models led to significant increases in the peak flows (90th percentiles) of up to 41.4 and 45.2% considering the SSP2-4.5 and SSP5-8.5 future climate change scenarios, respectively. The scaled ANN model, unlike the HEC-HMS and hybrid ANN models, resulted in significant decreases in the low flows (10th percentiles) of up to 17.8 and 19.3% considering the SSP2-4.5 and SSP5-8.5 future climate change scenarios, respectively.

Comparison of HEC-HMS, MIKE HYDRO, ANN, and hybrid HEC-HMS-ANN model performance

The physically based HEC-HMS model resulted in satisfactory performance during the calibration and validation periods in contrast to the corresponding results obtained using the conceptual and lumped MIKE HYDRO model. The superior performance of the HEC-HMS model in a tropical region could be attributed to its ability to account for the contribution of the baseflow in the catchment and consideration of canopy processes that influence runoff generation in the catchment (Gumindoga et al. 2017; Sanjay Shekar & Vinay 2021; Tibangayuka et al. 2022). In addition, both HEC-HMS and MIKE HYDRO were able to accurately simulate the low flows during both the calibration and validation periods.

However, both the HEC-HMS and MIKE HYDRO models tended to underestimate the peak flows for some events during the calibration period, which is consistent with the findings of previous studies (Aredo et al. 2021; Gunathilake et al. 2021). In addition, the MIKE HYDRO model also overestimated some peak flows during the validation period. These discrepancies have also been reported in previous studies such as Slieman & Kozlov (2023) and are attributed to the lack of stream flow gauging stations at outlets of catchments, insufficient spatial resolution of existing rainfall, temperature, and soil datasets, relatively low resolution of the utilised digital elevation models, which are vital for estimation of model parameters and limited data on the split between rainfall-runoff and baseflow in gauged rivers (Vilanova et al. 2019).

In future studies, the physically based or conceptual model performance could be enhanced through improved estimation of sensitive model parameters that include percentage imperviousness, storage coefficient, and time of concentration in addition to densification of the meteorological and stream flow gauging stations within catchments to improve the spatial distribution of the observed rainfall, temperature, and stream flow datasets (e.g. Pareta 2023).

In contrast, the study results suggested that the developed ANN model performed slightly better than HEC-HMS and MIKE HYDRO during the calibration and validation periods and was able to sufficiently simulate most of the low flows occurring in the River Mayanja catchment. This was attributed to the presence of a high number of low flows in the training dataset, e.g. Pramanik et al. (2011). However, the ANN model also did not perform satisfactorily well in capturing peak flows for the River Mayanja catchment. This could be attributed to the use of point rainfall that did not effectively represent the spatial rainfall distribution in the catchment. Furthermore, the inadequacy of ANN models in capturing peak flows has been reported in past studies, which attributed the phenomenon to an insufficient number of peak flows in the calibration dataset, leading to the failure of the ANN to establish a strong pattern (Jimeno-Sáez et al. 2018). This, therefore, suggests that the ANN model performance can be significantly improved through the use of longer calibration datasets that contain multiple peaks that are sufficient to establish a strong pattern in the ANN model (Tibangayuka et al. 2022).

Furthermore, the developed hybrid HEC-HMS-ANN model resulted in more superior performance during both calibration and validation periods when compared with the individual HEC-HMS, MIKE HYDRO, and ANN models. In addition, the results of the SHAP model interpretability analysis facilitated an explanation of the significant difference in the resulting ANN and hybrid HEC-HMS-ANN model performance. This difference was attributed to the use of rainfall and stream flow as inputs for the hybrid model, in contrast to the use of a single input (rainfall) for the individual ANN model. In addition, the hybrid model combined the strengths of the physically based HEC-HMS model (detailed mathematical description of physical processes involved in movement and storage of water in a catchment) and the ANN model (ability to accurately simulate complex non-linear processes without the need for detailed knowledge on underlying hydrological processes) and the resulting into more superior performance (e.g. Humphrey et al. 2016).

These results are in agreement with recent studies that suggested that hybrid ANN models are able to exploit the strengths of the individual physically based or conceptual models that utilise mathematical equations to describe the physical processes entailed in the movement and storage of water in catchments and ANN models that have the ability to accurately simulate complex non-linear processes for stream flow prediction without the need for detailed knowledge on underlying hydrological processes (Mount et al. 2016). More so, the performance of the developed hybrid ANN model showed that by improving the model structure to facilitate the use of multiple input datasets, the performance of the ANN models can be significantly improved. The study findings also demonstrate that hybrid ANN models present an opportunity for improved hydrological modelling in data-scarce regions, located in areas with similar climatic zones. This would require prior calibration and validation of hybrid ANN models using available observed rainfall and stream flow data for adjacent gauged catchments located in the same climatic zone as the ungauged catchments being studied.

Projected stream flows

The study results suggest that using the HEC-HMS model, stream flow is projected to increase by up 18.0% [2.7–41.4%] and 19.3% [1.4–45.2%] by 2039 considering SSP2-4.5 and SSP5-8.5 future climate change scenarios, respectively. For the scaled ANN model, the results indicate both increase/decrease stream flow of 15.3% [−17.8–27.2%] and 3.1% [−19.3–30.4%] SSP2-4.5 and SSP5-8.5, respectively, suggesting less satisfactory model performance for future stream flow prediction. The decrease in future stream flow for the scaled ANN model could be attributed to training the ANN using only one model input parameter, which is the projected future rainfall data, as opposed to the HEC-HMS model where future rainfall, temperature, and soil and land use data were utilised as input parameters. The hybrid HEC-HMS-ANN model, on the other hand, addresses this challenge by using input data that consists of both projected future rainfall and the HEC-HMS projected stream flow as inputs for future stream flow simulation.

Furthermore, for the hybrid HEC-HMS-ANN model, the results indicate increases of up to 17.3% [2.2–39.5] and 18.5% [0.8–42.7%] by 2039 considering SSP2-4.5 and SSP5-8.5 future climate change scenarios, respectively. The study results suggest that the developed hybrid ANN model adequately represents the low flows but underpredicts the peak flows. This could be attributed to the training of the ANN model using observed stream flow data with a relatively low number of observed peak flows and lower values of the peak flow magnitudes when compared with the simulated future peak flows.

The results obtained therefore suggest that for the River Mayanja catchment, the developed hybrid ANN model provides highly satisfactory results (model calibration: NSE and R2 > 0.99 and validation: NSE and R2 > 0.98) and can be used for the simulation of future stream flow in catchments with similar topographic and climatic characteristics. Furthermore, the results agree with similar studies undertaken in the Nile Basin (Gebrechorkos et al. 2023; Tusingwiire et al. 2023; Sempewo et al. 2024).

This study, therefore, suggests that climate change will lead to an increase in available water resources for water supply, irrigation, hydropower production, and other water uses in the catchment. However, the results also confirmed a future increase in the variability of water resources at monthly and seasonal time scales (relatively low increase in low flows of 1.4–2.7% visa vice the significant increase in high flows of 41.4–45.2% for both SSP2-4.5 and SSP5-8.5 future climate change scenarios). These findings therefore point to the need for prioritisation of investments in water storage infrastructure such as dams and reservoirs so as to meet the projected water demands all year round within the case study catchment.

In this study, a hybrid ANN model that combines HEC-HMS and the back propagation multi-layer perceptron FFNN was developed and applied to assess the impact of climate change on future stream flow and water resources availability in a data-scarce River Mayanja catchment in Central Uganda. The performance of the developed hybrid HEC-HMS-ANN model was validated through comparison with individual physically based semi-distributed (HEC-HMS), conceptual and lumped (MIKE HYDRO), and ANN models considering both current and future climate change conditions. Future stream flows were simulated for the near future period using all the models and considering the SSP2-4.5 and SSP5-8.5 future climate change scenarios. Anomalies for future climate change were obtained from the CIMP6 ensemble of global climate models compiled by the World Bank Climate Research Programme and used to project future daily climate conditions.

The study results suggest that the performance of the validated hybrid HEC-HMS-ANN model during calibration and validation (NSE and R2 > 0.99) was more superior in predicting stream flows of the River Mayanja catchment when compared with the corresponding performance obtained using individual HEC-HMS (NSE and R2 > 0.50), MIKE HYDRO (NSE and R2 > 0.42), and ANN models (NSE and R2 > 0.56). Using the developed hybrid ANN model, future average daily stream flows are projected to increase by up to 17.3% [2.2–39.5%] and 18.5% [0.8–42.7%] considering the SSP2-4.5 and SSP5-8.5 future climate change scenarios.

In addition, the study has demonstrated that the performance of the hybrid HEC-HMS-ANN model can be improved through the use of multiple and relatively long input datasets so as to establish strong ANN model output patterns that can sufficiently represent both low and peak flows. The study results also suggest that the developed hybrid model combined the strengths of the physically based HEC-HMS model (detailed mathematical description of physical processes involved in movement and storage of water in a catchment) and the ANN model (ability to accurately simulate complex non-linear processes without the need for detailed knowledge on underlying hydrological processes) and the resulting into more superior performance.

The findings of this study demonstrate that hybrid ANN models are suitable for modelling stream flow in catchments with limited observed hydrometeorological datasets. From a modelling perspective, it is also worth noting that MIKE HYDRO is a lumped model with a straightforward structure that is easier to use while HEC-HMS requires a lot more datasets and computations. On the other hand, whereas the developed hybrid HEC-HMS-ANN model is more straightforward, it requires iterative training, testing, and validation. More so, the hybrid ANN model requires both the set-up of the HEC-HMS model and in addition ANN model that uses stream flow output from the HEC-HMS model as one of two inputs. This, therefore, means that ease of model use, modeller experience and set-up should not be used as the main criteria in the selection of a model for a catchment. The selection of a model should also consider other factors such as data availability, length, and spatial distribution of hydrometeorological data. Nonetheless, the calibrated and validated hybrid HEC-HMS-ANN model can be used to more accurately simulate stream flows in adjacent ungauged catchments located in the same climatic zone as the River Mayanja catchment, given access to open-source reanalysis climate datasets (Nakkazi et al. 2022), open-source digital elevation models, soil, and land use and land cover maps.

In addition, this research has demonstrated that ANN models can be developed and applied to support water balance and allocation modelling considering both the current and projected impacts of future climate change on stream flow. The study findings can potentially support the development of water resource modelling guidelines in data-scarce regions that are faced with the challenges of choosing suitable models for different terrains and varying lengths of in situ observed data. Furthermore, this research could inform the identification and prioritisation of potential mitigation and adaptation strategies that are necessary for ensuring water security in view of emerging climate change and urbanisation threats.

However, the performance of the developed HEC-HMS-ANN model was constrained by a limited number of input datasets and an insufficient number of peaks in the calibration dataset. Therefore, future research should focus on enhancing the performance of hybrid models using varying model structures and relatively long calibration and validation datasets that include an adequate number of peaks. Furthermore, future research should investigate the effectiveness of non-linear ensemble models (Gelete et al. 2023) created by training the hybrid ANN using the stream flow from obtained from physically based models in enhancing the performance of hybrid ANN models. Furthermore, future research should focus on the investigation of the effectiveness of hyperparameter tuning methods (Ghimire et al. 2021) for determining the most optimal ANN model parameters during ANN model calibration and validation.

Acknowledgement is given to the Directorate of Water Resources Management (DWRM), Ministry of Water and Environment, Uganda for the provision of the datasets used in the study. The insights of four anonymous reviewers are also gratefully acknowledged.

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Akram
F.
,
Rasul
M. G.
,
Khan
M. M. K.
&
Amir
M. S. I. I.
2012
Automatic delineation of drainage networks and catchments using DEM data and GIS capabilities: A case study
. In:
Proc. 18th Australas. Fluid Mech. Conf. AFMC
,
2012 1
, pp.
3
6
.
Ali
S.
&
Shahbaz
M.
2020
Streamflow forecasting by modeling the rainfall–streamflow relationship using artificial neural networks
.
Model. Earth Syst. Environ.
6
.
https://doi.org/10.1007/s40808-020-00780-3
.
Aredo
M. R.
,
Hatiye
S. D.
&
Pingale
S. M.
2021
Modeling the rainfall-runoff using MIKE 11 NAM model in Shaya catchment, Ethiopia
.
Model. Earth Syst. Environ.
7
,
2545
2551
.
https://doi.org/10.1007/s40808-020-01054-8
.
Aryal
A.
,
Shrestha
S.
&
Babel
M. S.
2019
Quantifying the sources of uncertainty in an ensemble of hydrological climate-impact projections
.
Theor. Appl. Climatol.
135
,
193
209
.
https://doi.org/10.1007/s00704-017-2359-3
.
Awchi
T. A.
2014
River discharges forecasting in northern Iraq using different ANN techniques
.
Water Resour. Manag.
28
,
801
814
.
https://doi.org/10.1007/s11269-014-0516-3
.
Bartles
M.
,
Brauer
T.
,
Ho
D.
,
Fleming
M.
,
Karlovits
G.
,
Pak
J.
,
Van
N.
&
Willis
J.
2022
HEC-HMS User's Manual
.
Birkhead
A. L.
&
James
C. S.
2002
Muskingum river routing with dynamic bank storage
.
J. Hydrol.
264
,
113
132
.
Chathuranika
I. M.
,
Gunathilake
M. B.
,
Baddewela
P. K.
,
Sachinthanie
E.
,
Babel
M. S.
,
Shrestha
S.
,
Jha
M. K.
&
Rathnayake
U. S.
2022
Comparison of two hydrological models, HEC-HMS and SWAT in runoff estimation: Application to Huai Bang Sai Tropical Watershed, Thailand
.
Fluids
7
,
1
14
.
Chebii
S. J.
,
Mukolwe
M. M.
&
Ong'or
B. I.
2022
River flow modelling for flood prediction using artificial neural network in ungauged Perkerra catchment, Baringo County, Kenya
.
Water Pract. Technol.
17
,
914
929
.
https://doi.org/10.2166/wpt.2022.034
.
Chen
S. M.
,
Wang
Y. M.
&
Tsou
I.
2013
Using artificial neural network approach for modelling rainfall-runoff due to typhoon
.
J. Earth Syst. Sci.
122
.
https://doi.org/10.1007/s12040-013-0289-8
.
Dariane
A. B.
,
Javadianzadeh
M. M.
&
James
L. D.
2016
Developing an efficient auto-calibration algorithm for HEC-HMS program
.
Water Resour. Manag.
30
,
1923
1937
.
https://doi.org/10.1007/s11269-016-1260-7
.
Demirel
M. C.
,
Venancio
A.
&
Kahya
E.
2009
Flow forecast by SWAT model and ANN in Pracana basin, Portugal
.
Adv. Eng. Softw.
40
,
467
473
.
https://doi.org/10.1016/j.advengsoft.2008.08.002
.
DHI
2017
MIKEHYDRO River User Guide
.
Dile
Y. T.
,
Ayana
E. K.
,
Worqlul
A. W.
,
Xie
H.
,
Srinivasan
R.
,
Lefore
N.
,
You
L.
&
Clarke
N.
2020
Evaluating satellite-based evapotranspiration estimates for hydrological applications in data-scarce regions: A case in Ethiopia
.
Sci. Total Environ.
743
,
140702
.
https://doi.org/10.1016/j.scitotenv.2020.140702
.
Eccles
R.
,
Zhang
H.
&
Hamilton
D.
2019
A review of the effects of climate change on riverine flooding in subtropical and tropical regions
.
J. Water Clim. Chang.
10
,
687
707
.
https://doi.org/10.2166/wcc.2019.175
.
Elsafi
S. H.
2014
Artificial neural networks (ANNs) for flood forecasting at Dongola Station in the River Nile, Sudan
.
Alex. Eng. J.
53
,
655
662
.
https://doi.org/10.1016/j.aej.2014.06.010
.
Gebrechorkos
S. H.
,
Taye
M. T.
,
Birhanu
B.
,
Solomon
D.
&
Demissie
T.
2023
Future changes in climate and hydroclimate extremes in East Africa
.
Earth's Futur.
11
,
1
21
.
https://doi.org/10.1029/2022EF003011
.
Gelete
G.
,
Nourania
V.
,
Gokcekus
H.
&
Gichamo
T.
2023
Ensemble physically based semi-distributed models for the rainfall-runoff process modeling in the data-scarce Katar catchment, Ethiopia
.
J. Hydroinf.
25
,
567
592
.
https://doi.org/10.2166/hydro.2023.197
.
Ghimire
S.
,
Yaseen
Z. M.
,
Farooque
A. A.
&
Deo
R. C.
2021
Streamflow prediction using an integrated methodology based on convolutional neural network and long short-term memory networks
.
Sci. Rep.
11
,
1
26
.
https://doi.org/10.1038/s41598-021-96751-4
.
Ghorbani
M. A.
,
Zadeh
H. A.
,
Isazadeh
M.
&
Terzi
O.
2016
A comparative study of artificial neural network (MLP, RBF) and support vector machine models for river flow prediction
.
Environ. Earth Sci.
75
,
1
4
.
https://doi.org/10.1007/s12665-015-5096-x
.
Ghosh
A.
,
Roy
M. B.
&
Roy
P. K.
2022
Evaluating the performance of MIKE NAM model on rainfall–runoff in lower Gangetic floodplain, West Bengal, India
.
Model. Earth Syst. Environ.
8
,
4001
4017
.
https://doi.org/10.1007/s40808-021-01347-6
.
Gumindoga
W.
,
Rwasoka
D. T.
,
Nhapi
I.
&
Dube
T.
2017
Ungauged runoff simulation in Upper Manyame Catchment, Zimbabwe: Application of the HEC-HMS model
.
Phys. Chem. Earth, Parts A/B/C
100
,
371
382
.
Gunathilake
M. B.
,
Karunanayake
C.
,
Gunathilake
A. S.
,
Marasingha
N.
,
Samarasinghe
J. T.
,
Bandara
I. M.
&
Rathnayake
U.
2021
Hydrological models and artificial neural networks (ANNs) to simulate streamflow in a tropical catchment of Sri Lanka
.
Appl. Comput. Intell. Soft Comput.
2021
,
1
9
.
https://doi.org/10.1155/2021/6683389
.
Gupta
H. V.
&
Kling
H.
2011
On typical range, sensitivity, and normalization of mean squared error and Nash-Sutcliffe efficiency type metrics
.
Water Resour. Res.
47
,
2
4
.
https://doi.org/10.1029/2011WR010962
.
Hanington
P.
,
To
Q. T.
,
Van
P. D. T.
,
Doan
N. A. V.
&
Kiem
A. S.
2017
A hydrological model for interprovincial water resource planning and management: A case study in the Long Xuyen Quadrangle, Mekong Delta, Vietnam
.
J. Hydrol.
547
.
https://doi.org/10.1016/j.jhydrol.2017.01.030
.
Hu
C.
,
Wu
Q.
,
Li
H.
,
Jian
S.
,
Li
N.
&
Lou
Z.
2018
Deep learning with a long short-term memory networks approach for rainfall-runoff simulation
.
Water (Switzerland)
10
,
1
16
.
https://doi.org/10.3390/w10111543
.
Huang
S.
,
Chang
J.
,
Huang
Q.
&
Chen
Y.
2014
Monthly streamflow prediction using modified EMD-based support vector machine
.
J. Hydrol.
511
,
764
775
.
https://doi.org/10.1016/j.jhydrol.2014.01.062
.
Humphrey
G. B.
,
Gibbs
M. S.
,
Dandy
G. C.
&
Maier
H. R.
2016
A hybrid approach to monthly streamflow forecasting: Integrating hydrological model outputs into a Bayesian artificial neural network
.
J. Hydrol.
540
,
623
640
.
https://doi.org/10.1016/j.jhydrol.2016.06.026
.
Javan
K.
,
Lialestani
M. R. F. H.
&
Nejadhossein
M.
2015
A comparison of ANN and HSPF models for runoff simulation in Gharehsoo River watershed, Iran
.
Model. Earth Syst. Environ.
1
,
1
13
.
https://doi.org/10.1007/s40808-015-0042-1
.
Jimeno-Sáez
P.
,
Senent-Aparicio
J.
,
Pérez-Sánchez
J.
&
Pulido-Velazquez
D.
2018
A comparison of SWAT and ANN models for daily runoff simulation in different climatic zones of Peninsular Spain
.
Water
10
,
1
19
.
https://doi.org/10.3390/w10020192
.
Karlsson
I. B.
,
Sonnenborg
T. O.
,
Refsgaard
J. C.
,
Trolle
D.
,
Børgesen
C. D.
,
Olesen
J. E.
,
Jeppesen
E.
&
Jensen
K. H.
2016
Combined effects of climate models, hydrological model structures and land use scenarios on hydrological impacts of climate change
.
J. Hydrol.
535
,
301
317
.
https://doi.org/10.1016/j.jhydrol.2016.01.069
.
Kim
T.
,
Yang
T.
,
Gao
S.
,
Zhang
L.
,
Ding
Z.
,
Wen
X.
,
Gourley
J. J.
&
Hong
Y.
2021
Can artificial intelligence and data-driven machine learning models match or even replace process-driven hydrologic models for streamflow simulation?: A case study of four watersheds with different hydro-climatic regions across the CONUS
.
J. Hydrol.
598
,
1
20
.
Kovačević
M.
,
Ivanišević
N.
,
Dašić
T.
&
Marković
L.
2018
Primjena neuronskih mreža za hidrološko modeliranje u krškom području
.
Gradjevinar
70
,
1
10
.
https://doi.org/10.14256/JCE.1594.2016
.
Koycegiz
C.
&
Buyukyildiz
M.
2022
Estimation of streamflow using different artificial neural network models
.
OKU J. Inst. Sci. Technol.
5
,
1141
1154
.
Kurian
C.
,
Sudheer
K. P.
,
Vema
V. K.
,
Sahoo
D.
&
Vema
K.
2020
Effective flood forecasting at higher lead times through hybrid modelling framework
.
J. Hydrol.
587
,
1
14
.
Lidberg
W.
,
Nilsson
M.
,
Lundmark
T.
&
Ågren
A. M.
2017
Evaluating preprocessing methods of digital elevation models for hydrological modelling
.
Hydrol. Process.
31
,
4660
4668
.
https://doi.org/10.1002/hyp.11385
.
Lundberg
S. M.
&
Lee
S. I.
2017
A unified approach to interpreting model predictions
. In:
31st Conference on Neural Information Processing Systems (NIPS 2017)
,
Long Beach, CA, USA
, pp.
4766
4775
.
https://doi.org/10.48550/arXiv.1705.07874.
Moriasi
D. N.
,
Arnold
J. G.
,
Van Liew
M. W.
,
Bingner
R. L.
,
Harmel
R. D.
&
Veith
T. L.
2007
Model evaluation guidelines for systematic quantification of accuracy in watershed simulations
.
Trans. ASABE
50
,
885
900
.
https://doi.org/10.13031/2013.23153
.
Mount
N.
,
Maier
H.
,
Toth
E.
,
Elshorbagy
A.
,
Solomatine
D.
,
Chang
F.-J.
&
Brahart
R.
2016
Data-driven modelling approaches for socio-hydrology: Opportunities and challenges within the Panta Rhei Science Plan
.
Hydrol. Sci. J.
61
,
1192
1208
.
https://doi.org/10.1080/02626667.2016.1159683
.
Molnar
C.
2022
Interpretable Machine Learning - A guide for making black box models explainable, 2nd Editio. ed. Leanpub
.
Mugume
S. N.
,
Abasabyoona
G.
,
Engwau
J.
,
Sempewo
J. I.
,
Van de Sande
B.
&
Butler
D.
2024
Assessing the impact of the rise in Lake Victoria water levels on urban flooding using a GIS-based spatial flood modelling approach
.
Urban Water J.
21
,
219
233
.
https://doi.org/10.1080/1573062X.2023.2284960
.
Mushtaq
H.
,
Akhtar
T.
,
Hashmi
M. Z. U. R.
,
Masood
A.
&
Saeed
F.
2024
Hydrologic interpretation of machine learning models for 10-daily streamflow simulation in climate sensitive upper Indus catchments
.
Theor. Appl. Climatol.
20
.
https://doi.org/10.1007/s00704-024-04932-8
.
MWE
2024
Comprehensive Situation Assessment and Preparation of Albert Water Management Zone Strategy and Action Plan – Final Water Resources Assessment Report.
Kampala, Uganda
.
Nakkazi
M. T.
,
Sempewo
J. I.
,
Tumutungire
M. D.
&
Byakatonda
J.
2022
Performance evaluation of CFSR, MERRA-2 and TRMM3B42 data sets in simulating river discharge of data-scarce tropical catchments: A case study of Manafwa, Uganda
.
J. Water Clim. Chang.
13
,
522
541
.
https://doi.org/10.2166/wcc.2021.174
.
Nash
J. E.
&
Sutcliffe
J. V.
1970
River flow forecasting through conceptual models part I – A discussion of principles
.
J. Hydrol.
10
.
https://doi.org/10.1016/0022-1694(70)90255-6
.
Ni
L.
,
Wang
D.
,
Singh
V. P.
,
Wu
J.
,
Wang
Y.
,
Tao
Y.
&
Zhang
J.
2020
Streamflow and rainfall forecasting by two long short-term memory models
.
J. Hydrol.
583
,
1
10
.
Nourani
V.
,
Jabbarian Paknezhad
N.
,
Sharghi
E.
&
Khosravi
A.
2019
Estimation of prediction interval in ANN-based multi-GCMs downscaling of hydro-climatologic parameters
.
J. Hydrol.
579
.
https://doi.org/10.1016/j.jhydrol.2019.124226
.
Onyutha
C.
,
Asiimwe
A.
,
Ayugi
B.
,
Ngoma
H.
,
Ongoma
V.
&
Tabari
H.
2021
Observed and future precipitation and evapotranspiration in water management zones of Uganda: CMIP6 projections
.
Atmosphere (Basel)
12
.
https://doi.org/10.3390/atmos12070887
.
Papa
F.
,
Crétaux
J. F.
,
Grippa
M.
,
Robert
E.
,
Trigg
M.
,
Tshimanga
R. M.
,
Kitambo
B.
,
Paris
A.
,
Carr
A.
,
Fleischmann
A. S.
,
de Fleury
M.
,
Gbetkom
P. G.
,
Calmettes
B.
&
Calmant
S.
2023
Water resources in Africa under global change: Monitoring surface waters from space
.
Surv. Geophys.
https://doi.org/10.1007/s10712-022-09700-9
.
Patterson
N. K.
,
Sandoval-solis
S.
&
Persad
G. G.
2022
Projected effects of temperature and precipitation variability change on streamflow patterns using a functional flows approach earth's future
.
Earth's Futur.
1
19
.
https://doi.org/10.1029/2021EF002631
.
Pramanik
N.
,
Panda
R. K.
&
Singh
A.
2011
Daily river flow forecasting using wavelet ANN hybrid models
.
J. Hydroinf.
13
,
49
63
.
https://doi.org/10.2166/hydro.2010.040
.
Priddy
L. K.
&
Keller
E. P.
2005
Artificial Neural Networks, an Introduction
.
SPIE Press
,
Bellingham, Washington
.
Rahimzad
M.
,
Nia
A. M.
,
Zolfonoon
H. K.
,
Soltani
J.
,
Mehr
A. D.
&
Hyun-Han
.
2021
Performance comparison of an LSTM-based deep learning model versus conventional machine learning algorithms for streamflow forecasting
.
Water Resour. Manag.
35
,
4167
4187
.
https://doi.org/10.1007/s11269-021-02937-w
.
Rahman
K. U.
,
Pham
Q. B.
,
Jadoon
K. Z.
,
Shahid
M.
,
Kushwaha
D. P.
,
Khedher
Z. D.
,
Mohammadi
B.
,
Mohamed
K.
&
Anh
D. T.
2022
Comparison of machine learning and process-based SWAT model in simulating streamflow in the Upper Indus Basin
.
Appl. Water Sci.
12
.
https://doi.org/10.1007/s13201-022-01692-6
.
Reddy
B. S. N.
&
Pramada
S. K.
2022
A hybrid artificial intelligence and semi-distributed model for runoff prediction
.
Water Supply
22
,
6181
6194
.
https://doi.org/10.2166/ws.2022.239
.
Sabol
G. V.
1988
Clark unit hydrograph and R-parameter estimation
.
J. Hydraul. Eng.
114
,
103
111
.
Sahoo
B. B.
,
Jha
R.
,
Singh
A.
&
Kumar
D.
2018
Application of support vector regression for modeling low flow time series
.
KSCE J. Civ. Eng.
923
934
.
https://doi.org/10.1007/s12205-018-0128-1
.
Sahoo
B. B.
,
Sankalp
S.
&
Kisi
O.
2023
A novel smoothing-based deep learning time-series approach for daily suspended sediment load prediction
.
Water Resour. Manag.
37
,
4271
4292
.
https://doi.org/10.1007/s11269-023-03552-7
.
Sanjay Shekar
N. C.
&
Vinay
D. C.
2021
Performance of HEC-HMS and SWAT to simulate streamflow in the sub-humid tropical Hemavathi catchment
.
J. Water Clim. Chang.
12
,
3005
3017
.
https://doi.org/10.2166/wcc.2021.072
.
Seenath
A.
,
Wilson
M.
&
Miller
K.
2016
Hydrodynamic versus GIS modelling for coastal flood vulnerability assessment: Which is better for guiding coastal management?
Ocean Coast. Manag.
120
,
99
109
.
https://doi.org/10.1016/j.ocecoaman.2015.11.019
.
Sempewo
J. I.
,
Twite
D.
,
Nyenje
P.
&
Mugume
S. N.
2022
Comparison of SWAT and HEC-HMS model performance in simulating catchment runoff
.
J. Appl. Water Eng. Res.
11
.
https://doi.org/10.1080/23249676.2022.2156401
.
Sempewo
J. I.
,
Kyeyune
J.
,
Nyenje
P. M.
,
Nkwasa
A.
,
Mugume
S. N.
,
Tsegaye
S.
&
Eckart
J.
2024
Distinct and combined impacts of future climate and land use change on the flow of River Rwizi in Uganda, East Africa
.
J. Water Clim. Chang.
15
,
1667
1692
.
https://doi.org/10.2166/wcc.2024.542
.
Shiau
J. T.
&
Hsu
H. T.
2016
Suitability of ANN-based daily streamflow extension models: A case study of Gaoping River Basin, Taiwan
.
Water Resour. Manag.
30
,
1499
1513
.
https://doi.org/10.1007/s11269-016-1235-8
.
Slieman
A.
&
Kozlov
D.
2023
Evaluating performance of MIKE 11 NAM model for runoff modeling on upper basin of Orontes River in Syria
. In
E3S Web of Conferences
.
EDP Sciences
.
https://doi.org/10.1051/e3sconf/202336503004.
Sweetapple
C.
,
Fu
G.
&
Butler
D.
2014
Identifying sensitive sources and key control handles for the reduction of greenhouse gas emissions from wastewater treatment
.
Water Res.
62
,
249
259
.
https://doi.org/10.1016/j.watres.2014.06.002
.
Tan
M. L.
,
Armanuos
A. M.
,
Ahmadianfar
I.
,
Demir
V.
,
Heddam
S.
,
Al-Areeq
A. M.
,
Abba
S. I.
,
Halder
B.
,
Cagan Kilinc
H.
&
Yaseen
Z. M.
2023
Evaluation of NASA POWER and ERA5-Land for estimating tropical precipitation and temperature extremes
.
J. Hydrol.
624
.
https://doi.org/10.1016/j.jhydrol.2023.129940
.
Tang
Y.
,
Reed
P.
,
Wagener
T.
&
van Werkhoven
K.
2007
Comparing sensitivity analysis methods to advance lumped watershed model identification and evaluation
.
Hydrol. Earth Syst. Sci.
11
,
793
817
.
https://doi.org/10.5194/hess-11-793-2007
.
Tanteliniaina
M. F. R.
,
Rahaman
M. H.
&
Zhai
J.
2021
Assessment of the future impact of climate change on the hydrology of the Mangoky river, Madagascar using ANN and SWAT
.
Water (Switzerland)
13
.
https://doi.org/10.3390/w13091239
.
Tibangayuka
N.
,
Mulungu
D. M. M.
&
Izdori
F.
2022
Evaluating the performance of HBV, HEC-HMS and ANN models in simulating streamflow for a data scarce high-humid tropical catchment in Tanzania
.
Hydrol. Sci. J.
67
,
2191
2204
.
https://doi.org/10.1080/02626667.2022.2137417
.
Tiwari
M. K.
&
Chatterjee
C.
2011
A new wavelet-bootstrap-ANN hybrid model for daily discharge forecasting
.
J. Hydroinf.
13
,
500
519
.
https://doi.org/10.2166/hydro.2010.142
.
Trisos
C. H.
,
Adelekan
I. O.
,
Totin
E.
,
Ayanlade
A.
,
Efitre
J.
,
Gemeda
A.
,
Kalaba
K.
,
Lennard
C.
,
Masao
C.
,
Mgaya
Y.
,
Ngaruiya
G.
,
Olago
D.
,
Simpson
N. P.
,
Zakieldeen
S.
,
2022
Climate change 2022: Impacts, adaptation and vulnerability
. In:
Climate Change 2022: Impacts, Adaptation and Vulnerability
(
Pörtner
H.-O.
,
Roberts
D. C.
,
Tignor
M.
,
Poloczanska
E. S.
&
Mintenbeck
K.
, eds).
IPCC
, pp.
1285
1455
.
https://doi.org/10.1017/9781009325844.011.1286
.
Troolin
W. D.
&
Clancy
K.
2016
Comparison of three delineation methods using the curve number method to model runoff
.
J. Water Resour. Prot.
08
,
945
964
.
https://doi.org/10.4236/jwarp.2016.811077
.
Tusingwiire
M. A.
,
Tumutungire
M. D.
,
Sempewo
J. I.
&
Semiyaga
S.
2023
Impacts of climate and land use/cover change on mini-hydropower generation in River Kyambura watershed in South Western part of Uganda
.
Water Pract. Technol.
18
,
1576
1597
.
https://doi.org/10.2166/wpt.2023.079
.
UNDRR
2020
Human Cost of Disasters, Human Cost of Disasters. An Overview of the Last 20 Years 2000–2019
.
https://doi.org/10.18356/79b92774-en
.
Venkatesh
M.
2012
Watershed and Stream Network Delineation Using ArcHydro Tools
.
Purdue University, West Lafayette, Indianapolis, USA
, pp.
1
22
.
Vilanova
R. S.
,
Zanetti
S. S.
&
Cecílio
R. A.
2019
Assessing combinations of artificial neural networks input/output parameters to better simulate daily stream flow: Case of Brazilian Atlantic Rainforest watersheds
.
Comput. Electron. Agric.
167
,
1
12
.
https://doi.org/10.1016/j.compag.2019.105080
.
Vo
N. D.
,
Nguyen
Q. B.
,
Le
C. H.
,
Doan
T. D.
,
Le
V. H.
&
Gourbesville
P.
,
2018
Comparing model effectiveness on simulating catchment hydrological regime
. In:
Advances in Hydroinformatics
In: (
Gourbesville
P.
, ed.).
Springer Nature Singapore Pte Ltd
,
Singapore
, pp.
401
418
.
https://doi.org/10.1007/978-981-10-7218-5
.
Wang
S.
,
Peng
H.
,
Hu
Q.
&
Jiang
M.
2022
Analysis of runoff generation driving factors based on hydrological model and interpretable machine learning method
.
J. Hydrol. Reg. Stud.
42
,
101139
.
https://doi.org/10.1016/j.ejrh.2022.101139
.
World Bank
2021
User Manual Climate Change Knowledge Portal (CCKP)
.
Yaseen
Z. M.
,
El-shafie
A.
,
Jaafar
O.
,
Afan
H. A.
&
Sayl
K. N.
2015
Artificial intelligence based models for stream-flow forecasting: 2000–2015
.
J. Hydrol.
530
,
829
844
.
https://doi.org/10.1016/j.jhydrol.2015.10.038
.
Young
C. C.
&
Liu
W. C.
2015
Prediction and modelling of rainfall-runoff during typhoon events using a physically-based and artificial neural network hybrid model
.
Hydrol. Sci. J.
60
,
2102
2116
.
https://doi.org/10.1080/02626667.2014.959446
.
Young
C.-C.
,
Liu
W.-C.
&
Wu
M.-C.
2017
A physically based and machine learning hybrid approach for accurate rainfall-runoff modeling during extreme typhoon events
.
Appl. Soft Comput.
53
,
205
216
.
https://doi.org/10.1016/j.asoc.2016.12.052
.
Zhang
Y. Q.
,
Chiew
F. H. S.
,
Zhang
L.
,
Leuning
R.
&
Cleugh
H. A.
2008
Estimating catchment evaporation and runoff using MODIS leaf area index and the Penman-Monteith equation
.
Water Resour. Res.
44
,
1
15
.
https://doi.org/10.1029/2007WR006563
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).

Supplementary data