Effective prediction of runoff is a substantial feature for the successful management of hydrological phenomena in arid regions. The present research findings reveal that a rainfall simulator (RS) can be a valuable instrument to estimate runoff as the intensity of rainfall is modifiable in the course of an experimental process, which turns out to be of great advantage. The rainfall-runoff process is a complex physical phenomenon caused by the effect of various parameters. In this research, a new hybrid technique integrating PSR (phase space reconstruction) with FFA (firefly algorithm) and SVM (support vector machine) has gained recognition in various modelling investigations in contrast to the principle of empirical risk minimization through ANN practices. Outcomes of SVM are contrasted against SVM-FFA and PSR-SVM-FFA models. The improvements in NSE (Nash–Sutcliffe efficiency), RMSE (Root Mean Square Error), and WI (Willmott's Index) by PSR-SVM-FFA over SVM models specify that the prediction accuracy of the hybrid model is better. The established PSR-SVM-FFA model generates preeminent WI values that range from 0.97 to 0.98, while the SVM and SVM-FFA models encompass 0.93–0.95 and 0.96–0.97, respectively. The proposed PSR-SVM-FFA model gives more accurate results and error limiting up to 2–3%.

  • The models developed in this study using three algorithms are SVM, SVM-FFA, and hybrid PSR-SVM-FFA models.

  • SVM, SVM-FFA, and PSR-SVM-FFA were used, while the earlier studies used simple machine learning approaches for predicting runoff.

  • Interactions of all techniques along with five different scenarios are presented.

  • Watershed has different slopes; hence, six different types of slope conditions have been considered.

The principal purpose of a rainfall simulator (RS) is to simulate natural precipitation precisely and specifically. Simultaneously, RSs regulate the intensity and duration of rainfall. They are beneficial as rainfall can be created swiftly on demand wherever essential, without waiting for natural rainfall at a desired intensity and interval, thus eradicating the unpredictable and erratic unevenness of natural rain (Wildhaber et al. 2012; Iserloh et al. 2013). The necessity for short- and long-term runoff simulation is significant for managing watersheds which comprises surplus runoff control, managing and using runoff for detailed tenacities. The complexity of the rainfall–runoff process and its unpredictability depending on watershed features and rainfall patterns makes it difficult to predict and estimate with desired accuracy (Abudi et al. 2012; Samantaray et al. 2019, 2021). Yet, hydrologists have recently developed several approaches and models varying from empirical- to physical-based relationships. Physical models have been ascertained to be improved for simulating runoff; however, they require a huge quantity of data. Hence, the necessity to develop alternate models for runoff and sediment yield simulation utilizing accessible data has taken primacy (Samantaray & Sahoo 2020a; Xu et al. 2020; Jimmy et al. 2021; Mohanta et al. 2021). The runoff modelling process is a key constituent in climatological engineering. Therefore, an improved and consistent smart model has to be explored for runoff forecasting that is vital for water resources engineering.

Pérez-Latorre et al. (2010) developed two diverse RS designs for obtaining rainfall intensities having drop size and energy like that of natural rainfall. Comparing the efficiency of two single simulators, RS-2 gave a more accurate drop-size distribution in the middle part and also a lesser budget than that of RS1. Akbarzadeh et al. (2009) investigated the decline of overland flow and splash detachment of clayey soil after casing it with various synthetic geotextiles utilizing RS as well as neuro-fuzzy and ANN. It was observed that the neuro-fuzzy model gives better estimates followed by ANN and multivariate regression for runoff and splash prediction, respectively. Elhakeem & Papanicolaou (2009) provided comprehensive operational phases for estimating in situ runoff curve number (CN) for certain agronomic fields in Iowa State using RS. Observed results revealed that RSs are valuable devices to estimate in situ runoff CN because the intensity of rainfall was modifiable at the time of an experimental run. Wildhaber et al. (2012) assessed the appropriateness of a new field hybrid RS which combined spray nozzle and dropped former features of RS for soil erosion study and for quantifying the effect of vegetation and soil arrangement permanency on runoff and erosion in sub-alpine parkland region. Outcomes demonstrated that vegetation cover plays a vital part in the soil structure stability of alpine soils. Abudi et al. (2012) designed and tested a movable RS to simulate precipitations that persuade runoff and soil erosion. They observed that the field RS acts as a potent device for field penetration, crusting, and erosion of soil. Iserloh et al. (2013) characterized non-natural rain produced by 13 RSs, constructed in different European research organizations, utilizing Laser Precipitation Monitor and rainfall accumulators in every simulation for ensuring comparability of outcomes. Aksoy et al. (2012) developed a laboratory-gauge non-movable RS to spray rain on a flume which can be provided with both lateral and longitudinal slopes. Performance of RS is assessed based on the intensity of rainfall and its consistency, raindrop size dissemination, and kinetic energy of produced rainfall. Cea et al. (2010) analysed the accuracy and efficiency of 2D vibrant wave models for computing rainfall–runoff directly from rainfall data sets within distinct urban geometries. They also determined different model constraints that are appropriate while using an experimental set-up in a laboratory RS. Fister et al. (2012) evaluated wind and precipitation features of Movable Wind and RS. The adequacy of simulation eminence and condition of suitability for relative measurement of soil erosion in field was assessed. Jin et al. (2009) assessed the impact of residue cover and intensity of rainfall on the runoff loss of soil organic matter directly, by quantifying sediment loss over time and variations in sediments' particle size distribution, using a field RS in the laboratory.

Sivapragasam et al. (2001) suggested a simple and effective technique based on Singular Spectrum Analysis integrated with SVM for predicting runoff data of the Tryggevaelde catchment, Denmark. Results obtained were contrasted with that of the non-linear prediction (NLP) model and observed that the projected method gives considerably greater accuracy in predicting runoff compared with NLP. Bray & Han (2004) utilized the SVM model, for forecasting flood events, focusing on identifying a suitable model arrangement and its important constraints to model rainfall–runoff process. Behzad et al. (2009) proposed a new machine learning tool called SVM, for runoff modelling of the Bakhtiyari River watershed, Iran. Outcomes of the proposed model were compared with ANN and that of ANN combined with GA and it was observed that SVM performs the best as compared to other models. Okkan & Serbes (2012) examined the applicability of least squares SVM for predicting runoff of the Gordes and Tahtali catchments, located in Turkey, and compared its outcomes with ANN and other regression models. Findings from their study showed that the hybrid model helps in making run time substantially quicker with greater accuracy in runoff prediction. Misra et al. (2009) and Sharma et al. (2015) applied SVM and ANN models for predicting both runoff and sediment concentration of different watersheds. An outcome of SVM was assessed against ANN and simple regression, and it was observed that the SVM model was more competent for predicting runoff and sediment concentration under equivalent prediction accuracy. Jajarmizadeh et al. (2015) compared outcomes of the Soil and Water Assessment Tool (SWAT) with SVM for predicting the monthly streamflow of the Roodan catchment situated in Southern Iran. The SVM model gave better values for average flow as compared with SWAT, while SWAT performed better for predicting entire runoff volume having lesser error during validation. Granata et al. (2016) comparatively studied the modelling of the rainfall–runoff process amid SVM technique and Storm Water Management Model (SWMM) at two gauging stations situated in the northern part of Italy. SVR showed abundant application prospectives for applications in the urban hydrologic field as compared to SWMM. Ouyang et al. (2016) applied SVR for forecasting monthly precipitation at a climate station in Changchun, China, and implemented the phase space reconstruction (PSR) method for designing input vectors. Outcomes indicated that the projected amalgam model is reasonable for forecasting monthly precipitation and performs better. Sedighi et al. (2016) studied the performance of ANN and SVM models for simulating the rainfall–runoff process prejudiced by the height of snow water equivalent (SWE) in Roodak catchment, Tehran region, Iran. Contemplation of SWE improves the performance and accuracy of SVM. Wang et al. (2017) projected a novel amalgam model based on variation mode decomposition, PSR, and wavelet NN augmented by genetic algorithm (GA) to forecast multistep ahead wind speed. Experimental outcomes showed that the projected model outclasses all other models used in the study and performs the best. Sun & Wang (2018) proposed a pioneering amalgam model to forecast wind speed, comprising fast collaborative empirical mode disintegration, model entropy, PSR, and back-propagation NN for enhancing the accuracy of wind speed prediction, and observed that the projected model performed better to forecast short-term wind speed. Lekscha & Donner (2018) compared time interval entrenching and differential entrenching techniques aimed at PSR by examining two fine-premeditated paleo climate sets of data from Ecuador and Mexico. Liu et al. (1998) developed a PSR map from observed streamflow indicators based on the dynamic system concept, for modelling and predicting streamflow on a daily basis. Results revealed that the suggested PSR model could be utilized for making a timid peculiarity amid a noisy and deterministic frenzied indicator. Tao et al. (2018a) projected the hybridization of the SVM model with FFA coupled with PSR for a monthly precipitation forecast in Chhattisgarh, India. The projected model significantly improved the forecasting accuracy of the model and it was found to be a very vigorous smart model, which can be used for the Indian provincial precinct. Mehr et al. (2019) developed a new approach based on the integration of SVR and FFA to forecast monthly precipitation of Tabriz and Urmia substations, Iran. SVR-FFA gives promising accuracy in forecasting precipitation in the semi-arid province as compared to other models. Soleymani et al. (2016) developed an amalgam method comprising Radial Basis Function and FFA for predicting the water level of Selangor River, Malaysia. The performance of RBF-FFA was examined using simulated and real-time water level data and it was found that RBF-FFA is a proficient technique for accurately predicting river water level. Samantaray & Ghose (2020) applied RBFN and BPNN techniques for runoff prediction at the Ong river basin, India. Xu et al. (2020) used FFA with deep learning for improving the prediction efficacy of hydrologic processes, optimizing parameters of SVR spontaneously, and establishing a prediction model for Huangfuchuan in the Fugu region of Shanxi State. Experimental outcomes revealed that the hybrid model accomplished enhanced prediction values. Bozorg-Haddad et al. (2018) implemented GA, SVM, ANN, and dynamic programming for calculating optimal release of hydropower reservoirs of Karoon dam, Iran, under forecasting and non-forecasting scenarios. The SVM model output was compared with other applied models and it was found that SVM and ANN performed best to produce optimal reservoir operation. Tao et al. ( 2018b) investigated the ability of a hybrid model combining fuzzy model with FA to predict daily reference evapotranspiration over Burkina Faso province. Outcomes showed the effect of using FFA for significantly improving the performance of the conventional ANFIS model.

The objective of this research is to develop a novel hybrid PSR-SVM-FFA model for runoff prediction considering runoff data generated through experimental set-up (RS). The prediction performance of the proposed robust model is assessed against the hybrid SVM-FFA and benchmark SVM model based on different rainfall intensities with respect to basin slope. The application of PSR-SVM-FFA for runoff prediction is a novel approach of the present study.

Support vector machine

SVM is a set of connected supervised learning techniques utilized for classifications and regressions (Vapnik 1998). SVM can be characterized in a double layer network (where given weights are non-linear in principal and linear in the succeeding layer). SVMs choose constraints for the first layer as training input vectors, as this minimizes Vapnik Chervonenkis (VC) dimension. The network structure of SVM is specified in Figure 1.

Figure 1

Architecture of SVM.

Figure 1

Architecture of SVM.

Close modal
In mathematical terms, an elementary function for statistical learning procedure is:
(1)
where output is a linear sum of M. Non-linear alteration in Equation (1) is done by . This comprises a big range of models. The decision function of SVM, which is one among the model range, is signified in Equation (2)
(2)
where K is the kernel function, and b are constraints obtained by maximizing its objective function, N is the number of training data, are vectors utilized during training, and x is the independent vector.

It utilizes training data to calibrate the model for estimating model constraints, but also retains most vital parts of the input vector in its model (Samantaray & Sahoo 2020b). This vector is known as a support vector. Distinctive structures of kernel functions utilized to transform non-linear input vectors allow SVM to be free from many training vectors, so as to obtain a very small resultant model.

Firefly algorithm

Yang (2010) introduced a firefly algorithm (FFA) motivated by the societal conduct of fireflies, based on their blinking characteristics. Three important norms in FFA architecture depend on the features of real-world fireflies irrespective of their sex; (i) they are presumed to be unisex, therefore, every firefly can attract another; (ii) luminous intensity regulates the attraction of a single firefly to another; and (iii) flashing intensity is proportionate to the magnitude of light produced by a firefly. On the basis of these norms, the FFA model's objective function is signified by brightness and light intensity produced from a firefly. Intensity and desirability are given in Equations (3) and (4), respectively. Every firefly has its own desirability β with regard to its strength and how it entices other swarm members (Yang 2010).
(3)
(4)
where and I signify desirability and light intensity at distance r from firefly, in respective order. and are desirability and light intensity at distance r = 0 from firefly and is the coefficient of light immersion. Distance amid any two fireflies i and j is articulated in Equation (5) (Yang 2010):
(5)
where and are positions of fireflies i and j. FFA is modest, supple, and adaptable, which is most effective to solve a varied array of miscellaneous real-world glitches. FFA has the capability to control its modality and control the scale constraint for adapting to a problematic background (Sahoo et al. 2021). For any meta-heuristic algorithm, a moral equilibrium amid utilization and investigation throughout pursuit processes must be sustained for achieving better outcomes. FFA acts as a global method in optimization problems, and hence is designed for exploring search space and probably to give an optimum/near-optimum result, if utilized solitarily (Fister et al. 2013). The proposed hybrid SVM-FFA model flowchart is presented in Figure 2.
Figure 2

Flowchart of the SVM-FFA model.

Figure 2

Flowchart of the SVM-FFA model.

Close modal

Phase space reconstruction

The theory of the PSR approach helps in the possible recognition of the chaotic, deterministic, and stochastic nature of dynamic classifications (Sivakumar et al. 2001). PSR is primarily utilized for reconstructing single-variable time series to avoid any univariable in time series. The system's state is signified at each point, while its evolution time is signified at each flight route, corresponding to changing preliminary constraints. Each point or group of points in a system consists of distinctive patterns to attract flight routes toward themselves. This arrangement of pattern designs is identified as attractors. For estimating the intricacy of attractors in the system, the reconstruction of the phase system is very much needed; hence, attractor behaviour could be defined and enumerated as being multifaceted or not. With assistance of the time delay entrenching concept, a PSR could be fabricated based on univariable dimension time series. PSR of any specified time series can be articulated as follows:
(6)
where t is the time delay, m represents embedding dimensions, while M signifies the number of points in PSR. PSR amid Xt and Xt−(m−1)τ, when rebuilt, can graphically depict the existence of an attractor aimed at deterministic chaos in a time series. A phase space sequence of m-dimensions can be abridged in the subsequent manner (Delafrouz et al. 2017):
(7)

Assortment of an optimum delay time t along with dimensions of embedding m is a very significant stage during the construction of a PSR. Mutual information function (MIF) is one of the common methods to determine optimal delay time in context to the scheming of an attractor. An illustration of the hybrid extravagance model is demonstrated in Figure 3.

Figure 3

Graphical representation of the hybrid PSR-SVM-FFA.

Figure 3

Graphical representation of the hybrid PSR-SVM-FFA.

Close modal

Experimental set-up (description of RS)

RS consists of a sand tank that is straddling on a sustenance frame with essential services, features, and arrangements for facilitating studies of groundwater flow, groundwater abstraction, flood hydrographs, and fluvial mechanics. The sump tank and centrifugal pump straddle a casing underneath the sand tank that delivers water for numerous experimentations. Water leaving the sand tank from various outlets proceeds to the sump tank under gravitational force for reprocessing (Pérez-Latorre et al. 2010). A centrifugal pump draws water from the sump tank via drumming at the tank bottom. A pressure regulator in every provision ensures that the flow is unaffected by changes in the other feed provided that the regulator is adjusted to suit. The outlet from the centrifugal pump incorporates a pressure relief valve to limit the system pressure to a maximum of approximately 3.0 barg and prevent the pump from overheating, if the flow from the two outlets is restricted or stopped. Water discharging from the relief valve is returned to the sump tank via a connection in the sidewalls of the tank (Elhakeem & Papanicolaou 2009). When demand from the two outlets is high, the relief valve will remain closed to maintain the pressure in the system. When demand is reduced and the system pressure rises above 3.0 barg, the valve will open to relieve the excess flow.

The shallow sand tank is contrived from stainless steel to resist corrosion and should be filled with sand or other granular material as appropriate to the studies. Collection of tapping sockets in the sand tank is linked with multiple tube manometer which permits the determination of water table surface. The level in each tube can be read by sliding the common scale along the track at the top of the manometer. Before using the manometer to measure water levels, it is important to expel air from flexible tubes connecting manometer tubes to tapping points. Each tapping in the sand tank incorporates a filter mesh to retain the sand while allowing the water to flow. Valves and pipework below the sand tank allow water draining from each well to flow back to the sump tank. Supple outlet tubes permit water to be extracted to an accumulating container with the motivation of gauging rates of volumetric flow. The two wells are purposely designed to be short in length so that they can be left in position without affecting the surface flow experiments. The plug of sand directly above each well can be removed if required for abstraction experiments, but the effect on the results will be negligible if the sand is left in place.

Water and sediment exiting the sand tank via the weir chute/diffuser is dropped to the outlet accumulating container that measures water flow and assembles any sediment eroded from the sand chamber. This tank is fabricated from clear acrylic. Rain is delivered to the watershed region by two rows of four sprig spouts overhead the chamber, straddling a sustenance structure. The height of the sprig spouts overhead the sand chamber can be speckled for optimizing experimentation by regulating sustenance structure height. Solitary persons at both ends of the equipment must grip sustenance structure during modification performance. A separating valve upstream of every nozzle permits essential design alteration. Since the flow rate through each nozzle is dependent on the pressure, if the appropriate pressure regulator is adjusted to give the required flow rate, then the flow through each nozzle will remain constant when other nozzles are turned on or off. To achieve this, the feed flow control valve should be opened fully and the pressure regulator adjusted to give the required flow through the nozzles. The flexible tube from the arrangement of spray nozzles is connected to one of the water feeds, when required, using the self-sealing quick-release connector. The height of nozzles should be adjusted at a required flow rate to give adequate coverage over the surface of sand without excessive spray over the sides of the sand tank.

River inlet tank straddling on the right side of sand tank permits water stream flowing to the surface of the sand, simulating stream from a watercourse upstream. The river inlet tank is fabricated from stainless steel and is bolted to the end wall of the sand tank adjacent to the shallow cut-out. Water comes in at the tank bottom, courses uphill through a glass marble bed for minimizing any turbulence, and then streams sideward to the sand surface through a rectangular section. An anti-erosion mat is supplied to reduce any local scour where the water enters the sand tank. This mat is buried just beneath the sand surface adjacent to the river inlet tank outlet. The flexible tube from the base of the sand tank is connected to one of the water feeds when required, using the self-sealing quick-release connector.

It has been found through the experimentation that well-graded sand in the range 16/30 mesh (1,000 to 500 μm) will allow all of the experiments described in the teaching manual to be carried out without the need to change the sand between runs. To minimize the cost of filling the tank, using a 16/30 mesh swimming pool silica filter grit is suggested. To fill the tank, 550 kg of sand will be required. Figure 4(b) shows the pictographic view of the RS during the experiment. Statistical parameters of the experimental data set (Burgan & Aksoy 2018) for both training and testing phases are given in Table 1.

Table 1

Statistical parameters of experimental data set

Statistical parametersTraining set (480)Testing set (240)Total data set (720)Training set (480)Testing set (240)Total data set (720)
Slope 00 Slope 10 
Min 0.04 0.04 0.04 0.04 0.04 0.04 
Max 0.79 0.81 0.81 0.94 0.92 0.94 
Mean 0.41325 0.432199 0.419361 0.420958 0.396792 0.412903 
Kurt −0.96964 −1.04858 −1.01077 −0.98548 −0.967 −0.9918 
Skew −0.02542 −0.21722 −0.08664 −0.01955 0.150961 0.037441 
SD 0.211402 0.215723 0.212565 0.217727 0.213765 0.216546 
Slope 20 Slope 30 
Min 0.04 0.04 0.04 0.04 0.04 0.04 
Max 0.82 0.79 0.82 0.79 0.79 0.79 
Mean 0.416792 0.4445 0.426028 0.412417 0.438452 0.420639 
Kurt −1.11488 −1.1812 −1.16005 −1.14219 −1.30635 −1.19653 
Skew 0.024677 −0.23992 −0.06299 −6.4E-05 −0.04127 −0.00169 
SD 0.219708 0.225516 0.221911 0.217343 0.235875 0.223684 
Slope 40 Slope 50 
Min 0.04 0.04 0.04 0.04 0.04 0.04 
Max 0.88 0.79 0.88 0.79 0.79 0.79 
Mean 0.400417 0.454268 0.417889 0.400333 0.462218 0.420472 
Kurt −0.97373 −1.26826 −1.13257 −1.00105 −1.32073 −1.16801 
Skew 0.146643 −0.264 0.014725 0.110759 −0.25404 0.001526 
SD 0.213393 0.233795 0.220812 0.211888 0.240142 0.222677 
Statistical parametersTraining set (480)Testing set (240)Total data set (720)Training set (480)Testing set (240)Total data set (720)
Slope 00 Slope 10 
Min 0.04 0.04 0.04 0.04 0.04 0.04 
Max 0.79 0.81 0.81 0.94 0.92 0.94 
Mean 0.41325 0.432199 0.419361 0.420958 0.396792 0.412903 
Kurt −0.96964 −1.04858 −1.01077 −0.98548 −0.967 −0.9918 
Skew −0.02542 −0.21722 −0.08664 −0.01955 0.150961 0.037441 
SD 0.211402 0.215723 0.212565 0.217727 0.213765 0.216546 
Slope 20 Slope 30 
Min 0.04 0.04 0.04 0.04 0.04 0.04 
Max 0.82 0.79 0.82 0.79 0.79 0.79 
Mean 0.416792 0.4445 0.426028 0.412417 0.438452 0.420639 
Kurt −1.11488 −1.1812 −1.16005 −1.14219 −1.30635 −1.19653 
Skew 0.024677 −0.23992 −0.06299 −6.4E-05 −0.04127 −0.00169 
SD 0.219708 0.225516 0.221911 0.217343 0.235875 0.223684 
Slope 40 Slope 50 
Min 0.04 0.04 0.04 0.04 0.04 0.04 
Max 0.88 0.79 0.88 0.79 0.79 0.79 
Mean 0.400417 0.454268 0.417889 0.400333 0.462218 0.420472 
Kurt −0.97373 −1.26826 −1.13257 −1.00105 −1.32073 −1.16801 
Skew 0.146643 −0.264 0.014725 0.110759 −0.25404 0.001526 
SD 0.213393 0.233795 0.220812 0.211888 0.240142 0.222677 
Figure 4

Comprehensible delineation of the PSR-SVM-FFA model. (a) Graphical view of S12-MKII rainfall simulator and (b) pictographic view of watershed during the experiment.

Figure 4

Comprehensible delineation of the PSR-SVM-FFA model. (a) Graphical view of S12-MKII rainfall simulator and (b) pictographic view of watershed during the experiment.

Close modal

Processing and preparation of data

Runoffs are collected from RS with respect to various rainfall intensities of the pump. To enrich the maximum amount of runoff, first switch on the power supply of the pump, and then switch it off. Data are collected in 5 s intervals up to a constant limit of runoff generation. For a period of 1 h, runoff data are taken for model formulation. Data composed from 40 min (480 data) are employed for training and the remaining 20 min (240 data) for the testing model. The following arrangements are applied as input:

ModelInputSVMSVM*FFA
M*1  SVM1 SVMI*FFA 
M*2  SVM2 SVMII*FFA 
M*3  SVM3 SVMIII*FFA 
M*4  SVM4 SVMIV*FFA 
M*5  SVM5 SVMV*FFA 
ModelInputSVMSVM*FFA
M*1  SVM1 SVMI*FFA 
M*2  SVM2 SVMII*FFA 
M*3  SVM3 SVMIII*FFA 
M*4  SVM4 SVMIV*FFA 
M*5  SVM5 SVMV*FFA 

where is rainfall when pump input is 1 L/min; is rainfall when pump input is 1.5 L/min; is rainfall when pump input is 2 L/min; is rainfall when pump input is 2.5 L/min; and is rainfall when pump input is 3 L/min.

Input and output data before training are put between a quantified range (0–1). This process is identified as normalization. The equation utilized for putting data in the range is
(8)
where is the converted data set, Y is the actual data set, is the minimum of actual data set, and is the maximum of the actual data set.

Model performance evaluation

Indicators are NSE, RMSE, and WI used to assess the performance of model efficiency. The formulae can be articulated as
(9)
(10)
(11)
where Ai and Bi are measured and predicted ith runoffs and is the mean of measured runoff.

Experimental result

Runoff is generated by using RS for six various slopes (Slope = 00, Slope = 10, Slope = 20, Slope = 30, Slope = 40, Slope = 50) of watersheds in context to different input pump efficiencies (1, 1.5, 2, 2.5, and 3 L/min). As rainfall intensity has an impact on runoff, here a detailed description of runoff for axiom rainfall intensity condition (3 L/min) is given.

When rainfall occurs at maximum intensity in a sloppy horizontal bed modelled with sand, the peak hyetograph is established and achieves the highest flow rate. After achieving the peak steady flow rate with the highest epoch, the rising limb fall and rainfall intensity decrease and reach the minimum flow rate with a step-down decrement of rainfall intensity. Here, the hyetograph achieves the highest flow rate of 0.8 L/min for the 50 slope of the designed bed. Similarly, peak hyetograph is established with the highest flow rate of 0.88 m3/s for 40 slope, 0.78 m3/s for 30, 0.82 m3/s for 20, 0.94 m3/s for 10 slope, and 0.81 m3/s 00 slope.

Model performance analysis

The performance assessment of various input scenarios utilizing the SVM algorithm is presented in Table 2. Statistical measures like RMSE, WI, and NSE are used for the performance evaluation of models (training and testing phase). Amid five models, SVM5 provides the best performance value for all sloping conditions. While considering the testing period, the best WI values are 0.9412, 0.9344, 0.9378, 0.9516, 0.9518, and 0.9408 for SL-00, SL-10, SL-20, SL-30, SL-40, and SL-50, respectively. Similarly, preeminent values of WI are 0.9392, 0.9266, 0.9297, 0.9408, 0.9415, and 0.9353 for various slopes in training phases.

Table 2

SVM model results

PositionsModelTraining
Testing
NSERMSEWINSERMSEWI
Slope 00 SVM1 0.521 92.51 0.8276 0.493 90.38 0.8462 
SVM2 0.555 88.23 0.8491 0.517 85.34 0.8697 
SVM3 0.591 84.72 0.8717 0.557 81.25 0.8936 
SVM4 0.629 79.91 0.8913 0.596 77.77 0.9233 
SVM5 0.691 76.55 0.9392 0.661 73.16 0.9412 
Slope 10 SVM1 0.505 93.64 0.8049 0.459 91.81 0.8326 
SVM2 0.535 89.28 0.8382 0.504 87.98 0.8618 
SVM3 0.569 85.43 0.8651 0.545 83.71 0.8891 
SVM4 0.613 81.48 0.8857 0.562 78.56 0.8995 
SVM5 0.662 77.59 0.9266 0.625 73.86 0.9344 
Slope 20 SVM1 0.513 92.71 0.8145 0.486 90.54 0.8401 
SVM2 0.539 88.77 0.8398 0.509 86.48 0.8654 
SVM3 0.582 85.04 0.8667 0.549 82.45 0.8909 
SVM4 0.615 81.34 0.8871 0.584 78.24 0.9144 
SVM5 0.671 76.84 0.9297 0.628 73.79 0.9378 
Slope 30 SVM1 0.526 92.47 0.8313 0.494 89.89 0.8541 
SVM2 0.556 88.08 0.8518 0.521 84.94 0.8719 
SVM3 0.605 84.65 0.8784 0.561 80.45 0.8947 
SVM4 0.651 78.93 0.8959 0.609 75.79 0.9256 
SVM5 0.701 75.36 0.9408 0.667 69.01 0.9516 
Slope 40 SVM1 0.529 91.79 0.8358 0.498 88.78 0.8581 
SVM2 0.559 87.03 0.8622 0.525 84.79 0.8739 
SVM3 0.606 83.81 0.8834 0.561 80.38 0.8966 
SVM4 0.656 78.89 0.8991 0.621 75.77 0.9272 
SVM5 0.704 73.58 0.9415 0.669 69.59 0.9518 
Slope 50 SVM1 0.518 92.61 0.8179 0.492 90.41 0.8415 
SVM2 0.544 88.37 0.8443 0.515 85.87 0.8668 
SVM3 0.591 84.79 0.8717 0.549 81.65 0.8915 
SVM4 0.619 81.32 0.8902 0.592 78.12 0.9162 
SVM5 0.689 76.69 0.9353 0.656 73.27 0.9408 
PositionsModelTraining
Testing
NSERMSEWINSERMSEWI
Slope 00 SVM1 0.521 92.51 0.8276 0.493 90.38 0.8462 
SVM2 0.555 88.23 0.8491 0.517 85.34 0.8697 
SVM3 0.591 84.72 0.8717 0.557 81.25 0.8936 
SVM4 0.629 79.91 0.8913 0.596 77.77 0.9233 
SVM5 0.691 76.55 0.9392 0.661 73.16 0.9412 
Slope 10 SVM1 0.505 93.64 0.8049 0.459 91.81 0.8326 
SVM2 0.535 89.28 0.8382 0.504 87.98 0.8618 
SVM3 0.569 85.43 0.8651 0.545 83.71 0.8891 
SVM4 0.613 81.48 0.8857 0.562 78.56 0.8995 
SVM5 0.662 77.59 0.9266 0.625 73.86 0.9344 
Slope 20 SVM1 0.513 92.71 0.8145 0.486 90.54 0.8401 
SVM2 0.539 88.77 0.8398 0.509 86.48 0.8654 
SVM3 0.582 85.04 0.8667 0.549 82.45 0.8909 
SVM4 0.615 81.34 0.8871 0.584 78.24 0.9144 
SVM5 0.671 76.84 0.9297 0.628 73.79 0.9378 
Slope 30 SVM1 0.526 92.47 0.8313 0.494 89.89 0.8541 
SVM2 0.556 88.08 0.8518 0.521 84.94 0.8719 
SVM3 0.605 84.65 0.8784 0.561 80.45 0.8947 
SVM4 0.651 78.93 0.8959 0.609 75.79 0.9256 
SVM5 0.701 75.36 0.9408 0.667 69.01 0.9516 
Slope 40 SVM1 0.529 91.79 0.8358 0.498 88.78 0.8581 
SVM2 0.559 87.03 0.8622 0.525 84.79 0.8739 
SVM3 0.606 83.81 0.8834 0.561 80.38 0.8966 
SVM4 0.656 78.89 0.8991 0.621 75.77 0.9272 
SVM5 0.704 73.58 0.9415 0.669 69.59 0.9518 
Slope 50 SVM1 0.518 92.61 0.8179 0.492 90.41 0.8415 
SVM2 0.544 88.37 0.8443 0.515 85.87 0.8668 
SVM3 0.591 84.79 0.8717 0.549 81.65 0.8915 
SVM4 0.619 81.32 0.8902 0.592 78.12 0.9162 
SVM5 0.689 76.69 0.9353 0.656 73.27 0.9408 

Among six slopes, the SVM5-FFA model indicates the best result having a WI value of 0.9543 during training and 0.9716 during the testing phase of SL-40. For rest slopes, the SVM5-FFA model provides the best WI values during both periods among all simulations as presented in Table 3. Model outcomes of SVM-FFA based on NSE, RMSE, and WI for both periods are provided in Table 3.

Table 3

SVM-FFA model results

PositionsModelTraining period
Testing period
NSERMSEWINSERMSEWI
Slope 00 SVM*FFA 0.696 90.24 0.8442 0.664 88.67 0.8521 
SVMI*FFA 0.719 86.01 0.8698 0.694 83.82 0.8777 
SVMII*FFA 0.754 82.45 0.9084 0.729 78.91 0.9193 
SVMIII*FFA 0.789 77.77 0.9378 0.755 75.52 0.9482 
SVMIV*FFA 0.821 73.81 0.9487 0.791 69.75 0.9668 
Slope 10 SVM*FFA 0.687 91.69 0.8273 0.656 90.41 0.8349 
SVMI*FFA 0.702 87.82 0.8574 0.677 86.14 0.8703 
SVMII*FFA 0.745 83.72 0.8946 0.713 80.77 0.9085 
SVMIII*FFA 0.777 79.33 0.9329 0.741 76.53 0.9444 
SVMIV*FFA 0.802 74.22 0.9473 0.766 70.58 0.9606 
Slope 20 SVM*FFA 0.689 91.55 0.8313 0.657 89.14 0.8453 
SVMI*FFA 0.705 87.64 0.8672 0.681 84.82 0.8736 
SVMII*FFA 0.751 83.54 0.8975 0.718 80.66 0.9111 
SVMIII*FFA 0.781 78.03 0.9338 0.744 75.79 0.9468 
SVMIV*FFA 0.808 74.14 0.9474 0.768 69.97 0.9611 
Slope 30 SVM*FFA 0.699 89.88 0.8537 0.671 88.51 0.8652 
SVMI*FFA 0.722 85.89 0.8714 0.697 83.67 0.8793 
SVMII*FFA 0.761 80.89 0.9162 0.731 78.64 0.9227 
SVMIII*FFA 0.791 76.68 0.9418 0.759 73.54 0.9493 
SVMIV*FFA 0.823 70.41 0.9491 0.799 68.35 0.9684 
Slope 40 SVM*FFA 0.701 89.57 0.8553 0.674 86.34 0.8681 
SVMI*FFA 0.725 85.55 0.8786 0.706 82.57 0.8947 
SVMII*FFA 0.767 80.41 0.9231 0.734 78.49 0.9382 
SVMIII*FFA 0.798 76.54 0.9489 0.762 73.35 0.9554 
SVMIV*FFA 0.831 70.23 0.9543 0.802 67.74 0.9716 
Slope 50 SVM*FFA 0.693 90.42 0.8343 0.662 88.87 0.8482 
SVMI*FFA 0.716 86.67 0.8682 0.688 84.21 0.8757 
SVMII*FFA 0.753 83.39 0.9016 0.724 79.02 0.9123 
SVMIII*FFA 0.783 77.78 0.9352 0.751 75.62 0.9478 
SVMIV*FFA 0.815 73.95 0.9477 0.787 69.93 0.9632 
PositionsModelTraining period
Testing period
NSERMSEWINSERMSEWI
Slope 00 SVM*FFA 0.696 90.24 0.8442 0.664 88.67 0.8521 
SVMI*FFA 0.719 86.01 0.8698 0.694 83.82 0.8777 
SVMII*FFA 0.754 82.45 0.9084 0.729 78.91 0.9193 
SVMIII*FFA 0.789 77.77 0.9378 0.755 75.52 0.9482 
SVMIV*FFA 0.821 73.81 0.9487 0.791 69.75 0.9668 
Slope 10 SVM*FFA 0.687 91.69 0.8273 0.656 90.41 0.8349 
SVMI*FFA 0.702 87.82 0.8574 0.677 86.14 0.8703 
SVMII*FFA 0.745 83.72 0.8946 0.713 80.77 0.9085 
SVMIII*FFA 0.777 79.33 0.9329 0.741 76.53 0.9444 
SVMIV*FFA 0.802 74.22 0.9473 0.766 70.58 0.9606 
Slope 20 SVM*FFA 0.689 91.55 0.8313 0.657 89.14 0.8453 
SVMI*FFA 0.705 87.64 0.8672 0.681 84.82 0.8736 
SVMII*FFA 0.751 83.54 0.8975 0.718 80.66 0.9111 
SVMIII*FFA 0.781 78.03 0.9338 0.744 75.79 0.9468 
SVMIV*FFA 0.808 74.14 0.9474 0.768 69.97 0.9611 
Slope 30 SVM*FFA 0.699 89.88 0.8537 0.671 88.51 0.8652 
SVMI*FFA 0.722 85.89 0.8714 0.697 83.67 0.8793 
SVMII*FFA 0.761 80.89 0.9162 0.731 78.64 0.9227 
SVMIII*FFA 0.791 76.68 0.9418 0.759 73.54 0.9493 
SVMIV*FFA 0.823 70.41 0.9491 0.799 68.35 0.9684 
Slope 40 SVM*FFA 0.701 89.57 0.8553 0.674 86.34 0.8681 
SVMI*FFA 0.725 85.55 0.8786 0.706 82.57 0.8947 
SVMII*FFA 0.767 80.41 0.9231 0.734 78.49 0.9382 
SVMIII*FFA 0.798 76.54 0.9489 0.762 73.35 0.9554 
SVMIV*FFA 0.831 70.23 0.9543 0.802 67.74 0.9716 
Slope 50 SVM*FFA 0.693 90.42 0.8343 0.662 88.87 0.8482 
SVMI*FFA 0.716 86.67 0.8682 0.688 84.21 0.8757 
SVMII*FFA 0.753 83.39 0.9016 0.724 79.02 0.9123 
SVMIII*FFA 0.783 77.78 0.9352 0.751 75.62 0.9478 
SVMIV*FFA 0.815 73.95 0.9477 0.787 69.93 0.9632 

Results of the PSR-SVM-FFA technique during training and testing periods are given in Table 4. When SL-40 is considered, the obtained WI value shows the best performing results. In SL-40, the best values of WI for training and testing periods are 0.9623 and 0.9812. The WI values for other sloping conditions (SL-00, SL-10, SL-20, SL-30, and SL-50) are presented in Table 4. Based on obtained results, a possible inference regarding the dominance of variables on runoff dynamics can be that dynamics reveal a low-dimension chaotic behaviour, and hence, appropriateness of a low-dimension chaotic method. Manifestation of an ideal embedding dimension, i.e. prediction accuracy increases with an increase in embedding dimension to reach its best, i.e. mopt = 14, and thereafter, reduces with further increase in embedding dimension and best prediction outcomes attained with a small number of neighbours support this argument.

Table 4

Values of with results of PSR-SVM-FFA model

ModelOptimal parameters of for PSRTraining period
Testing period
NSERMSEWINSERMSEWI
PSR-SVM-FFA-SL-00 ( = 14, = 5) 0.926 52.89 0.9565 0.878 60.38 0.9757 
PSR-SVM-FFA-SL-10 ( = 14, = 5) 0.883 59.34 0.9546 0.836 63.21 0.9731 
PSR-SVM-FFA-SL-20 ( = 14, = 5) 0.899 57.09 0.9555 0.854 62.49 0.9742 
PSR-SVM-FFA-SL-30 ( = 14, = 5) 0.943 49.77 0.9584 0.889 59.23 0.9767 
PSR-SVM-FFA-SL-40 ( = 14, = 5) 0.958 45.01 0.9623 0.904 55.56 0.9812 
PSR-SVM-FFA-SL-50 ( = 14, = 5) 0.901 55.13 0.9556 0.865 61.65 0.9751 
ModelOptimal parameters of for PSRTraining period
Testing period
NSERMSEWINSERMSEWI
PSR-SVM-FFA-SL-00 ( = 14, = 5) 0.926 52.89 0.9565 0.878 60.38 0.9757 
PSR-SVM-FFA-SL-10 ( = 14, = 5) 0.883 59.34 0.9546 0.836 63.21 0.9731 
PSR-SVM-FFA-SL-20 ( = 14, = 5) 0.899 57.09 0.9555 0.854 62.49 0.9742 
PSR-SVM-FFA-SL-30 ( = 14, = 5) 0.943 49.77 0.9584 0.889 59.23 0.9767 
PSR-SVM-FFA-SL-40 ( = 14, = 5) 0.958 45.01 0.9623 0.904 55.56 0.9812 
PSR-SVM-FFA-SL-50 ( = 14, = 5) 0.901 55.13 0.9556 0.865 61.65 0.9751 

Assessment of results for applied models

For all slopes, the assessment of the PSR-SVM-FFA model during training and testing periods is represented in Figures 5 and 6. Among all slopes, the best value of WI is obtained in Slope 40. Here, WI values for SVM5, SVM5-FFA, and PSR-SVM-FFA-SL-40 models are 0.9412, 0.9668, and 0.9812, respectively. Hence, it can be observed that prediction models proposed in the present research achieve enhanced prediction performances in comparison to models discussed in the literature. As in the case of any model, runoff predictions provided by a newly applied PSR-SVM-FFA model are related to a certain degree of uncertainty that needs to be enumerated. Uncertainty related to the model structure is assessed for three data-driven models (SVM, SVM-FFA, and PSR-SVM-FFA) considered for best input combination model M#5.

Figure 5

Actual versus predicted runoff for (a) Slope 00, (b) Slope 10, (c) Slope 20, (d) Slope 30, (e) Slope 40, and (f) Slope 50 during training phase.

Figure 5

Actual versus predicted runoff for (a) Slope 00, (b) Slope 10, (c) Slope 20, (d) Slope 30, (e) Slope 40, and (f) Slope 50 during training phase.

Close modal
Figure 6

Actual versus predicted runoff for (a) Slope 00, (b) Slope 10, (c) Slope 20, (d) Slope 30, (e) Slope 40, and (f) Slope 50 during testing phase.

Figure 6

Actual versus predicted runoff for (a) Slope 00, (b) Slope 10, (c) Slope 20, (d) Slope 30, (e) Slope 40, and (f) Slope 50 during testing phase.

Close modal

A linear scale plot of actual versus predicted runoff for the proposed model of projected slopes is shown in Figure 7. Results illustrate that estimated peak runoffs are 0.7624, 0.7832, and 0.7903 m3/s for SVM, SVM-FFA, and PSR-SVM-FFA against an actual peak 0.81 m3/s for the slope 00. The approximated peak runoff is 0.8783, 0.903, and 0.9147 m3/s for SVM, SVM-FFA, and PSR-SVM-FFA adjacent to actual peak 0.94 m3/s for slope 10 condition. Similarly, for slope 20, tangible runoff is 0.82 m3/s aligned with predicted runoffs 0.7689, 0.7881, and 0.7988 m3/s for SVM, SVM-FFA, and PSR-SVM-FFA, respectively. Correspondingly, apprised peak runoffs are 0.7517, 0.765, and 0.7715 m3/s for SVM, SVM-FFA, and PSR-SVM-FFA against the actual peak 0.79 m3/s for the slope 30. Similarly, approached peak runoffs are 0.8375, 0.855, and 0.8634 m3/s for SVM, SVM-FFA, and PSR-SVM-FFA adjacent to actual peak 0.88 m3/s for slope 40. Consistently, for slope 50, the evaluated peak runoff is 0.79 m3/s associated with predicted runoffs 0.7432, 0.7609, and 0.7703 m3/s for SVM, SVM-FFA, and PSR-SVM-FFA, respectively. Simulations exhibit that the proposed technique can efficiently improve the forecasting accurateness of many conventional techniques in terms of all applied evaluation indices. Therefore, a robust data-driven model with powerful performances is proposed for forecasting experimental runoff.

Figure 7

Deviation of actual and predicted runoff at (a) Slope 00, (b) Slope 10, (c) Slope 20, (d) Slope 30, (e) Slope 40, and (f) Slope 50 condition during testing phase.

Figure 7

Deviation of actual and predicted runoff at (a) Slope 00, (b) Slope 10, (c) Slope 20, (d) Slope 30, (e) Slope 40, and (f) Slope 50 condition during testing phase.

Close modal

The box plot of actual and estimated runoff values was drawn as presented in Figure 8. It can be observed that data ranges, quantiles, and medians for predicted runoff values were analogous to those of actual runoff values. Interquartile ranges of all models show similarity with actual runoff. In the present study, outliers were not mentioned in Figure 8. They signify an extreme runoff value (maximum and minimum peak) – that is, runoff in the course of high flow and low flow periods. PSR-SVM-FFA performed better in apprehending extreme runoff, while SVM and SVM-FFA could not capture either top, bottom, or both extremes. The reason is that SVM and SVM-FFA techniques applied in the present study had a comparatively less complex structure and fewer operational parameters than the PSR-SVM-FFA model.

Figure 8

Box plots of runoff estimates of various ML models and observed runoff.

Figure 8

Box plots of runoff estimates of various ML models and observed runoff.

Close modal

To have better knowledge with regard to the prediction accuracy of three challenging techniques, SVM, SVM-FFA, and PSR-SVM-FFA, a comparison is done between runoff values of varied observations, ranges, and predictions. Histogram plots of observed and estimated runoff values are demonstrated in Figure 9. Plots in Figure 9 signify the occurrence probability of runoff between a precise range. The estimation of runoff for all slope conditions shows that for the lowest (0.0–0.1 m3/s) and the highest (0.7–0.8 m3/s) ranges of runoff. The frequency (number of events) of precise estimations by the PSR-SVM-FFA model illustrates improved agreement with the frequency of observed values than that of the frequency of precise estimations by the SVM-FFA and stand-alone SVM model.

Figure 9

Histogram plot of actual versus SVM, SVM-FFA, and PSR-SVM-FFA for (a) Slope 00, (b) Slope 10, (c) Slope 20, (d) Slope 30, (e) Slope 40, and (f) Slope 50 condition.

Figure 9

Histogram plot of actual versus SVM, SVM-FFA, and PSR-SVM-FFA for (a) Slope 00, (b) Slope 10, (c) Slope 20, (d) Slope 30, (e) Slope 40, and (f) Slope 50 condition.

Close modal

Assessment of various models

Statistical indicators like NSE, RMSE, and WI are used to evaluate the performance of SVM, SVM-FFA, and PSR-SVM-FFA models for various slope conditions. The assessment of performance measures is indicated in Table 5, which shows the effectiveness of each model. Runoff evaluation is significant, and hence, methods applied here are important to demonstrate runoff information. It is evident that PSR-SVM-FFA performs the best when compared with SVM-FFA and SVM. The assessment of each model is demonstrated in Figure 10 in terms of linear bar scheme. These experimental results revealed that the PSR-SVM-FFA model obtained more reliable prediction runoff values and had a clear advantage over other proposed models in runoff prediction. Hence, the proposed PSR-SVM-FFA is fit for the prediction intervals of a certain period in the runoff series and provides supporting information for decision-makers in water resources management.

Table 5

Performance index of various approaches for runoff prediction

Watershed bed slopeSVM
SVM-FFA
PSR-SVM-FFA
NSERMSEWINSERMSEWINSERMSEWI
Slope 00 0.661 73.16 0.9412 0.791 69.75 0.9668 0.878 60.38 0.9757 
Slope 10 0.625 73.86 0.9344 0.766 70.58 0.9606 0.836 63.21 0.9731 
Slope 20 0.628 73.79 0.9378 0.768 69.97 0.9611 0.854 62.49 0.9742 
Slope 30 0.667 69.01 0.9516 0.799 68.35 0.9684 0.889 59.23 0.9767 
Slope 40 0.669 69.59 0.9518 0.802 67.74 0.9716 0.904 55.56 0.9812 
Slope 50 0.656 73.27 0.9408 0.787 69.93 0.9632 0.865 61.65 0.9751 
Watershed bed slopeSVM
SVM-FFA
PSR-SVM-FFA
NSERMSEWINSERMSEWINSERMSEWI
Slope 00 0.661 73.16 0.9412 0.791 69.75 0.9668 0.878 60.38 0.9757 
Slope 10 0.625 73.86 0.9344 0.766 70.58 0.9606 0.836 63.21 0.9731 
Slope 20 0.628 73.79 0.9378 0.768 69.97 0.9611 0.854 62.49 0.9742 
Slope 30 0.667 69.01 0.9516 0.799 68.35 0.9684 0.889 59.23 0.9767 
Slope 40 0.669 69.59 0.9518 0.802 67.74 0.9716 0.904 55.56 0.9812 
Slope 50 0.656 73.27 0.9408 0.787 69.93 0.9632 0.865 61.65 0.9751 
Figure 10

Comparison of the performance index for the proposed watershed (a) training and (b) testing phases.

Figure 10

Comparison of the performance index for the proposed watershed (a) training and (b) testing phases.

Close modal

In the past, many investigations on runoff processes were not appropriate and accessible for an experimental set-up. It is understood that the findings of runoff prediction from the experimental study will provide a new horizon of understanding the rainfall–runoff process precisely. The PSR-SVM-FFA model is established to be a suitable model for runoff forecasting in an arid watershed. The research was carried out over various slopes of watershed, which will help non-gauging arid watershed for wide-spread good performance to predict runoff in similar conditions. Also, the present research findings can be used for irrigation and water resources engineering to design a hydraulics structure for agriculture. Basically, this study will be helpful where runoff measurement is impossible or simply for un-gauged catchments. In that situation, if a researcher/scientist knows the watershed/region slope, rainfall amount, and type of soil available there, then, with the help of RS, runoff data are generated, which will be helpful for runoff prediction. In reference to the objective of the present study (which is to present a newly developed method of runoff prediction), it is indicated that at least one case study is needed to verify the applicability of the presented model, and hence, others might apply for other scenarios. Similarly, the proposed approach can be applied to two or more different case studies for observing the validity and performance of models under different circumstances.

The advantages of proposed data-driven models are that all models attained suitable simulation outcomes; yet, their modelling complexities and costs vary. Conventional models necessitate a huge volume of input data; hence, data pre-processing, distribution, and validation processes are restricted by unsolved data assortment. In addition, conventional models utilize formulae for imitating certainty; however, imitation can never precisely match certainty. In general, simplification of real-world presents ambiguity and drops the accuracy of results. Artificial intelligence models have flexible model architectures and controlling input data parameters, which accelerate modelling procedures by reducing modelling uncertainty. Hybrid data-driven models also attained better performances, with much simpler procedures and at a lower cost. In brief, the PSR-SVM-FFA model is simpler and more appropriate. Previous studies have all emphasized the high efficiency and accuracy of the various hybrid models (Van et al. 2020; Han & Morrison 2021; Ouma et al. 2021; Reddy et al. 2021) in runoff prediction. In the current study, it was shown that the proposed PSR-SVM-FFA approach found more efficacy and superiority.

One limitation of this study is the consideration of experimental set-up within the laboratory room (i.e. at constant room temperature). Future research will include testing of the proposed approach on field study (River gauge station). Also, emphasis should be given to evaluating the usage of additional input variables for hydrological predictions produced from weather data (collected climatological data) utilizing sediment load, groundwater level, and flood modelling, as an alternative prediction performance. Additionally, satellite data can be assimilated as an external instructive characteristic to the prediction matrix where more precise prediction accuracy can be attained.

The major benefit of an RS is its compressed structure which fits well in a laboratory room for constructing the economic model. In the present study, the potential of the PSR-SVM-FFA model is compared with the SVM-FFA and standard SVM model for runoff prediction, considering distinguished experimental runoff data emerged from RS. Results obtained from this study specify that the PSR-SVM-FFA method is helpful for modelling rainfall–runoff series that consist of six different slopes of watershed, which gives good prediction performance compared to other proposed models. Outcomes of various performance indices demonstrate that PSR-SVM-FFA has a better ability for non-linear complex mapping. The results indicate a minimal change in the amount of predicted runoff volume and peaks, at different rainfall intensities. This study recommends that the proposed hybrid method can be adopted for modelling other hydrological parameters. Also, this study can be a basis for future studies in different parametric conditions like infiltration and evapotranspiration losses. Application of other robust data-driven models seeking runoff prediction can also be considered for further analysis.

The research has not been supported through any funds.

The authors declare that they have no conflict of interest.

The authors have made a significant contribution to this manuscript, have seen, and approved the final manuscript.

S. S.: Conceptualization, Roles/Writing – original draft, Methodology, Formal analysis, Experiment, Data curation, Formal analysis, Writing – review and editing. D. K. G.: Visualization, Project administration.

Data cannot be made publicly available; readers should contact the corresponding author for details.

Abudi
I.
,
Carmi
G.
&
Berliner
P.
2012
Rainfall simulator for field runoff studies
.
Journal of Hydrology
454–455
,
76
81
.
Akbarzadeh
A.
,
Mehrjardi
R. T.
,
Rouhipour
H.
,
Gorji
M.
&
Rahimi
H. G.
2009
Estimating of soil erosion covered with rolled erosion control systems using rainfall simulator (neuro-fuzzy and artificial neural network approaches)
.
Journal of Applied Sciences Research
5
(
5
),
505
514
.
Aksoy
H.
,
Unal
N. E.
,
Cokgor
S.
,
Gedikli
A.
,
Yoon
J.
,
Koca
K.
,
Inci
S. B.
&
Eris
E.
2012
A rainfall simulator for laboratory-scale assessment of rainfall-runoff-sediment transport processes over a two-dimensional flume
.
Catena
98
,
63
72
.
Behzad
M.
,
Asghari
K.
,
Eazi
M.
&
Palhang
M.
2009
Generalization performance of support vector machines and neural networks in runoff modeling
.
Expert Systems with Applications
36
(
4
),
7624
7629
.
Bozorg-Haddad
O.
,
Aboutalebi
M.
,
Ashofteh
P. S.
&
Loáiciga
H. A.
2018
Real-time reservoir operation using data mining techniques
.
Environmental Monitoring and Assessment
190
(
10
),
1
22
.
Bray
M.
&
Han
D.
2004
Identification of support vector machines for runoff modelling
.
Journal of Hydroinformatics
4
,
265
280
.
Burgan
H. I.
&
Aksoy
H.
2018
Annual flow duration curve model for ungauged basins
.
Hydrology Research
49
(
5
),
1684
1695
.
Fister
W.
,
Iserloh
T.
,
Ries
J. B.
&
Schmidt
R.
2012
A portable wind and rainfall simulator for in situ soil erosion measurements
.
Catena
91
,
72
84
.
Fister
I.
,
Fister
I.
,
Yang
X. S.
&
Brest
J.
2013
A comprehensive review of firefly algorithms
.
Swarm and Evolutionary Computation
13
,
34
46
.
Granata
F.
,
Gargano
R.
&
de Marinis
G.
2016
Support vector regression for rainfall-runoff modeling in urban drainage: a comparison with the EPA's storm water management model
.
Water (Switzerland)
8
,
1
13
.
Han
H.
&
Morrison
R. R.
2021
Data-driven approaches for runoff prediction using distributed data
.
Stochastic Environmental Research and Risk Assessment
2021
,
1
19
.
Iserloh
T.
,
Ries
J. B.
,
Arnáez
J.
,
Boix-Fayos
C.
,
Butzen
V.
,
Cerdà
A.
,
Echeverría
M. T.
,
Fernández-Gálvez
J.
,
Fister
W.
,
Geißler
C.
&
Gómez
J. A.
2013
European small portable rainfall simulators: a comparison of rainfall characteristics
.
Catena
110
,
100
112
.
Jajarmizadeh
M.
,
Lafdani
E. K.
,
Harun
S.
&
Ahmadi
A.
2015
Application of SVM and SWAT models for monthly streamflow prediction, a case study in South of Iran
.
KSCE Journal of Civil Engineering
19
,
345
357
.
Jimmy
S. R.
,
Sahoo
A.
,
Samantaray
S.
&
Ghose
D. K.
2021
Prophecy of Runoff in a River Basin Using Various Neural Networks
. In:
Communication Software and Networks
(S. C. Satapathy, V. Bhateja, M. R. Murty, N. G. Nhu & J. Kotti eds.)
, Vol.
134
.
Springer
,
Singapore
, pp.
709
718
.
Jin
K.
,
Cornelis
W. M.
,
Gabriels
D.
,
Baert
M.
,
Wu
H. J.
,
Schiettecatte
W.
,
Cai
D. X.
,
De Neve
S.
,
Jin
J. Y.
,
Hartmann
R.
&
Hofman
G.
2009
Residue cover and rainfall intensity effects on runoff soil organic carbon losses
.
Catena
78
(
1
),
81
86
.
Lekscha
J.
&
Donner
R. V.
2018
Phase space reconstruction for non-uniformly sampled noisy time series
.
Chaos: An Interdisciplinary Journal of Nonlinear Science
28
(
8
),
085702
.
Liu
Q.
,
Islam
S.
,
Rodriguez-Iturbe
I.
&
Le
Y.
1998
Phase-space analysis of daily streamflow: characterization and prediction
.
Advances in Water Resources
21
,
463
475
.
Mehr
A. D.
,
Nourani
V.
,
Khosrowshahi
V. K.
&
Ghorbani
M. A.
2019
A hybrid support vector regression–firefly model for monthly rainfall forecasting
.
International Journal of Environmental Science & Technology
16
,
335
346
.
Misra
D.
,
Oommen
T.
,
Agarwal
A.
,
Mishra
S. K.
&
Thompson
A. M.
2009
Application and analysis of support vector machine based simulation for runoff and sediment yield
.
Biosystems Engineering
103
(
4
),
527
535
.
Mohanta
N. R.
,
Biswal
P.
,
Kumari
S. S.
,
Samantaray
S.
&
Sahoo
A.
2021
Estimation of Sediment Load Using Adaptive Neuro-Fuzzy Inference System at Indus River Basin, India
. In:
Intelligent Data Engineering and Analytics
, Vol.
1177
(S. C. Satapathy, Y-D, Zhang, V. Bhateja & R. Majhi eds.)
.
Springer
,
Singapore
, pp.
427
434
.
Ouma
Y. O.
,
Cheruyot
R.
&
Wachera
A. N.
2021
Rainfall and runoff time-series trend analysis using LSTM recurrent neural network and wavelet neural network with satellite-based meteorological data: case study of Nzoia hydrologic basin
.
Complex & Intelligent Systems
2021
,
1
24
.
Ouyang
Q.
,
Lu
W.
,
Xin
X.
&
Zhang
Y.
2016
Monthly rainfall forecasting using EEMD-SVR based on phase-space reconstruction
.
Water Resources Management
30
(
7
),
2311
2325
.
Pérez-Latorre
F. J.
,
de Castro
L.
&
Delgado
A.
2010
A comparison of two variable intensity rainfall simulators for runoff studies
.
Soil and Tillage Research
107
,
11
16
.
Reddy
B. S. N.
,
Pramada
S. K.
&
Roshni
T.
2021
Monthly surface runoff prediction using artificial intelligence: a study from a tropical climate river basin
.
Journal of Earth System Science
130
(
1
),
1
15
.
Sahoo
A.
,
Samantaray
S.
&
Ghose
D. K.
2021
Prediction of flood in Barak River using hybrid machine learning approaches: a case study
.
Journal of the Geological Society of India
97
(
2
),
186
198
.
Samantaray
S.
&
Ghose
D. K.
2020
Modelling runoff in a river basin, India: an integration for developing un-gauged catchment
.
International Journal of Hydrology Science and Technology
10
(
3
),
248
266
.
Samantaray
S.
&
Sahoo
A.
2020a
Prediction of runoff using BPNN, FFBPNN, CFBPNN algorithm in arid watershed: a case study
.
International Journal of Knowledge-Based and Intelligent Engineering Systems
24
(
3
),
243
251
.
Samantaray
S.
&
Sahoo
A.
2020b
Estimation of runoff through BPNN and SVM in Agalpur Watershed
. In:
Frontiers in Intelligent Computing: Theory and Applications
, Vol.
1014
(S. C. Satapathy, V. Bhateja, B. L. Nguyen, N. G. Nguyen & D-N. Le eds.)
.
Springer
,
Singapore
, pp.
268
275
.
Samantaray
S.
,
Sahoo
A.
&
Ghose
D. K.
2019
Assessment of runoff via precipitation using neural networks: watershed modelling for developing environment in arid region
.
Pertanika Journal of Science and Technology
27
(
4
),
2245
2263
.
Samantaray
S.
,
Sahoo
A.
,
Mohanta
N. R.
,
Biswal
P.
&
Das
U. K.
2021
Runoff Prediction Using Hybrid Neural Networks in Semi-Arid Watershed, India: A Case Study
. In:
Communication Software and Networks
, Vol.
134
(S. C. Satapathy, V. Bhateja, M. R. Murty, N. G. Nhu & J. Kotti eds.)
.
Springer
,
Singapore
, pp.
729
736
.
Sedighi
F.
,
Vafakhah
M.
&
Javadi
M. R.
2016
Rainfall–runoff modeling using support vector machine in snow-affected watershed
.
Arabian Journal for Science and Engineering
41
,
4065
4076
.
Sharma
N.
,
Zakaullah
M.
,
Tiwari
H.
&
Kumar
D.
2015
Runoff and sediment yield modeling using ANN and support vector machines: a case study from Nepal watershed
.
Modeling Earth Systems and Environment
1
(
3
),
1
8
.
Sivakumar
B.
,
Berndtsson
R.
&
Persson
M.
2001
Monthly runoff prediction using phase space reconstruction
.
Hydrological Sciences Journal
46
,
377
387
.
Sivapragasam
C.
,
Liong
S. Y.
&
Pasha
M. F. K.
2001
Rainfall and runoff forecasting with SSA-SVM approach
.
Journal of Hydroinformatics
3
,
141
152
.
Soleymani
S. A.
,
Goudarzi
S.
,
Anisi
M. H.
,
Hassan
W. H.
,
Idris
M. Y. I.
,
Shamshirband
S.
,
Noor
N. M.
&
Ahmedy
I.
2016
A novel method to water level prediction using RBF and FFA
.
Water Resources Management
30
(
9
),
3265
3283
.
Tao
H.
,
Sulaiman
S. O.
,
Yaseen
Z. M.
,
Asadi
H.
,
Meshram
S. G.
&
Ghorbani
M. A.
2018a
What is the potential of integrating phase space reconstruction with SVM-FFA data-intelligence model? Application of rainfall forecasting over regional scale
.
Water Resources Management
32
(
12
),
3935
3959
.
Tao
H.
,
Diop
L.
,
Bodian
A.
,
Djaman
K.
,
Ndiaye
P. M.
&
Yaseen
Z. M.
2018b
Reference evapotranspiration prediction using hybridized fuzzy model with firefly algorithm: regional case study in Burkina Faso
.
Agricultural Water Management
208
,
140
151
.
Van
S. P.
,
Le
H. M.
,
Thanh
D. V.
,
Dang
T. D.
,
Loc
H. H.
&
Anh
D. T.
2020
Deep learning convolutional neural network in rainfall–runoff modelling
.
Journal of Hydroinformatics
22
(
3
),
541
561
.
Vapnik
V.
1998
Statistical Learning Theory
.
John Wiley & Sons
,
Toronto
.
Wildhaber
Y. S.
,
Bänninger
D.
,
Burri
K.
&
Alewell
C.
2012
Catena Evaluation and application of a portable rainfall simulator on subalpine grassland
.
Catena
91
,
56
62
.
Xu
L.
,
Zhao
J.
,
Li
C.
,
Li
C.
,
Wang
X.
&
Xie
Z.
2020
Simulation and prediction of hydrological processes based on firefly algorithm with deep learning and support vector for regression
.
International Journal of Parallel, Emergent and Distributed Systems
35
(
3
),
288
296
.
Yang
X.-S.
2010
Firefly algorithm, stochastic test functions and design optimisation
.
International Journal of Bio-Inspired Computation
2
,
78
84
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).