Flood is India's most prevalent natural calamity, devastatingly affecting human lives, infrastructure, and agriculture. Predicting floods can help to mitigate the potential damage and conduct timely evacuation drives. This research proposes a deep-learning regression model to forecast flood runoff. Various climatological, hydrological, land, and vegetation-related data have been collected from multiple sources for 18 years (2002–2019) to create a comprehensive dataset for the Godavari River at the Perur water station in India. The relevant attributes identified through feature selection are river water level, precipitation, temperature, surface pressure, evaporation, soil water content, daily runoff, and average river flow. The selected features were fed into various time series prediction models like AutoRegressive Integrated Moving Average (ARIMA), Prophet, Neural Prophet, and Long Short-Term Memory (LSTM). The LSTM model obtained the best results achieving a Root Mean Squared Error (RMSE) value of 0.05, Mean Absolute Error (MAE) value of 0.007, Willmott's Index (WI) of 0.83, Legates-McCabe's Index (LMI) of 0.58, and R2 of 0.67 for a 1-day prediction with a look-back window of 183 days. The model is also trained to predict the flood runoff value for a week ahead. The proposed model can serve as an essential component in flood warning systems.

  • Created a comprehensive dataset of various climatological, hydrological, land, and vegetation-related data.

  • Feature selection is used to identify top features related to the flood runoff.

  • Proposed a multivariate multi-step LSTM model for predicting the flood runoff for a day and a week ahead.

  • Compared the performance of the proposed LSTM model with other state-of-the-art models like ARIMA, Prophet, NeuralProphet.

Environmental disasters have gradually risen to the forefront of every country's economic and development forum due to their enormous impact on a nation. The frequently changing climate and urbanisation propel these. Within the context of natural disasters, the most frequent and common type of disaster is flooding. Floods are regarded as the most costly disaster globally. Floods result in significant human casualties and the destruction of agriculture, infrastructure, property, and public services. Between 1998 and 2017, more than 2 billion people were impacted by floods globally (Floods 2022). India is highly susceptible to flooding. Over 40 million hectares (mha) of the 329 mha total geographical region is prone to flooding. (Floods NDMA 2007). This vulnerability to floods is further exacerbated by rapid urbanisation, population growth, expanding economic and development activity in flood plains, and climate change. The damage caused by floods in a developing country is devastating. Hence, developing a foolproof data-driven model for predicting floods is essential to mitigate the potential damage. Data-driven models primarily analyse historically observed data to create an input–output mapping for the next timestep for which the prediction will be made (Solomatine et al. 2009; Ha et al. 2021).

Flood runoff is an attribute of utmost importance in flood risk management. It is defined as the excess water that flows in the river during any flood-like event. The amount of flood runoff is affected by several factors, like the duration and intensity of precipitation, soil type, size, and river basin drainage. Thus, accurate flood runoff forecasting can help to plan efficiently for flood management and mitigation. Flood event runoff prediction can help to identify vulnerable areas and effectively utilise the resources. It is also a critical component of flood warning systems. It can be used for flood risk assessment, disaster management, and evacuation planning. It enables authorities to take proactive measures that can help to save lives, reduce property damage, and improve emergency response planning. Deep learning-based predictive models can be used to forecast flood runoff values. The accuracy of these models can be enhanced with time through continuous learning. The performance of these models improves with the addition of new data into the model over time, leading to more reliable forecasts in the future.

In the past decade, many machine learning (ML) and statistical models (Bagherzadeh et al. 2021; Sampurno et al. 2022) like Long Short-Term Memory (LSTM), Artificial Neural Network (ANN), Support Vector Machine (SVM), and Convolutional Neural Network (CNN) were proposed for hydrological prediction. It has been demonstrated that ML models perform better in predicting floods than traditional hydrological models (Mosaffa et al. 2022). Ding et al. (2020) considered the LSTM network along with two dynamic attention modules to build the model. Using these modules, spatial–temporal weights can be dynamically adjusted to improve the performance of existing LSTM cells. Ighile et al. (2022) used ML techniques to map Nigeria's flood-prone regions. They used ANN and logistic regression models on the historical flood data along with flood-affecting features to show the flood susceptibility of areas. Yoosefdoost et al. (2022) compared data-mining algorithms like ANN, SVM, and Genetic Expression Programming (GEP) to simulate dam reservoir input runoff in an arid region. Another work (Miau & Hung 2020) used a Conv-GRU model to forecast the level of a river in Taiwan. Kim & Kim (2020) developed an LSTM model for predicting urban runoff according to rainfall input, and the flood hazard was mapped spatially. SWMM was used to simulate rainfall events. Zhu et al. (2020) compared standard models such as SVM, AutoRegressive Integrated Moving Average (ARIMA), and LSTM and found that RMSE is the minimum for LSTM. (Li et al. 2021) attempted to compare LSTM models having different architectures and assess their usability in hydrological applications. They concluded that synced sequence input and output (SSIO) architecture are more accurate than SISO and consume fewer computational resources. Rahimzad et al. (2021) aimed to compare the results of four data-driven techniques for daily streamflow forecasting. The methods used are LSTM, SVM, MultiLayer Perceptron (MLP), and Linear Regression. The results indicated that the LSTM is the best technique to efficiently capture the behaviour of time series data in hydrological modelling/ flood forecasting-related applications. Liu et al. (2020) used the Yangtze River streamflow data from 1952 to 2018 to create a stream flow prediction model using a combination of Encoder-Decoder LSTM and the Empirical Mode Decomposition (EMD) algorithm. A major drawback of this work was that it only considered the streamflow data and no other vital data like precipitation, evapotranspiration, and temperature. Another study of the Yangtze Basin (Jiang et al. 2022) uses a combination of four ML algorithms and weather forecasts to predict the water level, rainfall, and estimation of disaster losses in the Yangtze River basin. The four ML algorithms used were Random Forest (RF), MLPR, CNN, and LSTM. Atashi et al. (2022) have presented a case study of the Red River, where they predicted the water levels. They have compared three models – the classical statistical model ARIMA, the traditional ML model RF and the deep learning model LSTM and have used hourly water level data for the same.

Wu et al. (2020) presented a data-driven forecasting model for water accumulation in urban scenarios. They found peak rainfall, concentration skewness, and rainfall as key determinants of urban floods. Damle & Yalcin (2007) used a time-delayed embedding method for predicting the occurrences of events in the future. A multi-objective formulation is used to model the optimisation formulation and is solved using a genetic algorithm. A hybrid SOM–RNARX model for flood forecasting in the Kemaman River basin was proposed by Chang et al. (2018). Ramos Filho et al. (2021) aimed to develop a method for estimating the rainfall threshold needed to forecast floods and flash floods in Sao Paulo, Brazil. An existing technique to identify the threshold for peak rainfall intensity was improved in this paper. (Elsafi 2014) built an ANN model to predict the flow of the Nile River at Sudan's Dongola station. Based on the flow at upstream sites, the model was constructed to simulate the flow at a particular location in the reach of the river. Panigrahi et al. (2018) used a Cascaded Functional Link Artificial Neural Network (C-FLANN) model for predicting water flow prediction 14 days in advance. Differential Evolution (DE) and Harmony Search (HS) were utilised to model parameters, and important features were chosen using the map-reduce-based ANOVA technique. Keum et al. (2020) developed an urban flood prediction system wherein the runoff values of each representative timestep were initially predicted, followed by flood map creation using SWMM and Lidar. Zhu & Zhang (2022) developed a Flood Disaster Index model to analyse the threat of flooding in the Guanzhong area. The authors used ArcGIS 10.1 software to analyse the flood hazard. The dataset used in this study included land use data, remote sensing data, geographical data, and land use records. Puttinaovarat & Horkaew (2020) presented a flood forecasting approach based on integrating data of various types. The geospatial, hydrological, and meteorological data was fetched from GLOFAS, hourly rainfall predictions were obtained from the TMD big data platform, and crowdsource data from government websites (where people can report for help in comment-like structures) and social media. (Le et al. 2019) suggested the development of an LSTM neural network for predicting floods, taking rainfall and daily discharge data as input features. Time series data of the past 24 years, i.e., from 1961 to 1984, was considered for the dataset. (Furquim et al. 2016) deployed a Wireless Sensor Network (WSN) in São Carlos River, São Paulo State, Brazil. Data like rainfall volume, river pressure, river water level, etc., were collected from these sensors, followed by applying Efficient Recurrent Neural Network (E-RNN) and MLP models.

It is found that existing studies on runoff prediction using deep learning do not incorporate the impact of climate, land, vegetation, and hydrological parameters on floods collectively. Incorporating satellite-based data (Basheer Ahammed & Pandey 2022) with the gauge station data can help better understand the region in which floods are predicted. Moreover, studies exploring flood forecasting for Indian rivers (Barbetta et al. 2016; Paul et al. 2017; Nanditha & Mishra 2021; Vogeti et al. 2022) are less. It has been demonstrated in various studies that the climatological, land, and hydrological parameters have a key role (Stein et al. 2021) in determining the flood profile of a river. Hence, this research proposes a deep learning model trained on a dataset consisting of climatological, hydrological, land components, and vegetation-related historical real-world data parameters like water level, precipitation, evaporation, temperature, pressure, river discharge, etc. for the Godavari River at the Perur water monitoring station.

In this study, a comprehensive dataset has been created from various sources consisting of historical real-world data parameters like water level, precipitation, evaporation, temperature, pressure, river discharge, etc. The relevant features are selected from the complete dataset. The study shows that the LSTM model performs best among ARIMA, Prophet, NeuralProphet and LSTM. The RMSE values and R2 of the models are compared. A multivariate multi-step LSTM model is used to predict the flood runoff for each day over the next week. Root mean squared error (RMSE), Mean Absolute Error (MAE), Willmott's Index (WI) of Agreement (WI), Legates-McCabe's Index (LMI) and R2 metrics are used to evaluate the performance of the proposed LSTM architecture.

Case study area and data

The study area selected for this research is the region around the Perur water monitoring station at the Godavari River in Telangana. This area was chosen because the Godavari basin has:

  • 1.

    Large area – The Godavari River stretches for 1,465 km and is the largest in the Indian peninsular region. The basin covers a territory of 3,12,812 km² (Water Resources Karnataka 2022), and the Perur station monitors the gauge, discharge, sediment, and water quality of the river in the Godavari lower sub-basin.

  • 2.

    Prone to flooding – The Godavari basin receives maximum rainfall during the southwest monsoon. The annual rainfall during this period can reach a maximum of 1,000 mm. The high rainfall in the Godavari basin is the primary cause of flooding. Over the past two decades, the annual runoff of the Godavari River has changed by nearly 113%, and the frequency of floods has risen sharply in recent years (Brakenridge et al. 2010).

  • 3.

    Data availability – Data for the Godavari basin are available on various websites (Government of India – WRIS, DFO, etc.)

A map of the region (India-WRIS Godavari 2014) is depicted in Figure 1.
Figure 1

Godavari Basin [Source: India-WRIS Godavari 2014].

In this research, a comprehensive dataset consisting of various climatological, hydrological, land, and vegetation-related data parameters collected from different sources at a daily frequency have been combined for a duration of 18 years, i.e., from 2002 to 2019 for the Godavari River. Initially, data were collected for the Godavari River from the DFO Flood Observatory, University of Colorado (Brakenridge et al. 2010). The DFO is a comprehensive source for historical river discharge and major flood event runoff data for various rivers across the globe. This data ranged from 1998 to 2023. Flood runoff is inextricably linked with various land and climatological data (Stein et al. 2021). To include these, climate data were extracted for the Perur region from the Indian Meteorological Department (IMD)-gridded climate dataset, which is a collection of gauge stations' data in India. The data were collected using the IMD API (IMDLIB 2020) accessed through Python scripts. Other climatological and land attributes data were sourced from the Copernicus Climate Change Service (C3S) and extracted via Google Earth Engine (Muñoz-Sabater et al. 2021). The data are a collection of satellite images of Earth, which are processed to fetch the relevant land, vegetation, and climate attributes. The ERA5-Land Daily Aggregated Dataset was used to get the historical climate records for the area of interest. The data are available from July 1963 onwards and provides a comprehensive overview of the change in land and climate attributes on a daily cadence. It includes 32 different climate and land attributes, including surface runoff, precipitation, temperature, soil moisture, and land evaporation. Finally, the historical river water dataset from India WRIS (India Water Resources Information System) (India-WRIS 2016) was used to make the dataset more comprehensive. Data on daily river water level, historical flow, and discharge were sourced from the Perur River Point Observatory in Telangana. Thus, a comprehensive historical flood dataset for the Godavari River region around the Perur monitoring station has been created.

Methodology

The methodology shown in Figure 2 was followed in this study. The dataset was created as described in Section 2.1. The data for all attributes were normalised using MinMaxScaler. The data were interpolated to fill in missing values using a rolling mean of the previous eight values, followed by a piecewise cubic interpolation technique to construct new points within the boundaries of a set of known points. Since the number of continuous missing values was small, interpolation preserved the trend in the attributes of the time series. Feature selection was then performed using SelectKBest with F-regression as the scoring function. The selected features and the flood runoff values of previous days were used as inputs to the different models (ARIMA, Prophet, NeuralProphet, and LSTM), and the results of the models were compared.
Figure 2

Methodology.

Feature selection

The dataset contained data spanning 6,222 days (2002–2019). The created dataset has 43 variables, of which some variables had little or no correlation with the target feature. To eliminate these uncorrelated features, selectKbest with f_regression feature selection was used. f_regression, a widely used metric that ranks features highly correlated with the target, was used as the score function to obtain this set of variables. The selectKbest with f_regression first performs a univariate linear regression on every feature. Then, it calculates the F-statistic for every feature, indicating the overall significance of the feature in regression. Finally, it selects the k features with the highest F-statistics. The number of relevant features was identified by varying the feature set size (k) between 10 and 30 and training a vanilla LSTM model on these selected feature sets. The vanilla LSTM model used for this purpose had 1 LSTM layer of 50 units followed by a dense layer. The ReLU activation function was used with the ADAM optimiser function for the LSTM layer. The architecture of this LSTM model is shown in Figure 3.
Figure 3

Multivariate vanilla LSTM model (for feature selection).

Figure 3

Multivariate vanilla LSTM model (for feature selection).

Close modal
The performance of the model is evaluated on different feature sets. The model trained with the top 20 features gave the best results, as shown in Figure 4. The top 20 features identified are river water level, rain, dew point, evaporation from the top of the canopy, potential evaporation sum, skin reservoir content, subsurface runoff sum, surface net solar radiation sum, surface net thermal radiation sum, surface pressure, total evaporation sum, volumetric soil water layer 4, volumetric soil water layer 3, volumetric soil water layer 2, volumetric soil water layer 1, daily average discharge (m3/s), daily runoff (mm), average flow, last year (flow in cumecs), maximum temperature. Besides these 20 features, the previous day's flood_runoff values were also input for the model. It was observed that including previous flood_runoff to predict the current flood_runoff significantly improved the performance.
Figure 4

RMSE vs. feature set size (vanilla LSTM).

Figure 4

RMSE vs. feature set size (vanilla LSTM).

Close modal

One-day runoff prediction using ARIMA, Prophet, NeuralProphet, and LSTM

Different models have been used to forecast the next-day flood runoff. RMSE and R2 were used to compare the models. The first model used was ARIMA (Box et al. 2015). It is a statistical model that predicts future values based on historical trend analysis. It uses lagged moving averages to make predictions. It is used for non-seasonal data series. This did not perform well on this dataset due to seasonality in data. Then, the Prophet model developed by Facebook was used (Taylor & Letham 2017). It is an additive time series forecasting model that can fit non-linear trends of the data. It is robust to missing values and can handle them quite well. The main idea of the Prophet is signal decomposition. This model performed better than ARIMA but did not give satisfactory results. Next, the NeuralProphet model by Facebook (Triebe et al. 2021) was used. The NeuralProphet model was designed to overcome the shortcomings of the Prophet model. In NeuralProphet, an autoregressive model is fit using neural networks to enhance accuracy. This model performed better than the previous two. Finally, the deep learning model LSTM (Gers et al. 2000) was used, and it outperformed other models for one-day prediction on the combined dataset. Therefore, a multivariate multi-step LSTM model is proposed to predict the flood runoff for a week ahead.

Proposed multivariate multi-step LSTM model for a week ahead runoff prediction

For sequential data, LSTM (Gers et al. 2000) is frequently employed because it can detect complex patterns in input data. It can learn the significance of order while predicting the data. LSTM can figure out the long-term dependency in data through the three gates interacting with each other. Gates are made of a sigmoidal neural layer and multiplication operator. The forget gate is the initial gate, and it is required to do away with data that is not useful. Next is an input gate layer that selects what attributes should be updated, and then a tanh layer builds a new candidate vector to be added to the state. The output of the forget gate and fresh candidate vector are used to update the cell state. Finally, the output gate calculates the output based on cell state and tanh function. Stacked LSTM is a model with multiple LSTM layers in which each layer functions independently with its own set of parameters. This structure allows the model to learn abstract and complex patterns in the input sequence. Multivariate LSTM takes in multiple features as input and provides output for the target variable. It can analyse the interactions among input features, leading to better predictions. The multi-step multivariate LSTM model takes in multiple features as input and predicts multiple future steps.

This research proposes a multi-step multivariate LSTM model to predict flood runoff for the next week. The architecture of the model is shown in Figure 5. The input sequences are provided to the model by combining data corresponding to n timesteps, where n is the look-back window. The model consists of two LSTM layers with 100 units each with ReLU activation, followed by a dense layer of seven units with linear activation. The seven neurons in the dense layer correspond to the seven days. The first LSTM layer takes in the input sequence and provides its output to the second LSTM layer. The dense layer takes the output from the second LSTM layer and provides flood runoff values for the next seven days as output. The input shape of this model was (b * n * f), the output shape was (b * 7), where n is the look-back window and, b represents the number of instances of input data taken n at a time, and f is the number of features selected from the dataset. The nth batch can be represented as: bn = [t(n), t(n + 1), t(n+ 2) …….t(n+f)]. Here, bn represents the nth batch, and t(n) represents the input features for the nth timestep.
Figure 5

Multi-step model for next 7 days of prediction.

Figure 5

Multi-step model for next 7 days of prediction.

Close modal

The multi-step LSTM model was trained on varying look-back window sizes. The look-back window changed between 30 days (1 month) to 548 days (1.5 years). The RMSE decreases on increasing the look-back window. However, WI, LMI and R2 are best for a look-back window of 183 days, as discussed in Section 3.2.

The dataset contained data spanning 6,222 days and was divided into train and test in the ratio of 70:30. Thus, the train dataset contained 4,355 days' data, and test data had 1,867 days' data. The model is trained on the train data and then validated using the test data. Experiments were conducted to select the most optimal parameters for the proposed models. The epochs while training the model varied from 50 to 100. It was observed that the model gave optimal results when trained on 100 epochs with a batch size of 64 and Adam optimiser. ReLU and linear functions were used as LSTM and dense layer activations, respectively.

Comparison of various models for 1-day prediction

Several models were used to predict the runoff values – ARIMA, Prophet, NeuralProphet, and LSTM. The performance of these models was compared using root mean square error (RMSE) and R2 metrics. The values of RMSE and R2 are depicted in Table 1. The values show that deep learning models, i.e., NeuralProphet and LSTM, perform better than the traditional statistical model ARIMA and the forecasting model Prophet. This highlights that deep learning-based approaches can predict time series better than statistical methods. Specifically, a lower RMSE indicates that predictions made by deep-learning models were closer to the actual values than statistical models. Among these, LSTM performs the best, showing promising results for forecasting. This is likely because LSTM, a deep learning model, can detect complex patterns in data. Hence, further experiments were conducted on variants of the LSTM model.

Table 1

Model performance (RMSE and R2)

ModelR2RMSE
ARIMA −1.8 1.84 
Prophet 0.1 0.06 
NeuralProphet 0.2 0.05 
LSTM 0.67 0.046 
ModelR2RMSE
ARIMA −1.8 1.84 
Prophet 0.1 0.06 
NeuralProphet 0.2 0.05 
LSTM 0.67 0.046 

Comparison of variants of LSTM

The performance of the LSTM models is evaluated using RMSE and mean average error (MAE), WI of Agreement (WI), LMI, and R2. RMSE measures the mean difference between actual values and values predicted by a model. It is used to evaluate the accuracy of the results obtained through regression. R2 is the statistical value in regression that explains what proportion of the dependent variable can be explained by the independent variable, and it explains the goodness of fit. WI of Agreement (Willmott et al. 2011), and LMI (Legates & McCabe 1999) are indices ranging between 0 and 1 used for hydrological model evaluation. According to the results shown in Table 2, it is concluded that the model with a look-back window of 183 days performs the best with RMSE of 0.05, MAE of 0.007, R2 of 0.67, WI of 0.83, and LMI of 0.584 for one-day predictions. It is also found that the optimal look-back window is within the range of 3–6 months. However, the model with a 548-day look-back window has a lower RMSE, but the R2 value indicates that it fits poorly on data – probably overfitting the data.

Table 2

LSTM model performance for 1-day prediction

Look-back windowLMIR2WIRMSEMAE
30 (1 month) 0.433 0.225  0.447  0.0614  0.0080 
90 (3 months) 0.518 0.633  0.849  0.0455  0.0074 
183 (6 months) 0.584 0.672  0.831  0.0466  0.0068 
365 (1 year) −3.812 0.085  0.271  0.0230  0.0056 
548 (1.5 years) −0.413 0.283  0.693  0.0087  0.0021 
Look-back windowLMIR2WIRMSEMAE
30 (1 month) 0.433 0.225  0.447  0.0614  0.0080 
90 (3 months) 0.518 0.633  0.849  0.0455  0.0074 
183 (6 months) 0.584 0.672  0.831  0.0466  0.0068 
365 (1 year) −3.812 0.085  0.271  0.0230  0.0056 
548 (1.5 years) −0.413 0.283  0.693  0.0087  0.0021 

For multi-day prediction, the experiments were conducted on the identified look-back window sizes of 3 months and 6 months, and the results are shown in Table 3.

Table 3

Multi-step multi variate LSTM model performance for 7-day prediction

Look-back windowForecast dayLMIR2WIRMSEMAE
90 (3 months) Day 1 0.518 0.633 0.849 0.0455 0.0074 
Day 2 0.488 0.533 0.818 0.0477 0.0075 
Day 3 0.463 0.466 0.794 0.0494 0.0076 
Day 4 0.444 0.422 0.772 0.0510 0.0077 
Day 5 0.426 0.388 0.749 0.0521 0.0077 
Day 6 0.411 0.344 0.713 0.0531 0.0078 
Day 7 0.383 0.374 0.741 0.0533 0.0085 
Average 0.448 0.4514 0.777 0.0504 0.0077 
183 (6 months) Day 1 0.584 0.672 0.831 0.0466 0.0068 
Day 2 0.570 0.631 0.827 0.0472 0.0069 
Day 3 0.540 0.592 0.831 0.0479 0.0072 
Day 4 0.420 0.526 0.816 0.0498 0.0088 
Day 5 0.418 0.468 0.778 0.0523 0.0086 
Day 6 0.452 0.381 0.712 0.0540 0.0075 
Day 7 0.395 0.333 0.666 0.0549 0.0081 
Average 0.483 0.515 0.780 0.0505 0.0077 
Look-back windowForecast dayLMIR2WIRMSEMAE
90 (3 months) Day 1 0.518 0.633 0.849 0.0455 0.0074 
Day 2 0.488 0.533 0.818 0.0477 0.0075 
Day 3 0.463 0.466 0.794 0.0494 0.0076 
Day 4 0.444 0.422 0.772 0.0510 0.0077 
Day 5 0.426 0.388 0.749 0.0521 0.0077 
Day 6 0.411 0.344 0.713 0.0531 0.0078 
Day 7 0.383 0.374 0.741 0.0533 0.0085 
Average 0.448 0.4514 0.777 0.0504 0.0077 
183 (6 months) Day 1 0.584 0.672 0.831 0.0466 0.0068 
Day 2 0.570 0.631 0.827 0.0472 0.0069 
Day 3 0.540 0.592 0.831 0.0479 0.0072 
Day 4 0.420 0.526 0.816 0.0498 0.0088 
Day 5 0.418 0.468 0.778 0.0523 0.0086 
Day 6 0.452 0.381 0.712 0.0540 0.0075 
Day 7 0.395 0.333 0.666 0.0549 0.0081 
Average 0.483 0.515 0.780 0.0505 0.0077 

The actual flood runoff values of the test data are plotted with the predicted values of the flood runoff for each day over the next week. They are shown in Figure 6. The graph shows the difference between actual and predicted values of Day 1 is minimal compared to the difference between actual and predicted values of Day 7. It shows that the performance degrades with increasing timesteps.
Figure 6

Actual vs. predicted flood runoff (day-wise).

Figure 6

Actual vs. predicted flood runoff (day-wise).

Close modal

The data used in all the above models only had the 20 attributes selected through feature selection and previous runoff values. This LSTM model showed significant improvement in performance as compared to statistical models. The model with a look-back window of 183 days (approximately 6 months) gave the best 7-day predictions.

Prediction validation using real data

To further evaluate the predictions of the proposed model, the actual flood event that occurred during the period of test data is compared with the predictions of the model during the same period. As visible in Figure 6, there is a peak in the flood runoff prediction during the timesteps 1,308–1,313. These timesteps correspond to 2 August to 7 August 2019 in our dataset. According to the news articles during this period (Smart City to Flood City: Residents Suffer as Heavy Rains Inundate Telangana's Warangal 2019), there was a flooding event in the Godavari River Basin at Perur. This shows that the predictions of the proposed model are valid.

The paper presents a promising way to predict floods by predicting the flood runoff. Different models were employed to accomplish this task: ARIMA, Prophet, NeuralProphet, and LSTM. The selected study area is the Godavari River region at the Perur water gauge station. The step-by-step construction of a comprehensive and interpretable dataset of climatological, hydrological, land, and vegetation-related parameters is also presented. The data were collected for 18 years (2002–2019) from four different sources. To identify the important features affecting flood runoff in a region, feature selection has been performed. The selected features include river water level, precipitation, temperature, surface pressure, evaporation, soil water content, daily runoff, and average river flow. Various experiments show that deep learning models give better results than statistical approaches. Among the LSTM, ARIMA, Prophet, and NeuralProphet models, the LSTM model performs the best. The best one-day prediction achieved on the proposed LSTM model has a RMSE value of 0.05, MAE of 0.007, WI of 0.83, LMI of 0.58, and R2 of 0.67. The identified optimal look-back window is 3–6 months. The deep learning model proposed in this research to predict flood runoff can be a vital component of the flood warning system to help in generating timely warnings, planning evacuation drives, formulating emergency responses, and, hence, mitigating the damage from floods. However, this study does not consider the frequently changing demographics of the flood-prone area, the factor of concretisation around the areas near the river, and the effect of global warming. Though difficult to quantify, these parameters might sometimes have a role in inducing floods. This kind of study will be useful in flood mapping and hot spot analysis with improved spatial resolution of the target area. For future work, the factors related to urbanisation and global warming can be incorporated into the model.

N.G., S.N., R.N., and S.R. performed the methodology, did software analysis, did validation, did formal analysis, investigated the study, and wrote the original draft. K.R.S. conceptualised and performed the methodology, supervised the study, wrote, reviewed, and edited the article.

The authors declare there is no conflict.

Atashi
V.
,
Gorji
H. T.
,
Shahabi
S. M.
,
Kardan
R.
&
Lim
Y. H.
2022
Water level forecasting using deep learning time-series analysis: a case study of Red river of the north
.
Water
14
(
12
),
1971
.
https://doi.org/10.3390/w14121971
.
Bagherzadeh
F.
,
Mehrani
M. J.
,
Basirifard
M.
&
Roostaei
J.
2021
Comparative study on total nitrogen prediction in wastewater treatment plant and effect of various feature selection methods on machine learning algorithms performance
.
Journal of Water Process Engineering
41
,
102033
.
https://doi.org/10.1016/j.jwpe.2021.102033
.
Barbetta
S.
,
Coccia
G.
,
Moramarco
T.
&
Todini
E.
2016
Case study: a real-time flood forecasting system with predictive uncertainty estimation for the Godavari river, India
.
Water
8
,
463
.
https://doi.org/10.3390/w8100463
.
Basheer Ahammed
K. K.
&
Pandey
A. C.
2022
Assessment and prediction of shoreline change using multi-temporal satellite data and geostatistics: a case study on the eastern coast of India
.
Journal of Water and Climate Change
13
(
3
),
1477
1493
.
https://doi.org/10.2166/wcc.2022.270
.
Box
G. E.
,
Jenkins
G. M.
,
Reinsel
G. C.
&
Ljung
G. M.
2015
Time Series Analysis: Forecasting and Control
.
John Wiley & Sons, Hoboken, New Jersey, USA
.
Brakenridge
G. R.
,
Kettner
A. J.
,
Paris
S.
,
Cohen
S.
&
Nghiem
S. V.
2010
River and Reservoir Watch Version 4.5, Satellite-Based River Discharge and Reservoir Area Measurements
.
DFO Flood Observatory, University of Colorado
,
USA
.
Available from: https://floodobservatory.colorado.edu/SiteDisplays/20.htm (accessed 20 February 2023)
.
Chang
L.-C.
,
Amin
M.
,
Yang
S.-N.
&
Chang
F.-J.
2018
Building ANN-Based regional multi-step-ahead flood inundation forecast models
.
Water
10
(
9
),
1283
.
https://doi.org/10.3390/w10091283
.
Damle
C.
&
Yalcin
A.
2007
Flood prediction using time series data mining
.
Journal of Hydrology
333
(
2–4
),
305
316
.
https://doi.org/10.1016/j.jhydrol.2006.09.001
.
Ding
Y.
,
Zhu
Y.
,
Feng
J.
,
Zhang
P.
&
Cheng
Z.
2020
Interpretable spatio-temporal attention LSTM model for flood forecasting
.
Neurocomputing
403
,
348
359
.
https://doi.org/10.1016/j.neucom.2020.04.110
.
Elsafi
S. H.
2014
Artificial Neural Networks (ANNs) for flood forecasting at Dongola Station in the River Nile, Sudan
.
Alexandria Engineering Journal
53
(
3
),
655
662
.
https://doi.org/10.1016/j.aej.2014.06.010
.
Floods NDMA
2007
.
Available from: https://ndma.gov.in/Natural-Hazards/Floods (accessed 20 February 2023)
.
Floods [WWW Document]
2022
.
Floods
.
Available from: https://www.who.int/health-topics/floods#tab=tab_1 (accessed 20 March 2023)
.
Furquim
G.
,
Pessin
G.
,
Faiçal
B. S.
,
Mendiondo
E. M.
&
Ueyama
J.
2016
Improving the accuracy of a flood forecasting model by means of machine learning and chaos theory
.
Neural Computing and Applications
27
(
5
),
1129
1141
.
https://doi.org/10.1007/s00521-015-1930-z
.
Gers
F. A.
,
Schmidhuber
J.
&
Cummins
F.
2000
Learning to forget: continual prediction with LSTM
.
Neural Computation
12
(
10
),
2451
2471
.
https://doi.org/10.1162/089976600300015015
.
Ha
S.
,
Liu
D.
&
Mu
L.
2021
Prediction of Yangtze River streamflow based on deep learning neural network with El Niño–Southern Oscillation
.
Scientific Reports
11
.
https://doi.org/10.1038/s41598-021-90964-3
.
Ighile
E. H.
,
Shirakawa
H.
&
Tanikawa
H.
2022
Application of GIS and machine learning to predict flood areas in Nigeria
.
Sustainability
14
(
9
),
5039
.
https://doi.org/10.3390/su14095039
.
IMDLIB
.
2020
IMDLIB – A Python Library for IMD Gridded Data – IMDLIB Documentation
.
Available from: https://imdlib.readthedocs.io/en/latest/index.html (accessed 20 February 2023)
.
India-WRIS
.
2016
.
Available from: https://indiawris.gov.in/wris/#/ (accessed 20 February 2023)
.
India-WRIS Godavari
.
2014
.
Available from: https://indiawris.gov.in/downloads/Godavari%20Basin.pdf (accessed 4 August 2023)
.
Jiang
Z.
,
Yang
S.
,
Liu
Z.
,
Xu
Y.
, Xiong, Y., Qi, S., Pang, Q., Xu, J., Liu, F. & Xu, T.
2022
Coupling machine learning and weather forecast to predict farmland flood disaster: a case study in Yangtze river basin
.
Environmental Modelling & Software
155
, 105436.
https://doi.org/https://doi.org/10.1016/j.envsoft.2022.105436
.
Keum
H. J.
,
Han
K. Y.
&
Kim
H. I.
2020
Real-time flood disaster prediction system by applying machine learning technique
.
KSCE Journal of Civil Engineering
24
(
9
),
2835
2848
.
https://doi.org/10.1007/s12205-020-1677-7
.
Kim
H. I.
&
Kim
B. H.
2020
Flood hazard rating prediction for urban areas using random forest and LSTM
.
KSCE Journal of Civil Engineering
24
(
12
),
3884
3896
.
https://doi.org/10.1007/s12205-020-0951-z
.
Le
X. H.
,
Ho
H. V.
,
Lee
G.
&
Jung
S.
2019
Application of long short-term memory (LSTM) neural network for flood forecasting
.
Water
11
(
7
),
1387
.
https://doi.org/10.3390/w11071387
.
Legates
D. R.
&
McCabe
G. J.
1999
Evaluating the use of ‘goodness-of-fit’ measures in hydrologic and hydroclimatic model validation
.
Water Resources Research
35
,
233
241
.
https://doi.org/10.1029/1998wr900018
.
Li
W.
,
Kiaghadi
A.
&
Dawson
C.
2021
Exploring the best sequence LSTM modeling architecture for flood prediction
.
Neural Computing and Applications
33
(
11
),
5571
5580
.
https://doi.org/10.1007/s00521-020-05334-3
.
Liu
D.
,
Jiang
W.
,
Mu
L.
&
Wang
S.
2020
Streamflow prediction using deep learning neural network: case study of Yangtze river
.
IEEE Access
8
,
90069
90086
.
https://doi.org/10.1109/ACCESS.2020.2993874
.
Miau
S.
&
Hung
W.-H.
2020
River flooding forecasting and anomaly detection based on deep learning
.
IEEE Access
8
,
198384
198402
.
https://doi.org/10.1109/ACCESS.2020.3034875
.
Mosaffa
H.
,
Sadeghi
M.
,
Mallakpour
I.
,
Naghdyzadegan Jahromi
M.
&
Pourghasemi
H. R.
2022
Application of machine learning algorithms in hydrology
.
Computers in Earth and Environmental Sciences
,
585
591
.
https://doi.org/10.1016/b978-0-323-89861-4.00027-0
.
Muñoz-Sabater
J.
,
Dutra
E.
,
Agustí-Panareda
A.
,
Albergel
C.
,
Arduini
G.
,
Balsamo
G.
,
Boussetta
S.
,
Choulga
M.
,
Harrigan
S.
,
Hersbach
H.
,
Martens
B.
,
Miralles
D. G.
,
Piles
M.
,
Rodríguez-Fernández
N. J.
,
Zsoter
E.
,
Buontempo
C.
&
Thépaut
J. N.
2021
ERA5-Land: a state-of-the-art global reanalysis dataset for land applications
.
Earth System Science Data
13
(
9
),
4349
4383
.
https://doi.org/10.5194/ESSD-13-4349-2021
.
Nanditha
J. S.
&
Mishra
V.
2021
On the need of ensemble flood forecast in India
.
Water Security
12
,
100086
.
https://doi.org/10.1016/j.wasec.2021.100086
.
Panigrahi
B. K.
,
Das
S.
,
Nath
T. K.
&
Senapati
M. R.
2018
An application of data mining techniques for flood forecasting: application in rivers Daya and Bhargavi, India
.
Journal of The Institution of Engineers (India): Series B
99
(
4
),
331
342
.
https://doi.org/10.1007/s40031-018-0333-9
.
Paul
A.
,
Bhowmik
R.
,
Chowdary
V. M.
,
Dutta
D.
,
Sreedhar
U.
&
Sankar
H. R.
2017
Trend analysis of time series rainfall data using robust statistics
.
Journal of Water and Climate Change
8
,
691
700
.
https://doi.org/10.2166/wcc.2017.141
.
Puttinaovarat
S.
&
Horkaew
P.
2020
Flood forecasting system based on integrated big and crowdsource data by using machine learning techniques
.
IEEE Access
8
,
5885
5905
.
https://doi.org/10.1109/ACCESS.2019.2963819
.
Rahimzad
M.
,
Moghaddam Nia
A.
,
Zolfonoon
H.
,
Soltani
J.
,
Danandeh Mehr
A.
&
Kwon
H.-H.
2021
Performance comparison of an LSTM-based deep learning model versus conventional machine learning algorithms for streamflow forecasting
.
Water Resources Management
35
(
12
),
4167
4187
.
https://doi.org/10.1007/s11269-021-02937-w
.
Ramos Filho
G. M.
,
Coelho
V. H. R.
,
Freitas
E. d. S.
,
Xuan
Y.
&
Almeida
C. d. N.
2021
An improved rainfall-threshold approach for robust prediction and warning of flood and flash flood hazards
.
Natural Hazards
105
(
3
),
2409
2429
.
https://doi.org/10.1007/s11069-020-04405-x
.
Sampurno
J.
,
Vallaeys
V.
,
Ardianto
R.
&
Hanert
E.
2022
Integrated hydrodynamic and machine learning models for compound flooding prediction in a data-scarce estuarine delta
.
Nonlinear Processes in Geophysics
29
,
301
315
.
https://doi.org/10.5194/npg-29-301-2022
.
Smart city to flood city: Residents suffer as heavy rains inundate Telangana's Warangal
.
2019
.
Solomatine
D.
,
See
L.
,
Abrahart
R.
,
2009
Data-driven modelling: concepts, approaches and experiences
. In:
Practical Hydroinformatics. Water Science and Technology Library
, Vol.
68
(
Abrahart
R. J.
,
See
L. M.
&
Solomatine
D. P.
, eds).
Springer
,
Berlin, Heidelberg
.
https://doi.org/10.1007/978-3-540-79881-1_2
.
Stein
L.
,
Clark
M. P.
,
Knoben
W. J. M.
,
Pianosi
F.
&
Woods
R. A.
2021
How do climate and catchment attributes influence flood generating processes? A large-sample study for 671 catchments across the contiguous USA
.
Water Resources Research
57
(
4
),
1
10
,
https://doi.org/10.1029/2020WR028300
.
Taylor
S. J.
&
Letham
B.
2017
Forecasting at scale
.
PeerJ Preprints
5
,
e3190v2
.
https://doi.org/10.7287/peerj.preprints.3190v2
.
Triebe
O.
,
Hewamalage
H.
,
Pilyugina
P.
,
Laptev
N.
,
Bergmeir
C.
&
Rajagopal
R.
2021
Neuralprophet: Explainable forecasting at scale. arXiv preprint arXiv:2111.15397
.
Vogeti
R. K.
,
Boindala
S. P.
,
Kumar
D. N.
&
Raju
K. S.
2022
Streamflow forecasting in a climate change perspective using E-FUSE
.
Journal of Water and Climate Change
13
,
3934
3950
.
https://doi.org/10.2166/wcc.2022.251
.
Water Resources Karnataka
.
2022
.
Willmott
C. J.
,
Robeson
S. M.
&
Matsuura
K.
2011
A refined index of model performance
.
International Journal of Climatology
32
,
2088
2094
.
https://doi.org/10.1002/joc.2419
.
Wu
Z.
,
Zhou
Y.
&
Wang
H.
2020
Real-Time prediction of the water accumulation process of urban stormy accumulation points based on deep learning
.
IEEE Access
8
,
151938
151951
.
https://doi.org/10.1109/ACCESS.2020.3017277
.
Yoosefdoost
I.
,
Khashei-Siuki
A.
,
Tabari
H.
&
Mohammadrezapour
O.
2022
Runoff simulation under future climate change conditions: performance comparison of data-mining algorithms and conceptual models
.
Water Resources Management
36
,
1191
1215
.
https://doi.org/10.1007/s11269-022-03068-6
.
Zhu
Z.
&
Zhang
Y.
2022
Flood disaster risk assessment based on random forest algorithm
.
Neural Computing and Applications
34
(
5
),
3443
3455
.
https://doi.org/10.1007/s00521-021-05757-6
.
Zhu
Y.
,
Feng
J.
,
Yan
L.
,
Guo
T.
&
Li
X.
2020
Flood prediction using rainfall-flow pattern in data-Sparse watersheds
.
IEEE Access
8
,
39713
39724
.
https://doi.org/10.1109/ACCESS.2020.2971264
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).