This research evaluates the performance of deep learning (DL) models in predicting rainfall in George Town, Penang, utilizing the open-source NASA POWER meteorological data, which includes variables such as rainfall, dew point, solar radiation, wind speed, relative humidity, and temperature. This study introduces a newly developed hybrid DL based on the integration of a 2D convolutional neural network (CNN2D) with a bidirectional recurrent neural network (BRNN) and a bidirectional gated recurrent unit (BGRU). The proposed models, CNN2D–BGRU and BRNN–BGRU, were compared against standalone models CNN2D, BRNN, and BGRU. The results indicate that the BRNN–BGRU model is the most effective, with a root mean square error (RMSE) value of 2.59, a mean absolute error (MAE) value of 1.97, a Pearson correlation coefficient (PCC) value of 0.79, and a Willmott index (WI) value of 0.88. In a 3-day prediction, the BRNN–BGRU model also performed the best, with a test WI value of 0.83, a PCC value of 0.69, a RMSE value of 3.02, and MAE value of 2.34. The hybrid BRNN–BGRU model consistently excels in predicting multi-step rainfall in tropical regions using the NASA POWER dataset. These findings can contribute to the development of advanced rainfall-predicting systems for more effective management of water resources and flooding in urban areas.

  • Hybrid deep learning models substantially enhance rainfall prediction accuracy in George Town.

  • NASA POWER data effectively mitigate the issue of limited observational data.

  • The bidirectional recurrent neural–bidirectional gated recurrent unit hybrid model demonstrated superior performance for both 1- and 3-days rainfall predictions.

Rainfall is a vital component of the water cycle (Gu et al. 2022) and plays a significant role in predicting natural disasters (Jamei et al. 2023). In recent years, the spatiotemporal distribution of rainfall has undergone substantial changes due to the impacts of climate change. These regional shifts have led to adverse consequences, such as increased variability in heavy rainfall events and prolonged drought periods (Faiz et al. 2018, 2021; Baig et al. 2024). This results in natural disasters such as flash floods, landslides, and crop damage often occurring. For instance, Ghaderpour et al. (2024) found that heavy rain, particularly after a prolonged drought, can trigger landslides. Additionally, such intense rainfall can damage crops, resulting in severe losses for farmers (Fu et al. 2023; Su & Kuo 2023b). Therefore, accurate rainfall prediction is essential for various sectors, including agriculture, hydrology, environmental management, and disaster risk reduction (Latif et al. 2023).

In past research, various methodologies have been adopted for rainfall modeling, with statistical methods being widely used for predicting rainfall (Baig et al. 2024). However, the increasing frequency and intensity of extreme rainfall events have challenged the accuracy of these traditional approaches. Consequently, researchers have increasingly turned to artificial intelligence (AI)-based models, including machine learning (ML) and deep learning (DL) models, to enhance the accuracy and performance of predictions for extreme rainfall patterns under these changing conditions. For example, Baig et al. (2024) utilized a range of models, such as multilayer perceptron (MLP), random forest (RF), eXtreme Gradient Boosting (XGBoost), linear regression (LR), long short-term memory (LSTM), gradient boost, and support vector machine (SVM), along with their ensemble methods, to improve the accuracy of monthly rainfall predictions. Singarasubramanian et al. (2024) applied models like k-nearest neighbors (KNN), Naive Bayes, SVM, LR, classification and regression trees (CART), RF, and linear discriminant analysis to classify rainfall at Sathanur Dam, India, and found the CART model to be most accurate. Gu et al. (2022) employed stacking methods to ensemble models, including artificial neural networks (ANN), XGBoost, KNN, and SVM, to improve monthly rainfall prediction based on data from the China Meteorological Data Service Centre, China Meteorological Administration.

The shift from traditional ML techniques to DL models has improved the ability to analyze rainfall, which has become increasingly complex due to climate change. Previous research shows that DL models outperform ML techniques in rainfall forecasting. For example, Alqahtani (2024) applied the LSTM method to forecast monthly rainfall in Mecca, optimizing model hyper-parameters using a grid search strategy. Similarly, Salaeh et al. (2022) employed various ML and DL models, including MLP, M5, RF, LSTM, and SVM, to predict multi-step monthly rainfall in the Thale Sap Songkhla region. Their findings demonstrated that the LSTM model offered more accurate predictions. Additionally, Endalie et al. (2022) employed an LSTM network to predict rainfall in Oromia, Ethiopia, comparing its performance with ML models such as Decision Tree (DT), MLP, KNN, and SVM. Like Salaeh et al. (2022), they concluded that the LSTM model provided superior prediction accuracy. Thus, DL models have proven to be more effective than conventional ML models in capturing complex patterns in rainfall patterns.

Several strategies can improve the performance of DL for predicting precipitation, including the hybrid model (Sheikh Khozani et al. 2022), feature engineering (AlDahoul et al. 2023), feature selection (Pei et al. 2020), and parameter optimization through metaheuristic algorithms (Khosravi et al. 2024). Khan & Maity (2020) developed a hybrid model combining 1D Convolutional Layer (Conv1D) and MLP, referred to as the hybrid Conv1D–MLP. This model outperformed the compared model, demonstrating an improvement in the accuracy of multi-step predictions. Sheikhi et al. (2023) employed the invasive weed optimization algorithm, firefly optimization algorithm, and genetic particle swarm optimization algorithm (GAPSO) to enhance the performance of ANN and the group method of data handling (GMDH). They also used wavelet transformation, which is used to assess variations across multiple temporal scales, identify specific rainfall events, facilitate data compression, and detect changes and trends in rainfall patterns. The results showed that the GAPSO-wavelet-ANN model achieved the highest accuracy with the lowest root mean square error (RMSE), outperforming all other hybrid models. Despite these advancements, there is limited research into hybrid models that integrate two DL techniques. Hybrid approaches have the potential to optimize the predictive capabilities of individual models. For instance, Chhetri et al. (2020) demonstrated that the Bidirectional Long Short-Term Memory (BLSTM)–gated recurrent unit (GRU) hybrid improved RMSE results by 41.1% compared to the standalone LSTM model for monthly rainfall prediction. Thottungal Harilal et al. (2024) utilized hybrid DL models, specifically the Convolutional Neural Network (CNN)–LSTM and recurrent neural network (RNN)–LSTM hybrids, to predict daily rainfall. Their findings revealed that these hybrid models outperformed the other tested models. Additionally, Jamei et al. (2024b) showed that hybrid models provide greater accuracy than standalone models for predicting river streamflow. The literature review indicates that hybrid DL models offer higher accuracy for predictions compared to standalone models. However, the integration of hybrid models for tropical rainfall predictions remains underexplored.

The application of the NASA POWER dataset in tropical countries has been less explored in the literature (Tan et al. 2023). However, recent studies have highlighted its relevance to hydrological research and climate studies, particularly in regions lacking ground-based weather data. Kadhim Tayyeh & Mohammed (2023) demonstrated the potential of NASA POWER to generate weather datasets in the Euphrates River Basin, Iraq, highlighting its role in increasing the accessibility of climate data. Similarly, Rodrigues & Braga (2021) found that NASA POWER data effectively complemented ground observations of solar radiation, temperature, and humidity in the Alentejo Region, Portugal. Dharmayasa et al. (2022) validated the reliability of NASA POWER for rainfall estimates in Bali, Indonesia, highlighting its strong correlation with observed data, although there were some discrepancies in average rainfall values. Bandira et al. (2023) demonstrated the usefulness of NASA POWER in estimating climate variability in Northern Peninsular Malaysia, particularly for identifying optimal locations for solar farming. Khan et al. (2024) used classification models such as MLP, LR, RF, and decision trees to predict rainfall in Aligarh with NASA POWER data from 2013 to 2022. This approach shows that classification models can provide useful insights in predicting rainfall patterns using NASA POWER. While these studies suggest that the NASA POWER dataset is of high quality, its application in tropical rainfall prediction remains limited.

This study aims to develop both hybrid and standalone models to predict tropical rainfall in Georgetown, Malaysia, utilizing NASA POWER meteorological data. Three standalone models were developed: CNN2D, bidirectional recurrent neural (BRNN), and bidirectional gated recurrent unit (BGRU), along with two new hybrid models, CNN2D–BGRU and BRNN–BGRU, to evaluate their performance in predicting 1- and 3-day future tropical rainfall. Weather predictions are generally more accurate for short time frames, typically ranging from a few hours to about a week (Sarker 2022). In addition, multi-step ahead rainfall prediction is also important for real applications (Zhou et al. 2024). To improve accuracy, the Savitzky–Golay method was used to address outliers in the dataset (Pham et al. 2024), although its use in hybrid DL models for rainfall predictions is less common. The results are useful to improve disaster management strategies by enabling more effective planning and mitigation for extreme rainfall events.

Study area

Located in the northeastern region of Penang, Georgetown is highly susceptible to severe flooding due to heavy and prolonged rains affecting the catchment area (Saleh et al. 2022). For instance, a report from Department of Irrigation and Drainage Penang indicates that the flood in November 2017 was caused by high-intensity rainfall averaging 250 mm (Mazalan 2017). The city is located within the Pinang River basin, which covers only 51 km², as shown in Figure 1. The specific study location is situated at a latitude of 5.4097 and a longitude of 100.3151. According to Gao et al. (2020), Penang experiences a tropical rainforest climate, with an average annual rainfall of 2,477 mm/year. Rainfall varies throughout the year, reaching a low of 60 mm/month in February and a peak of approximately 210 mm/month between August and October. Daytime temperatures typically range from 29 to 35 °C, while nighttime temperatures generally fall between 26 and 29 °C.
Figure 1

The location of the study point in George Town, Penang.

Figure 1

The location of the study point in George Town, Penang.

Close modal

Datasets

The selected study site is located in the George Town area, which is prone to flooding. This research aims to provide preliminary information that can assist local stakeholders to improve disaster management strategies in flood-prone areas. The NASA POWER platform allows users to extract data in multiple formats, point, regional, and global, facilitating efficient data retrieval. For this study, the specific location is at a latitude of 5.4097 and a longitude of 100.3151, with the data covering the period from 1 January 2017 to 31 December 2021. An 80%/20% split of the complete dataset was used for training and validation purposes, respectively.

The NASA POWER dataset includes minimum temperature (Tmin), solar radiation (SR), maximum temperature (Tmax), maximum windspeed (WSmax), average windspeed, minimum windspeed (WSmin), average temperature (AvT), dew point, and relative humidity, which are freely accessible at https://power.larc.nasa.gov/data-access-viewer/. According to Stackhouse (2020), NASA POWER supplies high-resolution rainfall data obtained from NASA's Global Precipitation Measurement (GPM) mission's Integrated Multi-satellite Retrievals for GPM (IMERG), with a resolution of 0.1° latitude by 0.1° longitude. Other meteorological variables are obtained from NASA's GMAO Modern Era Retrospective-Analysis for Research and Applications (MERRA-2) assimilation model and the Goddard Earth Observing System (GEOS) 5.12.4 FP-IT (Bosilovich et al. 2017). For the SR parameter, data are obtained from the NASA GEWEX SRB Integrated Output 4-Product (R4-IP) archive and the NASA CERES SYN1deg and FLASHFlux projects.

This research adopts a comprehensive methodology, comprising three distinct phases, in order to develop an effective process for prediction model development, as shown in Figure 2. The first stage involves data acquisition and preprocessing, as discussed in the Datasets section.
Figure 2

Methodology flow chart for predicting rainfall using hybrid and standalone DL models in George Town, Penang.

Figure 2

Methodology flow chart for predicting rainfall using hybrid and standalone DL models in George Town, Penang.

Close modal

To enhance the quality and performance of the model, the dataset must undergo preprocessing, which includes improving data quality, scaling inputs, and addressing outliers. Preprocessing is crucial for optimizing datasets for model development. In this study, the MinMaxScaler method was employed to normalize the input data features, thereby improving the relevance and reliability of the predictive models. Additionally, the Savitzky–Golay filter was applied for smoothing and noise reduction, as supported by existing literature (Lian et al. 2023; Pham et al. 2024). According to Pham et al. (2024), this filter can improve data accuracy without distorting the signal trend, producing high-quality input data for use in AI model prediction. Additionally, the Savitzky–Golay filter uses local least squares polynomial approximation.

The next phase involves the modeling process, where three DL models, CNN2D, BRNN, and BGRU, were evaluated and utilized for nonlinear rainfall predictions. To improve the performance of the standalone models, the study introduces hybrid models that integrate CNN2D with BGRU and BRNN with BGRU for more sophisticated rainfall forecasting. The objective of developing these hybrid models is to better capture and interpret the complex rainfall patterns present in the dataset. The final phase focuses on evaluation and validation using statistical metrics such as RMSE, mean absolute error (MAE), Pearson correlation coefficient (PCC), and Willmott index (WI) to assess the performance and reliability of the models. This evaluation aims to identify the most effective model configuration for practical urban rainfall prediction.

2D convolutional neural network

In the CNN model, the fundamental architecture consists of an input layer, a convolution layer, a maximum pooling layer, a fully connected layer, and an output layer. The input layer is represented by an m × n matrix, where each element corresponds to a feature value, forming a two-dimensional feature map. The convolution layer, equipped with multiple convolution units, uses backpropagation to optimize its parameters and extract various features from the input layer (Wang et al. 2019). Following the convolution layer, pooling mechanisms are employed to lower the feature dimension, thereby reducing the coefficient number and reducing the risk of overfitting by simplifying the network (Mozo et al. 2018; Shu et al. 2021; Hakim et al. 2022). This aggregation process involves dimensionality reduction before the data are passed through fully connected layers (Lee et al. 2020; Hakim et al. 2022).

CNN, as a specific form of ANN feed, has shown significant effectiveness in the processing and analysis of digital images due to its layered architecture that facilitates comprehensive feature extraction and processing (Kuo 2016; Silva et al. 2023). The pooling layer is typically placed following the convolution layers (Su & Jiang 2023a). The maximum pooling is the most prominent pooling method used (Silva et al. 2023). The maximum pooling layer helps remove weaker features to reduce the number of parameters in the network and prevent overfitting (Silva et al. 2023; Su & Jiang 2023a). CNN also serves as a predictive time series solution, levering convolutional dilatation, which allows the filter to be used for intercellular dilatation calculation (Wibawa et al. 2022).

Bidirectional recurrent neural network

The RNN model was introduced by Elman in 1990. It represents a special class of ANN specifically designed to process sequential and time series data (Elman 1990). Unlike traditional feed-forward networks, RNN features recurrent layers that allow neurons to connect within the same layer and across adjacent layers, allowing the network to retain and utilize information from previous inputs (Sahoo et al. 2023). The internal state, or memory, is essential for predicting future values based on the past data, enabling RNN models to be effective and suitable for time series prediction and sequential data analysis (Khosravi et al. 2023b). RNNs can detect patterns and predict the next event by using historical information. It thus offers a time-efficient approach to analyzing sequential data (Jamei et al. 2024a). The ability of RNNs to recall and incorporate past values in their predictions emphasizes the efficiency of this model in tasks where historical information directly affects future outcomes (Zakhrouf et al. 2021; Tao et al. 2024). Figure 3 illustrates the structure of the BRNN model.
Figure 3

The illustration of the structure: (a) BRNN model and (b) BGRU model.

Figure 3

The illustration of the structure: (a) BRNN model and (b) BGRU model.

Close modal
The BRNN typically exhibits superior performance compared to a unidirectional RNN (Jamei et al. 2024b). This is due to the inclusion of a backward RNN in the BRNN, which enables the extraction of additional features (Jamei et al. 2024b). The BRNN outputs are integrated through a linear transformation, which can be demonstrated by the following equation:
(1)
where represents historical data in the negative time direction at specified time point t, b signifies the bias of the linear transformation, denotes positive historical information at time point t and W represents the weight of the linear transformation.

Bidirectional gated recurrent unit

The GRU model, introduced by Cho et al. (2014), represents an extended variant of RNN designed to overcome some of the weaknesses or limitations of LSTM networks (Chhetri et al. 2020). GRU offers advantages over LSTM models, such as fewer parameters and faster convergence, making it particularly effective for smaller datasets (Chung et al. 2014; Pan et al. 2020). Then, the GRU model offers the advantage as a type of RNN that is very effective in identifying and capturing temporal dependance, even in extended sequences, which is an important aspect of time series prediction (Han et al. 2024). The GRU architecture is a combination of two main components: an update gate and a reset gate. Update gates determine the extent to which information from the previous time step is carried forward to the current step, while reset gates control the degree to which previous information is ignored. These features contribute to the superior performance of GRU in terms of calculation efficiency and parameter updates compared to LSTM models (Guo et al. 2021). The BGRU is a neural network model that includes both unidirectional and asymmetric GRUs (Jamei et al. 2024b), which can be expressed in the following equation:
(2)
where and represent the outputs of GRUs that gather information in disparate ways. The function represents a process such as summation, concatenation, or average. Figure 3 shows the structure of the BGRU model.

Hybrid models

To increase the ability of DL models, this study developed hybrid models, CNN2D–BGRU and BRNN–BGRU, that integrate two different model layers. Each selected hybrid model has its own advantages. For example, Su & Jiang (2023a) utilized the CNN–BGRU hybrid model for tide level prediction, leveraging CNN's effectiveness in extracting valuable features from images or sequences and BGRU's strength in processing time series data. BGRU's robust fitting capabilities allow it to capture nonlinear features, while its bidirectional structure facilitates the modeling of potential relationships among past, present, and future data, thereby enhancing prediction accuracy (Bao et al. 2023). The BRNN framework, an extension of the RNN framework, processes input in both forward and backward directions (Vijay & Aparna 2023), allowing it to capture context from both past and future information. Figure 4 presents the proposed hybrid models, CNN2D–BGRU and BRNN–BGRU, for rainfall prediction in George Town, Penang.
Figure 4

The structure of the hybrid models includes (a) CNN2D–BGRU and (b) BRNN–BGRU.

Figure 4

The structure of the hybrid models includes (a) CNN2D–BGRU and (b) BRNN–BGRU.

Close modal

Model configuration

This work employs CNN2D, BRNN, and BGRU models, as well as the development of a hybrid model, CNN2D–BGRU and BRNN–BGRU, to enhance the performance of the standalone model. In the development of the CNN2D–BGRU model, the CNN2D layer first processes the spatial features from the reshaped input data, while the BGRU layer captures the temporal dependence. The outputs from these two branches are combined and further processed through additional dense layers to refine features and make predictions.

For the BRNN–BGRU model, the second model architecture uses separate BRNN and BGRU branches to analyze temporal patterns from different perspectives. The output from these branches is combined, passed through a dropout layer to reduce overfitting and then processed by a final dense layer before making predictions. The same setup model was used to test the prediction 3 days in advance. This is to assess the model's capacity to make future predictions. Table 1 shows the parameter settings for all of the models, including hybrid models.

Table 1

Model configuration

ModelLayers and parametersFilters/unitsKernel size/pool sizeDense units
CNN2D Conv2D, MaxPooling2D, dense 128, 256 filters Kernel size: (3, 3), pool size: (2, 2) 256 
BRNN Bidirectional LSTM (RNN), dropout, dense 200, 100 units – 64 
BGRU Bidirectional GRU, dropout, ensDe 128, 64 units – 128 
BRNN–BGRU Bidirectional LSTM (RNN), bidirectional GRU, dense LSTM: 200, 100 units, GRU: 128, 64 units – 128 
CNN2D–BGRU Conv2D, MaxPooling2D, bidirectional GRU, Dense Conv2D: 128, 256 filters, GRU: 128, 64 units Kernel size: (3, 3), pools: (2, 2) 128 
ModelLayers and parametersFilters/unitsKernel size/pool sizeDense units
CNN2D Conv2D, MaxPooling2D, dense 128, 256 filters Kernel size: (3, 3), pool size: (2, 2) 256 
BRNN Bidirectional LSTM (RNN), dropout, dense 200, 100 units – 64 
BGRU Bidirectional GRU, dropout, ensDe 128, 64 units – 128 
BRNN–BGRU Bidirectional LSTM (RNN), bidirectional GRU, dense LSTM: 200, 100 units, GRU: 128, 64 units – 128 
CNN2D–BGRU Conv2D, MaxPooling2D, bidirectional GRU, Dense Conv2D: 128, 256 filters, GRU: 128, 64 units Kernel size: (3, 3), pools: (2, 2) 128 

Model evaluation

Four evaluation metrics, such as RMSE, MAE, PCC, and WI, are used in this study, which have been previously applied by researchers to assess the performance of both standalone and hybrid models (Wang et al. 2022). The same evaluation metrics are used to assess the capability of the proposed models for multi-step predictions. These metrics are crucial for measuring the reliability and accuracy of the models, offering a thorough evaluation of their performance. Additionally, Taylor diagrams are utilized to visualize and compare model performance. These diagrams are graphical tools that help assess the accuracy of mathematical models by comparing simulated data with observed data (Sammen et al. 2023). RMSE is commonly employed to evaluate differences between observed and predicted values. The RMSE metric assigns greater weight to extreme errors, whereas the MAE is calculated based on the mean magnitude of the error without consideration of its direction (Khosravi et al. 2023a). Consequently, RMSE and MAE provide a comprehensive and complementary analysis of model error. The WI provides a descriptive index that enables cross-comparison and evaluation of difference models (Piri et al. 2023). PCC represents an established methodology for quantifying the linear degree of correlation between variables (Hu et al. 2020). The following are equations for metric assessment that have been in use in previous studies (Willmott 1981; Ghorbani et al. 2018; Kim et al. 2023):

  • (a) RMSE
    (3)
  • (b) MAE
    (4)
  • (c) PCC
    (5)
  • (d) WI
    (6)
    where and denote the observed value and the predicted values, respectively, N represents the observed data, while and present the values of observation and prediction, respectively.

One-day-ahead prediction

Table 2 displays the performance results of the models for 1-day-ahead prediction. The CNN2D–BGRU model achieved the lowest RMSE value of 2.84, outperforming all other models. This suggests that the CNN2D–BGRU model delivers the most accurate predictions on average for a 1-day period compared to the BRNN–BGRU, BRNN, BGRU, and CNN2D models, which have RMSE values of 3.04, 3.14, 3.15, and 3.34, respectively. The standalone CNN2D model recorded the highest RMSE, indicating lower accuracy. Additionally, the CNN2D–BGRU model excelled in MAE performance with a value of 2.03. The MAE values for the other models are as follows: BRNN–BGRU (2.17), BGRU (2.24), and BRNN (2.27), with the CNN2D model showing a relatively high MAE value of 2.47. These results indicate that the CNN2D model is less effective compared to the others. Overall, the CNN2D–BGRU model demonstrated superior accuracy and consistency, with fewer errors and more reliable predictions across different datasets.

Table 2

The results of the developed standalone and hybrid models for rainfall prediction over the training and testing modeling phases of 1-day step ahead. Bold text indicates statistical significance at the 95% confidence level

ModelRMSEMAEPCCWI
Training 
CNN2D 3.34 2.47 0.73 0.79 
BRNN 3.14 2.27 0.76 0.84 
BGRU 3.15 2.24 0.76 0.83 
CNN2D–BGRU 2.84 2.03 0.82 0.87 
BRNN–BGRU 3.04 2.17 0.78 0.85 
Testing 
CNN2D 2.70 2.24 0.75 0.85 
BRNN 2.72 2.13 0.76 0.86 
BGRU 2.61 2.01 0.78 0.88 
CNN2D–BGRU 2.64 2.10 0.77 0.87 
BRNN–BGRU 2.59 1.97 0.79 0.88 
ModelRMSEMAEPCCWI
Training 
CNN2D 3.34 2.47 0.73 0.79 
BRNN 3.14 2.27 0.76 0.84 
BGRU 3.15 2.24 0.76 0.83 
CNN2D–BGRU 2.84 2.03 0.82 0.87 
BRNN–BGRU 3.04 2.17 0.78 0.85 
Testing 
CNN2D 2.70 2.24 0.75 0.85 
BRNN 2.72 2.13 0.76 0.86 
BGRU 2.61 2.01 0.78 0.88 
CNN2D–BGRU 2.64 2.10 0.77 0.87 
BRNN–BGRU 2.59 1.97 0.79 0.88 

Regarding the PCC results, the CNN2D–BGRU model achieved the highest value of 0.82, outperforming all other models on the training dataset. The BRNN–BGRU model followed with a PCC value of 0.78, while the BRNN, BGRU, and CNN2D models had lower PCC values of 0.76, 0.76, and 0.73, respectively. For the WI metric, the CNN2D–BGRU model also excelled, attaining the highest WI value of 0.87. It outperformed the BRNN–BGRU (0.85), BRNN (0.84), BGRU (0.83), and CNN2D (0.79) models. The CNN2D model performed the worst in terms of both PCC and WI metrics. Overall, the CNN2D–BGRU model demonstrated the best performance for 1-day-ahead prediction in the training dataset, whereas the CNN2D model showed less favorable results.

For the testing dataset, the BRNN–BGRU hybrid model achieved the lowest RMSE value of 2.59, outperforming all other models: BGRU (2.61), CNN2D–BGRU (2.64), CNN2D (2.70), and BRNN (2.72). The BGRU model came in second for performance, demonstrating strong predictive capability. In contrast, the standalone BRNN model had the highest error. The BRNN–BGRU model also excelled in terms of MAE, with a value of 1.97, surpassing the BGRU (2.01), CNN2D–BGRU (2.10), BRNN (2.13), and CNN2D (2.24) models. The CNN2D model showed the poorest performance in this metric, while the BGRU model achieved the second-highest accuracy score.

The BRNN–BGRU model attained the highest PCC score of 0.79 on the test dataset, with the standalone BGRU model following closely with a PCC value of 0.78. The other models recorded scores of 0.77 for CNN2D–BGRU, 0.76 for BRNN, and 0.75 for CNN2D. In terms of the WI metric, the BRNN–BGRU model excelled with a value of 0.88, surpassing the BGRU model (0.88), CNN2D–BGRU (0.87), BRNN (0.86), and CNN2D (0.85). The CNN2D model showed the lowest WI and PCC scores in this test dataset, while the standalone BGRU model achieved the second-highest score, demonstrating its effectiveness in enhancing the performance of both CNN2D and BRNN models.

The hybrid models outperform the standalone models, as shown by their improved performance metrics. During training, the CNN2D–BGRU hybrid model reduces RMSE by 14.88% and MAE by 17.96% compared to the CNN2D model. It also shows a 10.42% increase in PCC and an 8.98% increase in WI. In testing, the BRNN–BGRU model improves RMSE by 5.07% and MAE by 7.73% compared to the BRNN model. Additionally, it shows a 3.44% increase in PCC and a 2.27% increase in WI over the BRNN model. These results indicate that hybrid models enhance accuracy and strengthen overall performance metrics.

Figure 5 illustrates that the BRNN–BGRU model is effective for predicting low to moderate rainfall events. During training, the hybrid model outperformed the standalone models. However, none of the models performed well in predicting peak rainfall intensities. The CNN2D–BGRU and BRNN–BGRU models show a strong correlation with observed peaks, whereas the standalone models tend to slightly underestimate these peaks. In contrast, the CNN2D and BGRU standalone models slightly overestimate the observed values. Figure 6 depicts the residual plot of the models used to forecast rainfall 1-day step ahead prediction.
Figure 5

The line plot of the models used to predict rainfall 1-day step ahead.

Figure 5

The line plot of the models used to predict rainfall 1-day step ahead.

Close modal
Figure 6

The residual plot of the models used to predict rainfall 1-day step ahead.

Figure 6

The residual plot of the models used to predict rainfall 1-day step ahead.

Close modal

Three-day-ahead prediction

Extending predictions to 3 days' results in decreased performance across all models, as shown in Table 3, highlighting the increased complexity and uncertainty associated with longer-term forecasts. The CNN2D–BGRU model leads in RMSE on the training dataset with a score of 3.23, outperforming all other models. This model has consistently demonstrated superior performance in 1-day-ahead predictions with lower RMSE values. The other models reported the following RMSE values: BRNN–BGRU (3.44), BRNN (3.51), BGRU (3.54), and CNN2D (3.56). The CNN2D–BGRU model again outperformed the standalone CNN2D model, which had the highest RMSE. These results indicate that hybrid models can achieve better performance compared to individual models.

Table 3

The results of the developed standalone and hybrid models for rainfall prediction over the training and testing modeling phases of 3-day step ahead. Bold text indicates statistical significance at the 95% confidence level

ModelRMSEMAEPCCWI
Training 
CNN2D 3.56 2.65 0.69 0.75 
BRNN 3.51 2.53 0.69 0.78 
BGRU 3.54 2.53 0.68 0.78 
CNN2D–BGRU 3.23 2.34 0.75 0.82 
BRNN–BGRU 3.44 2.45 0.70 0.79 
Testing 
CNN2D 2.96 2.42 0.69 0.80 
BRNN 3.14 2.48 0.66 0.81 
BGRU 3.06 2.34 0.68 0.82 
CNN2D–BGRU 3.05 2.40 0.68 0.81 
BRNN–BGRU 3.02 2.34 0.69 0.83 
ModelRMSEMAEPCCWI
Training 
CNN2D 3.56 2.65 0.69 0.75 
BRNN 3.51 2.53 0.69 0.78 
BGRU 3.54 2.53 0.68 0.78 
CNN2D–BGRU 3.23 2.34 0.75 0.82 
BRNN–BGRU 3.44 2.45 0.70 0.79 
Testing 
CNN2D 2.96 2.42 0.69 0.80 
BRNN 3.14 2.48 0.66 0.81 
BGRU 3.06 2.34 0.68 0.82 
CNN2D–BGRU 3.05 2.40 0.68 0.81 
BRNN–BGRU 3.02 2.34 0.69 0.83 

The CNN2D–BGRU model achieved the lowest MAE value of 2.34, with the BRNN–BGRU hybrid model closely following at 2.45. The remaining models had MAE values of 2.53 for both BRNN and BGRU and 2.65 for CNN2D. For PCC, the standalone CNN2D–BGRU model led with a value of 0.75, followed by BRNN–BGRU at 0.70, BRNN and CNN2D at 0.69 each, and BGRU at 0.68. In terms of WI, the CNN2D–BGRU model also performed best with a score of 0.82, surpassing BRNN–BGRU (0.79), BRNN and BGRU (0.78 each), and CNN2D (0.75). The CNN2D model had the lowest WI score for the 3-day-ahead predictions, indicating that the CNN2D–BGRU hybrid model significantly enhances the performance of the standalone CNN2D model.

For the testing dataset, the CNN2D model achieved the lowest RMSE value of 2.96, outperforming the other models: BRNN–BGRU (3.02), CNN2D–BGRU (3.05), BGRU (3.06), and BRNN (3.14). In terms of MAE, the BGRU model performed best with a score of 2.34, slightly better than BRNN–BGRU (2.34) and CNN2D–BGRU (2.40), while CNN2D (2.42) and BRNN (2.48) showed higher values. For PCC, the BRNN–BGRU hybrid model led with 0.69, closely followed by CNN2D (0.69), BGRU (0.68), CNN2D–BGRU (0.68), and BRNN (0.66). Regarding the WI metric, the BRNN–BGRU model was the most efficient with a score of 0.83, surpassing BGRU (0.82), CNN2D–BGRU (0.81), BRNN (0.81), and CNN2D (0.80). Overall, the CNN2D model had the lowest performance across all metrics for the 3-day-ahead rainfall prediction.

Similar to the 1-day-ahead prediction, the hybrid models show better performance than the standalone models for 3-day-ahead predictions, though the improvement is slightly reduced. In the training phase, the CNN2D–BGRU model improves RMSE by 9.18% compared to the CNN2D model but shows an 11.55% increase in MAE. It also demonstrates better performance in PCC and WI metrics, with an 8.17 and 8.79% improvement, respectively. For the testing phase, the BRNN–BGRU model improves RMSE by 3.95% and MAE by 5.45% compared to the BRNN model. It also shows a 4.05% improvement in PCC and a 2.66% improvement in WI. These results indicate that hybrid models enhance accuracy and overall performance. Figure 7 illustrates similar trends as the 1-day predictions, showing that the BRNN–BGRU model is effective for low to moderate rainfall but struggles with peak rainfall intensity. The CNN2D model continues to show tendencies for overprediction and underprediction in both training and testing datasets. Figure 8 shows the residual plot of the models used to forecast rainfall 3-day-ahead prediction.
Figure 7

The line plot of the models used to predict rainfall 3-day step ahead.

Figure 7

The line plot of the models used to predict rainfall 3-day step ahead.

Close modal
Figure 8

The residual plot of the models used to predict rainfall 3-day step ahead.

Figure 8

The residual plot of the models used to predict rainfall 3-day step ahead.

Close modal

Taylor diagram

The Taylor diagram combines three metrics, PCC, standard deviation, and RMSE, into a single graphical representation (Khosravi et al. 2018; Sammen et al. 2023). Models positioned closer to the observed data in this diagram demonstrate superior predictive performance (Khosravi et al. 2018). As illustrated in Figure 9, the hybrid BRNN–BGRU model shows the best performance, with the BRNN–BGRU hybrid model being nearest to the observed data points. In addition, both hybrid models, CNN2D–BGRU and BRNN–BGRU, performed better than the standalone models in terms of both correlation and standard deviation fit for 1- and 3-day-ahead rainfall prediction.
Figure 9

Taylor diagram of multi-step rainfall prediction using hybrid and standalone models in the testing phase.

Figure 9

Taylor diagram of multi-step rainfall prediction using hybrid and standalone models in the testing phase.

Close modal

This study revealed that the DL models CNN2D and BRNN had inconsistent and limited performance in predicting multi-step ahead rainfall. For example, CNN2D had the highest RMSE and MAE for both 1- and 3-day predictions, while BRNN showed slightly better results but still fell short compared to the hybrid models. These findings align with Salma & Ashwitha (2022), which found that the CNN–LSTM hybrid model outperformed standalone DL and ML models in rainfall prediction accuracy. This study leveraged the advantages of the GRU model, which excels in capturing temporal dependencies, as noted by Han et al. (2024). The BGRU model, incorporating unidirectional and asymmetric GRU, as discussed by Jamei et al. (2024b), improves the prediction performance of CNN2D and BRNN models.

The BRNN–BGRU model outperforms standalone models by providing enhanced multi-step prediction capabilities and maintaining consistent performance for both 1- and 3-day forecasts. Jamei et al. (2024b) also found that the CNN–BGRU hybrid model, used together with multivariate variational mode decomposition, achieved better results than other standalone models. The results are consistent with Chhetri et al. (2020), where combining two DL models leads to superior performance compared to other models. Similar to the study by Khan & Maity (2020), the combined model between ML and DL got better results than the standalone model tested. RNN models are particularly suitable for time series prediction and sequential data analysis (Khosravi et al. 2023b). This research uses the BRNN model, which provides additional feature extraction capabilities (Jamei et al. 2024b). The combined BRNN–BGRU model is the most effective model by using the strengths of both models for better time series data prediction.

NASA POWER has the potential to address the shortage of meteorological data for rainfall predictions in tropical regions. Oloyede et al. (2023) demonstrated that models can be developed to generate more accurate temperature forecasts by utilizing NASA POWER data as input. Moreover, there is widespread agreement that NASA POWER data are valuable in scenarios where ground-measured data are either unavailable or incomplete. Khan et al. (2024) used the NASA POWER dataset to predict the rainfall classification in India. However, the authors did not mention the quality of the dataset from NASA POWER.

In this study, the utilization of the NASA POWER dataset for rainfall forecasting introduces some uncertainty, primarily due to the lack of local rainfall stations. Tan et al. (2023) showed that while the NASA POWER dataset effectively captures rainfall variability during the 2014–2015 floods in Kelantan, Malaysia, it requires bias correction before application. Future research should prioritize model calibration with ground-based data sets to increase accuracy in predictions. Hyperparameter configuration is currently performed manually. Employing optimization techniques such as grid search, particle swarm optimization (PSO) or genetic algorithms can lead to more efficient models by systematically exploring the hyperparameter space. Hyperparameter optimization, particularly PSO for DL, is crucial since DL models are often sensitive to poor initial weights (Cholissodin & Sutrisno 2020). The model selection is based on the capabilities and advantages identified by previous studies, emphasizing their individual strengths (Su & Jiang 2023a; Vijay & Aparna 2023). Combining the best-performing standalone models with those that are less promising models not only reduces the impact of model uncertainty but also leverages multiple methodologies to improve overall results.

By 2030, climate change is projected to intensify due to rising carbon dioxide levels, resulting in drier summers, wetter winters, and more frequent severe storms (Sarker 2022). Furthermore, the occurrence of typhoons and hurricanes is expected to rise as ocean waters warm. Accurate rainfall prediction can serve as an early warning system for natural disasters such as flash floods, landslides, and droughts (Zhou et al. 2024). The results indicate that the developed model is capable of accurately predicting rainfall and capturing extreme rainfall patterns driven by climate change, utilizing datasets from NASA POWER. This information is especially beneficial for farmers, enabling more efficient use of water resources (Latif et al. 2023). The development of reliable multi-step predicting models can be applied in practical situations to provide early warnings to the public, assist in disaster management, and help authorities mitigate risks to property, crops, and infrastructure (Latif et al. 2023; Zhou et al. 2024).

Future research should consider incorporating optimizers to improve prediction accuracy. Adnan et al. (2023) found that metaheuristic optimizers enhance model performance in river flow prediction. Additionally, integrating multi-head attention could further refine DL models. Given that NASA POWER data often contain outliers and noise, which can affect model accuracy and increase the risk of overfitting, techniques like wavelet transform (Kumar & Kumar 2021), Savitzky–Golay (Pham et al. 2024), and Z-score (Raziei & Miri 2023) are used to eliminate outliers. This study utilized the Savitzky–Golay method to remove or mitigate outliers, demonstrating its effectiveness in enhancing model performance and preventing overfitting. Future research should emphasize the necessity of establishing rainfall stations in the study area. Changes in hydrologic data patterns resulting from climate change can impact the network of hydrometric gauges, indicating that the area may need a dedicated rainfall monitoring station (Singhal et al. 2024).

This study has several limitations, including the reliance on a single time series dataset, which may limit the generalizability of the results to other contexts. Future research should explore different areas to assess the model's predictive capabilities in various geographical conditions. Additionally, bias correction was not applied prior to training the model, which could impact the accuracy of the predictions. Another limitation is the absence of a feature selection process, as the study depended solely on the variables provided by NASA POWER and prior research for variable selection. Addressing these limitations in future studies can improve the reliability of NASA POWER in predicting tropical rainfall.

This study advances the development of enhanced rainfall prediction models that address contemporary climate challenges, especially in tropical countries that lack observations. It presents two hybrid models, CNN2D–BGRU and BRNN–BGRU, which exhibit superior multi-day forecasting capabilities compared to standalone models. This capacity for extended forecasting makes the hybrid models well-suited for real-world applications. Moreover, utilizing NASA POWER datasets for predicting tropical rainfall can offer valuable insights to researchers and stakeholders in tropical regions.

This study successfully develops and evaluates the performance of both standalone and hybrid DL models for rainfall prediction in George Town, Penang. Utilizing the NASA POWER dataset, which includes variables such as rainfall, wind speed, temperature, dew point, relative humidity, and SR, the research involved a thorough preprocessing phase to enhance model performance. The results reveal that the hybrid models BRNN–BGRU and CNN2D–BGRU show improved performance, particularly for 1- and 3-day predictions. Notably, the CNN2D–BGRU model performed best during training, while the BRNN–BGRU model yielded more accurate results on the testing dataset.

Integrating BRNN with BGRU has been shown to improve the accuracy and reliability of short-term rainfall predictions, especially for nonlinear scenarios. However, the study also highlights a decline in accuracy for long-term predictions, which may be due to the increased complexity of assessing additional variables over extended periods. To address these challenges, it is essential to refine models and methodologies. Overall, the research framework utilizing the BRNN–BGRU hybrid model proves effective for both short- and long-term rainfall prediction, providing valuable insights for mitigating the impacts of natural disasters such as floods, droughts, and landslides.

We extend our gratitude to NASA for providing the dataset available on the NASA POWER website.

This research was funded by the Ministry of Higher Education Malaysia under the Trans-Disciplinary Research Grant Scheme (TRGS), grant number TRGS/1/2022/USM/02/3/1.

A.S. and M.L.T. conceptualized the study and were responsible for writing, reviewing and editing the article. Z.M.Y. and Z.F. reviewed and edited the article.

NASA POWER data are available at https://power.larc.nasa.gov/data-access-viewer/

The authors declare there is no conflict.

Adnan
R. M.
,
Mostafa
R. R.
,
Dai
H. L.
,
Heddam
S.
,
Masood
A.
&
Kisi
O.
(
2023
)
Enhancing accuracy of extreme learning machine in predicting river flow using improved reptile search algorithm
,
Stochastic Environmental Research and Risk Assessment
,
37
(
8
),
3063
3083
.
https://doi.org/10.1007/s00477-023-02435-y
.
AlDahoul
N.
,
Momo
M. A.
,
Chong
K. L.
,
Ahmed
A. N.
,
Huang
Y. F.
,
Sherif
M.
&
El-Shafie
A.
(
2023
)
Streamflow classification by employing various machine learning models for peninsular Malaysia
,
Scientific Reports
,
13
(
1
),
14574
.
https://doi.org/10.1038/s41598-023-41735-9
.
Alqahtani
F.
(
2024
)
AI-driven improvement of monthly average rainfall forecasting in Mecca using grid search optimization for LSTM networks
,
Journal of Water and Climate Change
,
15
(
4
),
1439
1458
.
https://doi.org/10.2166/wcc.2024.242
.
Baig
F.
,
Ali
L.
,
Faiz
M. A.
,
Chen
H.
&
Sherif
M.
(
2024
)
How accurate are the machine learning models in improving monthly rainfall prediction in hyper arid environment?
,
Journal of Hydrology
,
633
(
April
),
131040
.
https://doi.org/10.1016/j.jhydrol.2024.131040
.
Bandira
P. N. A.
,
Tan
M. L.
,
Teh
S. Y.
,
Shaharudin
S. M.
,
Samat
N.
&
Mahamud
M. A.
(
2023
)
Assessment of NASA POWER for climate change analysis using the de Martonne Climate Index in Northern Peninsular Malaysia
,
IOP Conference Series: Earth and Environmental Science
,
1238
(
1
),
012029
.
https://doi.org/10.1088/1755-1315/1238/1/012029
.
Bao
K.
,
Bi
J.
,
Ma
R.
,
Sun
Y.
,
Zhang
W.
&
Wang
Y.
(
2023
)
A spatial-reduction attention-based BiGRU network for water level prediction
,
Water (Switzerland)
,
15
(
7
),
1306
.
https://doi.org/10.3390/w15071306
.
Bosilovich
M. G.
,
Robertson
F. R.
,
Takacs
L.
,
Molod
A.
&
Mocko
D.
(
2017
)
Atmospheric water balance and variability in the MERRA-2 reanalysis
,
Journal of Climate
,
30
(
4
),
1177
1196
.
https://doi.org/10.1175/JCLI-D-16-0338.1
.
Chhetri
M.
,
Kumar
S.
,
Roy
P. P.
&
Kim
B. G.
(
2020
)
Deep BLSTM-GRU model for monthly rainfall prediction: A case study of Simtokha, Bhutan
,
Remote Sensing
,
12
(
19
),
3174
.
https://doi.org/10.3390/rs12193174
.
Cho
K.
,
Van Merriënboer
B.
,
Gulcehre
C.
,
Bahdanau
D.
,
Bougares
F.
,
Schwenk
H.
&
Bengio
Y.
(
2014
). '
Learning phrase representations using RNN encoder-decoder for statistical machine translation
',
EMNLP 2014 – 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
, pp.
1724
1734
.
https://doi.org/10.3115/v1/d14-1179
.
Cholissodin
I.
&
Sutrisno
S.
(
2020
)
Prediction of rainfall using improved deep learning with particle swarm optimization
,
Telkomnika (Telecommunication Computing Electronics and Control)
,
18
(
5
),
2498
2504
.
https://doi.org/10.12928/TELKOMNIKA.V18I5.14665
.
Chung
J.
,
Gulcehre
C.
,
Cho
K.
&
Bengio
Y.
(
2014
)
Empirical evaluation of gated recurrent neural networks on sequence modeling
.
arXiv
,
1412.3555
.
Dharmayasa
I. G. N. P.
,
Simatupang
C. A.
&
Sinaga
D. M.
(
2022
)
NASA power's: An alternative rainfall data resources for hydrology research and planning activities in Bali Island, Indonesia
,
Journal of Infrastructure Planning and Engineering (JIPE)
,
1
(
1
),
1
7
.
https://doi.org/10.22225/jipe.1.1.2022.1-7
.
Elman
J. L.
(
1990
)
Finding structure in time
,
Cognitive Science
,
14
(
2
),
179
211
.
https://doi.org/https://doi.org/10.1016/0364-0213(90)90002-E
.
Endalie
D.
,
Haile
G.
&
Taye
W.
(
2022
)
Deep learning model for daily rainfall prediction: Case study of Jimma, Ethiopia
,
Water Supply
,
22
(
3
),
3448
3461
.
https://doi.org/10.2166/WS.2021.391
.
Faiz
M. A.
,
Liu
D.
,
Fu
Q.
,
Sun
Q.
,
Li
M.
,
Baig
F.
,
Li
T.
&
Cui
S.
(
2018
)
How accurate are the performances of gridded precipitation data products over Northeast China?
,
Atmospheric Research
,
211
(
January
),
12
20
.
https://doi.org/10.1016/j.atmosres.2018.05.006
.
Faiz
M. A.
,
Zhang
Y.
,
Baig
F.
,
Wrzesiński
D.
&
Naz
F.
(
2021
)
Identification and inter-comparison of appropriate long-term precipitation datasets using decision tree model and statistical matrix over China
,
International Journal of Climatology
,
41
(
10
),
5003
5021
.
https://doi.org/https://doi.org/10.1002/joc.7113
.
Fu
J.
,
Jian
Y.
,
Wang
X.
,
Li
L.
,
Ciais
P.
,
Zscheischler
J.
,
Wang
Y.
,
Tang
Y.
,
Müller
C.
,
Webber
H.
,
Yang
B.
,
Wu
Y.
,
Wang
Q.
,
Cui
X.
,
Huang
W.
,
Liu
Y.
,
Zhao
P.
,
Piao
S.
&
Zhou
F.
(
2023
)
Extreme rainfall reduces one-twelfth of China's rice yield over the last two decades
,
Nature Food
,
4
(
5
),
416
426
.
https://doi.org/10.1038/s43016-023-00753-6
.
Gao
H.
,
Shan
P.
,
Lea
F.
,
Tay
T.
&
Chin
H.
(
2020
)
Three oversampling methods applied in a comparative landslide spatial research in Penang Island, Malaysia
,
SN Applied Sciences
,
2
(
9
),
1512
.
https://doi.org/10.1007/s42452-020-03307-8
.
Ghaderpour
E.
,
Masciulli
C.
,
Zocchi
M.
,
Bozzano
F.
,
Scarascia Mugnozza
G.
&
Mazzanti
P.
(
2024
)
Estimating reactivation times and velocities of slow-moving landslides via PS-InSAR and their relationship with precipitation in Central Italy
,
Remote Sensing
,
16
(
16
),
3055
.
https://doi.org/10.3390/rs16163055
.
Ghorbani
M. A.
,
Deo
R. C.
,
Yaseen
Z. M.
,
Kashani
M. H.
&
Mohammadi
B.
(
2018
)
Pan evaporation prediction using a hybrid multilayer perceptron-firefly algorithm (MLP-FFA) model: Case study in North Iran
,
Theoretical and Applied Climatology
,
133
(
3
),
1119
1131
.
https://doi.org/10.1007/s00704-017-2244-0
.
Gu
J.
,
Liu
S.
,
Zhou
Z.
,
Chalov
S. R.
&
Zhuang
Q.
(
2022
)
A stacking ensemble learning model for monthly rainfall prediction in the Taihu Basin, China
,
Water (Switzerland)
,
14
(
3
),
1
20
.
https://doi.org/10.3390/w14030492
.
Guo
Y.
,
Xu
Y.-P.
,
Yu
X.
,
Chen
H.
,
Gu
H.
&
Xie
J.
(
2021
)
AI-based techniques for multi-step streamflow forecasts: Application for multi-objective reservoir operation optimization and performance assessment
,
Hydrology and Earth System Sciences Discussions
,
25
(
11
),
5951
5979
.
https://doi.org/10.5194/hess-2020-617
.
Hakim
W. L.
,
Rezaie
F.
,
Nur
A. S.
,
Panahi
M.
,
Khosravi
K.
,
Lee
C.-W.
&
Lee
S.
(
2022
)
Convolutional neural network (CNN) with metaheuristic optimization algorithms for landslide susceptibility mapping in Icheon, South Korea
,
Journal of Environmental Management
,
305
(
March
),
114367
.
https://doi.org/10.1016/j.jenvman.2021.114367
.
Han
H.
,
Neira-Molin
H.
,
Khan
A.
,
Fang
M.
,
Mahmoud
H. A.
,
Awwad
E. M.
,
Ahmed
B.
&
Ghadi
Y. Y.
(
2024
)
Advanced series decomposition with a gated recurrent unit and graph convolutional neural network for non-stationary data patterns
,
Journal of Cloud Computing
,
13
(
1
),
1
19
.
https://doi.org/10.1186/s13677-023-00560-1
.
Hu
X.
,
Zhang
H.
,
Mei
H.
,
Xiao
D.
,
Li
Y.
&
Li
M.
(
2020
)
Landslide susceptibility mapping using the stacking ensemble machine learning method in Lushui, Southwest China
,
Applied Sciences (Switzerland)
,
10
(
11
),
4016
.
https://doi.org/10.3390/app10114016
.
Jamei
M.
,
Ali
M.
,
Malik
A.
,
Karbasi
M.
,
Rai
P.
&
Yaseen
Z. M.
(
2023
)
Development of a TVF-EMD-based multi-decomposition technique integrated with encoder-decoder-bidirectional-LSTM for monthly rainfall forecasting
,
Journal of Hydrology
,
617
(
Part C, February
),
129105
.
https://doi.org/10.1016/j.jhydrol.2023.129105
.
Jamei
M.
,
Ali
M.
,
Bateni
S. M.
,
Jun
C.
,
Karbasi
M.
,
Malik
A.
,
Jamei
M.
&
Mundher Yaseen
Z.
(
2024a
)
Short-term drought Index forecasting for hot and semi-humid climate regions: A novel empirical Fourier decomposition-based ensemble deep-random vector functional link strategy
,
Computers and Electronics in Agriculture
,
217
(
February
),
108609
.
https://doi.org/10.1016/j.compag.2023.108609
.
Jamei
M.
,
Jamei
M.
,
Ali
M.
,
Karbasi
M.
,
Faroouqe
A. A.
,
Malik
A.
,
Cheema
S. J.
,
Esau
T. J.
&
Yaseen
Z. M.
(
2024b
)
Quantitative improvement of streamflow forecasting accuracy in the Atlantic zones of Canada based on hydro-meteorological signals: A multi-level advanced intelligent expert framework
,
Ecological Informatics
,
80
(
May
),
102455
.
https://doi.org/10.1016/j.ecoinf.2023.102455
.
Kadhim Tayyeh
H.
&
Mohammed
R.
(
2023
)
Analysis of NASA POWER reanalysis products to predict temperature and precipitation in Euphrates River basin
,
Journal of Hydrology
,
619
(
April
),
129327
.
https://doi.org/10.1016/j.jhydrol.2023.129327
.
Khan
M. I.
&
Maity
R.
(
2020
)
Hybrid deep learning approach for multi-step-ahead daily rainfall prediction using GCM simulations
,
IEEE Access
,
8
,
52774
52784
.
https://doi.org/10.1109/ACCESS.2020.2980977
.
Khan
M. U. S.
,
Mohammad Saifullah
K.
,
Hussain
A.
&
Mohammad Azamathulla
H.
(
2024
)
Comparative analysis of different rainfall prediction models: A case study of Aligarh City, India
,
Results in Engineering
,
22
(
June
),
102093
.
https://doi.org/10.1016/j.rineng.2024.102093
.
Khosravi
K.
,
Mao
L.
,
Kisi
O.
,
Yaseen
Z. M.
&
Shahid
S.
(
2018
)
Quantifying hourly suspended sediment load using data mining models: Case study of a glacierized Andean catchment in Chile
,
Journal of Hydrology
,
567
(
October
),
165
179
.
https://doi.org/10.1016/j.jhydrol.2018.10.015
.
Khosravi
K.
,
Golkarian
A.
,
Barzegar
R.
,
Aalami
M. T.
,
Heddam
S.
,
Keesstra
S. D.
&
Vicente
M. L.
(
2023a
)
Multi-step ahead soil temperature forecasting at different depths based on meteorological data: Integrating resampling algorithms and machine learning models
,
Pedosphere
,
33
(
3
),
479
495
.
https://doi.org/10.1016/j.pedsph.2022.06.056
.
Khosravi
K.
,
Rezaie
F.
,
Cooper
J. R.
,
Kalantari
Z.
,
Abolfathi
S.
&
Hatamiafkoueieh
J.
(
2023b
)
Soil water erosion susceptibility assessment using deep learning algorithms
,
Journal of Hydrology
,
618
(
March
),
129229
.
https://doi.org/10.1016/j.jhydrol.2023.129229
.
Khosravi
K.
,
Farooque
A. A.
,
Bateni
S. M.
,
Jun
C.
,
Mohammadi
D.
,
Kalantari
Z.
&
Cooper
J. R.
(
2024
)
Fluvial bedload transport modelling: Advanced ensemble tree-based models or optimized deep learning algorithms?
,
Engineering Applications of Computational Fluid Mechanics
,
18
(
1
),
2346221
.
https://doi.org/10.1080/19942060.2024.2346221
.
Kim
S.
,
Seo
Y.
,
Malik
A.
,
Kim
S.
,
Heddam
S.
,
Yaseen
Z. M.
,
Kisi
O.
&
Singh
V. P.
(
2023
)
Quantification of river total phosphorus using integrative artificial intelligence models
,
Ecological Indicators
,
153
(
September
),
110437
.
https://doi.org/10.1016/j.ecolind.2023.110437
.
Kumar
M.
&
Kumar
P.
(
2021
)
Stage-discharge-sediment modelling using support vector machine
.
The Pharma Innovation Journal
.
10
(
1S
),
149
154
.
Kuo
C.-C. J.
(
2016
)
Understanding convolutional neural networks with a mathematical model
,
Journal of Visual Communication and Image Representation
,
41
(
November
),
406
413
.
https://doi.org/https://doi.org/10.1016/j.jvcir.2016.11.003
.
Latif
S. D.
,
Alyaa Binti Hazrin
N.
,
Hoon Koo
C.
,
Lin Ng
J.
,
Chaplot
B.
,
Feng Huang
Y.
,
El-Shafie
A.
&
Najah Ahmed
A.
(
2023
)
Assessing rainfall prediction models: Exploring the advantages of machine learning and remote sensing approaches
,
Alexandria Engineering Journal
,
82
(
November
),
16
25
.
https://doi.org/10.1016/j.aej.2023.09.060
.
Lee
S.
,
Baek
W. K.
,
Jung
H. S.
&
Lee
S.
(
2020
)
Susceptibility mapping on urban landslides using deep learning approaches in Mt, Umyeon
.
Applied Sciences (Switzerland)
,
10
(
22
),
8189
.
https://doi.org/10.3390/app10228189
.
Lian
X.
,
Hu
X.
,
Bian
J.
,
Shi
L.
,
Lin
L.
&
Cui
Y.
(
2023
)
Enhancing streamflow estimation by integrating a data-driven evapotranspiration submodel into process-based hydrological models
,
Journal of Hydrology
,
621
(
June
),
129603
.
https://doi.org/10.1016/j.jhydrol.2023.129603
.
Mazalan
R.
(
2017
)
Pulau Pinang bencana luar jangka. BH Online. 6 Nov. Available online: https://www.bharian.com.my/berita/nasional/2017/11/347296/pulau-pinang-bencana-luar-jangka
.
Mozo
A.
,
Ordozgoiti
B.
&
Gómez-Canaval
S.
(
2018
)
Forecasting short-term data center network traffic load with convolutional neural networks
,
PLoS One
,
13
(
2
),
1
31
.
https://doi.org/10.1371/journal.pone.0191939
.
Oloyede
A.
,
Ozuomba
S.
,
Asuquo
P.
,
Olatomiwa
L.
&
Longe
O. M.
(
2023
)
Data-driven techniques for temperature data prediction: Big data analytics approach
,
Environmental Monitoring and Assessment
,
195
(
2
),
343
.
https://doi.org/10.1007/s10661-023-10961-z
.
Pan
M.
,
Zhou
H.
,
Cao
J.
,
Liu
Y.
,
Hao
J.
,
Li
S.
&
Chen
C. H.
(
2020
)
Water level prediction model based on GRU and CNN
,
IEEE Access
,
8
,
60090
60100
.
https://doi.org/10.1109/ACCESS.2020.2982433
.
Pei
S.
,
Qin
H.
,
Yao
L.
,
Liu
Y.
,
Wang
C.
&
Zhou
J.
(
2020
)
Multi-step ahead short-term load forecasting using hybrid feature selection and improved long short-term memory network
,
Energies
,
13
(
16
),
4121
.
https://doi.org/10.3390/en13164121
.
Pham
B. T.
,
Bui
K. T.
,
Prakash
I.
&
Ly
H.
(
2024
)
Hybrid artificial intelligence models based on adaptive neuro fuzzy inference system and metaheuristic optimization algorithms for prediction of daily rainfall
,
Physics and Chemistry of The Earth
,
134
(
June
),
103563
.
https://doi.org/10.1016/j.pce.2024.103563
.
Piri
J.
,
Abdolahipour
M.
&
Keshtegar
B.
(
2023
)
Advanced machine learning model for prediction of drought indices using hybrid SVR-RSM
,
Water Resources Management
,
37
(
2
),
683
712
.
https://doi.org/10.1007/s11269-022-03395-8
.
Raziei
T.
&
Miri
M.
(
2023
)
An alternative approach for computing the standardized precipitation-evapotranspiration index (SPEI)
,
Water Resources Management
,
37
(
10
),
4123
4141
.
https://doi.org/10.1007/s11269-023-03542-9
.
Rodrigues
G. C.
&
Braga
R. P.
(
2021
)
Evaluation of NASA POWER reanalysis products to estimate daily weather variables in a hot summer Mediterranean climate
,
Agronomy
,
11
(
6
),
1
17
.
https://doi.org/10.3390/agronomy11061207
.
Sahoo
B. B.
,
Sankalp
S.
&
Kisi
O.
(
2023
)
A novel smoothing-based deep learning time-series approach for daily suspended sediment load prediction
,
Water Resources Management
,
37
(
11
),
4271
4292
.
https://doi.org/10.1007/s11269-023-03552-7
.
Salaeh
N.
,
Ditthakit
P.
,
Pinthong
S.
,
Hasan
M. A.
,
Islam
S.
,
Mohammadi
B.
&
Linh
N. T. T.
(
2022
)
Long-short term memory technique for monthly rainfall prediction in Thale Sap Songkhla River Basin, Thailand
,
Symmetry
,
14
(
8
),
1
24
.
https://doi.org/10.3390/sym14081599
.
Saleh
A.
,
Yuzir
A.
,
Sabtu
N.
,
Abujayyab
S. K. M.
,
Mudashiru
R. B.
&
Pham
Q. B.
(
2022
)
Flash flood susceptibility mapping in urban area using genetic algorithm and ensemble method
,
Geocarto International
,
37
(
5
),
10199
10228
.
https://doi.org/10.1080/10106049.2022.2032394
.
Salma
S.
&
Ashwitha
A.
(
2022
)
Hybrid CNN-LSTM model: Rainfall analysis and prediction for Karnataka Region
,
Journal of Theoretical and Applied Information Technology
,
100
(
22
),
6715
6716
.
Sammen
S. S.
,
Kisi
O.
,
Ehteram
M.
,
Shafie
A. E.
,
Ansari
N. A.
,
Ghorbani
M. A.
,
Bhat
S. A.
,
Ahmed
A. N.
&
Shahid
S.
(
2023
)
Rainfall modeling using two different neural networks improved by metaheuristic algorithms
,
Environmental Sciences Europe
,
35
(
1
),
112
.
https://doi.org/10.1186/s12302-023-00818-0
.
Sarker
S.
(
2022
)
Fundamentals of climatology for engineers: Lecture note
,
Eng
,
3
(
4
),
573
595
.
https://doi.org/10.3390/eng3040040
.
Sheikhi
Y.
,
Ashrafi
S. M.
,
Nikoo
M. R.
&
Haghighi
A.
(
2023
)
Enhancing daily rainfall prediction in urban areas: A comparative study of hybrid artificial intelligence models with optimization algorithms
,
Applied Water Science
,
13
(
12
),
1
19
.
https://doi.org/10.1007/s13201-023-02036-8
.
Sheikh Khozani
Z.
,
Barzegari Banadkooki
F.
,
Ehteram
M.
,
Najah Ahmed
A.
&
El-Shafie
A.
(
2022
)
Combining autoregressive integrated moving average with long short-term memory neural network and optimisation algorithms for predicting ground water level
,
Journal of Cleaner Production
,
348
(
May
),
131224
.
https://doi.org/10.1016/j.jclepro.2022.131224
.
Shu
X.
,
Ding
W.
,
Peng
Y.
,
Wang
Z.
,
Wu
J.
&
Li
M.
(
2021
)
Monthly streamflow forecasting using convolutional neural network
,
Water Resources Management
,
35
(
15
),
5089
5104
.
https://doi.org/10.1007/s11269-021-02961-w
.
Silva
R. O.
,
Saporetti
C. M.
,
Yaseen
Z. M.
,
Pereira
E.
&
Goliatt
L.
(
2023
)
An approach for total organic carbon prediction using convolutional neural networks optimized by differential evolution
,
Neural Computing and Applications
,
35
(
28
),
20803
20817
.
https://doi.org/10.1007/s00521-023-08865-7
.
Singarasubramanian
S.
,
Anbumani
M.
&
Kaniyaiah
K.
(
2024
)
Rainfall prediction around Sathanur dam by Naive Bayes classifier, logistic regression models and various classification and regression machine learning techniques
,
Multidisciplinary Science Journal
,
6
(
10
),
2024200
.
https://doi.org/10.31893/multiscience.2024200
.
Singhal
A.
,
Jaseem
M.
,
Divya
,
Sarker
S.
,
Prajapati
P.
,
Singh
A.
&
Jha
S. K.
(
2024
)
Identifying potential locations of hydrologic monitoring stations based on topographical and hydrological information
,
Water Resources Management
,
38
(
1
),
369
384
.
https://doi.org/10.1007/s11269-023-03675-x
.
Stackhouse
P.
(
2020
)
POWER Data Methodology. [online] NASA Official. Available at: https://power.larc.nasa.gov/docs/methodology/ (Accessed: 24 August 2024)
.
Su
Y.
&
Jiang
X.
(
2023a
)
Prediction of tide level based on variable weight combination of LightGBM and CNN-BiGRU model
,
Scientific Reports
,
13
(
1
),
9
.
https://doi.org/10.1038/s41598-022-26213-y
.
Su
Y. C.
&
Kuo
B. J.
(
2023b
)
Risk assessment of rice damage due to heavy rain in Taiwan
,
Agriculture (Switzerland)
,
13
(
3
),
630
.
https://doi.org/10.3390/agriculture13030630
.
Tan
M. L.
,
Armanuos
A. M.
,
Ahmadianfar
I.
,
Demir
V.
,
Heddam
S.
,
Al-Areeq
A. M.
,
Abba
S. I.
,
Halder
B.
,
Cagan Kilinc
H.
&
Yaseen
Z. M.
(
2023
)
Evaluation of NASA POWER and ERA5-Land for estimating tropical precipitation and temperature extremes
,
Journal of Hydrology
,
624
(
September
),
129940
.
https://doi.org/10.1016/j.jhydrol.2023.129940
.
Tao
H.
,
Abba
S. I.
,
Al-Areeq
A. M.
,
Tangang
F.
,
Samantaray
S.
,
Sahoo
A.
,
Siqueira
H. V.
,
Maroufpoor
S.
,
Demir
V.
,
Dhanraj Bokde
N.
,
Goliatt
L.
,
Jamei
M.
,
Ahmadianfar
I.
,
Bhagat
S. K.
,
Halder
B.
,
Guo
T.
,
Helman
D. S.
,
Ali
M.
,
Sattar
S.
,
Al-Khafaji
Z.
,
Shahid
S.
&
Yaseen
Z. M.
(
2024
)
Hybridized artificial intelligence models with nature-inspired algorithms for river flow modeling: A comprehensive review, assessment, and possible future research directions
,
Engineering Applications of Artificial Intelligence
,
129
(
March
),
107559
.
https://doi.org/10.1016/j.engappai.2023.107559
.
Thottungal Harilal
G.
,
Dixit
A.
&
Quattrone
G.
(
2024
)
Establishing hybrid deep learning models for regional daily rainfall time series forecasting in the United Kingdom
,
Engineering Applications of Artificial Intelligence
,
133
(
Part F, July
),
108581
.
https://doi.org/10.1016/j.engappai.2024.108581
.
Vijay
E. V.
&
Aparna
K.
(
2023
)
RNN-BIRNN-LSTM based spectrum sensing for proficient data transmission in cognitive radio
,
e-Prime – Advances in Electrical Engineering, Electronics and Energy
,
6
(
December
),
100378
.
https://doi.org/10.1016/j.prime.2023.100378
.
Wang
Y.
,
Fang
Z.
&
Hong
H.
(
2019
)
Comparison of convolutional neural networks for landslide susceptibility mapping in Yanshan County, China
,
Science of the Total Environment
,
666
(
May
),
975
993
.
https://doi.org/10.1016/j.scitotenv.2019.02.263
.
Wang
K.
,
Band
S. S.
,
Ameri
R.
,
Biyari
M.
,
Hai
T.
,
Hsu
C. C.
,
Hadjouni
M.
,
Elmannai
H.
,
Chau
K. W.
&
Mosavi
A.
(
2022
)
Performance improvement of machine learning models via wavelet theory in estimating monthly river streamflow
,
Engineering Applications of Computational Fluid Mechanics
,
16
(
1
),
1833
1848
.
https://doi.org/10.1080/19942060.2022.2119281
.
Wibawa
A. P.
,
Utama
A. B. P.
,
Elmunsyah
H.
,
Pujianto
U.
,
Dwiyanto
F. A.
&
Hernandez
L.
(
2022
)
Time-series analysis with smoothed Convolutional Neural Network
,
Journal of Big Data
,
9
(
1
),
44
.
https://doi.org/10.1186/s40537-022-00599-y
.
Willmott
C. J.
(
1981
)
On the validation of models
,
Physical Geography
,
2
(
2
),
184
194
.
https://doi.org/10.1080/02723646.1981.10642213
.
Zakhrouf
M.
,
Hamid
B.
,
Kim
S.
&
Madani
S.
(
2021
)
Novel insights for streamflow forecasting based on deep learning models combined the evolutionary optimization algorithm
,
Physical Geography
,
44
(
1
),
31
54
.
https://doi.org/10.1080/02723646.2021.1943126
.
Zhou
H.
,
Schertzer
D.
&
Tchiguirinskaia
I.
(
2024
)
Combining recurrent neural networks with variational mode decomposition and multifractals to predict rainfall time series
,
EGUsphere
.
[preprint] https://doi.org/10.5194/egusphere-2023-2710
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY-NC 4.0), which permits copying, adaptation and redistribution for non-commercial purposes, provided the original work is properly cited (http://creativecommons.org/licenses/by-nc/4.0/).