Due to ongoing climate change, accurately predicting rainfall has become increasingly critical. This paper explores an approach utilizing two different machine learning algorithms, including multilayer perceptron neural networks (MPNN) and random forest regressors (RFR), to enhance rainfall forecast accuracy. Historical daily weather data spanning 100 years (1913–2023) from the Agro Climate Research Centre at Tamil Nadu Agricultural University were used. The study focused on global climate drivers like the Southwest Monsoon (SWM) and Northeast Monsoon (NEM) over the Coimbatore region; this region receives more rainfall during NEM. Normalization and scaling techniques addressed missing values, preserving 70–85% of the original data for the training set. Results demonstrated that MPNN outperformed RFR, achieving an accuracy of 85.55% for SWM and NEM, while RFR outperformed MPNN, producing an accuracy of 86.50%. The coefficient of determination (R2) for predicted versus observed values was 0.8 for daily rainfall from 2020 to 2023.

  • In this study, we aimed to assess multilayer perceptron neural networks (MPNN) and random forest regressors. Using expectation-maximization, we reduced the feature set from seven to five dimensions. Normalization helped identify centroids by selecting rows with distinct features and covering 90% of the original dataset for training. Future work will concentrate on minimizing error discrepancies using advanced neural network techniques.

  • A comparison with algorithms like MPNN and RFR is used for model performance and it is better than linear regression or time series models.

  • Expectation maximization in SPSS with machine learning techniques are employed.

  • Actual and predicted values of rainfall over SWM and NEM estimated with MPNN and RFR.

  • Evaluation metrics using R2, RMSE, and MAE were used to validate the accuracy.

Rainfall has a major impact on many different facets of human life and culture. Although difficult, accurate rainfall forecasts are necessary for efficient reservoir operations, flood avoidance, and water resource management. Rainfall also affects urban operations, such as traffic and sewer systems; therefore, accurate data are essential for managing and planning cities (Nayak et al. 2013).

This approach enhances our comprehension of the physical mechanisms underlying hydrological processes (Young & Liu 2015). In recent years, there has been significant interest among various research groups in developing high-resolution gridded rainfall datasets. Artificial neural network (ANN) activity mostly mimics that of the human brain. An ANN does calculations, recognizes patterns, and performs other tasks. Different datasets are used to train the ANN after it has been built (Basha et al. 2020). Another technique in artificial intelligence (AI) is the expectation-maximization (EM) algorithm, a statistical technique for estimating parameters in models with hidden variables by iteratively maximizing the likelihood of the observed data.

Similarly, the EM algorithm produces predictions by refining the model parameters to best fit the observed data. Most research projects are exposed to enormous amounts of data, which require considerable operations and computations (Khalifeloo et al. 2015). Rainfall forecasting has been practiced for many years using conventional procedures that evaluate the relationship between rainfall, other meteorological parameters (such as pressure, temperature, wind speed, and humidity), and geographic coordinates (such as latitude and longitude) using statistical techniques. However, the intricacy of rainfall, including its nonlinearity, makes forecasting difficult (Barrera-Animas et al. 2022). Furthermore, this paper examined and evaluated how the randomness of the database influenced the ANN and random forest regressors' (RFR) capacity to forecast results using statistical analysis and the Monte Carlo method (Pham et al. 2020).

These models are commonly referred to as universal approximators for this reason. Thus, the application of these AI methods is crucial for advanced climate forecasting (Dash et al. 2018). Accurate rainfall forecasting is still a desirable goal, and understanding the intricate physical processes that produce rainfall is still a huge task (Abbot & Marohasy 2012). MPNN models often require more data to train effectively due to their complexity and the number of parameters. RFR models, being ensemble methods that aggregate multiple decision trees, might need less data to achieve good performance (Mohammed & Kora 2023).

Some of the work done by the machine learning (ML) model in rainfall prediction shows that the Ponnaiyar River Basin, which traverses the states of Tamilnadu, Andhra Pradesh, and Karnataka, is the subject of this study because it is crucial to irrigation and industrial development (Samhitha & Srikanth 2017). To predict rainfall at Coonoor one day ahead of time, several ANN-based rainfall forecasting models were created. ANN, which often has many parameters and layers, might benefit from a larger training set to capture complex patterns and prevent overfitting (Brownlee 2018). A time-delay neural network with a restricted number of input predictors produced a high correlation coefficient. Findings demonstrate that the suggested wavelet (Renuga Devi et al. 2017) performance was high in the ANN model.

These days, ML techniques are successfully used for problems involving the identification, clustering, or reduction of the dimensionality of massive collections of especially high-dimensional input data (Geethalakshmi et al. 2022). One can use a multilayer perceptron neural network (MPNN) to model the actions of such nonlinear processes (Nandakumar et al. 2021) that was employed as a suitable forecasting tool to get trustworthy data on inflow into a reservoir. Another study in the Coimbatore region revealed that the average monthly inflow to the Bhavanisagar reservoir, which spans 24 years (1989–2013), was utilized as historical data for training, testing, and validating the model (Suriya et al. 2021).

The primary objective of this study is to evaluate and compare the performance of MPNN and RFR models in accurately predicting seasonal rainfall patterns during the Southwest Monsoon (SWM) and Northeast Monsoon (NEM) periods. This analysis aims to enhance rainfall forecasting accuracy by leveraging historical weather data, ultimately supporting better agricultural planning and disaster management decision-making in semi-arid regions.

The dataset discussed in this paper comprises monthly weather observations spanning a century, from 1913 to 2023. These observations were collected from the Agro Climate Research Centre at Tamil Nadu Agricultural University, Coimbatore-3. Coimbatore is located at 11°N latitude, 77°E longitude, and has an altitude of 427 m above sea level. The district receives rainfall influenced by both the southwest and northeast monsoons. According to Koppen's Climatic Classifications, it features a typical semi-arid climate within the western zone of agro-climate regional planning. Following this period, wind speeds gradually decrease to a low of 3.7 km/h. Overall, the district is characterized by high average temperatures and relatively low humidity.

Data description

Daily rainfall data, maximum temperature, minimum temperature, and humidity for the Coimbatore district, spanning from 1913 to 2023, were collected from the Agro Climate Research Centre at Tamil Nadu Agricultural University, Coimbatore.

Data preprocessing

Choosing an appropriate imputation algorithm is key to accurately replacing missing values. Preprocessing is considered the second most crucial stage after data collection. Its primary aim is to clean raw data sourced from either primary or secondary sources. This step becomes necessary due to various issues present in the data, such as noise, outliers, and missing values (Jafri et al. 2020). Depending on the pattern and nature of the missing data, the appropriate imputation algorithm can vary. In this study, several imputation algorithms were evaluated for their effectiveness in imputing rainfall data. For implementation, the imputation package in Python was utilized. Specifically, the ‘na_interpolation’ function from this package was employed to perform the imputation of the missing rainfall data.
(1)

Normalization

To ensure that input and output data are comparable and to establish correlations between them, normalization is required due to the different units of measurement involved (Abhishek et al. 2012). First, calculate the mean (M) for each parameter (humidity, wind speed, and rainfall) separately:
(2)
Next, determine the standard deviation (SD) for each parameter individually (Zainudin et al. 2016). Once the mean and SD for each parameter are obtained, normalize the values using the following formula:
(3)

Development of ANN

The brain is made up of many neurons that are linked to one another by synapses. Neural networks, also known as natural neural networks, are these networks of neurons. A mathematically simplified representation of a natural neural network is called an ANN. A neuron is represented by a vertex in this directed graph and a synapse by an edge. Since the 1940s, many ANN models have been suggested; however, the most popular ones are the radial basis function network and the MPNN (Lee et al. 1998).

The selection of two specific ML algorithms for rainfall prediction typically depends on several key factors, which means that the selected ML algorithms for rainfall prediction should be accurate, efficient, and robust. They must handle specific data types effectively, such as time-series data, and be computationally efficient for large datasets and real-time predictions. The correlation coefficient (CC) and mean square error are used as performance metrics to evaluate how well ANN models estimate the future (Abinayarajam et al. 2021).

Neural network architecture

The neurons of ANNs are connected by synapses. The values that are fed into this network of synapses and neurons determine the value that it generates. Different neuronal types use different processes to generate their output. Among them are neurons used as input: after receiving the input values, these neurons feed them into a network (Table 1). The divide may vary depending on how much data is provided. A lower percentage (e.g., 85:15) of a big dataset can be put aside for testing because there is sufficient data to train the model successfully. On the other hand, if there is less data, a greater percentage might be set aside for testing to guarantee the accuracy of the test findings (e.g., 70:30). The performance of the model can be more thoroughly assessed by adjusting the split ratios (Poornima et al. 2020). A smaller training set, such as 70%, can help detect overfitting. Additionally, it notes that an input neuron's output equals its input value, while hidden (sigmoid) neurons manage the computations within the network. The computations carried out by the network are handled by the hidden neurons. The output values of a specific neuron are accessible in the output (Dubey 2015).

Table 1

Statistical analyses of data points or total rainfall (in mm) developed by ANN model

YearData points/total rainfall (mm/day)Training (%)Testing (%)
1913–1923 5,660 75 25 
1923–1933 4,240 75 25 
1933–1943 5,320 75 25 
1943–1953 5,500 70 30 
1953–1963 3,600 70 30 
1963–1973 4,200 80 20 
1973–1983 2,400 80 20 
1983–1993 4,300 80 20 
1993–2003 3,820 75 25 
2003–2013 5,695 85 15 
2013–2023 5,243 85 15 
YearData points/total rainfall (mm/day)Training (%)Testing (%)
1913–1923 5,660 75 25 
1923–1933 4,240 75 25 
1933–1943 5,320 75 25 
1943–1953 5,500 70 30 
1953–1963 3,600 70 30 
1963–1973 4,200 80 20 
1973–1983 2,400 80 20 
1983–1993 4,300 80 20 
1993–2003 3,820 75 25 
2003–2013 5,695 85 15 
2013–2023 5,243 85 15 

Note. Data points = total rainfall in mm/day.

Multilayer perceptron neural network

An MPNN comprises layers, which are groups of processing elements. Each layer independently computes the data it receives and passes the results to the next layer. This process continues until a final layer determines the network's output. Each processing element computes based on a weighted sum of its inputs. The input layer is the first layer, the output layer is the last, and the layers between are the hidden layers. These processing elements, akin to neurons, are often called cells, neuromines, or artificial neurons. Occasionally, a threshold function is employed to evaluate the output of a neuron in the output layer (Abhishek et al. 2012).
(4)
(5)
  • are the inputs to the neuron.

  • are the synaptic weights of the neuron corresponding to each input.

  • u represents the linear combination of the input signals, calculated as the sum of the products of inputs and their respective weights, plus a bias term
    (6)
  • b is the bias term.

  • z represents the neuron's output, typically determined by passing the linear combination through an activation function: z = f(u)

Random forest regressor

A group (ensemble) of basic tree predictors make up RFR. When given a set of predictor values, each tree can generate a response. A class membership that links a collection of independent predictor values to one of the categories shown in the dependent variable is how the tree answer appears in classification problems. The most well-liked class is determined by the group of simple trees. The tree response in regression issues is an estimate of the dependent variable based on the predictors. To get an estimate of the dependent variable, their answers are averaged. There are many instances of rainfall (RF) applications in diverse domains (Yu et al. 2017).
(7)
seems (Baladjay et al. 2023) to represent a relationship where is equal to for a given . To understand this, let's break down the elements: This could be a function at time concerning Essentially, the equation states that the value of is determined by the transformation function for the same inputs and . If you need more context or details, please provide additional information about the functions and their specific roles. The RFR model is the most effective model among them all for rainfall prediction (Kokilavani et al. 2020).

Implementation of the EM algorithm in SPSS

Using EM algorithm in SPSS, this paper estimates, using inbuilt MPNN and RFR, the actual and predicted rainfall (mm) values for the SWM monsoon ranging from 1913 to 2023, with nearly 400–850 mm of rainfall estimated per decade. In SPSS, the EM algorithm is commonly used for data imputation and clustering analysis. The EM algorithm operates in two main steps. In this step, the algorithm estimates the missing or hidden data based on the observed data and the current parameter estimates. It computes the expected value of the log-likelihood function, assuming the current parameter estimates are correct. The algorithm updates the parameter estimates by maximizing the expected log-likelihood by using the predicted values computed in the E-step.

This step refines the model parameters to fit the data better. For the NEM, actual and predicted rainfall (mm) values from 1913 to 2023 range from nearly 900 to 1,500 mm of rainfall estimated per decade. The model was determined to be the best fit based on the root mean square error (RMSE) and R2 values. From 1913 to 2023, the RMSE and R2 values using MPNN for the SWM were 0.814 mm and 0.87, respectively. For NEM, the RMSE and R2 values from 1913 to 2023 using RFR were 0.784 mm and 0.89, respectively. The outcome of this paper evaluates that the Coimbatore region receives more rainfall during NEM. In comparison, MPNN outperformed RFR, producing a higher accuracy of 85.55%. For SWM and NEM, RFR produces more accuracy of 86.50%.

The relationship between the predictions shown in actual and predicted values of rainfall (mm) over the decade of SWM and NEM (Figures 1(a)–1(k) and 2(l)–2(v)) using the EM technique was employed. These predictions are closely related to ML predictions, as both approaches aim to model and predict outcomes based on historical data. Comparing EM algorithm predictions with observed values helps evaluate model performance and accuracy, a standard practice in ML for assessing generalization to new data (McLachlan & Krishnan 2007). The EM method can significantly enhance neural network training techniques. Using normalization, 90% of a century-long dataset with unique characteristics was selected, resulting in 313 rows from an original 433 rows, with redundant data removed to improve training quality. The refined training set aligns well with previous findings (Geethalakshmi et al. 2022).
Figure 1

(a)–(k) Comparison of actual and predicted decadal rainfall during the SWM from 1913 to 2023 (mm).

Figure 1

(a)–(k) Comparison of actual and predicted decadal rainfall during the SWM from 1913 to 2023 (mm).

Close modal
Figure 2

(a)–(k) Comparison of actual and predicted decadal rainfall during the NEM from 1913 to 2023 (mm).

Figure 2

(a)–(k) Comparison of actual and predicted decadal rainfall during the NEM from 1913 to 2023 (mm).

Close modal

The performance of the chosen model is displayed in Table 2 by the preprocessing technique used in MPNN and RFR. The variation in training and testing dataset proportions for different algorithms, such as 72.30 and 73.5% for MPNN and 27.70 and 26.5% for RFR, can be attributed to several factors, mainly that different algorithms have different complexities and are significantly different from each other. This can lead to different proportions for training and testing sets. The proportions might be chosen based on empirical results or previous studies that have shown optimal performance for each algorithm with those specific splits (Shalev-Shwartz & Ben-David 2014; Salehin et al. 2020). The models are used to forecast one step ahead of the observed data series after being fitted to actual data. Table 3 shows ANN information.

Table 2

Multilayer perceptron neural network and RFR case summary

Case processing summary
SampleNPercent (1)Percent (2)
Training 313 72.30% 73.50% 
Testing 120 27.70% 26.50% 
Valid 433 100.00% 100.00% 
Excluded   
Total 433   
Case processing summary
SampleNPercent (1)Percent (2)
Training 313 72.30% 73.50% 
Testing 120 27.70% 26.50% 
Valid 433 100.00% 100.00% 
Excluded   
Total 433   
Table 3

Multilayer perceptron neural network information

Network information
Input layer Factors Tmax, Tmin 
Number of units 
Hidden layer(s) Number of hidden layers 
Number of units in hidden layer 1a 
Activation function Hyperbolic tangent 
Output layer Dependent variables Rainfall 
Number of units 
Activation function Softmax 
Error function Cross-entropy 
Excluding the bias unit  
Network information
Input layer Factors Tmax, Tmin 
Number of units 
Hidden layer(s) Number of hidden layers 
Number of units in hidden layer 1a 
Activation function Hyperbolic tangent 
Output layer Dependent variables Rainfall 
Number of units 
Activation function Softmax 
Error function Cross-entropy 
Excluding the bias unit  

aThe number of units in hidden layer 1 was determined based on empirical tuning, optimizing model performance using cross-validation.

Temporal rainfall distribution

According to a temporal analysis (1913 –2023), the rainfall distribution in Coimbatore is not uniform across the period under study. In the specified time scale, the seasonality feature was most noticeable during the pre-monsoon (April and May), the North East Monsoon (October and November), and the South West Monsoon (June–September).

For the decadal average SWM rainfall, the highest actual rainfall (223.09 mm) occurred in 1953–1963, with MPNN predicting 253.09 mm and RFR 220.9 mm. The lowest actual SWM rainfall (159.94 mm) was recorded in 1943–1953, with MPNN predicting 160.16 mm and RFR 124.5 mm.

During the NEM, the highest actual rainfall (448.02 mm) occurred in 2003–2013, with the MPNN model predicting 344.43 mm and the RFR model predicting 463.07 mm. The lowest actual rainfall (306.65 mm) was recorded in 1933–1943, with MPNN predicting 260.18 mm and RFR 356.27 mm depicted in Figure 3(a) and 3(b).
Figure 3

(a), (b) Decadal average of Tamilnadu Agricultural University (TNAU) manual observed data (mm) and MPNN and RFR model predicted rainfall data (mm) for both SWM and NEM.

Figure 3

(a), (b) Decadal average of Tamilnadu Agricultural University (TNAU) manual observed data (mm) and MPNN and RFR model predicted rainfall data (mm) for both SWM and NEM.

Close modal

Selection of excellent algorithm

This paper discovered 85.55 and 86.50% accuracy in our work after employing MPNN and RFR to analyze all of our data. MPNN exhibited more accuracy for SWM, and RFR exhibited more accuracy for NEM. Our projections are based on data from the Coimbatore region. This region receives more rainfall during NEM.

Synaptic weights of the Hyperbolic Activation Function used in MPNN and RFR are represented in Figures 4 and 5. A sigmoid neuron's output is determined by multiplying the total of its inputs by a sigmoid activation function (Table 4). Tables 5 and 6 show the percentage correctness report of observed and predicted values of rainfall by MPNN and RFR model prediction. Table 7 shows a descriptive summary of the output similar to Ridwan et al. (2021). Predicting rainfall using machine learning, deep learning, and time-series models across an altitudinal gradient in the North-Western Himalayas (Wani et al. 2024) are compared with the results showed that in terms of prediction accuracy, deep learning models – particularly bi-directional long short-term memory networks – performed better than conventional ML models, such as ANN (Barrera-Animas et al. 2022). This implies that although ANN–MPNN models are useful, investigating more sophisticated models, such as Random Forest Tree (RFT) or ensemble approaches, may produce superior outcomes in rainfall prediction tasks.
Table 4

Overall model summary of the hidden layer activation function

Model summary
Cross-entropy error 125.448 
Percent incorrect predictions 14.10% 
Stopping rule used 1 consecutive step(s) with no decrease in error 
Training time 00:00.1 
Cross-entropy error 56.811 
Percent incorrect predictions 18.30% 
Dependent variable: VAR00001  
a. Error computations are based on the testing sample 
Model summary
Cross-entropy error 125.448 
Percent incorrect predictions 14.10% 
Stopping rule used 1 consecutive step(s) with no decrease in error 
Training time 00:00.1 
Cross-entropy error 56.811 
Percent incorrect predictions 18.30% 
Dependent variable: VAR00001  
a. Error computations are based on the testing sample 
Table 5

The percentage correctness report of observed and predicted values of rainfall (mm) used in MPNN model prediction

Classification
Predicted
SampleObserved01Percent correct
Training 221 13 94.40(%) 
30 48 61.50(%) 
Overall percent 80.20(%) 19.80(%) 85.55(%) 
Testing 82 93.20(%) 
16 16 50.00(%) 
Overall percent 81.70(%) 18.30(%) 81.70(%) 
Dependent variable: VAR00001 (rainfall) 
Classification
Predicted
SampleObserved01Percent correct
Training 221 13 94.40(%) 
30 48 61.50(%) 
Overall percent 80.20(%) 19.80(%) 85.55(%) 
Testing 82 93.20(%) 
16 16 50.00(%) 
Overall percent 81.70(%) 18.30(%) 81.70(%) 
Dependent variable: VAR00001 (rainfall) 
Table 6

The percentage correctness report of observed and predicted values of rainfall (mm) with RFR model prediction

Classification
Predicted
SampleObserved01Percent correct
Training 321 17 93.45% 
50 68 60.45% 
Overall percent 81.25% 20.90% 86.50% 
Testing 102 16 94.40% 
28 26 55.00% 
Overall percent 83.70% 19.40% 83.70% 
Dependent variable: VAR00001 (rainfall) 
Classification
Predicted
SampleObserved01Percent correct
Training 321 17 93.45% 
50 68 60.45% 
Overall percent 81.25% 20.90% 86.50% 
Testing 102 16 94.40% 
28 26 55.00% 
Overall percent 83.70% 19.40% 83.70% 
Dependent variable: VAR00001 (rainfall) 
Table 7

Descriptive summary of the output

Classification/UnitsSWM (1913–2023)NEM (1913–2023)
Algorithm MPNN RFR 
N (mm) 20 20 
Mean (mm) 329.1667 362.5 
SD (mm) 3.156 2.154 
Standard error mean (mm) 0.881 0.567 
Mean absolute error (mm) 0.751 0.834 
RMSE (mm) 0.814 0.784 
R^2 (mm) 0.87 0.89 
Accuracy (%) 85.55% 86.50% 
Training(%) 72.30 73.5 
Testing(%) 27.70 24.5 
Precision (%) 82% 80% 
Classification/UnitsSWM (1913–2023)NEM (1913–2023)
Algorithm MPNN RFR 
N (mm) 20 20 
Mean (mm) 329.1667 362.5 
SD (mm) 3.156 2.154 
Standard error mean (mm) 0.881 0.567 
Mean absolute error (mm) 0.751 0.834 
RMSE (mm) 0.814 0.784 
R^2 (mm) 0.87 0.89 
Accuracy (%) 85.55% 86.50% 
Training(%) 72.30 73.5 
Testing(%) 27.70 24.5 
Precision (%) 82% 80% 

Note. mm, millimeters; N, No of parameters; RMSE, root mean square error.

Figure 4

Synaptic weights of the hyperbolic activation function with MPNN.

Figure 4

Synaptic weights of the hyperbolic activation function with MPNN.

Close modal
Figure 5

Synaptic weights of the hyperbolic activation function with RFR.

Figure 5

Synaptic weights of the hyperbolic activation function with RFR.

Close modal

This study aimed to assess MPNN and RFR algorithms for forecasting monthly rainfall from 1913 to 2023. Using EM might reduce the feature set from seven to five dimensions. Normalization helped to identify centroids, selecting rows with distinct features, covering 70–85% of the original dataset for training. MPNN exhibited a higher accuracy of 85.55% for SWM compared to RFR. For NEM, RFR showed a higher accuracy of 86.50% than MPNN. The coefficient of determination (R2) between predicted and observed rainfall values from 2020 to 2023 was 0.8, indicating superior algorithm performance. Future work will concentrate on minimizing error discrepancies using advanced neural network techniques and also satellite integration with ML for accurate rainfall predictions.

The opinions and perspectives presented in this research paper/article are those of the authors and may not necessarily represent the views of their respective organizations.

The contents and viewpoints articulated in this study are the author's own and do not necessarily represent the opinions of their affiliated organizations.

The authors confirm their contribution to the paper as follows: study conception and design: Oviyakandasamy, N. Maragatham; data collection: N. Maragatham; analysis and interpretation of results: Oviyakandasamy, E. Somasundaram, R. Ravikumar; draft manuscript preparation: Oviyakandasamy, Balajikannan. All authors reviewed the results and approved the final version of the manuscript.

All relevant data are included in the paper or its Supplementary Information.

The authors declare there is no conflict.

Abbot
J.
&
Marohasy
J.
(
2012
)
Application of artificial neural networks to rainfall forecasting in Queensland, Australia
,
Advances in Atmospheric Sciences
,
29
,
717
730
.
Abhishek
K.
,
Kumar
A.
,
Ranjan
R.
&
Kumar
S.
(
2012
). '
A rainfall prediction model using an artificial neural network
',
2012 IEEE Control and System Graduate Research Colloquium
.
Abinayarajam
D.
,
Patil
S.
,
Dheebhakaran
G.
&
Ramanathan
S.
(
2021
)
Medium range forecast of maximum temperature of Coimbatore using ANN
,
International Journal of Current Microbiology and Applied Sciences
,
10
(
01
),
3015
3019
.
Baladjay
J. M.
,
Riva
N.
,
Santos
L. A.
,
Cortez
D. M.
,
Centeno
C.
&
Sison
A. A. R.
(
2023
)
Performance evaluation of random forest algorithm for automating classification of mathematics question items
,
World Journal of Advanced Research and Reviews
,
18
(
2
),
034
043
.
Barrera-Animas
A. Y.
,
Oyedele
L. O.
,
Bilal
M.
,
Akinosho
T. D.
,
Delgado
J. M. D.
&
Akanbi
L. A.
(
2022
)
Rainfall prediction: a comparative analysis of modern machine learning algorithms for time-series forecasting
,
Machine Learning with Applications
,
7
,
100204
.
Basha
C. Z.
,
Bhavana
N.
,
Bhavya
P.
&
Sowmya
V.
(
2020
). '
Rainfall prediction using machine learning & deep learning techniques
’,
2020 International Conference on Electronics and Sustainable Communication Systems (ICESC)
.
Brownlee
J.
(
2018
)
Deep Learning for Time Series Forecasting: Predict the Future with MLPs, CNNs, and LSTMs in Python
.
Machine Learning Mastery
.
Dash
Y.
,
Mishra
S. K.
&
Panigrahi
B. K.
(
2018
)
Rainfall prediction for the Kerala state of India using artificial intelligence approaches
,
Computers & Electrical Engineering
,
70
,
66
73
.
Dubey
A. D.
(
2015
)
Artificial neural network models for rainfall prediction in Pondicherry
,
International Journal of Computer Applications
,
120
(
3
),
30
35
.
Geethalakshmi
V.
,
Kokilavani
S.
,
Ramanathan
S.
,
Dheebakaran
G.
,
Sathyamoorthy
N.
&
Maragatham
N.
(
2022
)
Monthly rainfall hind-cast using machine learning algorithms for Coimbatore, Tamil Nadu
,
MAUSAM
,
73
(
1
),
19
26
.
Jafri
I. A. M.
,
Noor
N. M.
,
Ul-Saufie
A. Z.
&
Suwardi
A.
(
2020
). '
Prediction of missing data in rainfall dataset by using simple statistical method
',
IOP Conference Series: Earth and Environmental Science
.
Khalifeloo
M. H.
,
Mohammad
M.
&
Heydari
M.
(
2015
)
Application of different statistical methods to recover missing rainfall data in the Klang River catchment
,
Int. J. Innov. Sci. Math
,
3
,
2347
9051
.
Kokilavani
S.
,
Pangayarselvi
R.
,
Ramanathan
S.
,
Dheebakaran
G.
,
Sathyamoorthy
N.
,
Maragatham
N.
&
Gowtham
R.
(
2020
)
SARIMA modeling and forecasting of monthly rainfall patterns for Coimbatore, Tamil Nadu, India
,
Current Journal of Applied Science and Technology
,
39
(
8
),
69
76
.
Lee
S.
,
Cho
S.
&
Wong
P. M.
(
1998
)
Rainfall prediction using artificial neural networks
,
Journal of Geographic Information and Decision Analysis
,
2
(
2
),
233
242
.
McLachlan
G. J.
&
Krishnan
T.
(
2007
)
The EM Algorithm and Extensions
.
Hoboken, NJ
:
John Wiley & Sons
.
Mohammed
A.
&
Kora
R.
(
2023
)
A comprehensive review on ensemble deep learning: opportunities and challenges
,
Journal of King Saud University-Computer and Information Sciences
,
35
(
2
),
757
774
.
Nandakumar
S.
,
Valarmathi
R.
,
Juliet
P. S.
&
Brindha
G.
(
2021
). '
Artificial neural network for rainfall analysis using deep learning techniques
’,
Journal of Physics: Conference Series
.
Nayak
D. R.
,
Mahapatra
A.
&
Mishra
P.
(
2013
)
A survey on rainfall prediction using artificial neural network
,
International Journal of Computer Applications
,
72
(
16
),
32
40
.
Pham
B. T.
,
Le
L. M.
,
Le
T.-T.
,
Bui
K.-T. T.
,
Le
V. M.
,
Ly
H.-B.
&
Prakash
I.
(
2020
)
Development of advanced artificial intelligence models for daily rainfall prediction
,
Atmospheric Research
,
237
,
104845
.
Poornima
S.
,
Devi
S.
,
Oviya
D.
,
Taj
A. S.
&
Elakkiya
G. T.
(
2020
)
Prediction of rainfall using machine learning
,
International Journal of Recent Technology and Engineering (IJRTE)
,
9
(
1
),
1374
1377
.
Renuga Devi
S.
,
Arulmozhivarman
P.
&
Venkatesh
C.
(
2017
)
ANN-based rainfall prediction – a tool for developing a landslide early warning system
. In:
M. Mikoš, Ž. Arbanas, Y. Yin & K. Sassa (eds.)
,
Advancing Culture of Living with Landslides: Volume 3 Advances in Landslide Technology
,
Cham, Switzerland: Springer
.
Ridwan
W. M.
,
Sapitang
M.
,
Aziz
A.
,
Kushiar
K. F.
,
Ahmed
A. N.
&
El-Shafie
A.
(
2021
)
Rainfall forecasting model using machine learning methods: case study Terengganu, Malaysia
,
Ain Shams Engineering Journal
,
12
(
2
),
1651
1663
.
Salehin
I.
,
Talha
I. M.
,
Hasan
M. M.
,
Dip
S. T.
,
Saifuzzaman
M.
&
Moon
N. N.
(
2020
). '
An artificial intelligence-based rainfall prediction using LSTM and neural network
',
2020 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering (WIECON-ECE)
.
Samhitha
S. V.
&
Srikanth
P. G.
(
2017
)
Prediction of rainfall using inverse distance weighting method and artificial neural networks in Ponnaiyar river basin
,
Indian Journal of Scientific Research
,
5
(
11
),
144
149
.
Shalev-Shwartz
S.
&
Ben-David
S.
(
2014
)
Understanding Machine Learning: From Theory to Algorithms
.
New York, NY
:
Cambridge University Press
.
Suriya
S.
,
Saran
K.
,
Chris Anto
L.
,
Anbalagan
C.
&
Vinodh
K.
(
2021
). '
Inflow forecasting of Bhavanisagar reservoir using artificial neural network (ANN): a case study
',
Sustainable Practices and Innovations in Civil Engineering: Select Proceedings of SPICE 2019
.
Wani
O. A.
,
Mahdi
S. S.
,
Yeasin
M.
,
Kumar
S. S.
,
Gagnon
A. S.
,
Danish
F.
,
Al-Ansari
N.
,
El-Hendawy
S.
&
Mattar
M. A.
(
2024
)
Predicting rainfall using machine learning, deep learning, and time series models across an altitudinal gradient in the North-Western Himalayas
,
Scientific Reports
,
14
(
1
),
27876
.
Yu
P.-S.
,
Yang
T.-C.
,
Chen
S.-Y.
,
Kuo
C.-M.
&
Tseng
H.-W.
(
2017
)
Comparison of random forests and support vector machine for real-time radar-derived rainfall forecasting
,
Journal of Hydrology
,
552
,
92
104
.
Zainudin
S.
,
Jasim
D. S.
&
Bakar
A. A.
(
2016
)
Comparative analysis of data mining techniques for Malaysian rainfall prediction
,
International Journal on Advanced Science, Engineering and Information Technology
,
6
(
6
),
1148
1153
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).