Highly accurate rainfall prediction can provide a reliable scientific basis for human production and life. For the characteristics of occasional and sudden changes of rainfall in coastal hilly areas, this article chooses four cities in the eastern Zhejiang province as the object of the study and establishes a rainfall prediction model based on variational mode decomposition (VMD), reptile search algorithm (RSA), and differentiable neural computer (DNC). The VMD algorithm reduces the complexity of the sequence data; RSA is used to find the best-fit function; and DNC combines the advantages of the recurrent neural network and computational processing to improve the problem of memory forgetting of long short-term memory. To verify the prediction accuracy of the model, the prediction results are compared with the other three models, and the results show that the VMD–RSA–DNC model has the best prediction with the maximum and minimum relative errors of 9.62 and 0.17%, respectively, the average root-mean-square error of 5.43, the average mean absolute percentage error of 3.59%, and the average Nash–Sutcliffe efficiency of 0.95 for predicting four cities in the coastal hilly area. This study provides a new reference method for the construction of rainfall prediction models.

  • Optimization of the differentiable neural computer (DNC) controller with reptile search algorithm (RSA) has a solid theoretical basis.

  • The coastal hilly area where plum rains and typhoons exist is selected for the study, and the prediction effect is better.

  • The coupled variational mode decomposition (VMD)–RSA–DNC model has higher prediction accuracy compared with other models.

Rainfall prediction is closely related to human life and socioeconomic aspects, and accurate rainfall prediction can provide a reliable scientific basis for human travel and life safety, but there are many challenges in establishing efficient rainfall prediction models. The commonly used rainfall prediction tools are long in periodicity and too cumbersome, which are not effective in predicting areas with large weather variations and chance. In recent years data-driven machine learning models have become a hot topic in the field of hydrological forecasting. Vidyarthi & Jain (2023) proposed to integrate two machine learning methods, artificial neural network (ANN) and decision tree, for rainfall prediction. Fahad et al. (2022) proposed a depth forecasting model based on an optimized gated recurrent unit. Bajpai et al. (2023) successfully predicted summer monsoon rainfall using 118 years of historical time-series data. Santos et al. (2023) presented a method for forecasting using wavelet neural networks (WNNs) and showed that for long-term forecasting in the Mahanadi River basin, the WNN method outperforms all other applications. Rojas et al. (2023) analyzed the potential of deep learning and used probabilistic ANNs in four observation locations for postprocessing integrated precipitation forecasts, and the results showed the potential of deep learning models in weather forecasting workflows. Xie et al. (2019) proposed to optimize the input parameters of the wavelet neural network using the genetic algorithm to establish a Global Positioning System (GPS) predictable precipitation model based on the genetic wavelet neural network, which better reflected the variation of precipitation characteristics, and the accuracy of the obtained results was significantly improved compared with both the backpropagation neural network and the wavelet neural network. Lv et al. (2020) developed a coupled variational modal decomposition (VMD)–least squares support vector regression machine model and applied it to monthly runoff prediction at several hydrological stations and obtained better prediction results.

From a comprehensive view of domestic and international research, a single model has limited accuracy in predicting rainfall time series, and rainfall prediction models coupled with many different methods are widely used. Recurrent neural networks are widely used to deal with simulation and prediction problems of rainfall time-series data because of their recurrent structure settings, but their utilization of long-lasting information (long memory) is still somewhat limited. Rainfall time series frequently have a certain correlation between before and after, and to effectively extract the feature information within the monthly rainfall time series, some researchers have introduced data preprocessing means such as time-series signal decomposition algorithms into the rainfall time-series prediction process (Wang et al. 2019). Among them, wavelet decomposition requires preset basis functions with poor adaptiveness (Sun et al. 2018), empirical mode decomposition (EMD) has strong adaptiveness but is prone to problems such as modal blending and endpoint effects, and the VMD algorithm can self-adjust the number of modal decompositions to effectively avoid modal confusion and reduce the complexity of sequence data. Long short-term memory (LSTM), a neural network proposed in recent years, effectively solves the problem of long memory loss by changing the neuron structure based on traditional recurrent neural networks, but its processing of long-range sequences relies on a large amount of computational resources and is ineffective. To solve this problem, a variety of neural networks with external storage mechanisms have been designed, and the most famous one is the differentiable neural computer (DNC) hybrid learning neural network proposed by Graves et al., which is based on the neural turing machine to improve the storage management method. The inclusion of improved temporal memory links allows the DNC to jump read or update memory information, thus combining the advantages of recurrent neural networks and computational processing to improve the problem of memory forgetting in LSTM (Graves et al. 2016). There are several hyperparameters that need to be set manually in the recurrent neural network prediction process, and different combinations of hyperparameters affect the final results of the model (Qiu et al. 2020). In contrast, the selection of hyperparameters for the network structure has not been clearly defined in a series of learning models (Aufa et al. 2020). Therefore, many algorithms for optimizing hyperparameters have been proposed, among which RSA is a new nature-inspired meta-heuristic optimizer proposed in 2021, which has the advantages of less adjustable parameters, strong stability of optimization search, and easy programming implementation compared with other intelligent optimization algorithms (Abualigah et al. 2022). Therefore, this article innovatively uses RSA to optimize the controller of DNC on this basis and then predicts the rainfall data after VMD decomposition, so as to establish a VMD–RSA–DNC model and apply it to rainfall prediction in coastal hilly areas, and to explore the application effect of VMD–RSA–DNC in rainfall prediction.

VMD

Variational mode decomposition (VMD) is an adaptive and completely nonrecursive method for modal variation and signal processing (Zuo et al. 2020). The adaptive nature of the EMD method is that it determines the number of modal decompositions of a given sequence according to the actual situation, and the subsequent search and solution process can adaptively match the optimal center frequency and finite bandwidth of each mode. It reduces the nonsmoothness of rainfall time series with high complexity and strong nonlinearity and decomposes a relatively smooth sub-series containing several different frequency scales, which is suitable for nonstationary series.

  • (1)
    Construction of the variational problem: The constraints of the model are to ensure that the sum of bandwidths of all IMFs is minimum and that the sum of individual Intrinsic Modal Function (IMF)s is equal to the input signal. Assuming that the original signal f is decomposed into k components, the decomposition sequence is guaranteed to be a finite bandwidth of modal components with central frequency, while the sum of estimated bandwidths of each mode is minimized, and the constraint is that the sum of all modes is equal to the original signal. The corresponding constrained variational expression is given as follows:
    (1)
    (2)
    where ‘K’ is the number of modes to be decomposed (positive integer), and are the decomposed modal component and the center frequency, respectively. is the Dirac function, is the convolution operator, and is the phase volume representation of the central frequency on the complex plane.
  • (2)
    Solution of the variational model: To find the optimal solution for the aforementioned variational model, the constrained variational model is transformed into an unconstrained variational model by constructing an augmented LaGrange expression using a quadratic penalty factor and a LaGrange multiplier , which is expressed as follows:
    (3)
    where can be used to guarantee the reconfiguration accuracy of the signal to impose a limit on the bandwidth. BY using the alternating direction multiplier method, , , and are updated alternately until the convergence condition is satisfied.

RSA

The RSA is inspired by the social behavior of crocodiles in nature and contains two main mechanisms: encirclement mechanism and hunting mechanism. RSA is designed to simulate the crocodile's four location update strategies: high walking, belly crawling, hunting coordination, and hunting cooperation. Traditional grid search and random search are computationally intensive, while RSA imitates the social behavior of crocodiles in nature. The RSA algorithm has the advantages of less adjustable parameters, strong stability of the optimization search, and easy programming implementation compared with other intelligent optimization algorithms.

The RSA mathematical description is briefly described as follows:

  • (1)
    Initialization: Set the crocodile population size N. The formula for calculating of the initialized crocodile individual location is as follows:
    (4)
    where the ‘j’ denotes the dimensional location of the ‘i’ alligator; N denotes the population size; n denotes the problem dimension; UB and LB denote the upper and lower limits of the search space, respectively; and ‘rand’ denotes a uniformly distributed random number between 0 and 1.
  • (2)
    Surrounding mechanism (exploration phase): RSA implements the transition between the alligator surrounding (exploration) and hunting (exploitation) mechanisms by dividing the total number of iterations ‘T’ into four parts. The encirclement mechanism mainly performs a high walking, belly crawling strategy to explore a wider search area and find a better solution. The encirclement mechanism position update operator is specified as follows:
    (5)

Here, means the ‘j’ position of the ‘i’ alligator in the ‘t + 1’ iteration; denotes the ‘j’ position of the best solution obtained so far; ‘t’ denotes the current number of iterations, ‘T’ denotes the maximum number of iterations; denotes the hunting operator at the ‘j’ position of the ‘i’ alligator, which is described as follows: , where denotes the upper and lower boundaries of the ‘j’ position, respectively. α denotes the sensitive parameter to control the accuracy of hunting mechanism exploration, taking the value 0.1; ε denotes the very small constant; β denotes a sensitive parameter controlling the exploration accuracy of the envelope mechanism, taking the value 0.1; used to narrow the search area; denotes a random number between ; indicates the random position of the ‘i’ alligator; denotes the probability ratio, described as ; denotes a random integer between −1 and 1; and other parameters have the same meaning as mentioned earlier.

  • (3)
    Hunting mechanism (development stage). Crocodiles exploit search during hunting mainly through hunting coordination and hunting cooperation strategies. Unlike the encirclement mechanism, the crocodile hunting mechanism enables them to easily approach the target prey, i.e., the algorithm optimal solution. The hunting mechanism position update operator is described as follows:
    (6)

The meaning of the parameters in the aforementioned formula is the same as earlier.

DNC

DNC hybrid learning neural network is a special recurrent neural network with external memory (not trainable). By combining a trainable neural network controller with a read–write external memory, it can both learn algorithms and parameters like a neural network and process complex data information streams like a computer. After each time step t by the trainable controller based on the information flow at moment t − 1 exchanged with the external memory, the prediction information of both parts is linearly combined to determine the final output prediction. Figure 1 illustrates the overall architecture of the DNC model. The memory information obtained from the input data processing in the controller is written to the memory, and the interaction between the controller and the memory is done through the read and write head (Masoud & Faramarz 2020; Ming et al. 2020).
Figure 1

DNC model structure.

Figure 1

DNC model structure.

Close modal

As shown earlier, the DNC is equivalent to adding external memory to the LSTM, which greatly improves the problem of memory forgetting of the LSTM.

Controller

The controller in the DNC is a variant of the LSTM. Sometimes, the controller obtains an input vector and read R vectors in the memory storage matrix . Thus, the vector is concatenated with the R readout vectors, and the input to the controller at time t is . At moment t, the output value of the hidden layer for an LSTM network with one hidden layer is given as follows:
(7)
At each moment t, the controller computes the output vector and an interaction vector , which is defined as follows:
(8)
The controller passes the information back by creating a loop in the computational graph, which in turn yields . The computational graph is shown in Figure 2.
Figure 2

Computation graph of controller.

Figure 2

Computation graph of controller.

Close modal
Figure 3

The combined VMD–RSA–DNC model prediction flowchart.

Figure 3

The combined VMD–RSA–DNC model prediction flowchart.

Close modal
Finally, the output vector is expressed as follows:
(9)

This design allows the controller to regulate its output decisions by reinforcing the dependence on the memory storage matrix.

Read and write head and memory

The controller η operates on the data in the memory through the read/write head. The read or write positions are determined by the corresponding weights, and the set of allowed weights at N positions is the nonnegative quadrant of the standard simplex form in RN:

(10)
In the read operation, multiple read weights are used to compute the weighted average of the contents, thus defining the read vector as follows:
(11)
In the write operation, the write weight is combined with the erase vector and the write vector to modify the memory storage matrix:
(12)
where ° denotes the Hadamard product and E is the all one matrix of N × M.

The addressing mechanism of the DNC is a combination of several things: content-based addressing and dynamic addressing are used when writing data in memory and content-based addressing and temporal memory links are used to acquire locations when reading data in memory:

  • (1)

    Content-based addressing

The content lookup operation on the storage matrix M is defined as follows:
(13)
Here, vector k is the lookup key; β is the key strength parameter; and is the cosine similarity, which is used as the evaluation function of content similarity:
(14)

defines the normalized probability distribution over the storage locations. is a constraint vector defined as a standard (N − 1)-simple form:

(15)
  • (2)

    Dynamic addressing

Dynamic addressing is achieved through the release list . The release list records the free storage locations in the memory and is updated after the controller makes changes to the data in the memory.

Before writing data, the controller assigns the release gate parameter to the read head i to determine whether the most recently fetched location information is released. is a retention vector indicating the extent to which the location information will not be released:
(16)
where denotes the storage usage vector at moment t, = 0:
(17)

The value of is obtained, which is sorted in the ascending order, and the release list can be updated. records the storage location with the lowest utilization.

  • (3)

    Write weights

Define the allocation weight , which is used to generate the newly allocated write position:
(18)
The controller has the option to write or not write to the newly assigned location. First, define the write content weights :
(19)
where vector is the write key and is the write intensity.
Then, the write weight is defined as follows:
(20)
where denotes allocation gate and denotes write gate.
  • (4)

    Read weight

For read header i, the read content weight is defined as follows:
(21)
where is the read key.
The read head also receives a read mode vector , defining the read weights as follows:
(22)
where is the forward weight and is the reverse weight.

Time-series memory link

The temporal memory link holds information about the order in which the memory locations are written and is denoted as . indicates the degree to which location i is written after location j is written, and each row and column of can be defined with location weights:
(23)
where is the priority weight and denotes the degree to which position i was written last:
(24)
For read head i, define reverse weights and forward weights :
(25)
(26)

Model coupling

The VMD algorithm is based on the strict variational theory, and the number of modal decompositions is selected autonomously to make the decomposed components more regular (Zhang et al. 2023), while the RSA can realize the adaptive selection of the embedding dimension of each component in the prediction. Each IMF obtained from the VMD decomposition is a single harmonic, and the sequence complexity is greatly reduced compared to the original rainfall data, thus allowing effective adaptation to the DNC predictions.

On the basis of this, we combine VMD, RSA, and DNC to build a rainfall time-series prediction model based on VMD–RSA–DNC. VMD decomposes the rainfall data into several components and then uses DNC to predict each component IMF separately; and train and solve the posterior probability distribution of weight parameters of DNC based on variational inference; and use the training sample mean square error (MSE) as the fitness function of the RSA-optimized DNC controller. The crocodile individual position is initialized, the crocodile population fitness value is calculated, the current optimal crocodile individual position is compared and continuously updated, and whether the termination condition is satisfied is judged. If it is satisfied, output (Li & Cui 2022), which is used as the input layer weights and the implied layer bias matrix of the DNC controller for prediction. At the end of the algorithm, the prediction results of each DNC are reconstructed and superimposed to obtain the final prediction results of the rainfall time series based on the VMD–RSA–DNC model. The model coupling process is shown in Figure 3.

Study area and data sources

The four cities in the study area are located in the coastal hilly area of eastern Zhejiang province, with a monsoonal maritime climate on the southern edge of the northern subtropics. The hydrological situation is complex, and the surface water system is not well developed, mostly originating from the hilly hinterland and meandering into the sea in a radial pattern. The water system is influenced by the scale of the island; the flow is short; the catchment area is small, which is affected by heavy rainfall; and the water level rises and falls and hence it is easy to trigger flash floods and other natural disasters. May to June is the rainy season, and the rainfall is more concentrated in this season. July to September is the typhoon season, and typhoons or tropical storms are very frequent. The ensuing heavy rainfall intensity, long duration, is one of the main factors triggering geological disasters. The specific location of the study area is shown in Figure 4.
Figure 4

Location map of the study area.

Figure 4

Location map of the study area.

Close modal
The meteorological data in this article were obtained from the National Meteorological Information Center of the China Meteorological Administration (http://data.cma.cn) for monthly precipitation of meteorological stations in Zhoushan, Ningbo, Taizhou and Wenzhou. The dataset (2000–2020) was used. The study data are shown in Figure 5.
Figure 5

Monthly precipitation data of Zhoushan, Ningbo, Taizhou, and Wenzhou from 2000 to 2020.

Figure 5

Monthly precipitation data of Zhoushan, Ningbo, Taizhou, and Wenzhou from 2000 to 2020.

Close modal

As shown in Figure 5, the suddenness of rainfall in the study area, the short duration, and the large amount of rainfall make it a great challenge to build prediction models. There is often a certain correlation between the rainfall time series before and after, and it is difficult for a single neural network model to achieve a better prediction effect, and it cannot simulate some high-frequency sudden change data well. Therefore, the use of rainfall prediction models coupled with different methods is an important way to improve the prediction accuracy.

Data decomposition

The rainfall time-series data are relatively homogeneous, and the internal frequency information and variation patterns are difficult to be explored directly. The VMD algorithm is used to decompose the time series into several IMFs, thus visualizing the hidden information such as the trend of periodic variation inherent in the time series and increasing the amount of data information for the prediction model. The number of decompositions of IMFs is selected reasonably by comparing the closeness of the central frequencies. Following the steps of the previous VMD decomposition, the rainfall data of four cities, namely, Zhoushan, Ningbo, Taizhou, and Wenzhou, from 2000 to 2020 were decomposed by VMD, and the decomposition results are shown in Figure 6.
Figure 6

VMD of rainfall sequence.

Figure 6

VMD of rainfall sequence.

Close modal

Rainfall forecast

The VMD–RSA–DNC model was used to train the rainfall data of Zhoushan, Ningbo, Taizhou, and Wenzhou cities from 2000 to 2018 and validate the rainfall from 2019 to 2020. The results of the training and validation periods using the training sample MSE as the fitness function of the RSA-optimized DNC controller are shown in Table 2.

Based on the previously described steps, the rainfall of Zhoushan, Ningbo, Taizhou, and Wenzhou for 2000–2020 is predicted, and the prediction j results of each IMF component are shown in Figure 7.
Figure 7

IMF component forecast map: (a) Zhoushan, (b) Ningbo, (c) Taizhou, and (d) Wenzhou.

Figure 7

IMF component forecast map: (a) Zhoushan, (b) Ningbo, (c) Taizhou, and (d) Wenzhou.

Close modal
Figure 7 shows that the overall effect of DNC prediction optimized by RSA for each component is feasible. To see the prediction effect of this model more obviously, the results are presented in Figure 8, and the predicted values are compared with the real values as shown in Table 1.
Table 1

Error index data for 2019–2020

CityYearMonthPredicted value (mm)Measured value (mm)Relative error (%)
Zhoushan 2019 71.00 72.96 −2.69 
157.17 161.97 −2.96 
120.20 113.54 5.87 
72.15 70.34 2.57 
82.85 78.92 4.98 
223.47 207.23 7.84 
184.80 177.89 3.88 
401.64 438.24 −8.35 
209.46 197.40 6.11 
10 271.88 292.21 −6.96 
11 32.10 33.96 −5.48 
12 108.11 113.18 −4.48 
2020 141.75 142.20 −0.32 
102.00 93.99 8.52 
144.42 135.03 6.95 
47.10 45.20 4.20 
132.80 133.75 −0.71 
328.24 317.54 3.37 
167.26 174.33 −4.06 
64.92 61.65 5.30 
200.60 211.67 −5.23 
10 11.55 11.31 2.12 
11 48.07 47.85 0.46 
12 26.00 26.15 −0.57 
Ningbo 2019 90.37 96.11 −5.97 
162.27 174.35 −6.93 
140.92 135.98 3.63 
110.98 108.11 2.65 
87.82 82.30 6.71 
229.43 232.64 −1.38 
239.33 225.48 6.14 
330.65 347.81 −4.93 
165.20 172.64 −4.31 
10 180.45 169.99 6.15 
11 31.06 30.53 1.74 
12 117.72 115.27 2.13 
2020 150.02 150.97 −0.63 
71.18 74.53 −4.49 
143.77 141.07 1.91 
45.57 44.83 1.65 
145.40 150.69 −3.51 
355.57 348.85 1.93 
160.11 154.62 3.55 
144.25 147.91 −2.47 
188.02 182.54 3.00 
10 19.85 19.60 1.28 
11 33.62 32.91 2.16 
12 21.36 21.69 −1.52 
Taizhou 2019 84.70 85.43 −0.85 
151.08 148.82 1.52 
147.96 151.67 −2.45 
187.56 193.27 −2.95 
103.62 109.04 −4.97 
264.36 254.26 3.97 
313.19 321.84 −2.69 
282.20 299.37 −5.74 
91.84 93.79 −2.08 
10 112.83 118.35 −4.66 
11 8.94 9.02 −0.89 
12 73.00 75.41 −3.20 
2020 92.66 89.56 3.46 
70.94 72.82 −2.58 
142.95 153.59 −6.93 
52.33 54.08 −3.24 
194.75 187.25 4.01 
227.53 223.62 1.75 
119.92 113.50 5.66 
160.69 164.86 −2.53 
158.03 144.16 9.62 
10 10.21 10.33 −1.16 
11 14.71 14.11 4.25 
12 25.33 25.96 −2.43 
Wenzhou 2019 54.06 53.97 0.17 
135.20 137.60 −1.74 
150.05 144.77 3.65 
198.82 205.27 −3.14 
111.65 106.48 4.86 
312.51 329.55 −5.17 
249.16 233.83 6.56 
194.86 199.61 −2.38 
77.45 80.44 −3.72 
10 74.99 72.29 3.73 
11 9.51 9.40 1.17 
12 54.83 53.35 2.77 
2020 51.53 50.02 3.02 
77.87 75.96 2.51 
221.13 218.32 1.29 
53.19 52.61 1.10 
136.14 143.36 −5.04 
173.93 173.47 0.27 
128.59 139.06 −7.53 
159.22 153.09 4.00 
197.48 187.23 5.47 
10 7.53 7.95 −5.28 
11 14.03 13.23 6.05 
12 28.96 29.89 −3.11 
CityYearMonthPredicted value (mm)Measured value (mm)Relative error (%)
Zhoushan 2019 71.00 72.96 −2.69 
157.17 161.97 −2.96 
120.20 113.54 5.87 
72.15 70.34 2.57 
82.85 78.92 4.98 
223.47 207.23 7.84 
184.80 177.89 3.88 
401.64 438.24 −8.35 
209.46 197.40 6.11 
10 271.88 292.21 −6.96 
11 32.10 33.96 −5.48 
12 108.11 113.18 −4.48 
2020 141.75 142.20 −0.32 
102.00 93.99 8.52 
144.42 135.03 6.95 
47.10 45.20 4.20 
132.80 133.75 −0.71 
328.24 317.54 3.37 
167.26 174.33 −4.06 
64.92 61.65 5.30 
200.60 211.67 −5.23 
10 11.55 11.31 2.12 
11 48.07 47.85 0.46 
12 26.00 26.15 −0.57 
Ningbo 2019 90.37 96.11 −5.97 
162.27 174.35 −6.93 
140.92 135.98 3.63 
110.98 108.11 2.65 
87.82 82.30 6.71 
229.43 232.64 −1.38 
239.33 225.48 6.14 
330.65 347.81 −4.93 
165.20 172.64 −4.31 
10 180.45 169.99 6.15 
11 31.06 30.53 1.74 
12 117.72 115.27 2.13 
2020 150.02 150.97 −0.63 
71.18 74.53 −4.49 
143.77 141.07 1.91 
45.57 44.83 1.65 
145.40 150.69 −3.51 
355.57 348.85 1.93 
160.11 154.62 3.55 
144.25 147.91 −2.47 
188.02 182.54 3.00 
10 19.85 19.60 1.28 
11 33.62 32.91 2.16 
12 21.36 21.69 −1.52 
Taizhou 2019 84.70 85.43 −0.85 
151.08 148.82 1.52 
147.96 151.67 −2.45 
187.56 193.27 −2.95 
103.62 109.04 −4.97 
264.36 254.26 3.97 
313.19 321.84 −2.69 
282.20 299.37 −5.74 
91.84 93.79 −2.08 
10 112.83 118.35 −4.66 
11 8.94 9.02 −0.89 
12 73.00 75.41 −3.20 
2020 92.66 89.56 3.46 
70.94 72.82 −2.58 
142.95 153.59 −6.93 
52.33 54.08 −3.24 
194.75 187.25 4.01 
227.53 223.62 1.75 
119.92 113.50 5.66 
160.69 164.86 −2.53 
158.03 144.16 9.62 
10 10.21 10.33 −1.16 
11 14.71 14.11 4.25 
12 25.33 25.96 −2.43 
Wenzhou 2019 54.06 53.97 0.17 
135.20 137.60 −1.74 
150.05 144.77 3.65 
198.82 205.27 −3.14 
111.65 106.48 4.86 
312.51 329.55 −5.17 
249.16 233.83 6.56 
194.86 199.61 −2.38 
77.45 80.44 −3.72 
10 74.99 72.29 3.73 
11 9.51 9.40 1.17 
12 54.83 53.35 2.77 
2020 51.53 50.02 3.02 
77.87 75.96 2.51 
221.13 218.32 1.29 
53.19 52.61 1.10 
136.14 143.36 −5.04 
173.93 173.47 0.27 
128.59 139.06 −7.53 
159.22 153.09 4.00 
197.48 187.23 5.47 
10 7.53 7.95 −5.28 
11 14.03 13.23 6.05 
12 28.96 29.89 −3.11 
Figure 8

Model prediction curve.

Figure 8

Model prediction curve.

Close modal

As can be seen from Figure 8 and Table 1, the predicted values of rainfall data for four cities in eastern Zhejiang for 2019–2020 are basically consistent with the true values. Although the error reaches more than 5% in individual months, the occurrence of extreme weather in individual months can cause this phenomenon. From the aforementioned results, it can also be seen that the maximum and minimum relative errors are divided into 9.62 and 0.17%, and all relative errors are less than 20%.

To verify the prediction effect of the models, four models, namely, VMD–RSA–DNC, EMD–RSA–DNC, VMD–DNC, and VMD–RSA–LSTM were selected for the experiments in turn. The number of adaptive decompositions of EMD is 6, and the number of IMF decomposition K of VMD is set to 6 according to the results of central frequency analysis. Figure 9 shows the comparison of the absolute values of prediction errors of each model in four cities.
Figure 9

Model relative error comparison chart.

Figure 9

Model relative error comparison chart.

Close modal

From Figure 9, it can be seen that the VMD–RSA–DNC model is more accurate in predicting the overall trend of monthly rainfall in all four cities, and the prediction at the extreme values of the samples has a significant advantage compared with the other three models.

Under the same conditions, the combined EMD-based model can predict the general trend of the monthly rainfall series, but the overall prediction effect is inferior to that of the VMD-based prediction model, which fully indicates that the VMD decomposition overcomes the problems of endpoint effects and modal component confounding in the EMD method, and reduces the nonsmoothness of time series with high complexity and strong nonlinearity; the DNC combines the advantages of the recurrent neural network and computational processing. The DNC combines the advantages of recurrent neural network and computational processing and breaks through the problem of memory forgetting of LSTM. From the comparison graphs, it can be seen that the predicted values based on VMD–RSA–DNC are closer to the sample real-world values compared with the predicted values of the corresponding VMD–RSA–LSTM-based models; similarly, the prediction effect of the model optimized by RSA is significantly better than that of the VMD–DNC model without parameter optimization:
(27)
(28)
(29)
where is the predicted value at time i; is the measured value at time i; and is the mean value of the measured value.

Table 2 shows the error comparison of the four cities, and it can be seen that the VMD–RSA–DNC model has the best rainfall prediction results for Zhoushan, Ningbo, Taizhou, and Wenzhou, with the average root-mean-square error (RMSE) of 5.43, the average mean absolute percentage error (MAPE) of 3.59%, and the average Nash–Sutcliffe efficiency (NSE) of 0.95, which is significantly better than the other three models and has higher prediction accuracy.

Table 2

Prediction accuracy of VMD–RSA–DNC compared with other models

ModelCityRMSEMAPE (%)NSE
Zhoushan VMD–DNC 21.41 9.76 0.85 
EMD–RSA–DNC 23.10 11.60 0.85 
VMD–RSA–LSTM 19.07 8.86 0.89 
VMD–RSA–DNC 7.87 4.64 0.94 
Ningbo VMD–DNC 18.92 12.12 0.86 
EMD–RSA–DNC 19.73 12.28 0.85 
VMD–RSA–LSTM 18.25 10.08 0.91 
VMD–RSA–DNC 5.25 3.07 0.96 
Taizhou VMD–DNC 17.32 11.04 0.86 
EMD–RSA–DNC 19.15 13.10 0.84 
VMD–RSA–LSTM 14.67 8.22 0.88 
VMD–RSA–DNC 4.98 2.99 0.94 
Wenzhou VMD–DNC 10.88 9.59 0.90 
EMD–RSA–DNC 14.20 11.28 0.87 
VMD–RSA–LSTM 10.61 8.77 0.91 
VMD–RSA–DNC 3.62 3.68 0.96 
Average Value VMD–DNC 17.13 10.63 0.87 
EMD–RSA–DNC 19.05 12.06 0.85 
VMD–RSA–LSTM 15.65 8.98 0.90 
VMD–RSA–DNC 5.43 3.59 0.95 
ModelCityRMSEMAPE (%)NSE
Zhoushan VMD–DNC 21.41 9.76 0.85 
EMD–RSA–DNC 23.10 11.60 0.85 
VMD–RSA–LSTM 19.07 8.86 0.89 
VMD–RSA–DNC 7.87 4.64 0.94 
Ningbo VMD–DNC 18.92 12.12 0.86 
EMD–RSA–DNC 19.73 12.28 0.85 
VMD–RSA–LSTM 18.25 10.08 0.91 
VMD–RSA–DNC 5.25 3.07 0.96 
Taizhou VMD–DNC 17.32 11.04 0.86 
EMD–RSA–DNC 19.15 13.10 0.84 
VMD–RSA–LSTM 14.67 8.22 0.88 
VMD–RSA–DNC 4.98 2.99 0.94 
Wenzhou VMD–DNC 10.88 9.59 0.90 
EMD–RSA–DNC 14.20 11.28 0.87 
VMD–RSA–LSTM 10.61 8.77 0.91 
VMD–RSA–DNC 3.62 3.68 0.96 
Average Value VMD–DNC 17.13 10.63 0.87 
EMD–RSA–DNC 19.05 12.06 0.85 
VMD–RSA–LSTM 15.65 8.98 0.90 
VMD–RSA–DNC 5.43 3.59 0.95 

In summary, the combination of modeling with VMD has greatly improved the learning effect of DNC, while the combination of RSA technology has reduced the time step artificial selection and other aspects in the model prediction process, effectively reducing the uncertainty in the modeling process and making the model achieve good prediction results.

Based on the VMD–RSA–DNC model to carry out the rainfall prediction study of four meteorological stations in eastern Zhejiang Province and by comparing the prediction effects of different models under VMD decomposition and RSA optimization, the following conclusions were obtained:

  • (1)

    For rainfall time series with strong abrupt variability in coastal hilly areas, they are subjected to VMD processing, so that the complex rainfall time series, which are not easy to extract regularity, are decomposed into a number of regular and simple signals, and the more regular and useful information obtained from the decomposition is combined with DNC for prediction, which effectively brings into play the advantage that DNC can better perform a priori learning for long- and short-term memory data. There are several hyperparameters that need to be set manually in the prediction process, and the RSA optimization algorithm is used to solve the optimization problem, which reduces the link of artificial selection of model time step, effectively reduces the uncertainty in the modeling process and obtains a better prediction effect; the DNC selects a variant of LSTM as a trainable neural network controller and combines it with a readable and writeable external memory, so as to achieve the effect of the linear combination of short-term and long-term memory predictions, which is a breakthrough solution to the problem of memory forgetting of LSTM and more accurate prediction of rainfall time series.

  • (2)

    The maximum and minimum relative errors predicted by the VMD–RSA–DNC model are 9.62 and 0.17%, and all relative errors are less than 20%, with an average RMSE of 5.43, an average MAPE of 3.59%, and an average NSE of 0.95, and compared with the three models EMD–RSA–DNC, VMD–DNC, and VMD–RSA–LSTM, the VMD–RSA–DNC has the best prediction effect.

  • (3)

    The prediction effect of this model for the May to June rainy season and the July to September typhoon season in the coastal hilly area needs to be improved. The rainfall prediction data are not only a simple time series but also related to a variety of factors such as the geographical location of the study area. The prediction method proposed in this article is only applied to rainfall prediction in coastal hilly areas, and applying the coupled model to different characteristics of the time series for prediction is the next research direction.

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

This work was supported by the Key Scientific Research Project of Colleges and Universities in Henan Province (CN) [grant numbers 17A570004].

All authors contributed to the study conception and design. Writing and editing: Xianqi Zhang and Qiuwen Yin; chart editing: Fang Liu; preliminary data collection: Haiyang Li, Haiyang Chen. All authors read and approved the final manuscript.

Abualigah
L.
,
Abd
E. M.
,
Sumari
P.
,
Geem
Z. W.
&
Gandomi
A. H.
2022
Reptile search algorithm (RSA): a nature-inspired meta-heuristic optimizer
.
Expert Systems with Applications
191
,
116158
.
Aufa
B. Z.
,
Suyanto
S.
&
Arifianto
A.
2020
Hyperparameter setting of LSTM-based language model using grey wolf optimizer
. In
Proceedings of 2020 International Conference on Data Science and Its Applications (ICoDSA)
,
5–6 August 2020, Bandung, Indonesia
, pp.
1
5
.
Bajpai
V.
,
Bansal
A.
&
Dash
S.
2023
A deep and wide neural network to predict summer monsoon rainfall using time series data
.
Concurrency and Computation: Practice and Experience
35
(
8
),
e7626
.
Graves
A.
,
Wayne
G.
,
Reynolds
M.
,
Harley
T.
,
Danihelka
I.
,
Grabska
B. A.
,
Colmenarejo
S. G.
,
Grefenstette
E.
,
Ramalho
T.
,
Agapiou
J.
,
Badia
A. P.
,
Hermann
K. M.
,
Zwols
Y.
,
Ostrovski
G.
,
Cain
A.
,
King
H.
,
Summerfield
C.
,
Blunsom
P.
,
Kavukcuoglu
K.
&
Hassabis
D.
2016
Hybrid computing using a neural network with dynamic external memory
.
Nature
538
,
471
476
.
Li
X. H.
&
Cui
D. W.
2022
Multi-step prediction of hydrological time series based on WPD-RSA-ELM model
.
Water Resources and Hydropower Engineering
53
(
11
),
69
77
.
Lv
H. F.
,
Zhao
X. H.
&
Sang
Y. T.
2020
Research on monthly runoff prediction method based on VMD-LSSVM
.
China Rural Water and Hydropower
8
,
166
170, 176
.
Masoud
S. R.
&
Faramarz
S. E.
2020
EDNC: evolving differentiable neural computers
.
Neurocomputing
412
,
514
542
.
Ming
Y.
,
Pelusi
D.
,
Fang
C. N.
,
Prasad
M.
,
Wang
Y. K.
,
Wu
D.
&
Lin
C. T.
2020
EEG data analysis with stacked differentiable neural computers
.
Neural Computing and Applications
32
,
7611
7621
.
Qiu
R.
,
Wang
Y.
,
Wang
D.
,
Qiu
W.
&
Tao
Y.
2020
Water temperature forecasting based on modified artificial neural network methods: two cases of the Yangtze River
.
Science of the Total Environment
737
,
139729
.
Rojas
A.
,
Wittenbrink
M.
,
Nieters
P.
,
Schaffernicht
E. J.
,
Keller
J. D.
&
Pipa
G.
2023
Postprocessing of NWP precipitation forecasts using deep learning
.
Weather and Forecasting
38
,
487
497
.
Santos
C. A. G.
,
do Nascimento
G. R.
,
de Farias
C. A. S.
,
da Silva
R. M.
&
Mishra
M.
2023
Short- and long-term streamflow forecasting using wavelet neural networks for complex watersheds: a case study in the Mahanadi River, India
.
Ecological Informatics
73
,
101945
.
Sun
N.
,
Zhou
J. Z.
&
Zhu
S.
2018
Application of hybrid models based on wavelet analysis and two different neural networks in prediction of monthly runoff
.
Water Resources and Power
3604
,
14
17 + 32
.
Vidyarthi
V. K.
&
Jain
A.
2023
Advanced rule-based system for rainfall occurrence forecasting by integrating machine learning techniques
.
Journal of Water Resources Planning and Management
149
(
1
),
04022072
.
Xie
S. F.
,
Su
Y. N.
&
Wang
X. Q.
2019
Genetic wavelet neural network-based GPS precipitable water prediction
.
China Science Paper
13
(
09
),
1041
1044
.
Zhang
L.
,
Liu
Z.
,
Li
L.
&
Liu
X.
2023
Research on monthly runoff prediction method based on VMD-PSR-BNN model
.
China Rural Water and Hydropower
4
,
105
113
.
Zuo
G.
,
Luo
J.
,
Wang
N.
,
Lian
Y.
&
He
X.
2020
Decomposition ensemble model based on variational mode decomposition and long short-term memory for streamflow forecasting
.
Journal of Hydrology
585
(
3–4
),
124776
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).