An algorithm, named long short-term memory (LSTM)-logistic chaos mapping chicken swarm algorithm (LCCSA), is proposed for initializing the weights and thresholds of LSTM neural networks using the Logistic chaotic mapping chicken swarm algorithm (CSA). This algorithm aims to improve mid- to long-term runoff sequence prediction in river basins. In this model, the logistic chaotic mapping method is used to initialize the chicken swarm, and LCCSA is employed to pre-train the weights and thresholds of each layer of LSTM 50 times, using the training results' initial weights of LSTM to enhance convergence accuracy and speed. Taking the Manas river and Kuitun river, two typical basins in northern Xinjiang, China, as the research objects, LSTM-LCCSA was used to forecast the mean monthly runoff in the mid- to long-term under different lag time series by using the runoff evolution data within a certain period. The example using the basin located in northern Xinjiang demonstrates the effectiveness, stability, and generality of the LSTM-LCCSA method in mid- to long-term prediction of average monthly runoff, and the prediction accuracy and universality of LSTM-LCCSA are better than other data-driven models.

  • Proposed an LSTM-LCCSA algorithm to address mid- to long-term runoff sequence prediction in river basins.

  • Initialization of the chicken swarm using a logistic chaotic mapping method improved population diversity and avoided CSA falling into a local extreme value.

  • The intercomparison of data-driven models (CM-SVM, GA-SAA, MRA, GA-FFNN) with respect to prediction accuracy and stability.

A runoff sequence is a type of high-dimensional nonlinear system that is influenced by many factors. Since the 20th century, river runoff has changed owing to the influence of urbanization development and climate change, resulting in increased uncertainty in predicting runoff (Chau & Jiang 2002). High-precision river runoff prediction can provide valuable hydrological information for downstream reservoir discharge and flood prediction (He et al. 2011; Valipour 2015; Jiang et al. 2018). In recent years, numerous scholars have used data-driven models to predict mid- to long-term runoff time series, and many notable research results have been obtained (Liu et al. 2019; Ren et al. 2022).

For runoff series prediction, the prediction accuracy declines as the prediction period is extended. Extending the prediction period while improving prediction accuracy is the primary focus of contemporary mid- to long-term runoff prediction research. In a study on extending the prediction period, Abrahart & See (2000) discussed the use of artificial neural networks (ANN) to extend the prediction period very early. As machine learning and deep learning theory have improved, the prediction capability of neural network models in the mid- to long-term prediction of hydrology has improved as well. For prediction using machine learning and deep learning technology, the support vector machine (SVM) has been used by many scholars in flood prediction (Bafitlhile & Li 2019), monthly average runoff, and inflow prediction of dam reservoirs (Kisi 2008; Awchi 2014; Ghorbani et al. 2016; Karami et al. 2017; Babaei et al. 2019). These studies have demonstrated that SVM exhibits strong capabilities regarding runoff prediction. Furthermore, a variety of neural network models have been used to conduct comparative studies of daily runoff prediction (Kisi 2009a, 2009b, 2009c; Nacar et al. 2018; Modaresi et al. 2018), and these studies have demonstrated that various neural network models achieve different prediction results owing to different fitting capabilities. In general, neural network models fit nonlinear problems well. In addition, Bayesian regression (BR) and the adaptive network-based fuzzy inference system (ANFIS) have been used for runoff prediction (Hamaamin et al. 2016), and the prediction results indicate that the performances of BR and ANFIS are satisfactory at the global level; their reported Nash–Sutcliffe efficiencies are 0.99 and 0.97, respectively.

To improve the prediction accuracy after extending the prediction period, Delafrouz et al. (2018) proposed a new spatial reconstruction technique hybrid neural network (GEPNN) to predict the daily runoff in three river basins: the Mississippi, Pearl River, and Muskingum River in the USA. Combining phase space hybrid model reconstruction with a hybrid neural network model, the daily runoff of river measurement stations involving the three rivers was predicted. Hybrid neural networks are considered to have higher predictive power than pure artificial neural networks. For example, a phase space hybrid model based on chaotic dynamics can yield better prediction results than an ANN. Heuristic algorithms have been evaluated to analyze the runoff evolution law of time series with uncertainty in a river basin and provide new ideas for non-stationary processes treatment in hydrological series (Venzon et al. 2018; Wang et al. 2018; Yang et al. 2018). Using rainfall and climate information as input, two stations in the Dongjiang River Basin in Southern China were selected as the research objects, and a wavelet neural network and support vector hybrid model (BWS) were used to predict river runoff (Liu et al. 2015). The results indicate that BWS can provide detailed information regarding prediction uncertainty, which has provided further inspiration to improve the accuracy of extreme runoff evolution prediction. To analyze non-stationary hydrological series, Latifoglu et al. (2015) combined spectral analysis and an ANN to achieve better prediction results than by using a single algorithm. They discussed the importance of hybrid models in predicting non-stationary hydrological series as well. Although the models improved the prediction accuracy to a certain extent, the increasing complexity of human activities and the non-stationary nature and randomness of runoff series have increased. As the forecast cycle gets longer over time in a real system, which increases the need for models that can skilfully adapt to changes in runoff, failure to do so would result in decreased predictive accuracy. Human activity, shifting climate patterns, and changing terrain all directly impact runoff, rendering the forecast yet more complex. Hence, enhancing forecast precision and broadening the prediction period are significant for runoff prediction.

Therefore, fully analyzing the relationship among runoff time, factors of climate, topography, and other factors in the basin, and revealing the evolution rules of a series would be important for the prediction period extending and prediction accuracy increasing. In this paper, we used the logistic chaotic mapping chicken swarm algorithm (CSA) to pre-train the initial weights and thresholds of LSTM and propose the LSTM-logistic chaos mapping chicken swarm algorithm (LCCSA) algorithm. To the best of our knowledge, this is the first study to apply the LSTM-LCCSA algorithm to mid- to long-term prediction of hydrological runoff series. The prediction results on the hydrological sequence of a river basin in northern Xinjiang have been used to verify the effectiveness, stability, and generality of the LSTM-LCCSA algorithm in predicting mid- to long-term runoff series.

Study data

The Manas River and Kuitun River Basins in Northern Xinjiang of China were selected as representative study objects, and their geographical locations are shown in Figure 1. The Manas River is located north of Tianshan Mountain, in the hinterland of Eurasia, between N43°27′–45°21′ and E85°01′–86°32′. It is located in a semi-arid area. The river bottom elevation specifies that the river height varies between 170 and 5,242 m, with a total length of 420 km, annual average temperature of 4.7–5.7 °C, and spans a total area of 3.099104 km2. The Kuitun river basin is located between N43°30′–45°04′ and E83°22′–85°47′. The main concourse has a total length of 360 km and is composed of three rivers. The basin area spans 2.83104 km2, and the annual average temperature is 7 °C. The average temperature is −16 °C in January and 26 °C in July. The spatial and temporal distribution of annual precipitation is inconsistent.
Figure 1

Locations of the study areas (red circles) for the Manas River and Kuitun River. Please refer to the online version of this paper to see this figure in colour: https://dx.doi.org/10.2166/wcc.2023.435.

Figure 1

Locations of the study areas (red circles) for the Manas River and Kuitun River. Please refer to the online version of this paper to see this figure in colour: https://dx.doi.org/10.2166/wcc.2023.435.

Close modal
Based on the analysis of the physical causes of runoff evolution in the northern Xinjiang basin, 21 basic datasets affecting runoff evolution from 1955 to 2011 were obtained. The evolution trends of monthly runoff in the two basins are shown in Figure 2. The physical factors related to runoff evolution include those represented by meteorological data (temperature (X1), humidity (X2), air pressure (X3), flow intensity (X4), wind level (X5), sunshine (X6), radiation (X7), rainfall (X8), and evapotranspiration (X9)), environmental index data (green coverage evolution (X10), air quality change (X11)), urbanization index data (urban and rural consumption change (X12), per capita disposable income change (X13), change in per capita net income (X14), total agricultural machinery power (X15), tertiary industry output value change (X16)), atmospheric circulation data (Arctic Oscillation Index (X17), North Atlantic Oscillation Index (X18), mean grid height in southern China under 500 hPa (X19), mean sea temperature in Japan archipelago and South Pacific (X20)), and runoff (X21). Basic data sources are the National Oceanic and Atmospheric Administration of the United States (https://www.noaa.gov/), National Climate Center of China Meteorological Administration (http://www.cma.gov.cn), urbanization data monitoring and sharing platform (http://www.sdxxczh.cn), environmental protection data center of the People's Republic of China, and the private pm25.in, water conservancy project construction administration of Kenswat station (located in Shihezi City, Xinjiang).
Figure 2

Evolution trend of average daily runoff of (a) Manas river and (b) Kuitun River.

Figure 2

Evolution trend of average daily runoff of (a) Manas river and (b) Kuitun River.

Close modal

Problem discussion

The river runoff is measured at distinct intervals , and n groups of ordered sequences with time t as the parameter are obtained: . Sequences consist of the monthly runoff of Kenswat station from 1955 to 2011, and the monthly high and low extreme runoffs were evaluated. A total of 672 months of data were used for model training. In the interim prediction, the runoff of 12 months was predicted. 80% of the data were used as the training set, 18.2% as the verification set, and 1.8% as the prediction set. In the long-term forecast, we predicted the runoff of 24 months, using 80% of the data as the training set, 16.4% as the verification set and 3.6% as the prediction set. In recent years, owing to the strong influence of climate change and human activities, the trend component and transient component of the runoff time series exhibited strong non-stationarity and randomness. The runoff non-stationary series can be simplified as , where and are functions determined at time t, representing the mathematical expectation and variance of the change in over time, and is a stationary stochastic process. Therefore, the analysis of runoff time series is a process based on extracting a deterministic function and processing a random function . According to the rules of monthly runoff and extreme runoff evolution, the physical causes of runoff series formation are inferred and possible values of future runoff are predicted, especially in the medium and long terms. Because the runoff process is a complex non-stationary random process, it is very important to fit the nonlinear process of the model when making prediction fitting. The general form of a hydrological time series model is , where is the random disturbance term in the model. If the random disturbance term is a white noise, the model is denoted as
If it is not a white noise, a general autoregressive moving average process model is

The main goal is to use a model with strong fitting capability and long short-term memory (LSTM) capability to explore the evolution law of a hydrological sequence and then predict the future runoff.

Strategies for LCCSA

The CSO algorithm is an intelligent algorithm used to solve the NP-complete (NP-C) problem proposed by MENG in 2014 that is inspired by the hierarchy and intelligent behavior of foraging chickens (Meng et al. 2014). The CSO algorithm abstracts the hierarchical order and foraging behavior of groups of chickens and transforms the solution of a global optimization problem into a process involving N individual chicken groups consisting of one rooster and a set of hens and chicks foraging in a space defined by . For each iteration, the individuals with corresponding roles in a chicken group move according to their corresponding speed and position update mode to achieve the goal of moving the entire chicken group closer to the food. In the iterative process, roosters independently move to a position where the fitness value continuously improves, and the position is updated based on Equation (1). In the algorithm, hens can follow the roosters in any subgroup to find food during the foraging period. They can also steal the food found by other chickens. Moreover, the adaptability of hens with higher fitness values is stronger than that of hens with low fitness values. Therefore, in the iterative process, the position update of hens is relatively complex, and the update strategy followed is defined by Equation (2). From the position update strategy, roosters with higher are more likely to find food. In each subpopulation, all chicks follow their mothers to search for food. In the process of evolution, the position of the chicks is constantly updated, and the updating rules are applied using Equation (3). The global optimal solution of the problem is to select the chicken that has the strongest foraging ability and is most likely to find food – that is, the target solution. The search method for the solution under random initialization is shown in Figure 3(a). As the problem size increases, the CSO algorithm is likely to fall into a local extremum because the initial solution set is randomly generated. A good initial solution set selection strategy can effectively prevent the CSO algorithm from falling into a local extremum.

Using chaotic mapping to generate chaotic sequences is one method for selecting the initial solution, which can effectively prevent the CSO algorithm from falling into a local extremum solution. In the case of decreasing population diversity, the ergodicity of the chaotic map is used to generate a Logistic chaotic map sequence, which improves the capability of the algorithm to escape from a local solution. Figure 3(b) shows the process used to search for solutions after Logistic chaotic mapping. The mode of Logistic chaotic mapping is shown in Equation (4), where the form of the system dynamics is different when the value of m is different. To eliminate the influence of the initial value as much as possible, we first let the system iterate t times, and then use a chaotic sequence to generate the -th generation initial population.
Figure 3

CSA and LCCSA initial solution optimization process.

Figure 3

CSA and LCCSA initial solution optimization process.

Close modal
The strategy for generating a uniform initial value is given by Equation (5), and the algorithm is described as Figure 4.
(1)
Figure 4

Chicken swarm algorithm flow chart.

Figure 4

Chicken swarm algorithm flow chart.

Close modal
In Equation (1), represents the random value of the Gaussian distribution satisfying and and represents the fitness value of the rooster.
(2)
In Equation (2), is the hen following a rooster during foraging and is a random individual selected from the roosters or hens, where .
(3)
In Equation (3), is the mother hen of the chick and is the following coefficient, having a random value between [0,2].
(4)
(5)

In Equations (4) and (5), is the chaotic sequence generated by the chaotic mapping, is the variable in space , and are the upper and lower bounds of the problem, respectively.

LSTM dynamic neural network

LSTM is a recursive neural network proposed by Hochreiter & Schmidhuber (1997) . It has the characteristics of LSTM and exhibits various advantages for time series problems. The basic structure of a modern LSTM is shown in Figure 5. An LSTM recursive network is composed of an input layer, LSTM layer, hidden layer, and output layer. Backpropagation through time (BPTT) was adopted for network training (Grau et al. 2013). When a traditional recurrent neural network learns gradients, the gradient disappears or explodes because of the weight association characteristics in the recursive layer, resulting in poor or non-convergence of the entire network, which negatively affects network performance. LSTM adopts constant error carousels (CEC) to combat gradient disappearance (Hochreiter & Schmidhuber 1997), which introduces two problems. One is input weight conflicts, and the other is output weight conflicts. Therefore, adding control gates in the hidden layer neurons is used to handle different gradient signals in LSTM. That is, LSTM uses special and complex hidden neurons to provide improvements over traditional RNNs. The introduction of an input gate, forget gate, and output gate not only effectively remembers short-term signals but also provides a strong memory function for the long-term signals in learning data. Meanwhile, for longer-term useless signals, its gradient information can be effectively controlled by the forget gate. This control method effectively solves the decaying gradient signal problem in RNN because the current decision is not allowed to rely on the earlier gradient signal.
Figure 5

Modern LSTM network structure.

Figure 5

Modern LSTM network structure.

Close modal

Strategies for LSTM-LCCSA

To achieve target accuracy, LSTM neural network training should gradually reduce the error between the network output value and the expectation by routinely adjusting the weights and thresholds between layers. The topological structure, learning rate, and random index of each layer in the LSTM network are all part of the hyperparameter in the neural network. Each group of individuals in a chicken swarm represents a type of hyperparameter configuration of LSTM. During training, the training accuracy of the neural network is measured by the least-square method. The best individual in the solution space searched by CSA becomes the initial parameter value of the network. LSTM-LCCSA undertakes the process of pre-training network weights using LCCSA to solve the problem of setting initial weights in LSTM. The training process is shown in Figure 6. In LSTM-LCCSA, after training the LSTM 50 times with LCCSA, the initial weight values obtained were used as the initial values of the LSTM dynamic network training. This can improve the convergence speed of LSTM and prevent it from falling into a local extreme value during the training process. In deep learning hydrological time series, good initial weights can obtain more profound sequence evolution rules, provide a better mining effect on sequence analysis problems, and achieve more accurate prediction effects on future sequences.
Figure 6

LSTM-LCCSA algorithm flow chart.

Figure 6

LSTM-LCCSA algorithm flow chart.

Close modal

Parameter calibration

To study the evolution of monthly runoff, data were divided according to different lag time series. Two sets of time series, and , were selected by a 5-month sliding window as the basic unit. In this case, and . included 136 group samples, contained 171 group samples, and each group contained 12 months of monthly average data. When the prediction period is mid-term, the prediction period was 12 months, including 135 groups of training samples. When the prediction period was long-term, it was 24 months, including 169 groups of training samples. In the mid- to long-term forecast period, the problems affecting overall prediction accuracy are mainly poor prediction when large evolutions of high and low runoff occur in special months, which lead to an increase of the overall prediction error. It is necessary to study the special runoff series to improve the prediction accuracy. Using the sliding window method, a 5-month window was selected as the basic unit to correspond with and in the strict time scale, and the special runoff series was then divided into two groups: and , where and . is a sample set of special series including 136 groups within a mid-term prediction period and is a sample set including 171 groups within a long-term prediction period. Under the strict time scales of T and S, 135 groups of the mid-term training set and 169 groups of the long-term training set were divided.

To compare the effectiveness of LSTM-LCCSA in hydrological series processing, various current popular prediction algorithms were selected to compare results. Selecting the chaotic mapping support vector machine (CM-SVM) (Tharwat & Hassanien 2018), genetic algorithm-simulated annealing algorithm (GA-SAA) (Zhong 2017), mixture regression algorithm (MRA) (Blanchard 1999), and the genetic algorithm optimization feedforward neural network (GA-FFNN) (Tsai et al. 2006), four data-driven models for sequence prediction were used for comparison. The comparison indexes are as follows: mean value (), mean square deviation (), coefficient of variation (), coefficient of skewness (), square of correlation coefficient (), mean absolute error (), range (), rank comprehensive score (), number of parameter calibrations (), and model training time (). The calculation method for each index is shown in Table 1.

Table 1

Calculation method for each index

    
    
    
    

Data pre-processing

After combining the basic data of impact factors from different sources, many autocorrelations and information redundancy were observed between the factors. This could lead to a decrease in the accuracy of the sequence prediction and reduce the robustness of the model. Some noise sources can make the model extremely sensitive to changes in some variables during the prediction process. Therefore, correlation analysis and factor analysis (Bradford 1976) were applied to sequence samples to remove any autocorrelations among factors in the samples, reduce information redundancy between factor sets, and remove abnormal noise caused by measurement. Data pre-processing is primarily employed to develop a new set of factor patterns.

Through factor analysis pre-processing, the resulting 20-dimensional impact factors are combined into four types of common factor patterns (F1, F2, F3, and F4). The load matrix of the 20 physical factors to the common factor pattern set is rotated by variance maximization, and the results are shown in Table 2. After factor analysis, the original high-dimensional information data are greatly reduced, which in turn reduces the noise caused by collinear redundancy and strong correlations among the original factors. The four common factor pattern sets have certain mathematical statistical characteristics and satisfy the weighted linear combination of physical factors under certain laws, which have strong practical interpretation significance.

Table 2

Load matrix after variance maximization rotation

F1F2F3F4
X1 0.049 0.037 0.0033 −0.0001 
X2 0.012 0.021 −0.0017 0.00011 
X3 0.001 − 0.019 −0.0021 −0.0002 
X4 0.001 0.012 0.0005 −0.0001 
X5 0.0007 0.0004 0.0002 0.00001 
X6 0.015 0.016 −0.0007 0.0001 
X7 0.0003 0.012 0.0021 0.00001 
X8 0.0042 0.072 0.0014 0.00001 
X9 −0.005 −0.001 0.0012 0.09001 
X10 −0.004 −0.0001 0.019 0.00001 
X11 0.015 −0.002 0.078 0.00001 
X12 0.031 −0.0001 0.00002 0.00011 
X13 0.059 0.0002 0.023 0.00024 
X14 0.025 0.0002 0.00001 0.00013 
X15 0.017 0.0002 0.00002 0.0001 
X16 0.043 −0.00016 − 0.016 0.0071 
X17 0.001 −0.0001 0.0001 0.0011 
X18 −0.0002 0.00017 0.00002 0.022 
X19 0.00005 −0.0001 0.00003 0.001 
X20 0.0003 −0.0003 0.00002 0.0013 
X21 0.091 0.056 0.0001 0.00001 
Variance contribution rate 0.426 0.389 0.117 0.065 
Cumulative contribution rate 0.426 0.815 0.932 0.997 
F1F2F3F4
X1 0.049 0.037 0.0033 −0.0001 
X2 0.012 0.021 −0.0017 0.00011 
X3 0.001 − 0.019 −0.0021 −0.0002 
X4 0.001 0.012 0.0005 −0.0001 
X5 0.0007 0.0004 0.0002 0.00001 
X6 0.015 0.016 −0.0007 0.0001 
X7 0.0003 0.012 0.0021 0.00001 
X8 0.0042 0.072 0.0014 0.00001 
X9 −0.005 −0.001 0.0012 0.09001 
X10 −0.004 −0.0001 0.019 0.00001 
X11 0.015 −0.002 0.078 0.00001 
X12 0.031 −0.0001 0.00002 0.00011 
X13 0.059 0.0002 0.023 0.00024 
X14 0.025 0.0002 0.00001 0.00013 
X15 0.017 0.0002 0.00002 0.0001 
X16 0.043 −0.00016 − 0.016 0.0071 
X17 0.001 −0.0001 0.0001 0.0011 
X18 −0.0002 0.00017 0.00002 0.022 
X19 0.00005 −0.0001 0.00003 0.001 
X20 0.0003 −0.0003 0.00002 0.0013 
X21 0.091 0.056 0.0001 0.00001 
Variance contribution rate 0.426 0.389 0.117 0.065 
Cumulative contribution rate 0.426 0.815 0.932 0.997 

Note: Each index in Table 2 is shown as follows: temperature (X1), humidity (X2), air pressure (X3), flow intensity (X4), wind level (X5), sunshine (X6), radiation (X7), rainfall (X8), evapotranspiration (X9), green coverage evolution (X10), air quality change (X11), urban and rural consumption change (X12), per capita disposable income change (X13), change in per capita net income (X14), total agricultural machinery power (X15), tertiary industry output value change (X16), Arctic Oscillation Index (X17), North Atlantic Oscillation Index (X18), mean grid height in southern China under 500 hPa (X19), mean sea temperature in Japan archipelago and South Pacific (X20), runoff (X21). And the bold values in table 2 represent the values with the maximum weight in the dataset.

Four types of common factors explained 99.7% of the original information. Among them, F1 and F2 have the highest variance contribution rate and the strongest explanatory significance, indicating that human activities have a great influence on runoff evolution. The contribution rate of meteorology and atmospheric circulation is second, further indicating that the evolution of runoff time series must consider the strong influence of human activities because human activities will cause climate change, affect local circulation in the basin control area, lead to changes in precipitation, snowmelt, evapotranspiration, and will cause changes in the underlying sequence. Therefore, the new influence mode F = (F1, F2, F3, F4) should focus on the influence of F1 on the evolution of hydrological sequences.

Five models: LSTM-LCCSA, CM-SVM, GA-SA, MRA, and GA-FFNN undergo the parameter calibration method presented in Section 2.3.4 and the new impact factor model obtained is discussed in Section 2.3.5. The five models are tested 20 times each, respectively, and the average value of the 20 tests was obtained as the prediction result for each model.

Figure 7(a) shows the forecast results of monthly runoff when the forecast period is mid-term (the forecast time is 12 months). This part of the forecast represents the trend component of the average runoff series of the corresponding forecast month. Figure 8(a) shows the prediction results for the special runoff evolution years corresponding to the time scale in Figure 7(a). This part of the forecast results represents the transient component of the hydrological series of the corresponding forecast month. Combining the trend component and transient component, we can predict the final forecast trend of the five models in the selected forecast month, as shown in Figure 9(a). In the long forecast period (the average runoff of 24 months is the forecast target), the forecast results of the five models are shown in Figure 7(b). Figure 8(b) shows the prediction results for the special runoff evolution years corresponding to the time scale in Figure 7(b). This part of the forecast represents the transient components of the monthly hydrological series in the long forecast period. After considering the trend components and transient components in Figures 7(b) and 8(b), the overall forecast trends of the five models (Figure 9(b)) are obtained for the long forecast period.
Figure 7

Mid- to long-term forecast results of monthly average runoff for Manas River.

Figure 7

Mid- to long-term forecast results of monthly average runoff for Manas River.

Close modal
Figure 8

Mid- to long-term forecast results of monthly extreme runoff for Manas River.

Figure 8

Mid- to long-term forecast results of monthly extreme runoff for Manas River.

Close modal
Figure 9

Mid- to long-term forecast general trend for Manas River.

Figure 9

Mid- to long-term forecast general trend for Manas River.

Close modal

Analysis of effectiveness

The validity of LSTM-LCCSA was analyzed from two aspects: accuracy and time efficiency. From Figures 79, the differences between the five algorithms and the real measured values regarding the overall prediction trend are small. This indicates that the overall forecast is good. The prediction accuracy of the five algorithms was measured using correlation and mathematical statistics between the predicted value and the real value.

The forecast results in indexes in the mid-term are shown in Table 3. From the R values, LSTM-LCCSA improves by 3, 3, 5, and 4% compared to CM-SVM, GA-SA, and MRA,GA-FFNN, the is reduced by −0.04, 2.19, 1.62, and 0.22, and is reduced by 6.8, 7.5, 10.1, and 7.6, respectively. Compared with the overall mean of 30.89, the LSTM-LCCSA mean value is closer to the real mean. This indicates that LSTM-LCCSA achieves higher prediction accuracy than the other four models in the mid-term period. Using a different model, Zhang et al. (2011) used the differential autoregressive moving average (ARIMA) model to predict annual average runoff, and their SSA-ARIMA model achieved R = 0.814. In comparison, the value achieved by LCCSA-LSTM is 17% higher than SSA-ARIMA. Therefore, the prediction effect of the proposed LCCSA-LSTM is better than ARIMA for processing non-stationary sequences. From the perspective of time efficiency, LSTM-LCCSA requires 45 network parameters to be calibrated, while the CM-SVM, GA-SA, MRA, and GA-FFNN network parameters to be calibrated are 33, 28, 21, and 38, respectively. Compared with the other four algorithms, the complexity of LSTM-LCCSA is not much improved. Under the same hardware and software conditions, from the convergence time point of view, LSTM-LCCSA, CM-SVM, GA-SA, MRA, and GA-FFNN take 1,260, 1,180, 1,066, 1,332, and 1,120 s to run, respectively. The training time of LSTM-LCCSA is acceptable; therefore, LSTM-LCCSA has acceptable efficiency.

Table 3

Short-term forecast results effectiveness indexes for Manas River

Mid-term(s)
LSTM-LCCSA 0.99 0.46 29.62 10.36 45 1,260 
TOSVM 0.96 0.42 28.10 17.16 33 1,180 
GA-SAA 0.96 2.65 27.78 17.84 28 1,066 
MRA 0.94 2.08 27.46 20.48 21 1,332 
GA-FFNN 0.95 0.68 28.03 17.92 38 1,120 
Mid-term(s)
LSTM-LCCSA 0.99 0.46 29.62 10.36 45 1,260 
TOSVM 0.96 0.42 28.10 17.16 33 1,180 
GA-SAA 0.96 2.65 27.78 17.84 28 1,066 
MRA 0.94 2.08 27.46 20.48 21 1,332 
GA-FFNN 0.95 0.68 28.03 17.92 38 1,120 

The forecast result indicators for long-term prediction are shown in Table 4. Compared with CM-SVM, GA-SA, MRA, and GA-FFNN, LSTM-LCCSA increased by 5, 7, 6, and 2% in , the decreased by 2.99, 2.09, −0.55, and 2.71, decreased by 5.55, 5.40, 11.25, and 6.07, respectively, and the mean value was closer to the real mean value of 32.93.

Table 4

Long-term forecast results effectiveness indexes for Manas River

Long-term(s)
LSTM-LCCSA 0.98 1.97 28.98 13.07 45 1,220 
TOSVM 0.93 4.96 24.50 18.62 33 1,010 
GA-SAA 0.91 4.06 24.67 18.47 28 1,165 
MRA 0.92 1.42 25.02 24.32 21 1,310 
GA-FFNN 0.96 4.68 25.26 19.14 38 1,072 
Long-term(s)
LSTM-LCCSA 0.98 1.97 28.98 13.07 45 1,220 
TOSVM 0.93 4.96 24.50 18.62 33 1,010 
GA-SAA 0.91 4.06 24.67 18.47 28 1,165 
MRA 0.92 1.42 25.02 24.32 21 1,310 
GA-FFNN 0.96 4.68 25.26 19.14 38 1,072 

The reason LSTM-LCCSA exhibits greater accuracy than and comparable complexity to the other four algorithms is that chaotic mapping produces more uniform effective initial values, improving CSA's capability to escape a local extremum solution. Concurrently, compared with ordinary RNN, LSTM solves the problem of input weight conflicts and output weight conflicts by using control gates. The network can not only remember the short-term learning signals but can also learn the longer signals. Any early useless signals can be controlled using the forget gate, so that later useless signals cannot disturb the current sequence judgment. Therefore, LSTM-LCCSA has high accuracy and strong prediction validity in time series prediction problems.

Analysis of stability

In actual predictions, runoff evolution will be affected by many uncertain factors and measurement errors, which will increase uncertainty in hydrological series prediction. Therefore, the prediction stability of the model must be analyzed. The forecast stability indexes in the mid-term are shown in Table 5. The relative dispersion degree of the prediction sequence and the measured sequence can reflect the variation of runoff, and the dispersion between the prediction sequence and the measured sequence can be observed.

Table 5

Mid-term forecast results in stability indexes for Manas River

Mid-term
LSTM-LCCSA 0.34 2.12 86.40 88 
TOSVM 0.61 4.31 80.14 83 
GA-SAA 0.64 3.90 79.03 79 
MRA 0.74 2.31 77.13 77 
GA-FFNN 0.63 3.70 77.65 82 
Mid-term
LSTM-LCCSA 0.34 2.12 86.40 88 
TOSVM 0.61 4.31 80.14 83 
GA-SAA 0.64 3.90 79.03 79 
MRA 0.74 2.31 77.13 77 
GA-FFNN 0.63 3.70 77.65 82 

Regarding Cv, LSTM-LCCSA decreased by 0.27, 0.30, 0.40, and 0.29 compared with CM-SVM, GA-SA, MRA, and GA-FFNN, respectively. This result indicates that the sequence predicted by LSTM-LCCSA is closer to and is concentrated around the actual measured sequence. The value of CS can reflect the degree of symmetry of the time series on both sides of the mean, and it is an important parameter for measuring the degree of symmetry of the sequence. If CS = 0, the probability of the sequence being symmetrical on both sides of the mean value is very high, and the stability of the sequence is stronger. CS > 0 indicates positive sequence bias, while CS < 0 indicates negative sequence bias. Compared with the other four models, the CS value of LSTM-LCCSA is closer to 0, which indicates that the symmetry of the prediction sequence on both sides of the mean value is more concentrated and the sequence stability is better. The larger range of the predicted value indicates the weaker sensitivity of the prediction model. To further explore the stability of the model, we use the rank scoring method, where the higher rank score indicates the better prediction effect of the model. LSTM-LCCSA scored higher than CM-SVM, GA-SA, MRA, and GA-FFNN in the RS by 5, 9, 11, and 6 points, respectively.

According to the analysis of the various indexes, the prediction stability of the LSTM-LCCSA model is acceptable when the prediction period is mid-term. The stability indexes under long-term prediction are shown in Table 6. Compared with CM-SVM, GA-SA, MRA, and GA-FFNN, the of LSTM-LCCSA is lower by 0.34, 0.36, 0.42, and 0.38, is closer to 0, and the comprehensive rank score is higher by 2, 8, 6, and 3, respectively. Therefore, the prediction using the LSTM-LCCSA model is considered to be stable and reliable in the long-term.

Table 6

Long-term forecast results stability indexes for Manas River

Long-term
LSTM-LCCSA 0.40 4.21 84.68 82 
TOSVM 0.74 6.51 81.2 80 
GA-SAA 0.76 5.82 82.71 74 
MRA 0.82 4.68 81.4 76 
GA-FFNN 0.78 5.78 81.79 79 
Long-term
LSTM-LCCSA 0.40 4.21 84.68 82 
TOSVM 0.74 6.51 81.2 80 
GA-SAA 0.76 5.82 82.71 74 
MRA 0.82 4.68 81.4 76 
GA-FFNN 0.78 5.78 81.79 79 

Analysis of generality

To further analyze the generality of LSTM-LCCSA under mid- to long-term forecasting, the monthly average runoff data from 1974 to 2017 and the pre-processed physical factor data of Kuitun River Basin, another typical basin in northern Xinjiang, were used for algorithm training. The prediction is carried out by the same lag timing division method as discussed in Section 3.1., and the forecast results of the Kuitun River Basin are obtained under different prediction periods.

The monthly runoff forecast results obtained for the mid-term forecast period are shown in Figure 10(a), and the forecast results in special runoff evolution years are shown in Figure 11(a). After integrating the two figures, the overall trend of the mid-term forecast of the Kuitun River Basin is shown in Figure 12(a). It shows that the predicted results for each month fall within the coverage rate of 90% confidence, which indicates that the prediction results in the mid-term prediction period are acceptable for predicting the overall trend. Table 7 shows the prediction validity and stability indexes in the mid-term. It can be seen from the index results that the prediction of LSTM-LCCSA for the Kuitun River Basin is feasible in the mid-term, which indicates that the model has strong generality, as it has been successfully applied to two river basins in Xinjiang.
Table 7

Mid- to long-term forecast results indexes for Kuitun River

(s)
Mid-term 0.98 1.32 27.35 13.95 1,033 0.34 2.11 65.7 84 
Long-term 0.93 3.72 26.27 17.17 1,202 0.59 5.31 68.3 79 
(s)
Mid-term 0.98 1.32 27.35 13.95 1,033 0.34 2.11 65.7 84 
Long-term 0.93 3.72 26.27 17.17 1,202 0.59 5.31 68.3 79 
Figure 10

Mid- to long-term forecast results of monthly average runoff for Kuitun River.

Figure 10

Mid- to long-term forecast results of monthly average runoff for Kuitun River.

Close modal
Figure 11

Mid- to long-term forecast results of monthly extreme runoff for Kuitun River.

Figure 11

Mid- to long-term forecast results of monthly extreme runoff for Kuitun River.

Close modal
Figure 12

Mid- to long-term forecast general trend for Kuitun River.

Figure 12

Mid- to long-term forecast general trend for Kuitun River.

Close modal

In the long-term, the monthly runoff of the Kuitun River Basin is also predicted. Figure 10(b) shows the forecast results of monthly runoff. Under the same time scale, Figure 11(b) shows the forecast results of the special runoff series. After integrating the two figures, the overall forecast trend of the Kuitun River Basin in the long-term is shown in Figure 12(b). It can be seen that the forecast results of almost every month fall within the 85% confidence coverage. From the overall trend results in the long-term, the forecast results are acceptable. Table 7 shows the validity and stability indexes of the prediction series in the long-term prediction period. From the index results, the prediction effect of LSTM-LCCSA is suitable in the long-term, further indicating that the prediction universality of the model is strong.

Therefore, after this comprehensive analysis, it was concluded that LSTM-LCCSA has strong universality in the mid- to long-term runoff time series forecast of the river basins in northern Xinjiang and therefore can be applied to the runoff forecast of other typical basins in northern Xinjiang.

In summary, a hybrid LSTM-LCCSA algorithm is proposed to solve the mid- to long-term runoff time series prediction problem. Compared with CM-SVM, GA-SA, MRA and GA-FFNN, LSTM-LCCSA exhibits higher accuracy and stability in predicting mid- to long-term runoff time series. The predictive stability of LSTM-LCCSA was tested by Manas river data and the generality was verified by Kuitun River. The LSTM-LCCSA hybrid algorithm is proposed to solve the mid- to long-term runoff time series prediction problem in the northern basin of Xinjiang, China. The number of parameter calibration frequency of LSTM-LCCSA model is higher than that of the other four models. In the future, we will investigate the method of improving parameter calibration, and continue to apply LCCSA-LSTM to the prediction of watershed runoff series. In the future, LCCSA-LSTM will continue to be applied to the prediction of runoff series in the basin, the further exploration of the evolution pattern and main influencing factors of runoff in months will be studied to improve the practicability of LCCSA-LSTM.

The financial support from the Special application science and technology project of the 7th division in Xinjiang Bingtuan (2021A03008) and the Special application science and technology project of the 1st division in Xinjiang Bingtuan (2022A007) are gratefully acknowledged.

W.Y.Y. and C.X.M. conceptualized the whole article, developed the methodology, and bought the software; X.G.G. and X.T.F. rendered support in data curation and wrote the original draft preparation; X.T.F. and C.X.M. visualized the process, investigated the project; J.F.L. supervised the article; J.F.L. and W.Y.Q. bought the software, validated the article; J.F.L. and W.Y.Q. wrote the review and edited the article.

All relevant data are available from an online repository or repositories.

The authors declare there is no conflict.

Blanchard
G.
1999
The ‘progressive mixture’ estimator for regression trees
.
Annales de L'Institut Henri Poincare (B) Probability and Statistics
35
(
6
),
793
820
.
Ghorbani
M. A.
,
Zadeh
H. A.
,
Isazadeh
M.
&
Terzi
O.
2016
A comparative study of artificial neural network (MLP, RBF) and support vector machine models for river flow prediction
.
Environmental Earth Sciences
75
(
6
),
1
14
.
Grau
I.
,
Nápoles
G.
,
Bonet
I.
&
García
M. M.
2013
Backpropagation through time algorithm for training recurrent neural networks using variable length instances (Algoritmo de retropropagación a través de tiempo para el aprendizaje de redes neuronales recurrentes usando instancias de longitud variable)
.
Computación Y Sistemas
17
(
1
),
15
24
.
Hamaamin
Y. A.
,
Nejadhashemi
A. P.
,
Zhang
Z.
,
Giri
S.
&
Woznicki
S. A.
2016
Bayesian regression and neuro-fuzzy methods reliability assessment for estimating streamflow
.
Water
8
(
7
),
287
.
He
J.
,
Valeo
C.
,
Chu
A.
&
Neumann
N. F.
2011
Prediction of event-based storm water runoff quantity and quality by ANNs developed using PMI-based input selection
.
Journal of Hydrology
400
(
1–2
),
10
23
.
Hochreiter
S.
&
Schmidhuber
J.
1997
Long short-term memory
.
Neural Computation
9
(
8
),
1735
1780
.
Latifoglu
L.
,
Kisi
O.
&
Latifoglu
F.
2015
Importance of hybrid models for forecasting of hydrological variable
.
Neural Computing & Applications
26
(
7
),
1669
1680
.
Liu
Y.
,
Ye
L.
,
Qin
H.
,
Ouyang
S.
,
Zhang
Z.
&
Zhou
J.
2019
Middle and long-term runoff probabilistic forecasting based on Gaussian mixture regression
.
Water Resources Management
33
,
1785
1799
.
Meng
X.
,
Liu
Y.
,
Gao
X.
,
Zhang
H.
,
2014
A New bio-inspired algorithm: chicken swarm optimization
. In:
Advances in Swarm Intelligence, Pt1
(
Tan
Y.
,
Shi
Y.
&
Coello
C. A. C.
, eds). Lecture Notes in Computer Science, Vol. 8794. Springer, Cham. pp.
86
94
.
Nacar
S.
,
Hinis
M. A.
&
Kankal
M.
2018
Forecasting daily streamflow discharges using various neural network models and training algorithms
.
Ksce Journal of Civil Engineering
22
(
9
),
3676
3685
.
Ren
Y.
,
Zeng
S.
,
Liu
J.
,
Tang
Z.
,
Hua
X.
,
Li
Z.
,
Song
J.
&
Xia
J.
2022
Mid- to long-term runoff prediction based on deep learning at different time scales in the Upper Yangtze River Basin
.
Water
14
(
11
),
1692
.
https://doi.org/10.3390/w14111692
.
Tharwat
A.
&
Hassanien
A. E.
2018
Chaotic antlion algorithm for parameter optimization of support vector machine
.
Applied Intelligence
48
(
3
),
670
686
.
Tsai
J. T.
,
Chou
J. H.
&
Liu
T. K.
2006
Tuning the structure and parameters of a neural network by using hybrid Taguchi-genetic algorithm
.
IEEE Transactions on Neural Networks
17
(
1
),
69
80
.
Wang
X.
,
Yang
K.
&
Yang
L.
2018
Application research of inner-plant economical operation by multi-colony ant optimization
.
Water Resources Management
32
(
13
),
4275
4295
.
Zhang
Q.
,
Wang
B.-D.
,
He
B.
,
Peng
Y.
&
Ren
M.-L.
2011
Singular spectrum analysis and ARIMA hybrid model for annual runoff forecasting
.
Water Resources Management
25
(
11
),
2683
2703
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).