Water demand prediction is crucial for effective planning and management of water supply systems to handle the problem of water scarcity. Taking into account the uncertainties and imprecisions within the framework of water demand forecasting, the uncertain time series prediction method is introduced for water demand prediction. Uncertain time series is a sequence of imprecisely observed values that are characterized by uncertain variables and the corresponding uncertain autoregressive model (UAR) is employed to describe it for predicting future values. The main contributions of this paper are shown as follows. Firstly, by defining the auto-similarity of uncertain time series, the identification algorithm of UAR model order is proposed. Secondly, a new parameter estimation method based on the uncertain programming is developed. Thirdly, the imprecisely observed values are assumed as the linear uncertain variables and a ratio-based method is presented for constructing the uncertain time series. Finally, the proposed methodologies are applied to model and forecast Beijing's water demand under different confidence levels and compared with the traditional time series, i.e. ARIMA method. The experimental results are evaluated on the basis of performance criteria, which shows that the proposed method outperforms over the ARIMA method for water demand prediction.

  • Considering the uncertainty of water demand, the uncertain time series method for demand estimation of water resources is presented.

  • The auto-similarity of uncertain time series is defined, and the identification algorithm of uncertain autoregressive model order is proposed.

  • An uncertain programming approach to estimate the parameters of model is proposed.

  • The construction of liner uncertain time series is investigated due to the interval-valued data frequently encountered in real life.

Water is an indispensable natural resource on our planet, which plays an important role in man's life and activity. Apart from drinking and personal hygiene, water is still a necessary resource for agricultural and industrial production, economic and ecological development (Deng et al. 2015; Liu et al. 2015). However, due to climate change, socio-economic development and population growth, water consumption (especially for freshwater) is growing rapidly, and water supply is facing many challenges (Choksi et al. 2015), especially the problem of water scarcity (Frederick 1997; Pahl-Wostl 2007; Arnell & Lloyd-Hughes 2014; Wang et al. 2015). This has led to the need for effective planning, managing and operating of finite water resources (Oduro-Kwarteng et al. 2009; Wang et al. 2018). Therefore, water demand forecasting is a fundamental phase for optimal allocation of water resources and aims to provide the simulated view of future demand, which can assist decision makers in devising appropriate management schemes to relieve the conflict between growing demand and limited supply of water resources. To this end, many researchers have proposed different methods to model and forecast water demand.

Time series analysis is one of the commonly used methods for water demand prediction. It was proposed by Box and Jenkins who considered the dependence among the data. The model, namely, ARIMA, is regarded as a classical forecasting technique, describing a predicted value as a linear function of previous data and random errors and including a cyclical or seasonal component. For example, Maidment et al. (1985) applied the time series model of daily municipal water use as a function of rainfall and air temperature for short-term forecasting of daily water use in Austin, Texas. Smith (1988) developed an autoregressive process with randomly varying mean to forecast the daily municipal water use, which captured the seasonality and day-week effects in the model through the unit demand function. Aly & Wanakule (2004) utilized a deterministic smoothing algorithm that considered level, trend and seasonality components of time series to estimate monthly water use. Zhai et al. (2012) employed the time series forecasting method for predicting the future needs of water in Beijing by analyzing the driving mechanism of changes of water consumption and water consumed structure.

In a time series modelling application, the determination of the model order is the fundamental step towards describing any dynamic process and has been of considerable interest for a long time. Primarily, the determination method of the order of time series model is based on the properties of sample autocorrelation coefficient and partial autocorrelation coefficient. Following that, several order selection approaches based on information theoretic criteria, such as Akaike's information criterion (AIC) (Akaike 1974), Akaike's final prediction error (FPE) (Akaike 1970), minimum description length (MDL) (Rissanen 1978; Liang et al. 1993) and so on, have been developed. Another common method, namely, the linear algebraic method based upon the determinant and rank testing algorithms, was proposed in (Fuchs 1987) and Sadabadi et al. (2007). Apart from two methods above, many other methods like bayesian information criterion (BIC) (Schwarz 1978), edge detection-based approach (Al-Smadi & Al-Zaben 2005), optimal instrumental variable (IV) algorithm (Sadabadi et al. 2009) and so on were investigated to estimate the order of the time series model. Another estimation problem that has also been considerably investigated is the aspect of coefficients determination of time series model. Commonly used methods for estimation of unknown coefficients are least-squares (LS) estimator and maximum likelihood estimator (MLE) methods.

Based on the mentioned methods of model order identification and parameters estimation, the time series models can be formulated to forecast water demand. It is worth noting that the aforementioned models provided a single valued forecast of water demand, disregarding the uncertainty inherent in some situations where the influential factors that affect water demand are uncertain, which leads to the uncertainty of water demand. This would limit the usefulness of these deterministic models. One classical way to handle the uncertainty is to use a probabilistic model (Almutaz et al. 2012; Haque et al. 2014) based on the Monte Carlo Stimulations (MCS) to obtain the distribution of water demand and provide an estimate of the overall uncertainty in the predictions connected to uncertainty of influential factors.

Unfortunately, the distribution function obtained in most practical problems is not close enough to the actual frequency, especially in the case of emergencies and lack of history data. In addition, the water demand data possess uncertain characteristics caused by inaccuracies in measurements that need to be given by experts. This motivates us to apply a new mathematical tool to deal with a range of uncertainties inherent in certain water demand data. Recently, uncertainty theory was proposed by Liu (2009) in 2007, which is an effective way to solve previous problem for imprecisely observed values. Based on the uncertainty theory, many researchers have done a lot of work including the determination of uncertain distribution (Wang et al. 2012a; Wang & Peng 2014), hypothesis test (Guo et al. 2017; Ye & Liu 2021), and uncertain regression analysis (Wang et al., 2012b; Lio & Liu 2018; Yao & Liu 2018; Ye & Liu 2020). Furthermore, the concept of uncertain time series was firstly proposed by Yang & Liu (2019) based on uncertain theory in 2019. Like the traditional time series analysis, there may be more than one approach to model time series. However, in their study, to describe uncertain time series, the UAR model was employed to predict the future values based on previously imprecisely observed values that are characterized in terms of uncertain variables. Based on the imprecisely observed values, Yang & Liu (2019) presented the least-squares method to estimate the coefficients of the UAR model for predicting the carbon emission.

However, there are still many important issues that have not been touched. Firstly, the identification of UAR model order is one of these, because it is the first step in estimating the model parameters. In the work of Yang & Liu (2019), the 2-order UAR model was directly employed to forecast the future values. This method is too subjective and lacks a certain theoretical foundation, which may reduce the prediction accuracy of the model. So, in this paper, by defining the auto-similarity of uncertain time series, an algorithm for determining the optimal order of autoregressive model is designed. Secondly, its novel parameter estimation approach is developed based on uncertain programming. Within the proposed method, the original problem including uncertain measure is transformed to the equivalent crisp mathematical programming. Thirdly, in our daily life, most information is uncertain in nature. For example, water demand naturally takes different values with minimum water demand and maximum water demand, which are inherently imprecisely observed values at times t, , respectively, so the linear uncertain variables are selected for this purpose. Hence, it is a critical issue for us to determine the lower and upper bounds for the actual data belonging to a range. That is, how to construct an uncertain time series based on observed historical point data. Referring to the work of Huarng (2006), we introduced a novel ratio-based approach to determine the effective uncertain time series. Furthermore, the proposed uncertain time series forecasting approach is used to predict the urban water demand. As the second-largest city of China, Beijing's rapid development has attracted many immigrants in recent years and water consumption is growing rapidly, which led to the increasingly sharp conflict between demand and supply of water resources. This situation has become an important constraint on the sustainable development of Beijing. Therefore, the water demand prediction of Beijing is a fundamental stage for water resources planning and utilization, which contributes to harmonious development between the socio-economy and resources environment in Beijing. To further verify the accuracy of the proposed methodologies, traditional time series method is selected as a competitor. The results are judged on the basis of presented criteria, i.e. the prediction reliability and accuracy compared to ARIMA method.

The organization of this paper is as follows. Section 2 briefly presents some fundamental concepts properties and theorems in uncertainty theory. Section 3 introduces the forecasting procedure of uncertain time series analysis. Section 4 provides an experimental analysis to validate the effectiveness of the proposed method and access its performance by comparing with the conventional time series (ARIMA) method. Finally, some conclusions are drawn.

In this section, we will present some fundamental definitions and theorems on uncertainty theory.

Definition 1. (Liu 2007) Letbe a nonempty set, andbe aσ-algebra over. Each elementis called an event. A numberindicates the belief degree thatwill occur. Thenis called an uncertain measure if it satisfies the following axioms:

Axiom 1:(Normality Axiom)for the nonempty set.

Axiom 2:(Duality Axiom)for any event.

Axiom 3:(Subadditivity Axiom) For every countable sequence of events, we have
(1)

In this case, the tripletis called an uncertainty space.

Then the product uncertain measure on the product σ-algebra was defined by Liu (2009), producing the fourth axiom of uncertain measure.

Axiom 4:(Product Axiom) Letbe uncertainty spaces for. The product uncertain measureis an uncertain measure satisfying
(2)
whereare arbitrarily chosen events fromforrespectively.

The concept of uncertain variable was introduced by Liu as a measurable function from an uncertainty space to the set of real numbers.

Definition 2. (Liu 2007) An uncertain variable is a measure functionfrom an uncertain spaceto the set of real number. That is, for any Borel set, the set
(3)
is an event.
Definition 3. (Liu 2007) Letbe uncertain variables, and letfbe a real-valued measurable function. Thenis an uncertain variable defined by
(4)
Definition 4. (Liu 2007) The uncertainty distributionof an uncertain variableis
(5)
Definition 5. (Liu 2007) An uncertain variable is called linear if it has a linear uncertainty distribution
(6)
denoted bywhereandbare real numbers with.
Definition 6. (Liu 2010a) An uncertainty distributionis said to be regular if it is continuous and strictly increasing function with respect toxat which, and
(7)

Definition 7. (Liu 2010a) Letbe an uncertain variable with regular uncertainty distribution. Then the inverse functionis called the inverse uncertainty distribution of.

Example 1. The inverse uncertainty distribution of linear uncertain variable is
(8)
Definition 8. (Liu 2007) Letbe an uncertain variable. Then the expected value ofis defined by
(9)
provided that at least one of the two integral is finite.
Theorem 1. (Liu 2010a) Letbe an uncertain variable with regular uncertainty distribution. Then
(10)
Theorem 2. (Liu 2010a) Letandbe independent uncertain variables with finite expected values. Then for any real numberaand, we have
(11)
Theorem 3. (Liu 2010b) Letbe independent uncertain variables with regular uncertainty distributions, respectively. Ifis strictly increasing with respect toand strictly decreasing with respect to, then
(12)
has an inverse uncertainty distribution
(13)
Theorem 4. (Liu 2010b) Letbe independent uncertain variables with regular uncertainty distributions, respectively. If constraint functionis strictly increasing with respect toand strictly decreasing with respect to, then
(14)
holds if and only if
(15)

Uncertain time series was proposed by Yang & Liu (2019) in 2019 so as to predict the future values based on previously imprecisely observed values that are described by uncertain variables. The basic definition of uncertain time series is as follows.

Definition 10. (Yang & Liu 2019) An uncertain time series is a sequence of imprecisely observed values that are characterized in terms of uncertain variables. Mathematically, an uncertain time series is represented by
(16)
whereare imprecisely observed values (i.e. uncertain variables) at times,respectively.
After giving the uncertain time series, it is necessary to formulate function relations between the observations at time t and those at previous times to describe uncertain time series. Generally, the relationship between uncertain variables can be expressed by the following function
(17)
According to the research (Yang & Liu 2019), the method for modelling uncertain time series is the autoregressive model,
(18)
where are unknown parameters, is an uncertain variable, and p is called the order of the autoregressive model. In order to recognize a good UAR model as a forecast tool for the given data, the following three problems need to be solved which include: the determination of order of UAR model, parameter estimation and the construction of uncertain time series. In the following subsections, all the detailed procedure of using our methodology to make predictions is presented, which can be divided into four stages, clearly differentiated in Figure 1.
Figure 1

The detailed four stages of the proposed uncertain time series method.

Figure 1

The detailed four stages of the proposed uncertain time series method.

Close modal

The determination of the order of UAR model

During the modeling process of uncertain time series, a fundamental phase is the identification of order of UAR model. Generally, we make predictions about the future to make strategies, which should not only get information from the data before but also get information from the near past, although they may not have the same effect strength. Therefore, it is crucial to find an appropriate order to determine the lagged variables existing in the model and establish the truly effective model. If the model order is not recognized efficiently, the accuracy of the predictions produced by the model will be compromised. Just like traditional time series analysis, correlation is a very important concept used in analyzing data in the time series. We often identify the model based on the trailing or truncating properties of the autocorrelation coefficient and the partial correlation coefficient. However, for the imprecisely observed values represented by uncertain variables, the traditional statistical method above is problematic. In this subsection, we introduce an order determination method based on the notion of similarity, which is generated from the distance measure defined by Li & Liu (2015). It can be represented by
(19)
where and are uncertain variables. It is easy to understand that the distance between uncertain variables essentially reflects their difference. The greater the distance is, the smaller the similarity is, and vice versa. Here, we just set out one, i.e. , as the application for the following definitions.

Basic definitions

Based on the above distance measure, the average distance between two uncertain variables in the uncertain time series is defined as below.

Definition 9.Average distance

Letbe an uncertain time series andbe imprecisely observed values characterized in terms of independent uncertain variables with regular uncertain distributionsrespectively,denotes the experimental order. Then the average distance between uncertain variablesandis defined as
(20)

To further provide ease of use, the study applies the following definition to make some adjustments.

Definition 10.Auto-similarity of uncertain time series

Letbe the average distance between uncertain variablesandof uncertain time series,mdenotes the experimental order. Then the auto-similarity of uncertain time series is defined as
(21)

It is clear that . Definition 10 shows that the greater is, the higher the similarity between the uncertain variables and is.

Model order selection algorithm

According to the above definitions, an algorithm for determining the appropriate order of UAR model is presented as follows.

Step 1. We set as an alternative order.

Step 2. Determine the confidence level .

Step 3. If , then select the order to add the set of alternative order numbers. That is, enter into Step 4.

If and , then choose the order as the optimal order.

If and , then into Step 4.

Step 4. Set and return to Step 2.

Step 5. Optimal order obtained from Step 1 to Step 4 is regarded as the order of UAR model.

Step 6. If we cannot find the effective order until the predetermined experimental order m is reached, then let .

We want to note again that the proposed algorithm is also presented in Figure 2.

Figure 2

Flowchart of the algorithm for determining the model order.

Figure 2

Flowchart of the algorithm for determining the model order.

Close modal

A new parameter estimation method based on uncertain programming

Once the order of the model is determined, we need to estimate the parameters of the UAR model to make predictions. Based on the imprecisely observed values, Yang & Liu (2019) investigated the least squares approach to estimate coefficients of the UAR model. Different from the previous research, in this subsection, we will propose a new method of parameter estimation based on uncertain programming, which can give more flexibility to the UAR model to make predictions. In general, we hope that the given parameters should make the differences between the predicted values and observed values as small as possible. In uncertain time series model, let be imprecisely observed values characterized in terms of independent uncertain variables with regular uncertain distributions , respectively. Then, the estimation of unknown parameters in the UAR model
(22)
can solve the following programming model
(23)
where is the target variable and is a given level by the domain experts according to their experience knowledge. In order to obtain the optimal solution, we need to transform it into an equivalent deterministic model. The following theorem will address this problem.
Theorem 5.The model (23) is equivalent to the crisp mathematical programming
(24)
where
(25)
for

Proof: It follows from Theorem 4 immediately.

The construction of uncertain time series

Uncertain time series was proposed so as to deal with such forecasting problems where the historical data are not crisp numbers but are imprecisely observed values. Besides, in practical cases, most traditional point data possess uncertainty characteristics due to the measurement errors. For instance, the water demand variables that naturally take a finite set of numerical values varying between a lower and upper bound are regarded as interval-valued variables, which stand for the inaccuracies in measurements. For doing so, each interval-valued variable is assumed as the linear uncertain variable. However, for the same uncertain time series model, the difference of constructed intervals by adopting different ways can result in different forecasting performance. So, how to use an efficient way to choose effective length of interval is especially critical to improve uncertain time series forecasting performance. A key point in determining the proper length of interval is that they should not too large or small. When an effective length of interval is too wide, the prediction results will be meaningless in the uncertain time series. If the length is too small, the uncertain time series will become very close to the traditional time series and the result is not intended. On the other hand, many traditional time series have the momentum to vibrate in a certain period of time. Therefore, in the process of constructing uncertain time series, we should consider the trend information of data of time series itself, which makes the determined interval series more reasonable and really reflects the variation tendency of data of time series. By following the two requirements, in this subsection, we propose a new ratio-based approach to determine the length of interval to obtain the high forecasting accuracy. The step of the algorithm of the method presented can be given as follows:

Step 1. Take the first order of differences between any two consecutive observations for any and , .

Step 2. Calculate relative differences for all t, .

Step 3. Determine the lower and upper bounds of the initial value.

Let
(26)
and
(27)

Step 4. Determine the lower and upper bounds on the interval series.

Let
(28)
and
(29)

be the lower and upper bounds for observation at time t, respectively.

Step 5. Construct the uncertain time series. Following steps above, the interval-valued variables, , are obtained. We can consider each interval variable at time t as the linear uncertain variable with linear uncertain distribution . Then, an uncertain time series can be represented as
(30)

This section presents the application of the proposed methods for water demand forecast in Beijing. In Section 4.1, the location and dataset used in model development are given. Section 4.2 provides the implementations of the proposed uncertain time series model. For the purpose of comparison, the ARIMA model is selected to contrast the forecasting performance, and the classical measure methods are adopted to evaluate the forecast accuracy of the models in Section 4.3.

Location and dataset

In this work, the study area is located in Beijing. As the capital of China, Beijing is China's political, cultural, and international communication center, located at the interlaced terrace of North China Plain and Mongolian Plateau. In Beijing, drinking water is mainly supplied by the Yongdinghe and Chaobaihe rivers. As a result of China's rapid development and dense population, Beijing's water demand consumption is increasing rapidly and Beijing is experiencing a shortage of water resources. According to the Beijing Water Authority (BWA), the annual water resources per capita is less than 300 m3, which is only 12.5% of the national average and far below the internationally recognized minimum standard of 1000 m3 per year. This situation of increasing water demands and limited water resource supplies has also become the vital restrictive factor affecting the socio-economic development and environmental health of Beijing for a long time into the future. Therefore, it is particularly important to apply the proposed method to forecast the water demand in Beijing, which contributes to the water resources planning and management in the near future.

All the research dataset of uncertain time series were obtained from Beijing Water Resources Bulletin during the time period between 1988 and 2016. A total of 29 data points were collected and are shown in Table 1. In order to illustrate the effectiveness of the proposed uncertain time series method, the data from 1988 to 2013 are used as an estimation sample to determine the coefficients of the estimation model, while the rest of data are reserved as the hold-out sample, used to test the model and access the performance of prediction.

Table 1

Imprecisely observed data where represents linear uncertain variable

Observation TimeTotal water demand First order differenceRelative difference
1988 424   406 442  
1989 446 22 0.05189 427 465  
1990 411 −35 −0.0785 394 428  
1991 423 12 0.0292 405 441  
1992 464 41 0.0969 444 484  
1993 452 −12 −0.0259 433 471  
1994 459 0.0155 440 478  
1995 449 −10 −0.0218 430 468  
1996 400 −49 −0.1091 383 417  
10 1997 403 0.0075 386 420  
11 1998 404 0.0025 387 421  
12 1999 417 13 0.0322 399 435  
13 2000 400 −17 −0.0408 383 417  
14 2001 389 −11 −0.0275 372 406  
15 2002 346 −43 −0.1105 331 361  
16 2003 358 12 0.0347 343 373  
17 2004 346 −12 −0.0335 331 361  
18 2005 345 −1 −0.0029 330 360  
19 2006 343 −2 −0.0058 328 358  
20 2007 348 0.0146 333 363  
21 2008 351 0.0086 336 366  
22 2009 355 0.0114 340 370  
23 2010 352 −3 −0.0085 337 367  
24 2011 360 0.0227 345 375  
25 2012 359 −1 −0.0028 344 374  
26 2013 364 0.01393 349 379  
Observation TimeTotal water demand First order differenceRelative difference
1988 424   406 442  
1989 446 22 0.05189 427 465  
1990 411 −35 −0.0785 394 428  
1991 423 12 0.0292 405 441  
1992 464 41 0.0969 444 484  
1993 452 −12 −0.0259 433 471  
1994 459 0.0155 440 478  
1995 449 −10 −0.0218 430 468  
1996 400 −49 −0.1091 383 417  
10 1997 403 0.0075 386 420  
11 1998 404 0.0025 387 421  
12 1999 417 13 0.0322 399 435  
13 2000 400 −17 −0.0408 383 417  
14 2001 389 −11 −0.0275 372 406  
15 2002 346 −43 −0.1105 331 361  
16 2003 358 12 0.0347 343 373  
17 2004 346 −12 −0.0335 331 361  
18 2005 345 −1 −0.0029 330 360  
19 2006 343 −2 −0.0058 328 358  
20 2007 348 0.0146 333 363  
21 2008 351 0.0086 336 366  
22 2009 355 0.0114 340 370  
23 2010 352 −3 −0.0085 337 367  
24 2011 360 0.0227 345 375  
25 2012 359 −1 −0.0028 344 374  
26 2013 364 0.01393 349 379  

Methodologies implementations

Construction of uncertain time series

As mentioned previously, water demand data are not crisp numbers but imprecisely observed values, so the data pre-processing is essential. For the purpose of implementation, we utilize the linear uncertain variables to describe the water demand observations and construct the uncertain time series by following the algorithm in the previous subsection:

  • (1)

    Take the first order of differences between any two consecutive observations, which are listed in the fourth column of Table 1.

  • (2)

    Calculate the relative differences between any two consecutive observations. All the relative differences are listed in the fifth column of Table 1.

  • (3)

    Determine the lower and upper bounds of the interval-valued series. Firstly, the initial interval is calculated as follows.

By
(31)
and
(32)
we have
(33)
Then, it is obvious that . We consider the initial interval-valued variable as the linear uncertain variable with linear uncertain distribution . Secondly, by following Step 3 in Section 3.3, the corresponding interval series can be constructed as shown in Table 1. Thus, we can obtain an uncertain time series that are characterized in terms of linear uncertain variables, i.e.
(34)

The determination of the model order

In the model order selection phase, different experimental orders are examined based on the definition of auto-similarity of uncertain time series and the best one among them is selected. Generally, we set the maximum lagging order . The details of the calculation can be found in the following paragraphs.

Firstly, we assume that linear uncertain variables are independent. According to Definition 9, the average distance of the first order lag of all uncertain variables is calculated as follows
(35)
Similarly, we have,
(36)
(37)
(38)
and
(39)
Then, , . It follows from Definition 10 that
(40)

By using the algorithm in Section 3.1, we can find the appropriate order.

Let . Because and . At the same time , thus the order of UAR model is .

The parameter estimation for the UAR model

According to Section 4.2.2, we obtain the 2-order UAR model
(41)
By using Theorem 5, we obtain the following mathematical programming model to estimate the unknown coefficients
(42)
Let , . Then we obtain the optimal solution
(43)
and the corresponding autoregressive model
(44)

According to different confidence levels (i.e. 0.75, 0.85, 0.90), different regression models can be established to compare with the existing traditional time series model.

Comparison with the existing method

In this subsection, we apply the proposed UAR model to forecast the water demand of Beijing from 2014 to 2016. Following the above forecasting model (Equation (44)), the forecasting result is an uncertain variable. However, in most cases, the results we require are often crisp values, so the research uses the expected value of the uncertain variable as the predicted value. The prediction results under different confidence levels (i.e. 0.75, 0.85, 0.90) are presented in Table 2. In order to further verify the effectiveness of the proposed methodologies, the traditional ARIMA time series method is selected as a competitor to contrast the forecasting performance. According to the historical data of water demand from 1988 to 2013, the result of ARIMA model is represented as follows
(45)
Table 2

Predicted results of Beijing's total water demand

TimeActual valuesARIMA modelUAR model
2014 375 361.59 375.64 383.74 385.74 
2015 382 359.17 385.57 400.56 404.32 
2016 388 356.75 398.53 416.54 422.04 
TimeActual valuesARIMA modelUAR model
2014 375 361.59 375.64 383.74 385.74 
2015 382 359.17 385.57 400.56 404.32 
2016 388 356.75 398.53 416.54 422.04 
The performances of the models are evaluated based on the classical measure methods, i.e. average relative error (ARE) and total absolute error (AE). Obviously, lower ARE and AE values lead to better performance. The definitions of all these performance criteria are represented by
(46)
and
(47)
where s is the total number of data needed to predict, and denote the predicted value and the actual observed value, respectively.

Therefore, the prediction performance are shown in Table 3 and Figure 3. From the experimental results obtained, it can be concluded that the proposed uncertain time series forecasting method has better forecasting performance than the ARIMA method under the considered levels.

Table 3

Comparison of above models

TimeAbsolute error
Relative error
ARIMA modelUAR model
ARIMA modelUAR model
2014 13.41 0.64 8.74 10.74 0.0358 0.0017 0.0233 0.0286 
2015 22.83 3.57 18.56 22.32 0.0598 0.0093 0.0486 0.0584 
2016 31.25 10.53 28.54 34.04 0.0805 0.0271 0.0736 0.0877 
Total predicted error 67.49 14.74 55.84 67.09     
Maximum relative error     0.0805 0.0271 0.0736 0.0877 
Average relative error     0.0587 0.0127 0.0485 0.0583 
TimeAbsolute error
Relative error
ARIMA modelUAR model
ARIMA modelUAR model
2014 13.41 0.64 8.74 10.74 0.0358 0.0017 0.0233 0.0286 
2015 22.83 3.57 18.56 22.32 0.0598 0.0093 0.0486 0.0584 
2016 31.25 10.53 28.54 34.04 0.0805 0.0271 0.0736 0.0877 
Total predicted error 67.49 14.74 55.84 67.09     
Maximum relative error     0.0805 0.0271 0.0736 0.0877 
Average relative error     0.0587 0.0127 0.0485 0.0583 
Figure 3

Comparisons of prediction accuracy for water demand prediction.

Figure 3

Comparisons of prediction accuracy for water demand prediction.

Close modal

As far as the comparison between the proposed method under the 0.75 and 0.85 confidence levels and the ARIMA method is concerned, the former outperforms the latter in all cases. The total prediction error is reduced by 78.15 and 17.26% respectively, and the average relative error by 78.36 and 17.38% respectively. In addition, for the prediction error of each observation, the proposed method under the 0.75 and 0.85 levels is smaller than the ARIMA method. Especially for the associated 0.75 confidence level, the improvements of forecasting performance are more obvious.

When considering the comparison between the proposed method under the 0.90 confidence level and the ARIMA method in all cases, we can see that the maximum prediction error of the proposed method is a little higher than the ARIMA method. Overall, the former almost wins. The total prediction error is reduced by 0.59% and the average relative error by 0.68%. This reduction is crucial in the planning and management of water supply systems.

Apart from the statistical criteria discussed above, we implement the forecasting trend to evaluate the performance of the above methods. According to the current trend, Beijing's total water demand was increasing from 2014 to 2016. However, the prediction trend of the ARIMA method was declining. This is not in accordance with the reality of Beijing's total water demand during 2014 and 2016. So, these predicted results may not provide support to the water resource management in the near future. It is worthy to note that the predicted results of the proposed method could reflect the realistic water demand trend in the short term and help the decision makers to devise reasonable management schemes.

In this study we presented a modified time series method for demand estimation of water resources in Beijing. Considering the uncertainty of water demand in real life, we attempted to combine the uncertainty theory with a time series model, called uncertain time series, to handle the above problems. In the presented method, we employed the UAR model to describe uncertain time series for predicting future values. First, the auto-similarity of uncertain time series, as a principle of justifiable recognition, is defined and the identification algorithm of determining the optimal model order is proposed, which enables the estimation of the correct parameters of the model. Second, we propose an uncertain programming approach for estimating the parameters of the model. Then, the imprecisely observed values are assumed as the linear uncertain variables and a ratio-based method is presented for constructing the uncertain time series. Finally, we tested the performance of the proposed model and the traditional time series model (ARIMA) based on the statistical criteria. The results demonstrated that the proposed model provided much better accuracy over the traditional model mentioned above for water demand predictions. The possible reason is that the traditional model cannot effectively handle the imprecisely observed values, this allows the possibility of the loss of effective information, which leads to the reduction of prediction accuracy.

Although the proposed UAR model has greatly improved the traditional water demand time series method, there are still some limitations which need to be improved, such as the determination of the model order and the construction of the uncertain time series. In the future, we will further study the algorithm optimization of model order, and provide better solutions for the UAR model applications which improve the accuracy of forecast. On the other hand, we only investigate the construction of linear uncertain time series due to the interval-valued data that are encountered frequently in multiple situations. Further study may attempt to construct the normal uncertain time series to expand the application field of the model.

This work was supported by the National Natural Science Foundation of China (No.61873084) and the Foundation of Hebei Education Department (No. ZD2017016).

All relevant data are included in the paper or its Supplementary Information.

Akaike
H.
1970
Statistical predictor identification
.
Annals of Institute of Statistical Mathematics
22
,
203
217
.
Akaike
H.
1974
A new look at the statistical model identification
.
IEEE Transactions on Automatic Control
AC-19
(
6
),
716
723
.
Al-Smadi
A.
&
Al-Zaben
A.
2005
ARMA model order determination using edge detection: a new perspective
.
Circuits, Systems and Signal Processing
24
(
6
),
723
732
.
Aly
A.
&
Wanakule
N.
2004
Short-term forecasting for urban water consumption
.
Water Resources Planning and Management
130
,
405
410
.
Choksi
K. N.
,
Sheth
M. A.
&
Mehta
D.
2015
To assess the performance of Sewage Treatment Plant: a case study of Surat city
.
International Journal of Engineering and Technology (IRJET)
2
(
08
),
1071
1075
.
Deng
W.
,
Wang
G.
&
Zhang
X.
2015
A novel hybrid water quality time series prediction method based on cloud model and fuzzy forecasting
.
Chemometrics and Intelligent Laboratory Systems
149
,
39
49
.
Fuchs
J. J.
1987
On estimating the order of an ARMA process
.
Automatica
23
(
6
),
779
782
.
Guo
H.
,
Wang
X.
&
Gao
Z.
2017
Uncertain hypothesis testing for two experts empirical data
.
Mathematical and Computer Modelling
55
,
1478
1482
.
Haque
M. M.
,
Rahman
A.
,
Hagare
D.
&
Kibria
G.
2014
Probabilistic water demand forecasting using projected climatic data for Blue Mountains water supply system in Australia
.
Water Resources Management
294
,
53
59
.
Li
X.
&
Liu
Y.
2015
Distance and similarity measures between uncertain variables
.
Journal of Intelligent and Fuzzy Systems
28
(
5
),
2073
2081
.
Liang
G.
,
Wilkes
D. M.
&
Cadzow
J. A.
1993
ARMA model order estimation based on the eigenvalues of the covariance matrix
.
IEEE Transactions on Signal Processing
41
(
10
),
3003
3009
.
Lio
W.
&
Liu
B.
2018
Residual and confidence interval for uncertain regression model with imprecise observations
.
Journal of Intelligent and Fuzzy Systems
35
(
2
),
2573
2583
.
Liu
B.
2007
Uncertain Theory
, 2nd edn.
Springer-Verlag
,
Berlin
.
Liu
B.
2009
Some research problems in uncertain theory
.
Journal of Uncertain Systems.
3
(
1
),
3
10
.
Liu
B.
2010a
Uncertain Theory: A Branch of Mathematics for Modeling Human Uncertainty
.
Springer-Verlag
,
Berlin
.
Liu
B.
2010b
Uncertain risk analysis and uncertain reliability analysis
.
Journal of Uncertain Systems
4
(
3
),
163
170
.
Maidment
D. R.
,
Miaou
S. P.
&
Crawford
M. M.
1985
Transfer function models of daily urban water use
.
Water Resources Research
21
(
4
),
425
432
.
Oduro-Kwarteng
S.
,
Nyarko
K.
,
Odai
S.
&
Aboagye-Sarfo
P.
2009
Water conservation potential in educational institutions in developing countries: case study of a university campus in Ghana
.
Journal of Urban Water
6
(
6
),
449
455
.
Rissanen
J.
1978
Modeling by shortest data description
.
Automatica
14
,
465
471
.
Sadabadi
M. S.
,
Shafiee
M.
&
Karrari
M.
2007
Determination of the two-dimensional ARMA model order using rank test based approach
.
Proceedings of 16th IEEE International Conference on Control Applications
,
1156
1160
Sadabadi
M. S.
,
Shafiee
M.
&
Karrari
M.
2009
Two-dimensional ARMA model order determination
.
ISA Transactions
48
,
247
253
.
Schwarz
G.
1978
Estimating the dimension of a model
.
Annals of Statistics
6
,
461
464
.
Smith
J. A.
1988
A model of daily municipal water use for short-term forecasting
.
Water Resources Research
24
(
2
),
153
164
.
Wang
X.
&
Peng
Z.
2014
Method of moments for estimating uncertainty distribution
.
Journal of Uncertainty Analysis and Applications
2
(
1
),
1
10
.
Wang
X.
,
Gao
Z.
&
Guo
H.
2012a
Delphi method for estimating uncertainty distributions
.
Information: An International Interdisciplinary Journal
15
(
2
),
449
460
.
Wang
X.
,
Gao
Z.
&
Guo
H.
2012b
Uncertain linear regression model and its application
.
Journal of Intelligent Manufacturing
28
(
3
),
559
564
.
Wang
X.
,
Zhang
J.
,
Shahid
S.
,
He
R.
,
Xia
X.
&
Mou
X.
2015
Potential impact of climate change on future water demand in Yulin city
.
Northwest China. Mitigation and Adaptation Strategies for Global Change
20
(
1
),
1
19
.
Wang
X.
,
Zhang
J.
,
Shahid
S.
,
Bi
S.
,
Elmahdi
A.
,
Liao
C.
&
Li
Y.
2018
Forecasting industrial water demand in Huaihe River Basin due to environmental changes
.
Mitigation and Adaptation Strategies for Global Change
23
,
469
483
.
Yang
X.
&
Liu
B.
2019
Uncertain time series analysis with imprecise observations
.
Fuzzy Optimization and Decision Making.
18
(
3
),
263
278
.
Yao
K.
&
Liu
B.
2018
Uncertain regression analysis: an approach for imprecise observation
.
Soft Computing
22
(
17
),
5573
5578
.
Ye
T.
&
Liu
B.
2021
Uncertain hypothesis test with application to uncertain regression analysis
. Fuzzy Optimization and Decision Making. https://doi.org/10.1007/s10700-021-09365-w.
Ye
T.
&
Liu
Y.
2020
Multivariate uncertain regression model with imprecise observations
.
Journal of Ambient Intelligence and Humanized Computing
.
https://doi.org/10.1007/s12652-020-01763-z
.
Zhai
Y.
,
Wang
J.
,
Teng
Y.
&
Zuo
R.
2012
Water demand forecasting of Beijing using the time series forecasting method
.
Geographical Sciences
22
(
5
),
919
932
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).