With the rapid development of urbanization and the continuous improvement of living standards, China's domestic water consumption shows a growing trend. However, in some arid and water deficient areas, the shortage of water resources is a crucial factor affecting regional economic development and population growth. Therefore, it is essential to reliably predict the future water consumption data of a region. Aiming at the problems of poor prediction accuracy and overfitting of non-growth series in traditional grey prediction, this paper uses residual grey model combined with Markov chain correction to predict domestic water consumption. Based on the traditional grey theory prediction, the residual grey prediction model is established. Combined with the Markov state transition matrix, the grey prediction value is modified, and the model is applied to the prediction of domestic water consumption in Shaanxi Province from 2003 to 2019. The fitting results show that the accuracy grade of the improved residual grey prediction model is “good”. This shows that the dynamic unbiased grey Markov model can eliminate the inherent error of the traditional grey GM (1,1) model, improve the prediction accuracy, have better reliability, and can provide a new method for water consumption prediction.

  • The prediction model of water resources is established.

  • The method of combining grey model with Markov model is put forward.

  • The modified method has a good prediction effect and application value.

Water is a basic natural and strategic resource, the material basis for the survival of human society, and an important element for the sustainable development of the national economy (Horne 2013). With the rapid growth of the world's economic level and the acceleration of urbanization process, water consumption presents a growing trend. The consumption uses of water resources are mainly divided into agricultural water, industrial water, domestic water, ecological water, and so on. The amount of domestic water directly affects the quality of life of local residents. The predicted value of domestic water consumption can be used as an important reference index for water supply decision-making and water conservancy construction investment, and the basis for formulating water supply and drainage planning and national economic planning. It plays a vital role in water resources planning and scheduling management, and is of great significance for China's modernization (Qin et al. 2013).

Although China is rich in total water resources, the per capita annual sustainable freshwater supply is only 2,000 m3, less than one-fourth of the average world level of 8,300 m3 (Manju & Sagar 2017). At present, one-quarter of the world's 400 million people facing severe water shortage live in China. The area of Shaanxi Province in China accounts for 2.14% of the whole country, while the total amount of water resources only accounts for 1.26% of the whole country's water resources. The per capita available water volume is only 736 m3, less than half of the national per capita level, and is below the international minimum water demand standard, making Shaanxi Province an extremely water deficient province in China. Moreover, the spatial and temporal distribution of water resources in Shaanxi Province is seriously uneven, with 70% concentrated in southern Shaanxi, and the water resources in Guanzhong and Northern Shaanxi are very scarce. With the rapid development of the social economy, the water consumption of Shaanxi Province is increasing. The development potential of water resources in some areas has reached the upper limit, and the contradiction between supply and demand of water resources has become increasingly prominent. As the core node of the construction of China's Silk Road Economic Belt and an important hub of Eurasia, Shaanxi Province straddles the Yellow River and Yangtze river basins, and is a key area in China's ecological security strategic pattern. It is very important to predict the water consumption accurately and reasonably plan and utilize the water resources in this area. At the same time, the region is also one of the most sensitive regions to climate change. The shortage of water resources is the core problem that restricts the region's economic and social development and ecological environment security. With the development of social economy and the improvement of people's living standards, the contradiction of water use will be further intensified, and domestic water, as an essential part of water use, needs to be given priority. Domestic water consumption prediction also plays a vital role in the unified management and allocation of water sources, the determination of the construction sequence and scale of water supply projects, and the implementation of drinking water safety measures for humans and livestock.

At present, domestic water consumption is affected by many factors, such as population, water price, economic development, environmental conditions, climate change and so on, which are nonlinear function. Many scholars have done a lot of research on the prediction of regional domestic water consumption, and the research methods are mainly divided into three categories: (a) time series method; (b) structural analysis method; and (c) system method.

The time series method does not consider the factors that affect the change of water consumption, but recognizes the change of water consumption as a ‘black box” system, which mainly depends on historical data and data model to predict. The standard methods of time series analysis are: (a) moving average method (Banihabib & Mousavi-Mirkalaei 2019); (b) Markov method (Gong et al. 2019); and (c) exponential smoothing method (Su et al. 2018). As the time series prediction is based on the accuracy of the prediction model, the model's improper use usually leads to a significant deviation (Firat et al. 2010).

The structural analysis method starts with studying the relationship between water consumption and influencing factors, analyzes the changes of different factors affecting water consumption, and establishes the relationship model between water consumption and influencing factors (Kanakoudis & Gonelas 2014, 2015). This method includes the regression analysis method (Chen et al. 2020; Zhang et al. 2021) and the index analysis method (Tian et al. 2020), which belongs to the category of cycle forecasting method and has a good effect on long-term forecasting. However, it is difficult to discuss the relationship between multi-factor and non-linearity in the system for structural analysis to find the law from the digital trend. The prediction methods are mostly simplified, such as simplifying the model to linear, reducing the number of variables, etc. Therefore, the simulation results are rough and the scope of application is narrow.

For this reason, some system research methods based on nonlinear prediction model are produced, such as neural network model (Walker et al. 2015; Zubaidi et al. 2018), system dynamics method (Lin et al. 2019), random forest algorithm (Brentan et al. 2018; Villarin & Rodriguez-Galiano 2019), support vector machine algorithm (Bai et al. 2015) and grey model method (Cui et al. 2013). These models reflect the nonlinear relationship in the process of domestic water consumption prediction to a certain extent. However, they are still deterministic models of point prediction, so it is difficult to predict the future fluctuation range of water consumption accurately. In the actual water use process, there are various uncertain factors (such as economic development, urban population, climate change, urban water use technology, etc.) affecting the prediction, so the impact of data fluctuation on the simulation data should be considered in the prediction. In order to evaluate the prediction effect of the model after considering the uncertainty factors, Duerr et al. (2018) compared the stochastic process model, machine learning model and other models, and concluded that the stochastic process model has a better prediction effect. Although some progress has been made in predicting urban domestic water consumption considering uncertainty, it is still necessary to explore some methods with sufficient information in the probability of prediction value (Ren et al. 2020). Correspondingly, interval prediction establishes a prediction interval under a certain confidence level, which reflects the uncertainty of domestic water consumption and helps decision makers make decisions according to the actual situation (Jiang et al. 2019). Ji et al. (2018, 2019) proposed an algorithm on parallel biological computing to solve water resources allocation and prediction. Deng et al. (2020a, 2020b) used the improved parallel intelligent algorithm to solve a series of complex problems, and achieved good results. The above algorithm provides a good reference for the prediction and optimization of water resources (Wang et al. 2021a, 2021b). However, the occurrence of water resources is affected by many factors, so we should consider the applicability of different calculation models, so as to find a better algorithm with better efficiency and prediction effect (Li et al. 2021). Therefore, it is necessary to study this issue further.

The Grey system is a new theory and technology for control and prediction, which was born in the second half of the 20th century. It has been widely used in agriculture, social economy and other fields. Grey system theory mainly takes “small sample” and “poor information” uncertain systems as the research object, extracts valuable information through the generation and development of some general information, and realizes the correct grasp and description of system operation behaviour and evolution law. Grey prediction predicts the development and change of system behaviour eigenvalues, the prediction of the system with both general and uncertain information; that is, the prediction of the grey process related to time series that change in a particular range. Although the phenomenon shown in the grey process is random and disordered, it is ordered and bounded after all, so the data set has potential rules. It is a theoretical method used to study the uncertainty problems with fewer data and less information, especially GM (1,1) as its most common model.

Traditional grey GM (1,1) model

Through the accumulation of the original data generation, the traditional grey model, using the least squares method to fit its accumulation generation, and then carry on the progressive reduction processing, get the original series of prediction value. GM (1,1) model requires that the original data must be equal interval time series.
The original sequence can be accumulated once to get the accumulated generating sequence :
(1)
The first order linear differential equation can be established by formula (1):
(2)
According to the principle of least squares method, the parameter vector of GM (1,1) model is as follows:
(3)
Among them:
After is obtained, it is substituted into the differential equation to obtain the cumulative prediction sequence , where is the initial value:
(4)
After calculating the predicted value of the model according to formula (4), the predicted value can be restored by subtraction,
(5)

In the above modeling process, a is called the development coefficient of the grey system, reflecting the development trend of ; u is called grey action, reflecting the change relationship between data.

Unbiased grey GM (1,1) model

If the original data series () presents an exponential trend, a new series is generated by an accumulation:
(6)
According to the traditional GM (1,1) method, new model parameters can be obtained:
(7)
The parameters and of traditional GM (1,1) model can be obtained from formula (7), and then the parameters b and A of the unbiased grey GM (1,1) model expressed by them are as follows:
(8)
Then the fitting model of the original data series of unbiased grey GM (1,1) model is as follows:
(9)

Compared with the traditional GM (1,1) model, the unbiased GM (1,1) model basically eliminates the inherent deviation of the traditional GM (1,1) model, and its application scope is wider than the traditional GM (1,1) model. In addition, the unbiased GM (1,1) model does not need to be reduced, which simplifies the modeling steps and improves the calculation speed of the model.

Sliding unbiased grey GM (1,1) model

In order to reduce the impact of large fluctuation of individual data on the prediction effect, the original data can be processed by a weighted moving average, and then the unbiased grey model is established, which is derived into the sliding unbiased grey model.

Suppose that the original sequence is:
(10)
A new sequence is obtained by weighted average:
(11)
Among them:
Then according to the calculation process of the unbiased grey model, the original data series fitting model is generated:
(12)

Markov model

Because the traditional grey model is mainly suitable for system objects with short prediction time, fewer data and slight fluctuation, the prediction accuracy is low for the data series with significant random fluctuation. The Markov model is a mathematical method to predict the future state of things by studying the initial probability of different states and the transition probability between states. When the prediction sequence has the characteristics of the Markov chain stationary process and other mean value, the transition probability can reflect the influence degree of random factors, which makes up for the limitations of grey prediction. In this way, the Markov model, based on stochastic process theory, provides a new prediction solution for the series, with a larger fluctuation range.

The Markov model can be expressed as follows:
(13)
where: is the state probability vector at the initial time; is the state probability vector after n times; is the one-step state transition probability matrix. The specific steps of the Markov model are as follows:
  • (1)

    The deviation between the predicted data of the original model and the actual data is calculated. According to the mean square deviation grouping method of deviation samples, the classification standard is established and the state space of the Markov chain is determined. Suppose we get m possible states, namely ,,,, then the i-th state interval , where and represent the upper and lower bounds of states respectively.

  • (2)

    The Mahalanobis property is tested. According to probability theory, statistic obeys distribution with degree of freedom . If the statistics think that the sequence has Markov property, the Markov model can be used to modify the prediction of residual sequence.

  • (3)
    The autocorrelation coefficient of each order of the index value of the sequence is calculated:
    (14)
  • Formulas (15 and 16) are normalized to obtain the Markov chain weight:
    (15)
  • (4)

    The construction of a state transition matrix. The possibility of one-step transition from state to the next state is state transition probability . The probability of state transition is usually calculated by the principle that frequency is approximately equal to probability, i.e. : where is the total number of occurrences of state i and is the number of times that state transfers to state . According to the state value of the data, the transition probability matrix of the k-step is calculated: .

  • (5)

    Based on the state transition probability matrix, the weighted sum of the prediction probabilities in the same state is taken as the prediction probability of the current state. The state with the maximum probability is selected as the final state interval.

  • (6)

    The Markov prediction is calculated. The original prediction value is , and the change interval of residual prediction value is . The middle value of this interval can be used as the final prediction result at the time; that is, the final prediction value of the model is .

Model validation

The methods to test the accuracy of the model generally include mean relative error test, residual test, correlation test and posterior variance test. In this paper, the average relative error test and posterior variance test are used to test the prediction results of the research model. The calculation formula of relative error is as follows:
(16)
The calculation formula of average relative error is as follows:
(17)

Posteriori test is based on the statistical situation between the predicted value and the actual value of the model, which is transplanted from the probability prediction method. Based on the residual value, according to the absolute value of the residual error in each period, the probability of the occurrence of the point with smaller residual error and the size of the index related to the prediction error variance are investigated; that is, the mean square error ratio C and the small error probability P are jointly evaluated and calculated. The process is as follows:

  • (1)

    Calculate the mean and variance of the original sequence:

  • If the original sequence is and the predicted value sequence is , then
    (18)
    (19)
  • (2)
    The mean and variance of residual error were calculated:
    (20)
    (21)
    (22)
  • (3)
    The posterior variance ratio C and the minimum probability error are calculated:
    (23)
    (24)

The smaller the index C is, the better. It shows that although the historical data is very discrete, the residual value obtained by the model is not very discrete; that is, the closer the model is to the actual value. The larger the index P is, the better. It shows that there are more points where the difference between the residual and the average residual value is less than the given value; that is, the more uniform the distribution of the predicted value is. According to C and P, the accuracy of the prediction model can be comprehensively evaluated, as shown in Table 1.

Table 1

Grade evaluation of grey GM (1,1) prediction model

Model levelLevel oneLevel twoLevel threeLevel four
Small error probability     
Posterior error ratio     
Forecast level good qualified barely unqualified 
Model levelLevel oneLevel twoLevel threeLevel four
Small error probability     
Posterior error ratio     
Forecast level good qualified barely unqualified 
Table 2

Annual water consumption of residents in Shaanxi Province from 2003 to 2019

YearWater consumptionYearWater consumptionYearWater consumptionYearWater consumption
2003 11.28 2008 14.0 2012 14.75 2016 16.4 
2004 12.6 2009 14.8 2013 15.12 2017 17.0 
2005 12.97 2010 14.83 2014 15.4 2018 17.4 
2006 13.27 2011 14.83 2015 16.1 2019 18.1 
2007 13.55       
YearWater consumptionYearWater consumptionYearWater consumptionYearWater consumption
2003 11.28 2008 14.0 2012 14.75 2016 16.4 
2004 12.6 2009 14.8 2013 15.12 2017 17.0 
2005 12.97 2010 14.83 2014 15.4 2018 17.4 
2006 13.27 2011 14.83 2015 16.1 2019 18.1 
2007 13.55       

unit: 100million cu.m.

Research ideas

Therefore, this study combines the grey prediction model and Markov model to improve the prediction accuracy of GM (1,1) model and reduce the requirements of the model on the stability of the sequence. The algorithm steps can be described as follows:

  • (1)

    The sliding unbiased GM (1,1) model of annual water consumption series in Shaanxi Province is constructed by using the grey modeling method. The prediction series of the original series are obtained by using the model.

  • (2)

    The residual sequence of the model is obtained from the difference between the absolute practical water quantity and the prediction sequence. It is divided into different state intervals according to the mean and standard deviation to obtain the state sequence.

  • (3)

    The state sequence is transformed into a probability transition matrix, the autocorrelation coefficient of the sequence is calculated, and then the weight sequence is obtained. According to the transition probability matrix and weight coefficient sequence of each order, the influence probability of the state over the years on the state of the forecast year is determined.

  • (4)

    According to the state transition probability and weight coefficient, the water consumption in the N + 1 year is calculated and predicted. The water consumption data is added to the original data series. The data of the first year in the series is removed to form a grey model with the same metabolic calculation dimension.

  • (5)

    Repeat steps (1) to (4) to obtain the annual water consumption forecast data of the following year.

In conclusion, the flow chart of annual domestic water consumption prediction algorithm based on a sliding unbiased grey-Markov model is shown in Figure 1.

Figure 1

Flow chart of grey-Markov model prediction.

Figure 1

Flow chart of grey-Markov model prediction.

Close modal

Study area and data introduction

Shaanxi Province is located in the hinterland of Northwest China, across the Yellow River and the Yangtze River. It is located between 105°29’ ∼ 111°15'E and 31°42’ ∼ 39°35'N, covering a total area of 205,800 square kilometers. It governs ten cities and one administrative region. The province is long in the South and narrow in the East and the West. There are more than ten secondary and tertiary rivers in the area. The annual average temperature is 7–16 °C, and the annual average precipitation is 686.41 mm. The rainfall is mainly concentrated from July to September, showing the distribution characteristics of more in the South and less in the north. The annual average evaporation is about 1,608 mm, belonging to the continental monsoon climate. The inner drainage system is mainly distributed in the sandy grassland area of Northern Shaanxi, accounting for 2.3% of the province's total area, and the outer drainage system accounts for 97.7% of the total area of the province. Using the data of annual water consumption of residents in Shaanxi Province from 2003 to 2019 (shown in Table 2), the average annual water consumption is 1.485 billion cubic metres. The data of this paper comes from the China National Statistical Yearbook, in which the annual water consumption data from 2003 to 2016 are the original calculation data of the model, and the water consumption data from 2017 to 2019 are the prediction and analysis data of the model. The data analysis software is Matlab R2013b.

Comparison of simulation results of three grey simulation methods

  • (1)
    In this study, using the annual water consumption data of Shaanxi Province from 2003 to 2019, the cumulative generation sequence is established according to formula (1). In order to obtain the first order linear differential equation of Equation (2), the development coefficient and grey action of GM (1,1) model are obtained by using the least squares principle; according to formula (4), the accumulated prediction sequence is obtained, and then the prediction model of GM (1,1) is obtained by subtraction.
    (25)
  • (2)
    It is observed that the annual water consumption data of residents in Shaanxi Province shows an exponential trend, so a new cumulative sequence is generated according to formula (6). Then, the parameters and are obtained by using the traditional GM (1,1) model; Using the above two parameters' data and calculating the parameters and according to formula (8), the unbiased GM (1,1) model is obtained by substituting them into formula (9):
    (26)
  • (3)
    Through formula (11), a new series is obtained by weighted moving average processing of annual water consumption data of residents in Shaanxi Province; similarly, according to the processing method of unbiased grey GM (1,1) model in step (2), a sliding unbiased grey GM (1,1) model is obtained:
    (27)

The comparison between the predicted data and the real values calculated by the functional equation in 2017–2019 is shown in Table 3.

Table 3

Simulation results of three grey models

YearActual annual water consumptionTraditional grey model
Unbiased grey model
Sliding unbiased grey model
Predicted valueRelative errorPredicted valueRelative errorPredicted valueRelative error
2017 17.0 16.6502 2.06% 16.6856  1.85% 16.6882  1.83% 
2018 17.4 16.9877 2.37% 17.0238  2.16% 17.0327  2.11% 
2019 18.1 17.3320 4.24% 17.3689 4.04% 17.3843 3.95% 
Average relative error  2.89% 2.68% 2.63% 
YearActual annual water consumptionTraditional grey model
Unbiased grey model
Sliding unbiased grey model
Predicted valueRelative errorPredicted valueRelative errorPredicted valueRelative error
2017 17.0 16.6502 2.06% 16.6856  1.85% 16.6882  1.83% 
2018 17.4 16.9877 2.37% 17.0238  2.16% 17.0327  2.11% 
2019 18.1 17.3320 4.24% 17.3689 4.04% 17.3843 3.95% 
Average relative error  2.89% 2.68% 2.63% 

unit: 100million cu.m.

At the same time, the simulation and prediction results of the three grey simulation methods are compared with the actual annual water consumption (Figure 2).

Figure 2

Comparison of different grey models and actual annual water consumption.

Figure 2

Comparison of different grey models and actual annual water consumption.

Close modal

It can be seen from Figure 2 and Table 3 that the accuracy of the three grey simulation methods is equivalent, the simulation effect of the improved grey simulation method is improved to a certain extent, and the performance of the sliding unbiased grey model is better. Although the two improved grey methods make the simulation effect of the model improved to a certain extent, they still can not overcome the discomfort of the grey model for nonlinear data. In order to further improve the simulation accuracy, the sliding unbiased grey model is modified by Markov, and the sliding unbiased grey-Markov model is established.

Modify the sliding unbiased grey model with the Markov model

In order to further reduce the calculation error of the model, the Markov model is used to modify the prediction sequence data of the sliding unbiased grey model. Firstly, the error state is divided. According to the prediction data and actual water consumption of the sliding unbiased grey model, the relative error of different years is calculated, as shown in Table 4.

Table 4

Prediction sequence and relative error of sliding unbiased grey model for annual domestic water consumption in Shaanxi Province from 2003 to 2014

YearWater consumptionPredicted valueRelative errorYearWater consumptionPredicted valueRelative error
2003 11.28 11.6100 −2.93% 2010 14.83 14.4640 2.47% 
2004 12.6 12.7951 −1.55% 2011 14.83 14.7626 0.45% 
2005 12.97 13.0592 −0.69% 2012 14.75 15.0674 −2.15% 
2006 13.27 13.3288 −0.44% 2013 15.12 15.3784 −1.71% 
2007 13.55 13.6040 −0.40% 2014 15.4 15.6959 −1.92% 
2008 14.0 13.8848 0.82% 2015 16.1 16.0199 0.50% 
2009 14.8 14.1714 4.25% 2016 16.4 16.3506 0.30% 
YearWater consumptionPredicted valueRelative errorYearWater consumptionPredicted valueRelative error
2003 11.28 11.6100 −2.93% 2010 14.83 14.4640 2.47% 
2004 12.6 12.7951 −1.55% 2011 14.83 14.7626 0.45% 
2005 12.97 13.0592 −0.69% 2012 14.75 15.0674 −2.15% 
2006 13.27 13.3288 −0.44% 2013 15.12 15.3784 −1.71% 
2007 13.55 13.6040 −0.40% 2014 15.4 15.6959 −1.92% 
2008 14.0 13.8848 0.82% 2015 16.1 16.0199 0.50% 
2009 14.8 14.1714 4.25% 2016 16.4 16.3506 0.30% 

According to the relative error between the simulated data of the sliding unbiased grey model from 2003 to 2016 and the actual water consumption, taking , , and as the segmentation points, it is divided into five different state intervals, namely state 1: ; state 2: ; state 3: ; state 4: ; state 5: .

According to the relative error of the state sequence, the state transition frequency matrix is obtained, and the element represents the frequency (i.e. times) of the state sequence from state i to state j. The first transition probability matrix of the simulated data of the sliding unbiased grey model is obtained from the interval, where is the probability of the occurrence of a certain state transition frequency, and m is the number of state intervals.

From step 2, the statistical value of was 26.48, and given the significance level , the statistical value of was obtained. Because of , the residual sequence satisfies the Markov property.

The autocorrelation coefficients and weights of each order of residual sequence are calculated according to formulas (14), (16) and (15), (17), as shown in Table 5.

Table 5
     
0.575838 0.097742 −0.30407 −0.51637 −0.49718 
     
0.289191 0.049087 0.152705 0.259326 0.249691 
     
0.575838 0.097742 −0.30407 −0.51637 −0.49718 
     
0.289191 0.049087 0.152705 0.259326 0.249691 
It is known that the one-step transition probability matrix is and the n-step transition probability matrix is . By matrix product calculation, the transition probability matrix of each step is ,,,.

According to the historical data, combined with the transfer probability matrix, the annual domestic water consumption from 2016 to 2019 is predicted.

According to Table 6, the maximum weighted sum is 0.4405, and the corresponding state is 5; that is, the state of the 2017 residual sequence is 5, and the interval is . Take an interval length as the prediction value, that is, , and add it to the original prediction sequence, and the prediction value after Markov correction is 16.7318.

Table 6

Residual prediction of domestic water consumption in Shaanxi Province in 2017

YearStateStepWeight coefficientState 1State 2State 3State 4State 5
2016 0.2892 0.0000 0.0000 0.0000 0.0000 1.0000 
2015 0.0491 0.1250 0.2500 0.2500 0.0833 0.2917 
2014 0.1527 0.1944 0.1667 0.3889 0.0556 0.1944 
2013 0.2593 0.1944 0.1667 0.3889 0.0556 0.1944 
2012 0.2497 0.1597 0.2176 0.3194 0.0756 0.2276 
Weighted summation    0.1261 0.1353 0.2523 0.0459 0.4405 
YearStateStepWeight coefficientState 1State 2State 3State 4State 5
2016 0.2892 0.0000 0.0000 0.0000 0.0000 1.0000 
2015 0.0491 0.1250 0.2500 0.2500 0.0833 0.2917 
2014 0.1527 0.1944 0.1667 0.3889 0.0556 0.1944 
2013 0.2593 0.1944 0.1667 0.3889 0.0556 0.1944 
2012 0.2497 0.1597 0.2176 0.3194 0.0756 0.2276 
Weighted summation    0.1261 0.1353 0.2523 0.0459 0.4405 

The model adopts the equal dimension information processing method, removes the data of the previous 2003, adds the newly predicted data of 2016, and calculates the data of the following year. The simulation data obtained are compared with the original data (Table 7). It can be seen from the above analysis results that the prediction accuracy of the dynamic unbiased grey-Markov model is high.

Table 7

Simulation results of three grey models, unit: 100 million cu.m

YearActual annual water consumptionUnbiased grey model
Moving unbiased GM (1,1) - Markov model
Accuracy improvement
Predicted valueRelative errorPredicted valueRelative error
2017 17.0 16.6882 1.83% 16.7100 1.71% 6.56% 
2018 17.4 17.0327 2.11% 17.0545 1.99% 5.69% 
2019 18.1 17.3843 3.95% 17.4062 3.83% 3.04% 
YearActual annual water consumptionUnbiased grey model
Moving unbiased GM (1,1) - Markov model
Accuracy improvement
Predicted valueRelative errorPredicted valueRelative error
2017 17.0 16.6882 1.83% 16.7100 1.71% 6.56% 
2018 17.4 17.0327 2.11% 17.0545 1.99% 5.69% 
2019 18.1 17.3843 3.95% 17.4062 3.83% 3.04% 

Forecast of future domestic water consumption

Using the state transition probability matrix and state division table, the annual domestic water consumption data of Shaanxi Province from 2020 to 2025 can be predicted as shown in Table 8.

Table 8

Predicted annual domestic water consumption of Shaanxi Province in 2020–2025

Year202020212022202320242025
Estimate 17.765 18.1313 18.5052 18.8868 19.2762 19.6737 
Year202020212022202320242025
Estimate 17.765 18.1313 18.5052 18.8868 19.2762 19.6737 

It can also be seen from Table 8 that the domestic water consumption in Shaanxi Province will maintain a sustained growth from 2020 to 2025, with an average annual growth of 2.06%. Therefore, Shaanxi provincial government should continue to strengthen the construction of a water-saving society. This paper puts forward the following suggestions on domestic water policy:

  • (1)

    Improve the supervision mechanism of domestic water and establish a water resources management system in line with the actual situation of Shaanxi Province.

  • (2)

    The government should establish a complete and unified water price system, give full play to the leverage of water price, and limit the behaviour of taking water at will and wasting water resource.

  • (3)

    It is necessary to promote scientific and technological innovation, improve the recycling technology of water resources in the whole province, and improve the urban domestic water recovery rate.

  • (4)

    Strengthen water-saving education and publicity, and raise the awareness of the whole people's water-saving.

The paper focuses on the estimation of the domestic water consumption in Shaanxi Province of China. In this study, taking the domestic water series data of Shaanxi Province from 2003 to 2019 as the research object, different grey models are established for modeling and comparative analysis, and the sliding unbiased grey model is modified by Markov to improve the accuracy of the model simulation data. Markov prediction has the property of ‘no aftereffect’; that is, when the state of the system at a particular time is known, the state of the system in the future is only related to the current time and has nothing to do with the past time. In short-term and medium-term forecasting of water consumption, the positive and negative of the residual sign is consistent with the hypothesis, which reflects the randomness and no aftereffect of the residual sign. Therefore, it is reasonable to use Markov to predict the sign of residual in this study. Especially when the water consumption of that year has nonlinear fluctuation, the improved grey Markov model can better fit the original data series, and the accuracy is improved more obviously, which also reflects the necessity of Markov correction of the grey model. In addition, in the process of building a grey-Markov model, the method of equal dimension information is used to update the original data in real time, so as to improve the accuracy of the model. Nevertheless, from the simulation results, the simulation accuracy of some years is still not high, GM (1,1), and unbiased grey model method simulation accuracy is limited (Rathnayaka et al. 2016). Next, although the model needs less information, the operation steps are relatively cumbersome, and it is not suitable for long-term domestic water consumption prediction. On the other hand, the partition of the Markov model is the key to determine the simulation effect of the model. In theory, the more detailed the partition is, the better. However, in practice, it is found that too dense a partition will make it difficult to select new data intervals, which increases the error. Therefore, reasonable division and judgment of interval is a direction worthy of attention and improvement in the future.

This paper was supported by the Open Research Fund of State Key Laboratory of Simulation and Regulation of Water Cycle in River Basin, China Institute of Water Resources and Hydropower Research (grant No. IWHR-SKL-201905).

All relevant data are included in the paper or its Supplementary Information.

Bai
Y.
,
Wang
P.
,
Li
C.
,
Xie
J.
&
Wang
Y.
2015
Dynamic forecast of daily urban water consumption using a variable-structure support vector regression model
.
Journal of Water Resources Planning and Management
141
(
3
),
4014058
.
Brentan
B. M.
,
Meirelles
G. L.
,
Manzi
D.
&
Luvizotto
E.
2018
Water demand time series generation for distribution network modeling and water demand forecasting
.
Urban Water Journal
15
(
2
),
150
158
.
Chen
M.
,
Luo
Y.
,
Shen
Y.
,
Han
Z.
&
Cui
Y.
2020
Driving force analysis of irrigation water consumption using principal component regression analysis
.
Agricultural Water Management
234
,
106089
.
Cui
J.
,
Liu
S.
,
Zeng
B.
&
Xie
N.
2013
A novel grey forecasting model and its optimization
.
Applied Mathematical Modelling
37
(
6
),
4399
4406
.
Deng
W.
,
Liu
H.
,
Xu
J.
,
Zhao
H.
&
Song
Y.
2020a
An improved quantum-inspired differential evolution algorithm for deep belief network
.
IEEE Transactions on Instrumentation and Measurement
69
(
10
),
7319
7327
.
Deng
W.
,
Xu
J.
,
Song
Y.
&
Zhao
H.
2020b
An effective improved co-evolution ant colony optimisation algorithm with multi-strategies and its application
.
International Journal of Bio-Inspired Computation
16
(
3
),
158
170
.
Duerr
I.
,
Merrill
H. R.
,
Wang
C.
,
Bai
R.
,
Boyer
M.
,
Dukes
M. D.
&
Bliznyuk
N.
2018
Forecasting urban household water demand with statistical and machine learning methods using large space-time data: a comparative study
.
Environmental Modelling and Software
102
,
29
38
.
Firat
M.
,
Turan
M. E.
&
Yurdusev
M. A.
2010
Comparative analysis of neural network techniques for predicting water consumption time series
.
Journal of Hydrology
384
(
1-2
),
46
51
.
Gong
X.
,
Cardenas-Barrera
J. L.
,
Castillo-Guerra
E.
,
Cao
B.
,
Saleh
S. A.
&
Chang
L.
2019
Bottom-up load forecasting with Markov-based error reduction method for aggregated domestic electric water heaters
.
IEEE Transactions on Industry Applications
55
(
6
),
6401
6413
.
Horne
J.
2013
Economic approaches to water management in Australia
.
International Journal of Water Resources Development
29
(
4
),
526
543
.
Ji
Z.
,
Wang
Z.
,
Bao
X.
,
Wang
X.
&
Wu
T.
2018
Research on water resources optimal scheduling problem based on parallel biological computing
.
Desalination and Water Treatment
111
,
88
93
.
Rathnayaka
R. M. K. T.
,
Seneviratna
D. M. K. N.
,
Jianguo
W.
&
Arumawadu
H. I.
2016
An unbiased GM (1,1)-based new hybrid approach for time series forecasting
.
Grey Systems: Theory and Application
6
(
3
),
322
340
.
Ren
X.
,
Wang
X.
,
Wang
Z.
&
Wu
T.
2020
Parallel DNA algorithms of generalized traveling salesman problem-based bioinspired computing model
.
International Journal of Computational Intelligence Systems
14
(
1
),
228
237
.
Villarin
M. C.
&
Rodriguez-Galiano
V. F.
2019
Machine learning for modeling water demand
.
Journal of Water Resources Planning and Management
145
(
5
),
4019017
.
Walker
D. J.
,
Creaco
E.
,
Vamvakeridou-Lyroudia
L. S.
,
Farmani
R.
,
Kapelan
Z.
&
Savić
D.
2015
Forecasting domestic water consumption from smart meter readings using statistical methods and artificial neural networks
.
Procedia Engineering
119
,
1419
1428
.
Wang
Z.
,
Bao
X.
&
Wu
T.
2021a
A parallel bioinspired algorithm for Chinese postman problem based on molecular computing
.
Computational Intelligence and Neuroscience
2021
,
1
13
.
Wang
Z.
,
Wang
D.
,
Bao
X.
&
Wu
T.
2021b
A parallel biological computing algorithm to solve the vertex coloring problem with polynomial time complexity
.
Journal of Intelligent and Fuzzy Systems
3
,
1
11
.
Zubaidi
S. L.
,
Dooley
J.
,
Alkhaddar
R. M.
,
Abdellatif
M.
,
Al-Bugharbee
H.
&
Ortega-Martorell
S.
2018
A novel approach for predicting monthly water demand by combining singular spectrum analysis with neural networks
.
Journal of Hydrology
561
,
136
145
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY-NC-ND 4.0), which permits copying and redistribution for non-commercial purposes with no derivatives, provided the original work is properly cited (http://creativecommons.org/licenses/by-nc-nd/4.0/)