Real-time detection of water level outliers is critical for real-time regulation of gates or pump stations in open-channel water transfer projects. However, this remains a challenging task because of the lack of definition of water level outliers and the imbalance of flow monitoring data. In this study, we define the water level outliers and then propose a highly accurate outlier index for real-time detection of water level outliers based on the water level-flow relationship, and the thresholds for water level outliers are determined based on the order of magnitude of flow and water level differences. A case study is performed with the South-to-North Water Diversion Project of China. A random noise is added to 15 randomly selected non-adjacent monitoring datasets to verify the accuracy of the index, and the noise is increased from 4 to 9 cm at a step of 1 cm. The results show that a total of 159 outliers are detected out of 180 outliers with an accuracy rate of 88.3%.

  • An outlier index is proposed based on the water level-flow relationship.

  • The definition of water level outliers in open-channel water transfer projects is proposed.

  • A case study shows that the proposed method can effectively identify outliers in real time.

Cross-basin water transfer projects are intended to alleviate water shortage and optimize water allocation, and such projects are complex systems characterized by long-distance water transmission with the help of various hydraulic structures such as gates and pump stations. In order to solve the problem of slope damage caused by water pressure difference, the water levels in front of pump stations or gates should be kept as stable as possible (Clemmens et al. 2001). Therefore, there is a need for accurate real-time monitoring of water levels in front of pump stations or gates. As the presence of outliers can considerably reduce the quality of the datasets, effective methods are needed to detect these outliers in real time (Boiten 2008; Herschy 2014). Many hydrodynamic models and statistical methods are proposed to offer a potential solution but are not particularly useful in a real-time setting, especially for long-distance open-channel water transfer projects.

The one-dimensional hydrodynamic model based on Saint-Venant equations (Yi et al. 2017; Zhu et al. 2021) can detect outliers in water level datasets by simulating water movement. The Saint-Venant equations consist of a continuity equation and a motion equation, where the continuity equation requires the balance between the inflow and outflow of the same channel. However, this is difficult to achieve for long-distance open-channel water transfer projects whose channels are often tens of kilometers long because of abnormal operation of equipment, monitoring errors, rainfall, leakage, and even flow inversion (when the water level is stable, the downstream flow monitoring data points are larger than the upstream ones for a long time), which makes the equation set unsolvable. For this reason, the model is not applicable to real-time detection of outliers in water level datasets.

Statistically, the distribution-based 3-sigma method (Le et al. 2013; Hwang et al. 2016, 2019; Chen & Liao 2020) and the quantile-based box-plot method (Hubert & Vandervieren 2008; Zhao & Yang 2019) proposed by Tukey (1977) have been widely used to detect outliers in water level datasets. Nevertheless, these two methods do not apply to open-channel water transfer projects because of the masking effect (Rousseeuw & Hubert 2011) and the swamping effect (Rousseeuw & Hubert 2011). The masking effect means that the fitted model cannot detect the deviating observations, and the swamping effect means that some data points are incorrectly identified as outliers due to the presence of another good subset. It is apparent that the former effect may lead to false-negative misdiagnosis (missing of outliers), while the latter effect may lead to false-positive misdiagnosis (the outliers identified are actually not outliers). Such misdiagnoses are attributed to the neglect of the hydraulic correlation between water level changes and flow changes in statistical methods. Because of changes in the water level of open-channel water transfer projects (i.e., continuous short- or long-term changes in adjacent water levels in response to changes in operational requirement or scheduling objective), it is difficult to determine the length and method parameters (e.g., sigma estimation) for detection of outliers. Hence, statistical methods are not applicable to real-time detection of outliers in water level datasets of water transfer projects.

The inconsistency of the definition of water level outliers and the imbalance of flow in water level datasets make real-time detection of water level outliers extremely difficult. Outliers can be defined in different ways (Hawkins 1980; Barnett & Lewis 1994; Chandola et al. 2009; Rousseeuw & Hubert 2011; Liu et al. 2012), and no consensus is reached for open-channel water transfer projects due to the complexity of water level changes. Very often, datasets are reviewed manually at regular intervals, which is labor-intensive and inefficient and easily causes false or incomplete identification. It is clear that changes in the water levels of a given channel are affected not only by changes in inflow but also by changes in outflow. However, the monitoring of the inflow and outflow of the channel is subject to a variety of uncertainties, resulting in long-term water imbalance in the datasets. Therefore, the key for real-time detection of water level outliers is to process the flow datasets under the influence of uncertainties in an easier and quicker way. Based on the water level-flow relationship, we propose for the first time a novel high-accuracy outlier index for the real-time detection of outliers in water level datasets, which contributes to improving the quality of the water level-flow datasets and provides high-precision data for hydrodynamic simulation, emergency warning, and data mining. Our major contributions are threefold:

  • 1.

    An outlier index is proposed based on the water level-flow relationship and the thresholds for water level outliers are determined based on the order of magnitude of flow and water level differences.

  • 2.

    The definition of water level outliers for open-channel water transfer projects is proposed.

  • 3.

    A case study is presented to demonstrate the effectiveness of the outlier index in the real-time identification of outliers for open-channel water transfer projects.

The paper is organized as follows. Section 2 describes the outlier index and its definition and derivation process; Section 3 presents the study area and the method for outlier detection; Section 4 describes and analyzes the results; Section 5 presents the conclusions.

Definition of water level outliers

How the water level outliers are defined is closely related to the accuracy of the detection method. As water levels are generally measured using a water level ruler with a deviation of about 3–5 cm due to the influence of various factors, the measured water levels with a deviation of more than 3 cm from the reading of the ruler are defined as an outlier in this study. However, the manual reading of the water ruler is difficult to obtain, and the simulated value of the water level from the one-dimensional hydrodynamic model (see Section 3.2) is chosen to replace the reading of the water ruler in this paper.

Classification based on the order of magnitude

The first-order difference (FOD), a special case of the generalized difference method, is used in this study to calculate the changes in the monitoring data at adjacent moments in order to convert a non-smooth dataset into a smooth one (Thanh et al. 2022). This method is computationally fast and does not require high computer performance, which ensures real-time (minute level) detection of water level outliers. The FOD value can visually indicate the direction and magnitude of the change of adjacent data points (Shao et al. 2020). The calculation formula is as follows:
(1)
where is the FOD value, V is the type of raw datasets, and n is the sequence.

The water level ruler is generally accurate to the centimeter level. To make the analysis more practical, the raw datasets of water level and flow are processed before the first-order differential calculation, and the values of water level and flow are rounded to two decimals (0.01 m) and one decimal (0.1 m3/s), respectively. After FOD calculation, and are classified by the order of magnitude, where Z is the water level data point, m; and Q is the flow data point, m3/s. The classification results are shown in Table 1.

Table 1

Classification of FOD results of water level and flow

CategoryABCDE
  [0.1, 1) [1, 10) [10, 100) [100, 1,000) 
 [0.01, 0.1) [0.1, 1) [1, 10) [10, 100) [100, 1,000) 
Orders of magnitude − 2 − 1 
Number 0.01 0.1 10 100 
Scientific notation 10−2 10−1 100 101 102 
CategoryABCDE
  [0.1, 1) [1, 10) [10, 100) [100, 1,000) 
 [0.01, 0.1) [0.1, 1) [1, 10) [10, 100) [100, 1,000) 
Orders of magnitude − 2 − 1 
Number 0.01 0.1 10 100 
Scientific notation 10−2 10−1 100 101 102 

Derivation of the outlier index

Each channel can be viewed as a small reservoir consisting of an upstream sluice gate (or pump), a downstream sluice gate (or pump), and other hydraulic structures (Figure 1), and its storage capacity follows the water balance principle:
(2)
where V is the storage of channel i − 1, m3; t is the time, s; is the inflow at gate i − 1 (or pump i − 1), m3/s; is the inflow of channel i − 1 from other sources, m3/s; is the outflow at gate i (or pump i), m3/s; is the outflow through all outlets, m3/s and is the loss of channel i − 1, m3/s.
Figure 1

Schematic of channel structures and hydraulic elements. (1) Upstream and downstream gates. (2) Upstream pump station and downstream gate. (3) Upstream gate and downstream pump station. (4) Upstream and downstream pump stations.

Figure 1

Schematic of channel structures and hydraulic elements. (1) Upstream and downstream gates. (2) Upstream pump station and downstream gate. (3) Upstream gate and downstream pump station. (4) Upstream and downstream pump stations.

Close modal
Assume that channel i − 1 is an ideal body with the same rectangular section. Under ideal conditions (i.e., no head loss and accurate monitoring), the water level change is . Thus,
(3)
where is the water surface area at time t, m2, and other symbols are the same as above.
In reality, most open channels are prismatic shaped and have irregular sections of different shapes and sizes, and the water surface is sloped due to head loss. Thus, it is difficult to accurately determine the water surface area before and after the water level change. and are usually not available because they are difficult to monitor. Nevertheless, according to the characteristics of the order of magnitude of the data, the right part of Equation (3) can still be used to detect water level outliers. For channel i − 1, when is not equal to 0, the ratio () of the expected change in water volume to the change in measured water level is obtained:
(4)
where is the outlier index, m2/s; and other symbols are the same as above.
For an open-channel water transfer project, there is an error for the flow difference in Equation (4) due to the unavailability of and . Even under stable water level conditions, the flow difference fluctuates by an order of magnitude of −1 to 1. To reduce the fluctuation, the difference between inflow and outflow at the current moment can be used as the basis to calculate the flow difference at the next moment. Thus, the calculation of the outlier index should begin when the water level is stable. Assume that the flow difference at the current moment () and the inflow and outflow of the open channel at the next moment are changed by a, b, and c, respectively, then
(5)
Hence, the outlier index is calculated as follows:
(6)

The range of is determined depending on the water transfer project. For open-channel water transfer projects, the flow difference is usually several orders of magnitude higher than that of the water level difference. In general, belongs to class A or B. If belongs to class A, then belongs to class B or C. In this case, the corresponding range of is 10–90 or 100–900, respectively; similarly, if belongs to class B, then belongs to class C or D, and the corresponding range of is 10–99 or 100–999, respectively. When is outside the above range, there will be an outlier in , but which one, or , is the outlier needs to be further determined.

The low-frequency (e.g., hourly) flow data need to be homogenized. For low-frequency flow datasets, the flow data points at adjacent moments may fluctuate substantially due to various causes such as equipment errors and gate or pump regulation. Under stable water level conditions, there would be a small change (<0.05 cm) in the water level at adjacent moments, and then the fluctuation range of the low-frequency flow data at adjacent moments has a large impact on the value. Therefore, the data at adjacent moments can be averaged to reduce the fluctuation and improve the identification of outliers.

Study area

China is rich in freshwater resources, but the per capita freshwater availability is low (Wang et al. 2023) due to the large population and uneven distribution of freshwater resources between north and south (Jing et al. 2019; Li et al. 2023). The Middle Route of the South-to-North Water Diversion Project (Figure 2) is constructed to divert water from Danjiangkou Reservoir to the north in order to alleviate the water shortage in northern China. The operation and scheduling of this project are perhaps the most difficult in the world because (1) there are more than 60 sluice gates (Ren et al. 2020) along the 1,277 km-long main route (Cheng et al. 2023) with strong coupling between cascaded channels; (2) the regulation capacity of the project is limited because of no water storage facilities along the route; and (3) the control requirements are exceptionally high. For instance, the variation of the water level should not exceed 15 cm within an hour and 30 cm within 24 h (Zhou et al. 2022). Thus, real-time detection of outliers is of great significance for the successful water transfer of the project.
Figure 2

The map of the Middle Route of the South-to-North Water Diversion Project.

Figure 2

The map of the Middle Route of the South-to-North Water Diversion Project.

Close modal

This project has been put into operation since 12 December 2014, and now a large amount of water level datasets is available under different climate and operating conditions. Due to security and privacy concerns, we do not have access to minute-level monitoring data. In this study, 15,570 rows of data are collected for each gate for the period 2017–2021 at 2-h time intervals.

The water levels before gate 4, FOD values, and flow differences in channel 3 in 2018 are shown in Figure 3, and the hydrodynamic features of gate 4 with outlets are shown in Table 2.
Table 2

The recorded hydrodynamic data of gate 4 in 2018

No.Date and timeWater level before the gate (m)Gate-hole 1Gate-hole 2Flow (m3/s)Outlet 1 (m3/s)Outlet 2 (m3/s)
2018-01-01 00:00:00 144.775 1,050 1,080 128.0543 
2018-01-01 02:00:00 144.7781 1,050 1,080 125.9459 
……… … … … … … … 
2018-07-01 02:00:00 144.7558 1,720 1,720 182.1611 
……… … … … … … … 
2018-12-25 14:00:00 144.6307 1,280 1,280 143.2842 
2018-12-25 16:00:00 144.6273 1,280 1,280 142.3666 
No.Date and timeWater level before the gate (m)Gate-hole 1Gate-hole 2Flow (m3/s)Outlet 1 (m3/s)Outlet 2 (m3/s)
2018-01-01 00:00:00 144.775 1,050 1,080 128.0543 
2018-01-01 02:00:00 144.7781 1,050 1,080 125.9459 
……… … … … … … … 
2018-07-01 02:00:00 144.7558 1,720 1,720 182.1611 
……… … … … … … … 
2018-12-25 14:00:00 144.6307 1,280 1,280 143.2842 
2018-12-25 16:00:00 144.6273 1,280 1,280 142.3666 

1Null values in the raw datasets are filled with zero.

No.: gates are sequentially numbered from upstream to downstream. Date and time: the date and time of each data record with a time interval of 2 h. Water level before the gate: the values are obtained automatically from the monitoring equipment and are the average of all the gate holes. There are 2 gate holes in gate 4. Outliers in the water level datasets before the gate would be detected. Gate-hole 1, Gate-hole 2: the opening sizes of each gate hole are obtained automatically from the monitoring equipment. Flow: the sum of the flow through each gate hole. The monitoring equipment is usually deployed in front of the gate. Outlet 1, Outlet 2: Outlets are located in channel 3 and the total value indicates the outflow of channel 3.

Figure 3

The water levels before gate 4 (a), FOD values (b), and flow differences in channel 3 (c) in 2018.

Figure 3

The water levels before gate 4 (a), FOD values (b), and flow differences in channel 3 (c) in 2018.

Close modal

Figure 3(a) shows that the overall variation of the water level (144.4–144.9 m) before the gate in 2018 is within 0.5 m. It is also noted that the fluctuation of the dataset becomes smoother after FOD calculation (b). In most instances, the changes in water level are mostly less than 0.05 m and symmetrical about 0, and the differences between inflow and outflow of channel 3 are between 0 and 10. This indicates that evaporation and leakage loss during water transfer should not be ignored and that the outlier index formula is reasonable. The 2-h frequency flow data need to be homogenized. In this study, the average of the flow at two adjacent moments is taken as the flow at the latter moment.

Method

In this section, the simulated results of the 1D hydrodynamic model are used as water ruler data to find the true values of the water level, and a random noise greater than 3 cm is added to some true values. Finally, the outlier index is used for the detection of outliers.

Saint-Venant equations are often used to describe the 1-D flow in open channels:
(7)
(8)
where B is the channel cross-sectional width, m; Z is the water level, m; t is the time, s; Q is the discharge, m3/s; x is the distance along the channel, m; q is the lateral inflow, m3/s; is the momentum correction coefficient; A is the wetted cross-sectional area, m2; g is the gravitational acceleration, m/s2; and is the friction slope, which can be calculated from the following equation:
(9)
where n is the Manning's roughness coefficient, and R is the hydraulic radius of the channel section, m.

In the hydrodynamic model, the water level results of gate 49 for 7 consecutive days (total 85 data points) are used as the upstream boundary condition, and the flow results of gate 50 for the same time series are used as the downstream boundary condition. The model output is the water level data in front of gate 50. The Preissmann four-point implicit difference scheme is used to discretize Equations (7) and (8). If the deviation between simulated and gauged data at the corresponding moment is within 3 cm, the 85 monitored data points are considered to be true values.

A random noise greater than 3 cm is added to 15 randomly selected non-adjacent data to construct the water level dataset with outliers. At each time, the noise of the same size is added to 15 data and the size is gradually increased from 4 to 9 cm at a step of 1 cm. After that, the outlier index is calculated.

The gauged data of channel 49 from 0:00 on 1 October 2017 to 0:00 on 8 October 2017 is selected for simulation, and the roughness of the channel is 0.0165. The simulated water levels before gate 50 and the differences between simulated and gauged water levels are shown in Figure 4. In order to more accurately describe the deviation between the simulated values of the 1D hydrodynamic model and the gauged values, three decimals (0.001 m) are retained for both values.
Figure 4

Simulated and gauged water levels before gate 50 and the differences between them.

Figure 4

Simulated and gauged water levels before gate 50 and the differences between them.

Close modal
Figure 5 shows the differences in the inflow and outflow of channel 49 before and after averaging.
Figure 5

The differences of inflow and outflow of channel 49 before and after averaging.

Figure 5

The differences of inflow and outflow of channel 49 before and after averaging.

Close modal

The 15 randomly selected non-adjacent water levels and other data are shown in Table 3. The addition of a noise will change the associated with the water level point by the same amount and in the opposite direction. The partial outlier indexes after adding noise are shown in Table 4.

Table 3

Fifteen water level points and other data

No.Serial No. (m3/s) (m) (m2/s) (m3/s) (m) (m2/s)
− 1.6 − 0.01 160 0.3 0.01 30 
12 0.1 − 0.01 − 10 0.2 
18 − 0.3 3.1 
23 − 0.3 0.8 0.01 80 
37 0.5 − 3.1 − 0.01 310 
44 − 0.2 − 0.1 0.01 − 10 
48 0.1 0.5 
51 − 0.1 0.01 − 10 − 0.3 
57 0.4 − 0.01 − 40 − 0.2 
10 61 0.01 0.3 
11 65 − 1 
12 70 − 0.3 0.01 − 30 − 0.3 
13 73 0.5 − 0.01 − 50 − 0.6 
14 77 3.6 − 0.1 
15 81 1.2 − 0.1 
No.Serial No. (m3/s) (m) (m2/s) (m3/s) (m) (m2/s)
− 1.6 − 0.01 160 0.3 0.01 30 
12 0.1 − 0.01 − 10 0.2 
18 − 0.3 3.1 
23 − 0.3 0.8 0.01 80 
37 0.5 − 3.1 − 0.01 310 
44 − 0.2 − 0.1 0.01 − 10 
48 0.1 0.5 
51 − 0.1 0.01 − 10 − 0.3 
57 0.4 − 0.01 − 40 − 0.2 
10 61 0.01 0.3 
11 65 − 1 
12 70 − 0.3 0.01 − 30 − 0.3 
13 73 0.5 − 0.01 − 50 − 0.6 
14 77 3.6 − 0.1 
15 81 1.2 − 0.1 
Table 4

Water level points with different noises and outlier indexes

No.Serial No. = 0.04 m
= −0.04 m
= 0.06 m
= −0.07 m
= 0.09 m
(m2/s) (m2/s) (m2/s) (m2/s) (m2/s) (m2/s) (m2/s) (m2/s) (m2/s) (m2/s)
− 53 − 10 32 − 32 − 6 20 − 20 − 4 
12 − 5 − 2 − 3 − 1 − 2 
18 − 8 − 78 78 − 5 − 52 44 − 3 − 34 
23 − 8 − 27 16 − 5 − 16 10 − 3 − 10 
37 13 62 − 13 − 103 44 − 7 − 52 31 
44 − 5 − 2 − 3 − 1 − 2 
48 − 13 − 3 13 − 8 − 1 − 6 
51 − 2 − 8 − 1 − 4 − 1 
57 13 − 8 − 5 − 5 − 3 
10 61 − 8 − 5 − 3 
11 65 − 25 25 − 17 14 − 11 
12 70 − 6 10 − 8 − 4 − 4 − 3 
13 73 17 15 − 10 − 15 10 10 − 6 − 9 
14 77 90 − 90 − 3 60 − 51 − 1 40 
15 81 30 − 30 − 3 20 − 17 − 1 13 
No.Serial No. = 0.04 m
= −0.04 m
= 0.06 m
= −0.07 m
= 0.09 m
(m2/s) (m2/s) (m2/s) (m2/s) (m2/s) (m2/s) (m2/s) (m2/s) (m2/s) (m2/s)
− 53 − 10 32 − 32 − 6 20 − 20 − 4 
12 − 5 − 2 − 3 − 1 − 2 
18 − 8 − 78 78 − 5 − 52 44 − 3 − 34 
23 − 8 − 27 16 − 5 − 16 10 − 3 − 10 
37 13 62 − 13 − 103 44 − 7 − 52 31 
44 − 5 − 2 − 3 − 1 − 2 
48 − 13 − 3 13 − 8 − 1 − 6 
51 − 2 − 8 − 1 − 4 − 1 
57 13 − 8 − 5 − 5 − 3 
10 61 − 8 − 5 − 3 
11 65 − 25 25 − 17 14 − 11 
12 70 − 6 10 − 8 − 4 − 4 − 3 
13 73 17 15 − 10 − 15 10 10 − 6 − 9 
14 77 90 − 90 − 3 60 − 51 − 1 40 
15 81 30 − 30 − 3 20 − 17 − 1 13 

Figure 4 shows that the gauged water level is very stable over the 7 days, and more than half of the data have an FOD value of 0. However, the simulated water level is much less stable, but all the deviations from the corresponding gauged data are within 0.03 m. It can be considered that all the gauged datasets are the true values of the water level. The deviation of the water level at adjacent moments is 0 or 0.01, which means that is very sensitive to the change of . Thus, it is necessary to homogenize Q. Figure 5 shows that when the water level is in a steady state, the fluctuation of the raw flow difference is relatively large, while the mean-shifting treatment can still retain the overall variation trend while reducing the fluctuation of the flow difference. Most of the flow difference data lie between −1 and 2, and only three data points (19th, 36th, and 77th) lie between 6 and 8, which are averaged into six points that lie between 3 and 5.

In Table 3, there are 13 numbers with values less than or equal to 0.3, which means that is more likely to be lower than 10 if the value with the noise is greater than 0.03. Therefore, small amplitude outliers can be easily detected in most data. However, the data like the 5th and 11th data (both values are relatively large, or one value is 0 and the other value is relatively large) are not sensitive to small amplitude outliers. The manual diagnosis is needed to determine whether it is a flow error or a water level error when the outlier index is greater than 90 or 990, or or is 0.

Based on Table 1 and the discussion at the end of Section 2.3, the outliers resulting from the addition of different random noises can be identified according to the outlier index or , as shown in Table 4. As the noise is increased from = 0.04 m (−0.04 m) to = 0.09 m (−0.09 m) at a step of 1 cm, a total of 180 outliers are involved. Of these 180 outliers, 159 are detected, 11 (12), 12 (13), 13 (14), 14 (14), 14 (14), and 14 (14), respectively. Since the 2-h dataset is not sufficient enough to characterize the regulation process of gates, and similar to the abnormal 11th data point in Figure 4, the manual diagnosis could not be performed and it is temporarily considered as a missed diagnosis. According to Equation (6) and Table 4, when is unchanged, the larger the deviation of the abnormal data, the closer the outlier index is to 0, and thus the random noise in Table 4 is representative. The total detection rate of outliers reaches 88.3%, and it increases with the increase of noise.

However, it should be noted that the outlier index may not identify outliers in any one of the following four situations:

  • (a)

    , and ;

  • (b)

    , and ;

  • (c)

    , and ;

  • (d)

    , and .

This study may contribute to the detection of water level outliers for open-channel water transfer projects. To our knowledge, this is the first study on the real-time detection of water level outliers for open-channel water transfer projects. The main findings are: (1) an outlier index is proposed based on water balance; (2) the water level outlier is defined for water transfer projects; and (3) a case study with the Middle Route of the South-to-North Water Diversion Project shows that of the 180 outliers with the variation greater than or equal to 0.04 m, 159 outliers can be detected with an accuracy rate of 88.3%.

The limitations of this study warrant further investigation. First, some well-established indicators are excluded due to the lack of high-frequency data. Second, an outlier can be detected in , but which one, or , is the outlier needs to be further studied. Finally, the outlier index still has units since water wave motion is not considered, and a dimensionless outlier index should be derived on the basis of momentum conservation in the future.

L.Z., X.L., and H.W. conceptualized the whole article; L.Z. and Z.Z. developed the methodology; Y.Q., L.Z., Z.H., and Z.Z. validated the article; L.Z. and Y.Q. rendered support in data curation; L.Z. wrote the original draft; L.Z. wrote the review and edited the article. All authors have read and agreed to the published version of the manuscript.

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Data cannot be made publicly available; readers should contact the corresponding author for details.

The authors declare there is no conflict.

Barnett
V.
&
Lewis
T.
1994
Outliers in Statistical Data
.
Wiley
,
New York
.
Boiten
W.
2008
Hydrometry: Ihe Delft Lecture Note Series
.
Taylor &Francis
,
Leiden
,
The Netherlands
.
Chandola
V.
,
Banerjee
A.
&
Kumar
V.
2009
Anomaly detection: a survey
.
Acm Computing Surveys
41
,
1
58
.
Chen
J.
&
Liao
J.
2020
Monitoring lake level changes in China using multi-altimeter data (2016–2019)
.
Journal of Hydrology
590
,
125544
.
Clemmens
A. J.
,
Wahl
T. L.
,
Bos
M. G.
&
Replogle
J. A.
2001
Water Measurement with Flumes and Weirs
.
International Institute for Land Reclamation and Improvement
,
Wageningen
,
The Netherlands
.
Hawkins
D.
1980
Identification of Outliers
.
Chapman And Hall
,
London
.
Herschy
R. W.
2014
Streamflow Measurement
.
CRC Press
,
Boca Raton, FL
,
USA
.
Hubert
M.
&
Vandervieren
E.
2008
An adjusted boxplot for skewed distributions
.
Computational Statistics & Data Analysis
52
,
5186
5201
.
Hwang
C.
,
Cheng
Y.-S.
,
Yang
W.-H.
,
Zhang
G.
,
Huang
Y.-R.
,
Shen
W.-B.
&
Pan
Y.
2019
Lake level changes in the Tibetan plateau from Cryosat-2, Saral, Icesat, and Jason-2 Altimeters
.
Terrestrial, Atmospheric and Oceanic Sciences
30
,
33
50
.
Jing
L.
,
Zhenxin
B.
,
Cuishan
L.
,
Guoqing
W.
,
Yue
L.
,
Jie
W.
&
Xiaoxiang
G.
2019
Change law and cause analysis of water resources and water consumption in China in past 20 years
.
Journal of Hydro-Science and Engineering
31
,
31
41
.
Le
G.
,
Jingjuan
L.
&
Guozhuang
S.
2013
Monitoring lake-level changes in the Qinghai–Tibetan plateau using radar altimeter data (2002–2012)
.
Journal of Applied Remote Sensing
7
(
1
),
073470
.
Liu
F. T.
,
Ting
K. M.
&
Zhou
Z.-H.
2012
Isolation-based anomaly detection
.
Acm Transactions on Knowledge Discovery from Data
6
,
1
39
.
Rousseeuw
P. J.
&
Hubert
M.
2011
Robust statistics for outlier detection
.
Wires Data Mining and Knowledge Discovery
1
,
73
79
.
Shao
P.
,
Ye
F.
,
Liu
Z.
,
Wang
X.
,
Lu
M.
&
Mao
Y.
,
2020
Improving iForest for hydrological time series anomaly detection
. In:
Algorithms and Architectures for Parallel Processing
(
Qiu
M.
, ed.).
Springer International Publishing
,
Cham
, pp.
170
183
.
Thanh
H. V.
,
Binh
D. V.
,
Kantoush
S. A.
,
Nourani
V.
,
Saber
M.
,
Lee
K.-K.
&
Sumi
T.
2022
Reconstructing daily discharge in a megadelta using machine learning techniques. 58, E2021wr031048
.
Tukey
J. W.
1977
Exploratory Data Analysis
.
Addison-Wesley
,
Reading, MA
.
Wang
Q.
,
Zheng
G.
,
Li
J.
,
Huang
K.
,
Yu
Y.
&
Qu
S.
2023
Imbalance in the city-level crop water footprint aggravated regional inequality in China
.
Science of the Total Environment
867
,
161577
.
Yi
Y.
,
Tang
C.
,
Yang
Z.
,
Zhang
S.
&
Zhang
C.
2017
A one-dimensional hydrodynamic and water quality model for a water transfer project with multihydraulic structures
.
Mathematical Problems in Engineering
2017
,
2656191
.
Zhao
C.
&
Yang
J.
2019
A robust skewed boxplot for detecting outliers in rainfall observations in real-time flood forecasting
.
Advances in Meteorology
2019
,
1
7
.
Zhou
L.
,
Zhang
Z.
,
Zhang
W.
,
An
K.
,
Lei
X.
&
He
M.
2022
Real-time water level prediction in open channel water transfer projects based on time series similarity
.
Water
14
,
2070
.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits copying, adaptation and redistribution, provided the original work is properly cited (http://creativecommons.org/licenses/by/4.0/).