Next Article in Journal
Optimal Selection of Pumps As Turbines in Water Distribution Networks
Previous Article in Journal
Preliminary Results from Unsaturated Zone Studies in Unconfined Unconsolidated Coastal Aquifers
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Hybrid Fuzzy—Probabilistic Analysis and Classification of the Hydrological Drought †

by
Mike Spiliotis
*,
Christoforos Papadopoulos
,
Panagiotis Angelidis
and
Basil Papadopoulos
Department of Civil Engineering, School of Engineering, Democritus University of Thrace, Kimmeria Campus, 67100 Xanthi, Greece
*
Author to whom correspondence should be addressed.
Presented at the 3rd EWaS International Conference on “Insights on the Water-Energy-Food Nexus”, Lefkada Island, Greece, 27–30 June 2018.
Proceedings 2018, 2(11), 643; https://doi.org/10.3390/proceedings2110643
Published: 31 July 2018
(This article belongs to the Proceedings of EWaS3 2018)

Abstract

:
The consideration of a theoretical probability distribution regarding the annual cumulative discharge will provide a significant opportunity to characterize the intensity of the hydrological drought. However, the matching between the observed probabilities and the adopted theoretical probability distribution can not be identical. Hence, in this work this matching is achieved by using a fuzzy regression based methodology and the attributes of the log-normal distribution. Finally, an ascending procedure to classify the intensity of hydrological drought is proposed and it is applied in case of the Evros River.

1. Introduction

Drought must be considered as a relative, rather than absolute condition. It occurs in both high and low rainfall [1]. In other words, drought occurs when the water availability is below the canonical values which very often are described by the mean value and the standard deviation. Several types of droughts exist, while in this work the hydrological drought is studied.
The Standardized Precipitation Index, known as SPI, seems to be the most widely-used compared with the existing simple indices to classify the drought events [2]. In brief, the computation of the SPI involves fitting a gamma probability density function and thereafter the cumulative probability distribution is transformed into the standard normal distribution to yield the SPI [2]. Hence, starting initially from a probabilistic approach, many standardized indices to drought (SPI [2], RDI [3], SDI [4] etc.) conclude to an index which in fact, it is the standardized normal variable Z.
In this work, the examined hydrological variable is the annual cumulative discharge. Firstly, a hybrid fuzzy probabilistic approach is proposed in order to improve the couple between the observed probabilities and the adopted theoretical probability distribution. Secondly, based on the widely-used standardized normal thresholds to drought, the corresponding (fuzzy) annual cumulative discharge thresholds are determined. Thirdly, the (crisp) observed cumulative annual discharge is compared with these fuzzy thresholds in order to classify the drought.

2. Proposed Methodology

2.1. Fundamentals of Fuzzy Sets and Logic

A fuzzy set A on a universe set X is a mapping A : X [ 0 , 1 ] , assigning to each element x X a degree of membership 0 A ( x ) 1 . The membership function A(x) is also defined as μ A ( x ) .
If A is a fuzzy set, by α-cuts a ( 0 , 1 ] we define the crisp sets [5] with the following property:
A [ α ] = { x X : A ( x ) α }
The 0-cut can be defined as follows:
A [ 0 ] = { x X : A ( x ) > 0 }
It is worth noting that by using the α-cut concept we can move from the fuzzy sets to the conventional crisp mathematical methodologies.
A special kind of fuzzy sets is the fuzzy numbers. In this work, fuzzy symmetrical triangular numbers are used which are special kinds of fuzzy numbers. The fuzzy symmetrical triangular numbers have the following membership function:
μ A ( x ) = { 1 | x a | w , i f a x x a + w 0 , o t h e r w i s e
in which a is the centre and w the spreads of the fuzzy number (Figure 1).

2.2. Utilization of the Observed Probabilities by Using Fuzzy Regression in Case of Log Normal Distribution

Let an historical sample. The rank order method involves ordering the data from the largest hydrological value to the smallest hydrological value, assigning a rank of 1 to the largest value and a rank of N to the smallest value. Based on the Weibull 1939 empirical distribution to compute the plotting position probabilities, the cumulative exceedance probability can be assessed as follow:
P ( Q q ) = m Ν + 1
Therefore, the cumulative probability of non exceedance probability can be determined as follow [6]:
P ( Q < q ) = 1 m Ν + 1
Let now the log-normal theoretical probability distribution. To simplify the procedure, it is well-known that in case that the log-normal distribution is well-fitted to the observations, then the normal distribution with log-transformed data can be used instead of the lognormal distribution; which simply means that log-transformed data are implemented instead of the raw data and hence, the new auxiliary variable y, is normally distributed:
ln q T j =   y T j
Subsequently, based on the standardized normal variable Z of the normal distribution it holds:
ln q T j = y T j = y ¯ + s y Ζ Τ , j = λ + ζ Ζ Τ , j
In which as λ, ζ state the mean value and the standard deviation of the log transformed variable y correspondingly.
The critical points of the proposed methodology are the following next:
  • Based on the observed probabilities, the standardized normal variable Z can be determined for each data.
  • Based on Equation (7), a fuzzy linear regression model is implemented in order to determine a fuzzy relationship between the natural log of the cumulative discharge and the normalized variable Z (which corresponds to a probability). In addition, based on the fuzzy regression procedure, a fuzzy estimation regarding the mean value and the standard deviation of the log-transformed sample is achieved simultaneously [7]:
  y ˜ T j = y ¯ ˜ + s ˜ y Ζ Τ , j = λ ˜ + ζ ˜ Ζ Τ , j
It should be clarified that since fuzzy symmetrical triangular numbers are selected as fuzzy coefficients hence, the mean value and the standard deviation are estimated as fuzzy symmetrical triangular numbers.
The uncertainty of the matching between the observed probabilities and the adopted theoretical probability distribution can be treated by using the fuzzy regression of Tanaka (1987) and hence, all the observed data will be included in the produced fuzzy band [8].
  • The suitability of the proposed model can be estimated based on the magnitude of the fuzziness and furthermore by using mathematical distance norms to deal with the comparison between the unbiased estimators and the fuzzy estimation of the mean value and the standard deviation.
Considering the model of fuzzy regression itself let us provide some details. Although the standardized normal variable Z (independent variable) and the value of random variable (dependent variable) of the historical sample takes only crisp values, the fuzziness arises from the inclusion constraints that is, from the requirement that all the data must be included in the produced fuzzy band. In other words this means that the fuzziness is generated from the (expected) no identical matching between the theoretical probability distribution (in this article the log normal distribution) and the observed probabilities.
According to the extension principle, in case of fuzzy symmetrical triangular numbers as coefficients, the function y ˜ j will be also a fuzzy triangular number with the following centre (ya,j) and width (wyj) [5,9]:
Centre : y a j = λ a + ζ a Ζ Τ , j
Width : w y j = w λ + w ζ | Ζ Τ , j |  
In which as y a j , λ a , ζ a state the central values of the corresponding variables and as w y j , w λ , w ζ the corresponding spreads are meant.
The concept of inclusion is used to express the inclusion constraints. Thus, the inclusion of a fuzzy set A to the fuzzy set B with the associated degree 0 h 1 is defined as follow:
A [ h ] B [ h ]
A physical interpretation of the level h is that an observation yj is contained in the support interval of the corresponding fuzzy estimate, which has a degree of membership greater than hj. The degree of fit of the estimated model to the entire data set is defined as the minimum of all these hj, which is denoted as h [10].
The produced fuzzy band will contain all the observed data:
y j [ y h , j L , y h , j R ]
By taking into account the fuzzy arithmetic, for a selected level h, the inclusion constraints, in case that the decision variables (fuzzy coefficients) are selected to be fuzzy symmetrical triangular numbers, are equivalent to (e.g., [9,11,12]):
{ Ι n c l u s i o n C o n s t r a int s ( λ a + ζ a Ζ Τ , j ) ( 1 h ) ( w λ + w ζ | Ζ Τ , j | ) = y h , j L ln q j ( λ a + ζ a Ζ Τ , j ) + ( 1 h ) ( w λ + w ζ | Ζ Τ , j | ) = y h , j R ln q j w λ , w ζ 0
Since the fuzzy regression model of Tanaka (1987) is transformed to a constrained optimization problem, the assessment of the suitability of the model is based on the produced fuzzy band. Therefore, a significant small fuzzy band indicates a proper approach. Thus, Tanaka (1987) suggested the minimization of the sum of the produced fuzzy semi-spreads for all the data:
min J ( = j = 1 M w y j = { M w λ + w ζ j = 1 M | Ζ Τ , j | } )   ( Objective   function )
where M is the number of the observed data.
Based on the inclusion constraints, the produced fuzzy band must include all the observed data and this is one of the main advantages of the implementation of the fuzzy regression.
Two criteria are proposed in order to check the suitability of the achieved solution. The first one is sum of the produced fuzzy semi-spreads, J which indicates the fuzziness of the model. The second criterion of suitability, F, which examines how close is the central values of the mean value and the standard deviation to the unbiased (usual statistical) estimation of the same variables for the ln- transformed sample, λ ^ , ζ ^ :
F = ( λ a λ ^ ) 2 + ( ζ a ζ ^ ) 2

2.3. Categorization to Hydrogical Drough Based on the Return Period

As aforementioned, the thresholds of Z are used to define the thresholds of several drought categories (Table 1). In this article, based on the produced fuzzy curve (Equation (8)), the corresponding cumulative annual discharges, which can be seen as thresholds, are determined. It should be clarified that since the log–normal distribution is used (instead of the Gamma distribution), the normalized variable Z corresponds to a log-transformed cumulative discharge. The thresholds of Table 1 correspond to a fuzzy log-transformed cumulative discharge (based on Equation (8)) which can be compared with the current real log-transformed value of the cumulative discharge. Therefore, in this article the (fuzzy) thresholds of the annual cumulative discharge, ln V ˜ k (considering the k threshold) are compared with the (crisp) observed annual cumulative discharge. Even if, there are many measures to compare fuzzy numbers, there are not all of them suitable to compare a fuzzy number with a crisp number. To address this problematic the reliability measure of Ganoulis, 2004 is adopted.
Let a system which has a resistance R ˜ and a load L ˜ as fuzzy numbers. A reliability measure or a safety margin of the system may be defined as being the difference between load and resistance. This is also a fuzzy number given by [13].
M ˜ = R ˜ L ˜
Hence, Ganoulis, 2004 has proposed a fuzzy measure of risk, r , which is defined as the region of the fuzzy safety margin, where values of M ˜ are negative. Mathematically, this may be expressed as follows:
r = m 0 μ M ˜ ( m ) d m μ Μ ˜ ( m ) d m
Let us return to the examined problem. Hence, the authors propose a measure G i , k , to indicate the degree according to which the examined hydrological year, i, has a cumulative annual discharge, ln V i greater than the examined fuzzy threshold of drought k, ln V ˜ k . It may be considered (Figure 2):
G i , k = ln V k ln V i μ t h r e s h , k ( ln V ) d ( ln V ) μ t h r e s h , k ( ln V ) d ( ln V ) = 1 ln V k ln V i μ t h r e s h , k ( ln V ) d ( ln V ) μ t h r e s h , k ( ln V ) d ( ln V ) , S i , k = ln V k ln V i μ t h r e s h , k ( ln V ) d ( ln V ) μ t h r e s h , k ( ln V ) d ( ln V ) ,
By the same way, a degree, S i , k , according to which the examined hydrological year, i, has a cumulative annual discharge smaller than the examined fuzzy threshold of drought k can be considered. In Figure 2 the grey hatched area (which is marked with (1)) denotes the numerator whilst the dominator is equal to the total area. For instance, regarding the hydrological year 1985–1986, G85–86,Z =−1 =1 and S85–86,Z = 0 = 0.8487>0.5.

3. Implementation of the Proposed Methodology: Annual Cumulative Streamflow Time Sequence

The case under investigationis the northern region of Prefecture Evros (Figure 3). The annual cumulative streamflow, which is derived from the monthly discharges of Evros River at Pythio’sbridge, is studied. Evros River (Maritsa or Meric) is one of the largest river of Balcan Peninsula in terms of length, since it crosses the Bulgarian, Greek and Turkish borders. It rises in the Rila Mountains in Western Bulgaria and has its outlet in the Aegean Sea. It serves as a natural borderline between Greece and Turkey [14]. Its total watershed area is equal to 53,000 Km2, while the 6% of this is in Greek territory.

Methodology Steps

The proposed methodology is implemented using the following steps:
  • Based on the monthly discharges, the annual cumulative volumes of streamflow are calculated and then they are transformed to logarithmic values.
  • Based on the cumulative empirical (observed) distribution, the standardized normal variable Z is calculated for each examined hydrological year.
  • The fuzzy linear regression model of Tanaka (1987) [9] is applied (Figure 4) between the standardized normal random variable Z and the log-transformed annual cumulative discharge. The produced fuzzy coefficients constitute fuzzy symmetrical triangular numbers. Its values can be seen as a fuzzy assessment of the mean value and the standard deviation. The fuzzy relation that was determined is (Figure 4):
    y T j = ( 22.72 ,   0.081 ) L + ( 0.51 ,   0.172 ) L Z T j
    in which the brackets state the fuzzy symmetrical triangular numbers (which are a special case of the L-fuzzy numbers) and within the bracket the first term symbolizes the central value and the second term the spreads of the aforementioned fuzzy numbers.
4.
The suitability is checked according to the value of the objective function, J and the measure according to Equation (14). The objective function equal to J =4.08 and as it can be seen from Figure 4 the uncertainty is not no functional. The unbiased estimation of the mean value and the standard deviation for the logs transformed sample are λ ^ = 22.80 , ζ ^ = 0.44 and therefore, the centers of the fuzzy coefficients are close to the unbiased estimations.
F = ( λ a λ ^ ) 2 + ( ζ a ζ ^ ) 2 = ( 22.72 22.80 ) 2 + ( 0.51 0.44 ) 2 = 0.106
5.
Based on drought classification (Table 1) the ln of the annual cumulative discharge which correspond to the normalised variable Z equal to −2, −1.5, −1 and 0 (Table 1) are calculated based on Equation (18). Therefore, according to the corresponding values of Z, the fuzzy thresholds of drought are determined based on the produced fuzzy curve.
6.
The observed cumulative annual discharge is compared with the aforementioned thresholds to drought. From a mathematical point of view the observed cumulative annual discharges are crisp numbers whilst the thresholds are fuzzy numbers. The comparison is started from the lowest to the upper values, that is, by following an ascending procedure.
There also some cases where the comparison between the fuzzy threshold and the crisp annual cumulative discharge is not precise. Hence the following criterion is adopted in this analysis:
ln V i   overcomes ln V ˜ k   if   it   holds :   G i , k = ln V i l n V k μ t h r e s h , k ( ln V ) d ( ln V ) μ t h r e s h , k ( ln V ) d ( ln V ) > 0.5
In Table 2, the results of the above implementation steps are presented. It is observable that that because of the overlapping of the fuzzy lnVk values (Figure 5), the frontiers between the categories are overlapped to some degree. In any case it seems more reasonable to adopt fuzzy thresholds between the categories to drought, compare with crisp, as the conventional methodology do. It is worth noting that although there are some cases where the comparison between the fuzzy threshold and the crisp annual cumulative discharge is not precise, in most of them, the Gi,k index has values which are discernible different from 0.5.

4. Concluding Remarks

The uncertainty of the coupling between the observed probabilities and the adopted theoretical probability distribution can be treated by using the fuzzy regression model of Tanaka (1987), where all the observed data are included in the produced fuzzy band. By using the log-normal probability distribution together with the fuzzy regression, an estimation of the mean value and the standard deviation can be achieved simultaneously. As it is seen from the case study, two criterion of suitability are established in order to check the suitability of the adopted theoretical probability density with fuzzy numbers as parameters. The first criterion of suitability is based on the width of the produced fuzzy band and the second is based on the distance between the unbiased estimation of the mean value and the standard deviation with the central values of the estimated fuzzy quantities.
Hence, by using the standardized indices, Z, to categorize the drought, the corresponding thresholds to drought can be determined as fuzzy numbers. The proposed methodology is successfully applied in case of the Evros river regarding the cumulative annual discharge. The observed cumulative annual discharge is compared with the aforementioned thresholds to drought. This comparison is taken place by using the proposed measure of comparison between fuzzy number and crisp number, which exploits all the information of the membership function. Finally, an efficient classification to drought is achieved following the proposed methodology although the fuzziness is taken into account.

References

  1. Wilhite, D.; Sivakumar, M.; Pulwarty, R. Managing drought risk in a changing climate: The role of national drought policy. Weather Clim. Extrem. 2014, 3, 4–13. [Google Scholar] [CrossRef]
  2. Angelidis, P.; Maris, F.; Kotsovinos, N.; Hrissanthou, V. Computation of Drought Index SPI with Alternative Distribution Functions. Water Resour. Manag. 2012, 26, 2453–2473. [Google Scholar] [CrossRef]
  3. Tsakiris, G.; Pangalou, D.; Vangelis, H. Regional drought assessment based on the Reconnaissance Drought Index (RDI). Water Resour. Manag. 2007, 21, 821–833. [Google Scholar] [CrossRef]
  4. Nalbantis, I.; Tsakiris, G. Assessment of Hydrological Drought Revisited. Water Resour. Manag. 2009, 23, 881–897. [Google Scholar] [CrossRef]
  5. Spiliotis, M.; Hrissanthou, V. Fuzzy and crisp regression analysis between sediment transport rates and stream discharge in the case of two basins in northeastern Greece. In Regression Analysis: Introduction, Applications and Theory; Nova Science Publishers: New York, NY, USA, 2018; in press. [Google Scholar]
  6. Chow, V.; Maidment, D.; Mays, L. Applied Hydrology, Hill International ed.; McGraw: Singapore; New York, NY, USA, 1988; pp. 394–398. [Google Scholar]
  7. Spiliotis, M.; Papadopoulos, B. A hybrid fuzzy probabilistic assessment of the extreme hydrological events. In Proceedings of the 15th International Conference of Numerical Analysis and Applied Mathematics (ICNAAM 2017, Thessaloniki, Greece, 25–30 September 2017. [Google Scholar]
  8. Spiliotis, M.; Angelidis, P.; Papadopoulos, B. A Hybrid Fuzzy Regression-Based Methodology for Normal Distribution (Case Study: Cumulative Annual Precipitation). In Proceedings of the 14th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), Rhodes, Greece, May 2018; Iliadis, L., Maglogiannis, I., Plagianakos, V., Eds.; Springer International Publishing: Berlin, Germany, 2018; pp. 568–579. [Google Scholar]
  9. Tanaka, H. Fuzzy data analysis by possibilistic linear models. Fuzzy Sets Syst. 1987, 24, 363–375. [Google Scholar] [CrossRef]
  10. Moskowitz, H.; Kim, K. On assessing the h value in fuzzy linear Regression. Fuzzy Sets Syst. 1993, 58, 303–327. [Google Scholar] [CrossRef]
  11. Tsakiris, G.; Tigkas, D.; Spiliotis, M. Assessment of interconnection between two adjacent watersheds using deterministic and fuzzy approaches. Eur. Water 2006, 15, 15–22. [Google Scholar]
  12. Tzimopoulos, C.; Papadopoulos, K.; Papadopoulos, B. Fuzzy Regression with Applications in Hydrology. Int. J. Eng. Innov. Technol. (IJEIT) 2016, 5, 69–75. [Google Scholar]
  13. Ganoulis, J. Integrated Risk Analysis for Sustainable Water Resources Management. In Comparative Risk Assessment and Environmental Decision Making; Linkov, I., Ramadan, A.B., Eds.; Kluwer Academic: Norwell, MA, USA, 2004; Volume 38, pp. 275–286. [Google Scholar]
  14. Angelidis, P.; Kotsikas, M.; Kotsovinos, N. Management of Upstream Dams and Flood Protection of the Transboundary River Evros/Maritza. Water Resour. Manag. 2010, 24, 2467–2484. [Google Scholar] [CrossRef]
Figure 1. Fuzzy triangular symmetrical number.
Figure 1. Fuzzy triangular symmetrical number.
Proceedings 02 00643 g001
Figure 2. Measure value resulting from the ratio of the hatched area to the total area is less than 0.50 for the year 1985–1986.
Figure 2. Measure value resulting from the ratio of the hatched area to the total area is less than 0.50 for the year 1985–1986.
Proceedings 02 00643 g002
Figure 3. Transboundary Evros river and its tributaries. The examined data are derived from Pythio’s bridge (41°21′43.51″ N 26°37′51.67″ E) (from Angelidis et al., 2010).
Figure 3. Transboundary Evros river and its tributaries. The examined data are derived from Pythio’s bridge (41°21′43.51″ N 26°37′51.67″ E) (from Angelidis et al., 2010).
Proceedings 02 00643 g003
Figure 4. Observed data, fuzzy and conventional (crisp) regression between the annual cumulative logarithmic streamflow values and the standardised random variable Z.
Figure 4. Observed data, fuzzy and conventional (crisp) regression between the annual cumulative logarithmic streamflow values and the standardised random variable Z.
Proceedings 02 00643 g004
Figure 5. Fuzzy thresholds of drought regarding the values of Z (presented in Table 1) and the (crisp) annual cumulative discharge for the hydrological year, 94–95 and 93–94.
Figure 5. Fuzzy thresholds of drought regarding the values of Z (presented in Table 1) and the (crisp) annual cumulative discharge for the hydrological year, 94–95 and 93–94.
Proceedings 02 00643 g005
Table 1. Classification of hydrological drought based on the random variable Z.
Table 1. Classification of hydrological drought based on the random variable Z.
CategoryDescriptionCriterion
0Non-droughtZ ≥ 0.0
1Mild drought−1.0 ≤ Z<0.0
2Moderate drought−1.5 ≤ Z<−1.0
3Severe drought−2.0 ≤ Z<−1.5
4Extreme droughtZ<−2.0
Table 2. Drought classification based on the fuzzy measure Gi,k and Si,k.
Table 2. Drought classification based on the fuzzy measure Gi,k and Si,k.
Comparison of Crisp and Fuzzy Streamflow Values
Hydrological YearLnVi(m3)Greater than the Lower Thresh. of the CategorySmaller than the Upper Thresh. of the CategoryDrought Categories
1985–198622.682654210.8787mild drought
1986–198722.711803410.5860mild drought
1987–198822.73913310.71881mildly wet
1988–198922.568250811mild drought
1989–199022.39410920.96081mild drought
1990–199122.705966010.6472mild drought
1991–199222.543461111mild drought
1992–199322.42116630.98551mild drought
1993–199422.14990700.90750.7112moderate drought
1994–199522.842477311mildly wet
1995–199623.177792510.6780mildly wet
1997–199823.52556780.62160.8750severely wet
1998–199923.39539910.94220.7262moderately wet
1999–200022.645600410.9950mild drought
2000–200122.35915260.91181mild drought
2001–200222.43344850.99281mild drought
2003–200422.828770811mildly wet
2004–200523.60167020.78870.7688severely wet
2005–200623.66188080.88960.6638severely wet
2006–200722.654972410.9780mild drought
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Spiliotis, M.; Papadopoulos, C.; Angelidis, P.; Papadopoulos, B. Hybrid Fuzzy—Probabilistic Analysis and Classification of the Hydrological Drought. Proceedings 2018, 2, 643. https://doi.org/10.3390/proceedings2110643

AMA Style

Spiliotis M, Papadopoulos C, Angelidis P, Papadopoulos B. Hybrid Fuzzy—Probabilistic Analysis and Classification of the Hydrological Drought. Proceedings. 2018; 2(11):643. https://doi.org/10.3390/proceedings2110643

Chicago/Turabian Style

Spiliotis, Mike, Christoforos Papadopoulos, Panagiotis Angelidis, and Basil Papadopoulos. 2018. "Hybrid Fuzzy—Probabilistic Analysis and Classification of the Hydrological Drought" Proceedings 2, no. 11: 643. https://doi.org/10.3390/proceedings2110643

Article Metrics

Back to TopTop