Study area and data acquisition
Pune district, situated in the western part of the Maharashtra State of India. It covers an area of about 15642 km
2 having 5.1% of total geographical area of the State. The exact geographical location lies between 17°54’N to 19°24’N latitude and 73°29’E to 75°10’E longitude. For administrative convenience, it is divided into 14
tehsils namely
Pune City, Haveli,
Khed,
Ambegaon,
Junnar,
Shirur,
Daund,
Indapur,
Baramati,
Purandhar,
Bhor,
Welhe,
Mulsi and
Maval (Vadgaon) (
CGWB, 2009).
Pune district experiences tropical monsoon and therefore, shows a significant seasonal variation in temperature as well as rainfall behaviour. Most of this rainfall is due to the South-West monsoon and about 87% of rain occurs during the monsoon months. The monsoon arrives in the month of June, with the maximum intensity of rainfall during the month of July, followed by August. The average annual rainfall of Pune district is 115 cm.
Tehsils falling in the highest rainfall intensity zone are
Welhe, Mulshi and
Maval (Vadgaon).
Tehsils falling in the moderate rainfall intensity zone are
Bhor, Ambegaon, Junnar, Khed, Haveli, Pune city and
Purandhar.
Tehsils with lowest rainfall intensity, the dry and semi-arid zone are
Shirur, Daund, Indapur and
Baramati (
MPCB, 2006).
For this study, data of daily rainfall depth (mm) from 1958 to 2004 (
i.e. 47 years) of 13 meteorological stations of Pune district and temperature data (1969-2008) of Pune was acquired from India Meteorological Department, Pune. The average annual rainfall depths (
i.e. from 1958 to 2004) of the 13 meteorological stations along with their geographical locations are given in Table 1.
Development of an interface for Climate Change Trend Analysis (CCTA)
MATLAB is a high level languages and interactive environment for numerical computation, visualization and programming. The language, tools and built-in math function enables to explore multiple approaches for faster solutions than spreadsheet and traditional programming languages.
Climate Change Trend Analysis (CCTA) Interface was developed in MATLAB® environment to analyze the trends in various meteorological variables
e.g. rainfall, temperature, solar radiation etc. using trend tests. The CCTA interface incorporates two trend tests
viz. MK test and MMK test besides Sen’s slope estimator. It also gives the statistical summary (
i.e. Maximum, Minimum, Mean, Standard deviation and variance) of the input data along with the histogram which shows the distribution of data.
CCTA Interface has input and output panels
i.e. ‘Basic Information’ and ‘Results and Inference’. In the basic information panel, user interactive input such as station name, longitude and latitude in radians, altitude in meter unit is provided. User can select one of the four trend tests for trend analysis by using ‘select trend test’ button besides the option for inputting the level of significance. For providing the meteorological data to the interface, there are two options, user can generate input file or load input file. The MS-Excel (*.xls) data input file is generated by clicking on ‘generate input file’ button. An input file named “Input.xls” will be generated in ‘My Documents’ folder. User has to insert data below the appropriate header column in appropriate units. After loading the data successfully to the interface, basic information panel gives the statistical summary and histogram of the data as shown in Fig 1.
The selected trend test can be run using the ‘RUN’ button, and the outputs of the test such as scatter plot of the data with trend line, probability value at which null hypothesis is accepted/rejected and Sen’s slope estimator will be displayed in ‘Results and Inference’ panel (Fig 1). After trend analysis, the outputs will be saved to “Output.xls” file in ‘My Documents’ folder of the computer.
Non-parametric methods used in interface
The Mann Kendall and Modified Mann Kendall tests besides the Sen’s slope estimator for determination of trend and slope magnitude of the meteorological variables were incorporate in the developed interface for trend detection of meteorological variables.
Mann Kendall test
The Mann-Kendall test is a non-parametric test for detecting trend in time series data. The Mann-Kendall test is simple and robust and can cope with missing values and values below a detection limit. The test was first created by Mann (1945) and Kendall (1975) and covariances between Mann-Kendall statistics were developed by Dietz and Kileen (1981). The test can be used to analyse trends of rainfall, streamûow and water quality data
(Yue et al., 2002; Burns et al., 2007; De
Luis et al., 2000; Ndiritu, 2005; Mazvimavi and Wolski, 2006). The Mann–Kendall-statistic S is given as:
(1)
where
The variance of S denoted by ( 𝛔
s2 ) is computed as:
(2)
Where n is the number of data points, q is the number of tied groups in the data set and
tj is the number of data points in jth tied group.
Then S and 𝛔
s2 were used to compute the test statistics Zs as:
A positive value of S indicates that there is an increasing trend and a negative value indicates a decreasing trend. The null hypothesis H
0 that there is no trend in the data is either accepted or rejected depending if the computed Z
S statistics is less than or more than the critical value of Z-statistics obtained from the normal distribution table at 5% significance level.
Modified Mann Kendall (MMK) test
The basic assumption of the original Mann-Kendall test is that the data need to be independent and randomly ordered. However, in many real situations the observed data are auto-correlated. The existence of positive autocorrelation in the data increases the probability of detecting trends when actually none exist and vice versa. Although this is a well-known fact that autocorrelation in the data is often ignored. Hamed and Rao (1998) have discussed the effect of autocorrelation on the variance of the Mann-Kendall trend test statistic. A theoretical relationship was derived to calculate the variance of the Mann-Kendall test statistic for auto-correlated data. Based on the modified value of the variance of the Mann Kendall trend test statistic, a modified non-parametric trend test which is suitable for auto-correlated data was incorporated. The empirical formula for calculating the variance of S in the case of auto-correlated data is given by equation (4).
(4)
where, represented a correction due to the autocorrelation
in the data. The correction of autocorrelation is given by equation (5).
(5)
Where n is the number of observations and
rs(i) is the autocorrelation function of the ranks of the observations. The advantage of the approximation in equation (5) is that by using the ranks of the observations, the variance of S can be evaluated by equations (4) and (5) without the need for either the normalized data or their autocorrelation function.
Sen’s slope estimator
If a linear trend is present in a time series, then the true slope (
i.e. change in data per unit time) can be estimated by using a simple nonparametric procedure developed by Sen (1968). Though Mann-Kendall statistics is effective and used to evaluate a significant increase or decrease in parameter under consideration, it does not estimate a trend slope. Therefore, the nonparametric Sen’s method was used to estimate the slope of an existing trend. The Sen’s slope estimator is widely used due to its simplicity in computation, analytical estimates of confidence intervals and robustness to outliers which are the advantages over the general slope estimation. This approach involves computing slopes for all the pairs of time points and then using the median of these slopes as an estimate of the overall slope. Sen’s method proceeds by calculating the slope of the line using all data pairs, as shown in the following equation:
(6)
where,
j> k. If there are
n values
xj in the time series, we get as many as
N = ((
n + 1)/2) slope estimate
Qi. Sen’s estimator of slope is simply given by the median of these
N values of
Qi’s.
if
N is odd (7)
if
N is even. (8)
Sen’s estimator is computed as Q
med=Q(N+1)/2 if N appears odd and it is considered as Q
med=[Q
N/2+Q
(N+2)/2]/2 if N appears even. At the end, Q
med is computed by a two sided test at 100 (1-α) % confidence interval and then a true slope can be obtained by the non-parametric test. Positive value of Q indicates an upward or increasing trend and a negative value of Q shows a downward or decreasing trend in the time series.