The study was carried out in Jammu region of UT J and K (India). Data used in this study were collected on 50 permanent sample plots of 0.25 ha in size.In order to achieve stipulated objectives, Height diameter data on 500 trees from Jammu forest division was utilized in this study. The forest composition of Chir in terms of area in Jammu forest division is given in Table 1.
In view of the above various height and diameter statistical models were used to study the tree height and diameter relationships of chir trees in each division. The functional form of the models used in this study is as follows:
Where,
H
i = i
th observation of the response variable tree height (m).
D
i = i
th observation of the predictor variable diameter at breast height (cm).
b = A vector of model parameters.
e
i = Unexplained error, that it is assumed to be independent and normally distributed with mean zero and a constant variance.
A constant, 1.3 (which is bh in models below) is added to the model to avoid prediction of a Hi less than 1.3 m when D
i approaches zero.
Based on the above functional form eleven height diameter models were used in this study, whose description is given in Table 2.
Then adequacy of the fitted models was tested by using different selection criteria like AIC, BIC, RMSE, (Adj. R
2),
etc. which is given below:
Adjusted R2 (Adj. R2)
Where,
R
2 = Coefficient of determination.
n = Number ofobservation and
k is number of parameters.
Root mean square error (RMSE)
Where,
n = Total number of observations.
y
i = Actual observation.
= Predicted value.
= Mean of observed value.
Mean absolute error (MAE)
Where,
n = Total number of observations.
y
i = Actual observation.
= Predicted value
= Mean of observed value,
Raghav et al., (2022).
Akaike information criterion (AIC)
AIC = n × In(RMSE) + 2k
Where,
n = Number of observations.
k = Number of parameters.
Bayesian information criterion (BIC)
BIC = -2In(l) + In(n) × k
Where,
l = Maximum likelihood value of the model.
k = Number of degrees of freedom (or independently adjusted parameters) in the model.
n = Number of observations (total sample size).
Residual standard error (RSE)
Where
y = The observed value.
= The predicted value.
df = The degrees of freedom.
Further, the assumption of normality of errors of fitted models was accessed by Shapiro Wilk test of significance. Then to check the predictive performance of the fitted models cross validation method was used and data was randomly partitioned into training and testing sets, where 80 per cent f data was used for training and the remaining 20 per cent of the data for testing. Statistical analysis of above models was carried out in R studio (version 3.5.1, 2018) utilising its various libraries. Further the function geom_line and ggtitle of R studio was used for creating these GG-plots.