Indian Journal of Animal Research

  • Chief EditorK.M.L. Pathak

  • Print ISSN 0367-6722

  • Online ISSN 0976-0555

  • NAAS Rating 6.50

  • SJR 0.263

  • Impact Factor 0.4 (2024)

Frequency :
Monthly (January, February, March, April, May, June, July, August, September, October, November and December)
Indexing Services :
Science Citation Index Expanded, BIOSIS Preview, ISI Citation Index, Biological Abstracts, Scopus, AGRICOLA, Google Scholar, CrossRef, CAB Abstracting Journals, Chemical Abstracts, Indian Science Abstracts, EBSCO Indexing Services, Index Copernicus
Indian Journal of Animal Research, volume 55 issue 4 (april 2021) : 486-490

Prediction of First Lactation 305-day Milk Yield Based on Bimonthly Test Day Milk Yield Records in Murrah Buffaloes

Ekta Rana1,*, Ashok Kumar Gupta1, Avtar Singh1, Anand Prakash Ruhil2, Ravinder Malhotra2, Saleem Yousuf1, Gedam Ete1
1Animal Genetics and Breeding Division, ICAR-National Dairy Research Institute, Karnal-132 001, Haryana, India.
2Dairy Economics, Statistics and Management Division, ICAR-National Dairy Research Institute, Karnal-132 001, Haryana, India.
Cite article:- Rana Ekta, Gupta Kumar Ashok, Singh Avtar, Ruhil Prakash Anand, Malhotra Ravinder, Yousuf Saleem, Ete Gedam (2020). Prediction of First Lactation 305-day Milk Yield Based on Bimonthly Test Day Milk Yield Records in Murrah Buffaloes . Indian Journal of Animal Research. 55(4): 486-490. doi: 10.18805/ijar.B-3963.
The present study was conducted on 2100 first lactation bimonthly test day milk yield (BTDY) records of 350 Murrah buffaloes calved in between 1993 and 2017 at ICAR-NDRI, Karnal. A total of 6 BTDY records were taken from each animal at an interval of 60 days, from 6th day to 305th day of lactation. The prediction of First Lactation 305-Day Milk Yield (FL305DMY) was done by utilizing five conventional and machine learning methods viz., Centering Date Method (CDM), Test Interval Method (TIM), Ratio Method (RM), Multiple Linear Regression (MLR) and Artificial Neural Network (ANN). Error in prediction was estimated by absolute error, percentage absolute error, average error, percentage average error, Root Mean Square Error (RMSE) and percentage RMSE. MLR was found to be the best method with the least error in prediction (5.71% RMSE), followed by ANN (5.77% RMSE). The accuracy (R2) of MLR equation including all 6 BTDY records was 91.86%. The best MLR equation for an early prediction of FL305DMY included 3 BTDY records viz., BTDY-2 (65th day), BTDY-3 (125th day) and BTDY-4 (185th day) with 85.29% R2. The study compared the conventional and computational methods for prediction of first lactation milk yield which could be used for early selection of the animals.
The breeding programme in developing countries like India is mainly planned and evaluated based on the lactational 305-day milk yield of the animal which requires daily recording of the data. On contrary, daily milk recording is a tedious, time-taking, expensive and practically near-to-impossible task under field conditions. Studies conducted in the past revealed that the test day milk yield records could be an alternative to the daily milk yield recordings because there is a high genetic association between test-day records and complete milk production record (Rahayu et al., 2018; Singh et al., 2015; Torshizi and Mashhadi, 2015; Kokate et al., 2014). Though the accuracy of prediction of lactational milk yield is less for bimonthly milk yield records as compared to monthly milk yield records (Rahayu et al., 2018; Sahoo et al., 2015; Tailor and Singh, 2014; Chakraborty et al., 2010), yet bimonthly records could be used for economic prediction of 305-day milk yield under field conditions as data recording is further reduced.
 
Over the years, various conventional methods are in vogue for predicting the first lactation 305-day milk yield based on test-day milk yield records. Some of those methods are Test Interval Method (TIM), Centering Date Method (CDM), Ratio method (RM) and Multiple Linear Regression (MLR). Recently, a computational machine learning model which mimics the structure and functions of the biological neural network called Artificial Neural Network (ANN) has been introduced in animal breeding for the prediction of 305-day lactation yield (Singh et al., 2015; Murphy et al., 2014;  Sharma et al., 2013; Gorgulu, 2012; Gandhi et al., 2009; Grzesiak et al., 2003). The literature on prediction of lactational 305-day milk yield of buffaloes based on bimonthly is scanty till date, therefore, to bridge the knowledge gap, the present investigation was undertaken to evaluate the Murrah buffaloes based on bimonthly test day milk yield records and to find the best early test-day milk yields combination to predict the first lactation 305-day milk yield in order to make an early selection of the animals for breeding programme.
The study had been conducted on 2100 first lactation bimonthly test day milk yield (BTDY) records of 350 Murrah buffaloes calved in between 1993 and 2017 at ICAR-NDRI, Karnal, India. The records with lactation length less than 100 days, lactation yield less than 900 kg, culled in the middle of lactation, abortion, still-birth or any other pathological causes were considered as abnormal and thus, such records were not included for the prediction of the lactation yield. The outliers beyond three-standard deviation on both the tail ends of normal distribution were also excluded from the data. A total of 6 BTDY records were taken from each animal at an interval of 60 days. First BTDY was recorded on 6th day (i.e. BTDY-1), BTDY-2 on 65th day, BTDY-3 on 125th day, BTDY-4 on 185th day, BTDY-5 on 245th day and the last BTDY (i.e. BTDY-6) was taken on 305th day.

The prediction of First Lactation 305-Day Milk Yield (FL305DMY) was performed utilizing the following four conventional methods:
 
Centering date method (CDM)
In the case of intervening intervals, production credits (Likhi et al., 1995; O’ Connor and Lipton, 1960) were calculated by multiplying yield on intervening sample days with fixed sampling interval:
 
                                         PN = (LI) (Pn)
 
In first and last sample day yield production credits (PN) were calculated as follows:
 
                                   PN = (DIM + ½ LI) Pn
Where,
DIM = days from the first day of lactation to the first sample day in case of first sample day yield  and days between last sample day and terminal day of lactation in case of last sample day yield
LI = Sampling interval
Pn  = Production on nth sample day
Production credits of all intervals were summed up to estimate lactation milk yield.
 
Test interval method (TIM)
 
TIM is also based on the calculation of prediction credit for all test-day yields and then summing up in order to know predicted milk yield (Likhi et al., 1995; Everett and Carter, 1968; Sargent et al., 1968). Basic formulae for all calculations were the same as in case of CDM, with sampling interval as follows:
For nth   intervening interval:
 
                                LI = ½ (DIMn+1 - DIMn-1)
Where,
DIMn+1 and DIMn-1 were the days in milk up to and including the proceeding (n+1)th  and preceding (n-1)th sample days, respectively.
For first and last sampling intervals, sampling interval was the length of the first or last test period as the case may be.
 
Ratio method (RM)
 
It predicts the 305-day lactation yield by multiplying each test day with the ratio factor (Dass and Sadana, 2003).
 
                                                    = R Xi
Where,
R = Y / X = Ratio of average 305-day milk yield to average test day milk yield.                  
   = Estimated 305-day milk yield of the ith animal.
Xi     =   Test day milk yield of ith animal.
Y    =   Average 305-day milk yield.
X    =   Average test day milk yield.
 
Multiple linear regression (MLR)
 
MLR was used to develop prediction equations by estimating the regression coefficients for the test-day milk yield records in different combination. The software used for MLR was SAS Enterprise Guide 4.3, 2003. Stepwise backward multiple linear regression analysis was used to estimate 305-day milk yield (Singh et al., 2015; Kokate et al., 2014; Chakraborty et al., 2010).
 
                                              
Where,
    =  Estimated first lactation 305-day or less milk yield of the ith animal.
xi =   Test-day record of ith animal.
a =   Intercept.
bi =   Regression coefficient of first lactation 305-day or less milk yield on test- day records. The accuracy of fitting the regression models was calculated by using the following formula:       
 
 
                       
The comparison of the above mentioned conventional methods was made with newly evolved computational machine learning method that mimics human brain (neural network) called Artificial Neural Network (ANN).
 
Artificial neural network
 
ANN model is basically an intelligent data processing system which learns the predictive ability automatically from the data set presented while training the network. Such neural network consists of input layer, hidden layer(s) and an output layer. Each layer has a specific role in the execution of the neural network. In back propagation technique, input vector and the corresponding target vectors are used to train a network until it can approximate a prediction function (Ruhil et al., 2011; Gandhi et al., 2010).

A multilayer feed forward neural network with back propagation of error learning mechanism was developed using Weka software version 3.8.0 to predict the first lactation 305-day or less milk yield (FL305DMY). The network was trained and simulated using cross-validation of 10 folds, up to 2500 epochs or till the algorithms truly converged. Network parameters such as learning rate (0.3), momentum (0.5) and validation set size (0) were used as the default setting of the algorithms. Most of the time, it was observed that algorithms were truly converged which means that performance/error goal was achieved.
 
Estimation of error in prediction
 
The error in the prediction of first lactation 305-day milk yield was estimated as a deviation of predicted milk yield from the actual milk yield:
                                                 
Where,
E= Error in prediction.
      = Predicted 305-day milk yield of the ith animal.
Yi = Actual 305-day milk yield of the ith animal.
 
Absolute error
 
Absolute error was estimated without considering the positive or negative signs as follows:
                                                          = | Ei |
 
Percentage absolute error
= (Absolute error / Mean of actual 305-day milk yield) × 100
 
Average error
 
                          = Sum of error / No. of observations
 
Percentage average error
= (Average error / Mean of actual 305-day milk yield) × 100
 
Root mean square error (RMSE)
 
 
 
Where,
     = Predicted 305-day milk yield of the ith animal.
Yi = Actual 305-day milk yield of the ith animal.
n = Number of observations.
 
Percentage RMSE 
  = (RMSE / Mean of actual 305-day milk yield) × 100
The first lactation 305-day milk yield of Murrah buffalo was predicted based on bimonthly test day milk yield records utilizing conventional and machine learning methods. The error was calculated as difference of observed value from predicted value, thus, the negative error indicated that the prediction method had under-estimated the milk yield and positive value indicated over-estimation. The efficacy of the prediction was also judged by absolute error, percentage absolute error, average error, percentage average error, root mean square error (RMSE) and percentage RMSE which has been presented in Table 1.

Table 1: Estimated errors in prediction of FL305DMY based on bimonthly test-day milk yield records utilizing different methods.



The deviation of predicted first lactation 305-day milk yield from actual 305-day milk yield utilizing CDM method was found to be 103.30 kg as absolute error (4.89%), -58.47 kg as average error (-2.77%) and 135.77 kg as RMSE (6.43%). Based on TIM, the deviation of predicted first lactation 305-day milk yield from actual 305-day milk yield was found to be 113.80 kg as absolute error (5.39%), 60.50 kg as average error (2.86%) and 146.13 kg as RMSE (6.92%). The deviation of predicted first lactation 305-day milk yield from actual 305-day milk yield utilizing Ratio method was found to be 104.96 kg as absolute error (4.97%), 2.11 kg as average error (0.10%) and 137.46 kg as RMSE (6.51%). The deviation of predicted first lactation 305-day milk yield from actual 305-day milk yield utilizing MLR method was found to be 93.49 kg as absolute error (4.43%), 0 kg as average error (0%) and 120.59 kg as RMSE (5.71%). The deviation of predicted first lactation 305-day milk yield from actual 305-day milk yield utilizing ANN method was found to be 93.99 kg as absolute error (4.45%), -0.58 kg as average error (-0.03%) and 121.82 kg as RMSE (5.77%). A perusal of Table 1 showed that the errors were least for MLR, followed by ANN, CDM, TIM and Ratio method. On contrary, Gandhi et al., (2010) reported ANN as best method for prediction as compared to MLR. Everett and Carter (1968) reported that TIM method overestimates actual first lactation 305-day milk yield by 91.4 kg of milk for bimonthly records. They also reported that estimation of actual 305-day production by CDM was more accurate than TIM.
 
he results clearly indicated that the multiple linear regression was the best amongst all the tested methods, therefore an attempt was then made to predict the first lactation 305-day milk yield based on early records. To select the best early BTDYs records amongst all six BTDYs records for prediction at an early stage, simple linear regression analysis with only one independent variable (BTDY) at one time was performed and the estimated intercept along with regression coefficients were estimated (Table 2). A perusal of the results showed gradual increase in the regression coefficient till BTDY-4, followed by gradual decline. Kokate et al., (2014) reported increase in regression coefficient till BTDY-3. The accuracy for prediction for different BTDYs varied from 19% to 62%. The accuracy was found to be higher in middle of the bimonthly records which was in agreement with Kokate et al., (2014).

Table 2: Estimated intercepts (a), regression coefficient (bi), adjusted coefficient of determination (R2) and root mean square errors (RMSE) for prediction of 305-day milk yield using bimonthly test day milk yield records.



The regression equation utilizing all the six bimonthly test day milk yield records was best with 121.8125 RMSE and 91.86% accuracy. But, in order to get early prediction, bimonthly test day yields were analysed by stepwise backward elimination regression method in SAS enterprise guide 4.3. For early prediction, with two variables (i.e. BTDY-2 and BTDY-4) the accuracy was 81.47%. On addition of one more variable the accuracy had increased to 85.29%. Further addition of bimonthly records were not leading to an increase in accuracy significantly. So, the equation with three bimonthly test day variables i.e. BTDY-2, BTDY-3 and BTDY-4 was found to be the best among early combinations with 163.7376 RMSE and 85.29% accuracy. The predicted equation showed more accuracy than reported by Kokate et al., (2014) in which the prediction equation include BTDY-2, BTDY-3 and BTDY-5 with 83% accuracy. The best prediction equations along with the estimated intercept values, regression coefficients, coefficients of determination (R2), AIC, BIC and RMSE values has been presented in Table 3.

Table 3: Best equations and their accuracy for prediction of first lactation 305-day milk yield by stepwise backward regression method.

The study revealed that the first lactation 305-day milk yield was observed to be best estimated by MLR, followed by ANN and CDM. Early prediction of FL305DMY using BTDY-2 (65th day), BTDY-3 (125th day) and BTDY-4 (185th day) gave higher accuracy (85.29%), which was nearer to the one wherein all the test day records were utilized. Evaluation of the sires and dam based on early test day yields would eventually result in reduced cost incurred on milk records, reduced generation interval and increased response to selection.
The authors are thankful to the Director, ICAR-NDRI, Karnal and Head of Animal Genetics and Breeding section, ICAR-NDRI, Karnal for providing necessary facilities for conducting this study. The help rendered by Livestock Record Unit of ICAR-NDRI, Karnal is deeply acknowledged for providing the required data for the analysis.

  1. Chakraborty, D., Dhaka, S. S., Pander, B. L., Yadav, A. S., Singh, S. and Malik, P. K. (2010). Prediction of lactation milk yield from test day records in Murrah buffaloes. Indian Journal of Animal Sciences. 80(3): 244-245.

  2. Dass, G. and Sadana, D. K. (2003). Predictability of lactation milk yield based on test day values in Murrah buffaloes. Indian Journal of Animal Research. 37(2): 136-138.

  3. Everett, R. W. and Carter, H. W. (1968). Accuracy of test interval method of calculating dairy herd improvement association records. Journal of Dairy Science. 51(12): 1936-1941.

  4. Gandhi, R.S., Raja, T.V., Ruhil, A.P. and Kumar, A. (2009). Prediction of lifetime milk production using artificial neural network in Sahiwal cattle. Indian Journal of Animal Sciences. 79 (10): 1038-1040.

  5. Gandhi, R.S., Raja, T.V., Ruhil, A.P. and Kumar, A. (2010). Artificial neural network versus multiple regression analysis for prediction of lifetime milk production in Sahiwal cattle. Journal of Applied Animal Research. 38(2): 233-237.

  6. Gorgulu, O. (2012). Prediction of 305-day milk yield in Brown Swiss cattle using artificial neural networks. South African Journal of Animal Science. 42(3): 280-287. 

  7. Grzesiak, W., Lacroix, R., Wójcik, J. and Blaszczyk, P. (2003). A comparison of neural network and multiple regression predictions for 305-day lactation yield using partial lactation records. Canadian journal of animal science. 83(2): 307-310.

  8. Kokate, L. S., Singh, A., Banu, R., Gandhi, R. S., Chakravarty, A. K., Gupta, A. K. and Sachdeva, G. K. (2014). Prediction of 305-day lactation milk yield based on bimonthly test day values in Karan Fries cattle. Indian Journal of Animal Research. 48(2): 103-105.

  9. Likhi, A.K., Khanna, A.S. and Jaiswal, U.C. (1995). Evaluation of centering dates test interval and sample-day production methods of computing lactation yield from AM and PM sample day records. Indian Journal of Animal Sciences. 65(11): 1241-1246.

  10. Murphy, M. D., O’Mahony, M. J., Shalloo, L., French, P. and Upton, J. (2014). Comparison of modelling techniques for milk-    production forecasting. Journal of Dairy Science. 97(6): 3352-3363.

  11. O’ Connor, L.K. and Lipton, S. (1960). The effect of various sampling intervals on the estimation of lactation milk yield and composition. Journal of Dairy Science. 27(3): 389-398.

  12. Rahayu, A.P., Hartatik, T., Purnomoadi, A. and Kurnianto, E. (2018). Estimation of 305 Day Milk Yield from Cumulative Monthly and Bimonthly Test Day Records in Indonesian Holstein Cattle. In: IOP Conference Series: Earth and Environmental Science. 119(1): 012028.

  13. Ruhil, A.P., Gandhi, R.S., Monalisa, D., Behra, K. and Raja, T.V. (2011). Prediction of lactation yield based on partial lactation records using artificial neural networks. In: Proc. 5th National Conference on Computing for Nation Develop- -ment, INDIACom-2011.

  14. Sahoo, S.K., Singh, A., Gupta, A.K., Chakravarty, A.K., Ambhore, G.S. and Dash, S.K. (2015). Comparative evaluation of different lactation curve functions for prediction of bi-monthly test day milk yields in murrah buffaloes. Animal Science Reporter. 9(3): 89-94.

  15. Sargent, F.D., Lytton, V.H. and Wall, O.G. (1968). Test interval method of calculating dairy herd improvement association records. Journal of Dairy Science. 51(1): 170-179.

  16. Sharma, A.K., Jain, D.K., Chakravarty, A.K., Malhotra, R. and Ruhil, A.P. (2013). Predicting economic traits in Murrah buffaloes with connectionist models. Journal of the Indian Society of Agricultural Statistics. 67(1): 1-11.

  17. Singh, M., Singh, A., Gupta, A.K., Dash, S.K., Gupta, A., Sahoo, S.K., Das, S. and Shivahre, P.R. (2015). Comparative evaluation of different lactation curve models in prediction of monthly test-day milk yields in Murrah buffaloes. Journal of Animal Research. 5(1): 189-193.

  18. Tailor, S.P. and Singh, S. (2014). Observed and predicted first lactation milk yield in Surti buffaloes. The Indian Journal of Animal Sciences. 84(7): 775-778.

  19. Torshizi, M.E. and Mashhadi, M.H. (2015). Evaluation of various approaches in prediction of daily and lactation yields of milk and fat using statistical models in Iranian primiparous Holstein dairy cows. Iranian Journal of Applied Animal Science. 5(1): 81-87.

Editorial Board

View all (0)