Indian Journal of Animal Research

  • Chief EditorK.M.L. Pathak

  • Print ISSN 0367-6722

  • Online ISSN 0976-0555

  • NAAS Rating 6.40

  • SJR 0.263

  • Impact Factor 0.4 (2024)

Frequency :
Monthly (January, February, March, April, May, June, July, August, September, October, November and December)
Indexing Services :
Science Citation Index Expanded, BIOSIS Preview, ISI Citation Index, Biological Abstracts, Scopus, AGRICOLA, Google Scholar, CrossRef, CAB Abstracting Journals, Chemical Abstracts, Indian Science Abstracts, EBSCO Indexing Services, Index Copernicus

Principal Component Analysis of Economic Traits in HD-K75 Pig, A Crossbred Pig Variety of India

Arundhati Phookan1,*, Olympica Sarma2, Dimpi Khanikar3, T.C. Tolenkhomba4, Nipu Deka1, N.H. Mohan5, V.K. Gupta5
1ICAR-All India Coordinated Research Project on Pig, Assam Agricultural University, Khanapara-781 022, Guwahati, Assam, India.
2Department of Animal Genetics and Breeding, College of Veterinary and Animal Science, G.B. Pant University of Agriculture and Technology, Pantnagar-263 145, Uttarakhand, India.
3Department of Animal Genetics and Breeding, College of Veterinary Science, Assam Agricultural University, Khanapara-781 022, Guwahati, Assam, India.
4College of Veterinary Science and Animal Husbandry, Central Agricultural University, Selesih, Aizawl-796 014, Mizoram, India.
5ICAR-National Research Centre on Pig, Rani, Guwahati-781 015, Assam, India.

Background: Principal component analysis (PCA) is a biometrical technique or a dimensionality reduction method where large number of measurements could be replaced by fewer measurements without significant loss of information. PCA is a multivariate methodology and helps to eliminate redundant traits and transforms original group of variables into another group known as principal components. It can be used with success when characteristics are correlated. PCA can help in accurate selection of superior animals.

Methods: The present study was carried out in HD-K75 pigs maintained at ICAR-All India Coordinated Research on Pig, AAU, Khanapara, Guwahati, Assam, India. A total of 13 economic traits viz. age at sexual maturity (ASM), age at first fertile service (AFFS), age at first farrowing (AFF), farrowing interval (FI), litter size at birth (LSB), litter weight at birth (LWB), litter size at weaning (LSW), litter weight at weaning (LWW), teats number (TN), body weight at Birth (BWB), body weight at weaning (BWW), body weight at 5 month (BM5) and body weight at 8 month (BM8) were considered. Data were collected from a total of 164 sows. Data belonged to 8 years from 2015 to 2023. The principal component analysis was performed using the factor module in SPSS 24.

Result: Factor analysis with varimax rotation uncovered four principal components, collectively explaining 84.18% of the total variance. The first, second, third and fourth principal component accounted for 29.71%, 29.33%, 12.683% and 12.452% of the variance. High component loading was found for LSW and LWW in first component, LSB for second component, BW5 for third component and BW8 for fourth component respectively. The communality values ranged from 0.980 (ASM and AFFS) to 0.608 (BWW). These findings indicate that PCA can serve as a valuable tool in breeding programmes, allowing for a significant reduction in the number of economic traits to be used in selection procedure.

As the swine industry continues to explore pork quality traits alongside growth, feed efficiency and leanness, it is essential to understand the genetic relationships underpinning these characteristics. With the growing emphasis on multiple desirable traits, animal breeders must develop effective strategies for making direct genetic improvements for each trait and evaluate alternative selection indices incorporating various phenotypic measurements (Green et al., 2024). Compared to other livestock, pigs contribute quicker economic return to farmers. They are among the most prolific and rapidly growing animals, capable of efficiently converting food waste into valuable products. Pig production is relatively cost-effective due to rapid growth rates, high production potential, excellent carcass yield efficiencyand strong adaptability to diverse environmental conditions (Panda et al. 2020).Rapid advancements in research have significantly increased the volume of large data sets, presenting challenges in timely and accurate data interpretation. To address these challenges, various techniques have been developed, among which Principal Component Analysis (PCA) stands out as a commonly used method (Jolliffe and Cadima, 2016). PCA is a multivariate statistical technique used for dimensionality reduction in data analysis. It transforms a set of correlated variables into a smaller set of uncorrelated variables called principal components, capturing the most significant features of the data while reducing the redundancy of the datasets. PCA can be used for data simplification, reduction, classification, variable selectionand more (Wold et al. 1987). Its primary goal is to extract critical information from data and represent it through a series of summary indices known as principal components (PC1, PC2, PC3, etc.). The principal component that accounts for the highest variance in the data is designated as the first principal component (PC1). The second principal component (PC2) represents the next most significant source of variation. Together, these two principal components form a plane that can be visualized graphically. In PCA, the extent of variance explained by each principal component is typically quantified and visualized using a scree plot. PCA involves the computation of eigen values and eigen vectors of the data’s covariance matrix. The eigen values quantify the variance accounted by each principal component, whereas the eigenvectors indicate the direction of each principal component within the dataset. Therefore, PCA is a powerful tool for simplifying and analyzing complex datasets, making it indispensable in various fields.

Though the state of Assam alone has a pig population of 2,099,000 (Twentieth Livestock Census, 2019), there lies a gap between the production and requirement of meat. The annual pork consumption by 30% of total population of Assam (i.e. 1.03 crore) is 11.07 crore kg. On the other hand, the annual pork production of the State is 1.87 crore kg. This clearly demonstrates that there is a shortfall of 9.20 crore kg of pork meat or 15,57,146 pigs, annually. (https://www.sentinelassam.com, Dated 6th August, 2021). This underlying gap may be due to the fact that the production performance of the pigs which are being reared is not getting momentum due to lack of quality germplasm, scientific selection and breeding, superior breeding stock, genotypic and phenotypic data, genetic testing, scientific management of pigs, etc. To address this, the initial and foremost step is farming of quality and genetically superior animals ensuring enhanced performance efficiency. To meet up this, a new variety of pig called HD-K75 (75% Hampshire and 25% Desi), has been developed by ICAR-All Indian Coordinated Research project (AICRP)  on Pig, Assam Agricultural University, College of Veterinary Science, Khanapara, Guwahati, Assam, India, which has been introduced in the villages helping rural pig farmers of Assam as well as nearby northeastern states. These animals have already proved to be well adapted to the prevailing conditions of Assam and gaining popularity too. The animals are maintained in the nucleus herd that is AICRP on Pig, Khanapara, Assam and it is being transmitted to the commercial herd or farmers field. Hence, prioritization is needed for precise and early selection of superior animals for production of best progenies to be transferred to field level. This can be achieved by implying different biometrical concepts, newer statistical techniques, reproductive biotechnologies particularly marker assisted selection (MAS), etc. It is envisaged that the Principal Component Analysis approach can be adopted at farm level for accurate selection of best performing animals. This will enable dissemination of superior quality germplasm (HD-K75) from farm to field level. Keeping in view of this the present study has been carried out for the Principal Component Analysis of Economic Traits in HD-K75 Pig maintained under ICAR-AICRP on Pig, AAU, Khanapara, Assam, India.
Description of data

For the present investigation, data reflecting the performance of HD-K75 pigs maintained under ICAR-AICRP on pigs, AAU, Khanapara were utilized. Data were collected from 164 sows of the farm. The data belonged from 16th generation to 19th generation (four generations) from the year 2015 to 2023. Each generation comprised of three crops or farrowing which occurs in duration of 2 years.

Data collection

For the present investigation, the data of following traits collected.



Statistical analysis

The phenotypic correlations between economic traits were computed by using Pearson’s correlation method (Snedecor and Cochran, 1967). Traits exhibiting high correlation were subsequently analyzed using multivariate principal component analysis. The aim of principal component analysis is to capture the maximum proportion of variance from the original set of variables while minimizing the number of composite variables. This approach assumes that the communalities (common variance) represent a small fraction of the total variance. Varimax rotation was applied to the principal components to transform them into a simpler structure. To assess whether the dataset, which included 1,163 animals and thirteen traits, was suitable for factor analysis, Bartlett’s test (1950) was initially performed, in accordance with the recommendation by Maxwell (1959). The dataset’s validity was further confirmed at a 1% significance level using the Kaiser-Meyer-Olkin (KMO) test for sampling adequacy. To determine the number of factors, the criterion established by the Kaiser rule (Johnson and Wichern, 1982) was used, retaining only factors with eigen values greater than 1. The suitability of the common factor model was evaluated using Kaiser’s measure of sampling adequacy (MSA), with an MSA below 0.5 considered as unacceptable. Principal component analysis, as described by Everitt et al. (2001), is a method for converting variables in a multivariate dataset (X1, X2, …, Xn) into a set of uncorrelated variables (Y1, Y2, …, Yn). These new variables explain a reduced proportion of the total variance present in the original variables specified as:

        Y= a11X1+a12X2+…+ a1nXn
        Y2 = a21X1+a22X2+…+ a2nXn
        Yn = an1X1+an2X2+…+ anXn

The principal components Y1, Y2, ..., Yn capture reduced portions of the overall variance observed in the original variables X1, X2, ..., Xn. To enhance the interpretability of the extracted principal components, orthogonal rotation was applied, which aimed to maximize variance in the linear transformation of the factor pattern matrix. The principal component analysis was performed using the factor module in SPSS 24.
Phenotypic correlation

In the present study the magnitude of correlation coefficient ranges from  -0.658 to 1.000 (Table 1). Highly significant positive correlation found between reproductive traits viz. ASM with AFFS and AFF and between AFF and AFFS. Similar finding of strong positive correlation between AFS and AFF was exhibited in Large White pig (Yu et al. 2022). Highly significant negative correlation was reported between FI with ASM, AFFS and AFF. Significant effect of BWB with FI; BW5 with LSW and LWW; and BW8 with TN was also observed. Highly significant correlation between litter traits such as LSB with LWB, LSW and LWW and between LWB with LSW and LWW were observed in the present study. Similar observations of high and positive correlation between the litter traits in Hampshire×Indigenous pigs was reported by Phookan et al. (2013). Also, Nowak et al. 2020 reported strong positive correlations between the litter-related traits that was between litter size and the number of piglets born alive (0.90), the number and the percentage of piglets born alive (0.88), the numbers of piglets born alive and weaned (0.78) and litter size and the number of weaned piglets (0.68) in Polish Large White, Polish Landraceand Yorkshire, Duroc, Berkshire and Hampshire animals kept on farms located in Poland and in the United States. Further, comparable observation of high and positive phenotypic correlations among the litter traits was revealed by Panda et al, 2020 in crossbred pigs and Bayan, 2022 in HD-K 75 pigs. Strong positive genetic correlations between litter size from farrowing to weaning were found in black pigs of Taiwan (Lee et al, 2022). High and positive phenotypic correlation obtained in the present study suggested that piglets having larger litter size at birth is suggestive of larger litter size at weaning, higher litter weight at birth and higher litter weight at weaning. This justify scope of genetic improvement of litter traits by favoring the selection for larger litter size at weaning and higher litter weight at weaning based on the superiority of litter size and weight of the piglets at birth. However, Pandey and Singh, (2010) observed that litter size at birth had negative correlation with litter size at weaning, litter weight at birth and weaning in Landrace, Desi and their cross-bred pigs. Also, the phenotypic correlation of litter size at weaning with litter weight at birth and at weaning and farrowing interval had negative and non-significant effect. They found litter weight at birth had negative phenotypic correlation with litter weight at weaning.

Principal component analysis

Principal component analysis was applied to thirteen different economic traits in HD-K75 pig. The Kaiser-Meyer-Olkin method yielded a measurement of sampling adequacy of 0.659 (Table:  2) whereby eigen values more than 1 were considered. This value assesses the adequacy of the data for each factor in providing reliable results for PCA. The measure of sampling adequacy below 0.5 is considered to be inadequate (Khargharia et al., 2015). The varimax rotation method was employed to maximize the sum of loading squares (Fernandez, 2002). Bartlett’s test of sphericity was applied to assess the significance of the correlation matrix. The resulting chi-square value was highly significant (P<0.01), measuring 442.809 (Table 2). The sum of squared loadings were extracted by PCA, variation explained by each component (Table 3) and Eigen values are given in Fig 1. Four, out of the thirteen components were selected using the Kaiser Rule Criterion (Johnson and Wichern, 1982) to determine the significant number of components. The cumulative variance of 84.18% (Table 3) was accounted by four principal components (PC1, PC2, PC3 and PC4), each with eigen values more than 1. The variance of each trait was as per explained by PCA (Mavule et al., 2013). The component plot in rotated space depicts the distribution of the eight components, as illustrated in Fig 2.

In the present study, the first principal component (PC1) explained for 29.71% of the total variance (Table 3). PC1 represented by a very high component loading for LSW and LWW in HD-K75 pig. The second principal component (PC2) explained for 29.33 % of total variance. PC2 showed high loadings on LSB. The third principal component (PC3) explained 12.68% of total variance which described high loads on BW5. The fourth principal component (PC4) explained for 12.45% of total variance which described high loads on BW8. Table 4 represents the coefficient of principal component analysis of rotated component matrix. The different weights were assigned by PC1, PC2, PC3 and PC4 showing different component loadings in HD-K75 pig. PC1 represented highest variance. Yakubu et al. (2009) in Fulani cattle, Pundir et al. (2011) in Kankrej cows, Yunusa et al. (2013) in Balami and Uda sheep, Tolenkhomba et al. (2013) in Mizo local pigs, Khargharia et al. (2015) in Assam hill goat and Okoro et al. (2015) in crossbred pigs of Nigeria also reported that the first factor explained maximum variation. Okoro et al. (2015) in crossbred pigs of Nigeria extracted two factors of 6 body confirmation traits at pre-weaning stage which accounted for 91.63% of total-variation. They however extracted only one factor at post-weaning stage explaining 73.63% of the total variation. Tolenkhomba et al. (2013) reported a single factor of 5 body measurements which was extracted by at birth, 4 week weaning, 14 week and 18 week which accounted 60.41, 69.79, 67.49, 81.85 and 87.73% of total variance in Mizo local pigs. Kramarenko et al., (2018) observed three principal components (PC) which accounted for near 80% of the dependency structure.PC1 accounted for 33.6% of the total variance and was influenced by total number of  piglets born (TNB) and number of  piglets born alive (NBA); PC2 accounted for 27.1% of the total variance and linked to NBA (positive), number of stillborn piglets (NSB) and frequency of stillborn piglets (FSB) (negative) and PC3 accounted for 18.7% of the total variance and contrasted sows having large number of weaned piglets and low pre-weaning mortality in piglets with sows having small number of weaned piglets and high pre-weaning mortality in piglets. Panda et al. (2020) could extract 3 PCs at 4 week, 2 PCs at 6 week and 2 PCs at 8 week in crossbred pigs which had more than 1 eigen values and explained 74%, 74% and 75% of total variance. PC1 was represented by heart girth and body length, PC2 by height at shoulder and height at foreleg and PC3 by snout circumference. However, Panda et al. (2020) in another study on liter size of crossbred pigs reported a single PC which was extracted with an Eigen value greater than 1. It accounted for 82.24% of total variance. PC1 was characterized by litter size at 4, 6 and 8 week. This factor seemed to be representing the overall performance of the litter.

The communality values ranged from 0.980 (ASM and AFFS) to 0.608 (BWW) across all economic traits (Table 5). The trait like BWW had lower communality indicating that this trait is less effective in explaining the performance in HD-K75 pig and traits including ASM, AFFS, AFF, LSB, LSW, LWW showed high communalities indicating that these traits will be effective in breeding programmes for selection of HD-K75 pig.
The study suggests that principal components provide an effective approach for selecting animals by considering a set of inter-correlated variables. A total of four PCs have been extracted from a total of 13 economic traits considered. These components determine the underlying source of shared variability in explaining the performance in HD-K75 pigs. PC1 had the largest share of overall variance (29.71%) which has high loading on LSW and LWW followed by PC2, PC3 (29.33%, 12.68% and 12.45%) of the total variance. PC2 showed high loadings on LSB. PC3 described high loads on BW5. PC4 explained high loads on BW8. The percentage of cumulative variance accounted for 84.18%. Thus, the four main components extracted could be used inbreeding programmes with reduction in the number of economic traits to be recorded for explanation of maximum variability for prediction of performance in HD-K75 pigs. This can lead the selection procedure to be more accurate and also time and labour saving.
The authors declare that there are no conflict of interest.

  1. Bartlett, M.S. (1950) Tests of significance in factor analysis. British Journal of Psychology. 3: 77-85.

  2. Bayan, Jyotishree. (2022) Genetic studies on the performance of HD K75 pigs. PhD thesis submitted to Assam Agricultural University, Jorhat, Assam.

  3. Everitt, B.S., Landau, S., Leese, M. (2001). Cluster Analysis. 4th edn. Arnold Publisher, London.

  4. Fernandez, G. (2002). Data Mining using SAS Application. USA: Chapman and Hall, CRC press.

  5. Green, H.E., Oliveira, H.R.D., Alvarenga, A.B., Scramlin Zuelly, S., Grossi, D., Schinckel, A.P., Brito, L.F. (2024). Genomic background of biotypes related to growth, carcass and meat quality traits in Duroc pigs based on principal component analysis. Journal of Animal Breeding and Genetics. 141(2):163-178. https://www.sentinelassam. com/topheadlines/assam-piggery-mission-piggery-farmers-submit-report-to-cm-himanta-biswa-sarma 549525 dated 6th August, (2021).

  6. Johnson, R.A. and Wichern, D.W. (1982). Applied Multivariate Statistical Analysis. USA: Prentice-Hall Inc.

  7. Jolliffe, I.T. and Cadima, J. (2016). Principal component analysis: A review and recent developments. Philosophical transactions of the royal society A: Mathematical, Physical and Engineering Sciences. 374(2065): 20150202.

  8. Khargharia, G., Kadirvel, G., Kumar, S., Doley, S., Bharti, P. K., Das, Mukut. (2015). Principal component analysis of morphological traits of Assam Hill goat in Eastern Himalayan. Indian Journal of Animal and Plant Science. 25(5): 1251- 1258.

  9. Kramarenko, S.S., Lugovoy, S.I., Lykhach, A.V., Kramarenko, A.S., Lykhach, V. (2018). A comparative study of the reproductive traits and clustering analysis among different pig breeds. Scientific Messenger of Lviv National University of Veterinary Medicine and Biotechnologies. 20(84): 21-26.

  10. Mavule, B.S., Muchenje, V., Bezuidenhout ., Kunene, N.W. (2013). Morphological structure of Zulu sheep based on principal component analysis of body measurements. Small Ruminant Research. 111: 23-30.

  11. Maxwell, A.F. (1959). Statistical methods in factor analysis. Psychological Bulletin. 56(1): 228-235.

  12. Nowak, B., Mucha, A., Kruszyñski, W., Moska, M. (2020). Phenotypic correlations between reproductive characteristics related to litter and reproductive cycle length in sows. Czech Journal of Animal Science. 65(06): 205-212 https://doi.org/10.17221/108/2020-CJAS.

  13. Okoro, V.M., Ogundu, U.E., Okani, M., Oziri, I., Eneowo, O., Olisenekwu, O.T., Kadurumba, O., Ogbuewu, I.P., Onyemauwa, S., Ukwu, HO., Ibe, S.N. ( 2015). Principal component analysis of conformation and blood marker traits at pre-and postweaning stages of growth in F2 crossbred Nigerian indigenous × Landrace pigs. Animal Biotechnology. 26(4): 243-50.

  14. Panda, S., Gaur, G.K., Jena, D., Chhotaray, Supriya., Tarang, M., Wara, A.B. (2020) Principal component analysis of litter traits in crossbred piglets. The Pharma Innovation Journal 9(1): 31-33.

  15. Panda, S., Gaur, G.K., Sahoo, N.R., Bharti, P.K., Kar, J. (2020) Principal component analysis of morphometric and growth traits in crossbred piglets. Indian Journal of Animal Sciences. 90(8): 1168-1171. https://doi.org/10.56093/ijans.v90i8.109303.

  16. Pandey A., Singh S.K. (2010) Genetic and phenotypic correlation among body weight at various ages and among the reproductive traits in landrace desi and their cross-bred pigs. Veterinary Science Research. 1(1): 0976-9978.

  17. Phookan, A., Roy, T.C., Goswami, R.N., Kalita, D., Laskar, S., Roychoudhury, R. (2013). Effect of some non-genetic factors on litter traits in crossbred pigs. Indian Veterinary Journal. 90(5): 133-134.

  18. Pundir, R.K., Singh, P.K., Singh, K.P., Dangi, P.S. (2011). Factor analysis of biometric traits of Kankrej cows to explain body conformation. Asian Australasian Journal of Animal Science. 24(4): 449-456.

  19. Snedecor, G.W., Cochran, W.G. (1967). Statistical Methods. Oxford and IBH Publishing Co., New Delhi, India.

  20. Tolenkhomba, T.C., Saikia, P., Hmar, L., Prava, M., Singh, N.S. (2013) Principal component analysis of body measurements in Mizo local pigs. Indian Journal of Veterinary Research 22(1): 26-31.

  21. Twentieth Livestock Census Report, (2019). Ministry of Agriculture, Department of Animal Husbandry, Dairying and Fisheries, Krishi Bhawan, New Delhi.

  22. Wold, S., Esbensen, K., Geladi, P. (1987). Principal component analysis. Chemometrics and Intelligent Laboratory Systems. 2(1-3): 37-52.

  23. Yakubu, A., Ogah, D. M., Idahor, K.O. (2009). Principal component analysis of the morphostructural indices of White Fulani cattle. Trakia Journal of Sciences. 7: 67-73.

  24. Yu, G., Wang, C., Wang, Y. (2022)  Genetic parameter analysis of reproductive traits in Large White pigs. Animal Bioscience. 35(11): 1649-1655 https://doi.org/10.5713/ab.22.0119 pISSN 2765-0189 eISSN 2765-0235.

  25. Yunusa, A.J., Salako, A.E., Oladejo, O.A. (2013). Principal component analysis of the morphostructure of Uda and Balami sheep of Nigeria. International Research Journal of Agricultural Science. 1(3): 45-51.

Editorial Board

View all (0)