Indian Journal of Animal Research

  • Chief EditorK.M.L. Pathak

  • Print ISSN 0367-6722

  • Online ISSN 0976-0555

  • NAAS Rating 6.50

  • SJR 0.263

  • Impact Factor 0.4 (2024)

Frequency :
Monthly (January, February, March, April, May, June, July, August, September, October, November and December)
Indexing Services :
Science Citation Index Expanded, BIOSIS Preview, ISI Citation Index, Biological Abstracts, Scopus, AGRICOLA, Google Scholar, CrossRef, CAB Abstracting Journals, Chemical Abstracts, Indian Science Abstracts, EBSCO Indexing Services, Index Copernicus

Domestication and Genetic Diversity Analysis of Several Sheep Breeds in China

Bujun Mei1,2,*, Zhihua Wang3, Yanping Liu2, Rong Liu4
1Laboratory of Small Ruminant Genetics, Breeding and Reproduction, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan430070, People’s Republic of China.
2Department of Agriculture, Hetao College, Bayannur, 015000, People’s Republic of China.
3Department of Civil Engineering, Hetao College, Bayannur, 015000, People’s Republic of China.
4Inner Mongolia Autonomous Region Farming and Pasture Science and Technology Extension Station, Huhhot, 010000, People’s Republic of China.
Sheep is the main livestock species in northern China. Under diverse climatic conditions, a gene pool of local sheep resources is now forming. There are many sheep breeds in northern China, including local breeds and hybrid breeds. However, little is known about the genetic background and genetic variation of local sheep populations in northern China. Therefore, our aims is to provide a characterization of the population structure and genetic relationship within sheep breeds in northern China, and to study its genetic relationship with other sheep breeds. We collected blood samples from 390 sheep for genetic diversity analysis using Reduced-Representation Genome Sequencing (RRGS). This study used bioinformatics methods to analyze linkage disequilibrium, population structure, evolutionary tree, selective sweep etc. The mixed model revealed in this study may be formed in the process of sheep breeding, partially overlapping with historical and archaeological findings. New information on the domestication and genetic variety of sheep is provided by this study, which will help sheep in the future be improved through genomics.
 
This study uses whole genome sequencing (WGS) to re-sequence different individuals or tissues of sheep with known genomic information to discover differences between individuals. Through this method, genome-wide SNP and InDel molecular markers were developed, and SNP/InDel markers were used for population genetics research.
 
Animals and genome sequencing
 
Two sheep populations-NKY and THT - were analyzed in this study. A total of 5 mL jugular vein blood was collected from 390 individuals (185 NKY and 205 THT population) in 2022 at Bayannaoer Academy of Agriculture and Animal Husbandry Sciences, Inner Mongolia, China. After the sample ‘s genomic DNA was qualified, the DNA sequence was fragmented by ultrasound to form a random fragment.
 
Bioinformatic analysis workflow
 
After the Illumina NovaSeqTM sequencing data (Raw Data) is offline, the quality control of the offline data is performed, and the low-quality data is filtered to obtain high-quality data (Clean Data). Clean Data was aligned to the reference genome sequence using BWA software(Li and Durbin 2009)to obtain the sequence location attribution (i.e., BAM file). BAM files were corrected using GATK ‘s Best Practices process (McKenna et al., 2010), and SNP and Small InDel markers were detected. The functional annotation information of SNP and InDel was obtained by using SNPEff software (Cingolani et al., 2012) and gene prediction information of reference genome. Based on the obtained SNP and InDel molecular markers, genetic diversity, population structure, linkage disequilibrium and selective sweep were further studied (see Table 1 for bioinformatics analysis tools).

Table 1: Bioinformatics analysis tools.



Linkage disequilibrium
 
In a population, the frequency of simultaneous inheritance of two genes at different loci is significantly higher than the expected random frequency, which is called linkage disequilibrium. The minimum genetic unit of species can be obtained by linkage disequilibrium analysis.
               
The linkage of SNP in all samples was analyzed by SNP combination on the same chromosome. Natural populations are represented by a linkage disequilibrium coefficient (r2). The closer r2 is to 1, the stronger the linkage. The distance between SNPs in the genome was fitted to r2. Generally, the closer the distance between SNPs, the larger the r2, and the farther the distance between SNPs, the smaller the r2. Generally, the distance between the corresponding sites when r2 decays to half is used as the value of LD-decay. The longer the LD-decay, the slower the LD decay, the greater the probability of linkage between the SNP of the representative species. The shorter the LD-decay, the faster the LD decay, the smaller the probability of linkage between the SNP of the representative species. It is generally believed that the species with fast LD decay are relatively primitive. The LD-decay of each subpopulation is shown in Fig 1.
 

Fig 1: LD of the two populations.


               
Genome-wide linkage disequilibrium (LD) research showed that the NKY and THT populations had nearly the same level of LD and decay, with a lower R2 correlation coefficient becoming stable at a distance of about 100 kb.
 
Population structure
 
Principal component analysis (PCA) is a pure mathematical operation method, which can select a small number of important variables through linear transformation of multiple related variables. Based on SNP, principal component analysis (PCA) was performed by GCTA software to obtain the principal component clustering of 36 samples. Through PCA analysis, we can know which samples are relatively close and which samples are relatively distant, which can assist evolutionary analysis. The results of the principal component analysis that was carried out on the genomic relatedness matrix between individuals are illustrated in Fig 2.
 

Fig 2: PCA clustering diagram of samples.


               
The population structure of the samples was analyzed. The number of clusters (K value) was assumed to be 1-20. The optimal number of clusters was determined to be 14 according to the K value corresponding to the lowest point of CV error (Cross validation error). It reflects that all our samples may come from 14 original ancestors. The population structure with K value of 14 is shown in Fig 3.
 

Fig 3: Population genetic structure based on whole genomes.


 
               
Through whole genome resequencing, it can be determined that there are regions with higher FST values and lower ð/è values on the genome. The q chromosome results are shown in Fig 4.
 

Fig 4: Selective sweep Analysis (Manhattan plot).


               
FST is an important index to measure the degree of genetic differentiation between populations. If an allele in a population experiences adaptive selection due to its high fitness to a specific environment, the increase of its frequency will increase the level of population differentiation, which is reflected in the higher FST value in F statistics. The π/θ represents the nucleotide polymorphism within the population, and the selective sweep will cause the selected genomic region to show that the polymorphism within the population is significantly lower than the average level of the genome, which corresponds to a lower π/θ value.
               
The evolutionary tree is used to represent the evolutionary relationship between species. According to the genetic relationship between various organisms, various organisms are placed on a branched tree-like figure to concisely represent the evolutionary history and genetic relationship of organisms. The phylogenetic tree is shown in Fig 5.
 

Fig 5: Phylogenetic tree.


               
Before human-mediated specialization for wool and milk began some 4000-5000 years ago, sheep were initially raised for access to meat. One of the earliest morphological modifications that likely occurred alongside domestication and is now a trait shared by many modern breeds is the removal of horns, which has recently been demonstrated to be one of those regions of the genome that contain strong evidence for accelerated change in response to artificial selection (Mohamadipoor Saadatabadi et al., 2021). Additionally, sheep’s other genomic areas that are being selected for comprise genes that regulate body size, reproduction and color. The separation of animals into breeds, followed by the discovery of better rams and their disproportionate genetic contribution through artificial insemination, has increased the rate of genetic gain for productive attributes during the previous few hundred years. Numerous genetic exchanges have taken place during the evolution of contemporary breeds, as evidenced by the high haplotype sharing and relatively short divergence times across breeds (Missohou et al., 2022). A population size of more than 300 has been maintained by around 75% of contemporary sheep breeds. There are between 850 and 1409 different breeds. But several sheep breeds, such as the Shetland, Soay, and Herdwick breeds, that were first chosen for their superior performance in a particular, sometimes remote, geographic location are now regarded as uncommon. It has become difficult to maintain genetic variety by preserving these historic breeds as generalist-type sheep have mostly taken over intensive sheep farming (Alberto et al., 2018).
Domestication is complex and entails multiple and parallel events and population admixtures. There are many breeds of sheep in China that have complex genetic exchanges. Native sheep breeds are abundant in China and some of them have unique characteristics including procreation, good lamb pelts, numerous horns, fat tails, and great stress tolerance. The focus of the research has been on all of these. However, there is still debate regarding the origins of sheep breeds and there have been few thorough investigations on the genetic diversity of local sheep in China. Their population dynamics and genetic variety were subsequently discovered, which was beneficial for our comprehension and preservation of these genetic resources. The current research, which for the first time examined the genome-wide diversity and population structure of sheep breeds with a Northern Chinese origin, advanced knowledge of the intricate genetic relationships and anticipated shared ancestry of the many sheep breeds in China.
We are thankful to Bayannaoer Academy of Agriculture and Animal Husbandry for their assistance in the sampling.
This work was supported by Inner Mongolia Autonomous Region Science and Technology Plan Project(No. 2020GG0201), Natural Science Foundation of China (No. 31760660), Natural Science Foundation of Inner Mongolia Autonomous Region (No. 2019MS03092).We are thankful to Bayannaoer Academy of Agriculture and Animal Husbandry for their assistance in the sampling.
The authors declare that there are no competing interests associated with the manuscript.
BJ designed the study and analyzed the RNA-seq data and drafted the manuscript. ZH, YP and RL joined the research and improved the manuscript.

  1. Alberto, F.J., Boyer, F., Orozco-ter, W.P., Streeter, I., Servin, B., de Villemereuil, P., Benjelloun, B.,  Librado, P.,  Biscarini, L.F. et al. (2018). Convergent genomic signatures of domestication in sheep and goats. Nat Commun, 9: 813.

  2. doi: 10.1038/s41467-018-03206-y.

  3. Alexander, D.H., Novembre, J. and Lange, K. (2009). Fast model- based estimation of ancestry in unrelated individuals. Genome Res. 19: 1655-64.

  4. Cingolani, E., Yee, K., Shehata, M., Chugh, S.S., Marban, E. and  Cho, H.C. (2012). Biological pacemaker created by percutaneous gene delivery via venous catheters in a porcine model of complete heart block. Heart Rhythm. 9: 1310-8.

  5. Kalds, P., Luo, Q., Sun, K., Zhou, S., Chen,  Y. and  Wang,  X. (2021). Trends towards revealing the genetic architecture of sheep tail patterning: Promising genes and Investigatory  Pathways. Anim Genet. 52: 799-812.

  6. Li, H. and  Durbin, R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 25: 1754-60.

  7. McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K. A. et al. (2010). The genome analysis toolkit: A map reduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20: 1297-303.

  8. Missohou, A., Kabore, B., Flori, L., Ayssiwede,  S.B., Hornick, J.L., Raes, M. and Cabaraux, J.F. (2022). Analysis of the genetic diversity and population structure of four senegalese  sheep breeds using medium-density single-nucleotide polymorphisms. Animals (Basel). 12(12), 1512; https:// doi.org/10.3390/ani12121512.

  9. Mohamadipoor, S.L., Mohammadabadi, M., Ghanatsaman, Z.A.,  Babenko, O., Stavetska, R., Kalashnik, O., Kucher, D. et al. (2021). Signature selection analysis reveals candidate genes associated with production traits in Iranian sheep breeds. BMC Vet Res. 17: 369.

  10. Stamatakis, A. (2006). RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 22: 2688-90.

  11. Taylor, W.T.T., Pruvost, M., Posth, C., Rendu, W., Krajcarz, M.T.,  Abdykanova, A., Brancaleoni, G., Spengler, R., Hermes, S.T. et al.  (2021). Evidence for early dispersal of domestic sheep into Central Asia. Nat Hum Behav. 5: 1169-79.

  12. Tsartsianidou, V., Kapsona, V.V., Sanchez-Molano, E.,  Basdagianni, Z., Carabano, M.J., Chatziplis, D., Arsenos, G., Triantafyllidis,  A. and  Banos, G. (2021). Understanding the seasonality of performance resilience to climate volatility in mediterranean  dairy sheep. Sci Rep. 11: 1889. doi.org/10.1038/s41598- 021-81461-8.

  13. Tsartsianidou, V., Sanchez-Molano, E.,  Kapsona, V.V.,  Basdagianni,  Z., Chatziplis, D., Arsenos, G., Triantafyllidis, A. and  Banos, G. (2021). A comprehensive genome-wide scan detects genomic regions related to local adaptation and climate resilience in Mediterranean domestic sheep. Genet Sel Evol. 53: 90. doi.org/10.1186/s12711-021- 00682-7.

  14. Weihua, C., Guochao, N., Hao, L., Wang, J. (2022). Integrative transcriptomic and proteomic analysis of ovaries at different physiological periods in dolang sheep. Indian Journal of Animal Research. 56(12): 1492-1498.

Editorial Board

View all (0)