Whole genome sequencing
For each sample, the depth of coverage for mitochondrial genome ranged from 145.82 to 1652.72 x with an average of 802.59 x when the paired-end reads generated were aligned to the
Bos indicus reference genome sequence (GCF-000247795.1-
Bos-indicus-1.0). The mitochondrial genome sequences ranged from 16,338 to 16,340 bp with 13 protein-coding genes, two ribosomal RNA (12S and 16S rRNA) genes, 22 transfer RNA (tRNA) genes and one control region of 913 bp (D-loop) in cattle breeds of Tamil Nadu as found in other mammals.
Variant calling
A total of 273 variants observed across 13 protein coding regions from the 15 pooled samples sequenced; of which, 268 were bi-allelic SNPs, three multi-allelic SNPs and two InDels. Pooling of the samples strenghtened the richness of variant calling in the present study as also supported by
Gautier et al., (2013). Of the biallelic SNPs, 256 SNPs were observed to be transitions while the remaining 12 were transversions. Out of total variants detected, 157 variants were found to be synonymus mutations with very low effect; whereas, 30 variants were observed to be non synonymous (missense) mutations Eighty six variants (84 SNPs and 2 InDels) were detected on the upstream region and have a modifier effect on the protein synthesis. List of variants observed in all the 13 protein coding regions were presented in Table 1. Highest number of variants was found in Pulikulam cattle across the coding regions. ND1 gene was observed to have highest number of variants (71) across 15 samples but is restricted to 45 loci indicating its polymorphism as 1.57 per locus. Lowest number of variants (5) were observed in ATP8 (five loci) and ND4L (four loci) genes. Information pertaining to number of polymorphic loci pertaining to 13 protein coding genes in five cattle breeds of Tamil Nadu is shown in Fig 1.
Genetic structure of indigenous cattle breeds
The amount of genetic variation within a population provides an insight into the demographic structure and evolutionary history of a population. The polymorphism and genetic diversity indices of five cattle breeds were detected using DnaSP v 6.12.03 software for the 15 mitochondrial DNA sequences and are presented in Table 2. The polymorphism, measured in terms of nucleotide diversity (p) is the average number of nucleotide differences between two sequences. Moderate amount of polymorphism was observed in Alambadi, Bargur, Kangayam, Pulikulam and Umblachery cattle breeds with average difference in nucleotides between the two sequences within the populations as 0.11, 0.12, 0.14, 0.76 and 0.12 per cent respectively. A total of 390 sites were found to be polymorphic of which, 46, 58, 43, 205 and 38 were observed in Alambadi, Bargur, Kangayam, Pulikulam and Umblachery cattle breeds respectively. Of these 390 sites, 304 sites were found as segregating. Number of polymorphic sites in the present study are more (390) than that revealed by
Chung (2013) in Korean cattle (286-288). A total of 22 regions in Alambadi, Kangayam and Umblachery; 23 regions in Pulikulam and 25 regions in Bargur cattle are found to be conserved significantly (p<0.05).
Genetic diversity has a vital role in the survivability and adaptation of the populations. The average number of nucleotide differences between the two sequences (p) within the populations and the number of segregating sites revealed high haplotype diversity. All the 15 sequences of five cattle breeds were categorised into 15 different haplotypes with haplotype diversity (HD) of 1.0 indicating high degree of haplotype diversity. Haplotype diversity of the present study was more than that reported by
Chung (2013) in Korean cattle (0.052 to 0.20) and
Petretto et al., (2022) in Sardinian local cattle stock (0.879). The genetic differences were observed between populations of the same breed, indicating that the identified haplotypes may be used to characterize group specificities of each cattle breed (bulls, moderate yielders, low yielders).
Genetic differentiation
DnaSP was used to compute nucleotide test statistics such as Ks, Kst , Snn and haplotype statistics such as Hs and Hst to describe the genetic divergence within the populations. All the 15 samples were separated into 15 haplogroups. The Hs, Hst, Ks, Kst, Z and Snn values observed in the present study were 1.00, 0.00, 45.00, -0.032, 57.20 and 0.033 respectively. As per overall genetic differentiation metrics, it was observed that no genetic differentiation among the
Bos indicus cattle breeds of Tamil Nadu under study. High gene flow between the types, which could be attributable to introgression during breeding programmes and subsequent selection, could explain the reason for no genetic distinction.
Gene flow
Gene flow is a crucial technique for spreading genetic variation between populations. The overall F
ST value was 0.4932 indicating genetic differentiation among the populations analysed when 24 sequences from
Bos indicus and
Bos taurus cattle were included together. This indicates that these populations are genetically distinct. On the contrary, the overall F
ST was -0.04435 with a net migration rate of 11.77 when only
Bos indicus sequences of Tamil Nadu were included (three each from Alambadi, Bargur, Kangayam, Pulikulam and Umblachery) indicating high rate of intermixing among the five populations and the effective migrants were also similarly high (11.77), suggesting high gene flow between them. Gene flow at high rates can minimise genetic differentiation between the two populations, resulting in increased homogeneity. Because the majority of cattle breeding in India have not been systematically evolved, gene movement between populations is generally expected. F
ST value observed in the present study (value) was lower than that reported by
Petretto et al., (2022) in Sardinian local cattle (0.056) with high amount of genetic differentiation.
Neutral evolution
The test of neutral evolution analyzed based on the total number of mutations and segregating sites across all 15 sequences of
Bos indicus breeds, revealed statistically significant (P<0.05) negative values for Tajimas’s D (-2.095), Fu and Li’s D (-2.974) and Fu and Li’s F (-3.150) test statistics. A significant negative Tajima’s D (P<0.05) for all the
Bos indicus cattle indicated an excess of low frequency polymorphisms than expected, indicating existence of purifying selection and population expansion among the genetic groups under study. Similar findings were reported by
Petretto et al., (2022) in Sardinian local cattle stock.
Phylogenetic analysis
Maximum likelihood based phylogenetic analysis was performed with 15 complete mitogenomes of five breeds combined with eight taurine sequences and one Nellore cattle reference mitogenome sequence and is shown in Fig 2. HKY+G+I (Hasegawa-Kishino-Yano + Gamma distribution + Evolutionarily invariable sites) model was found to have lowest AICC and BIC values and was selected as the best model to explain the divergence of sequences. The phylogenetic tree divided all the 24 sequences into two distinct haplogroups:
Bos indicus and
Bos taurus (Fig 1). It divided
Bos indicus clade into two subclades. Interestingly, Zwerg Zebu, an European dwarf zebu cattle breed was placed in the
Bos indicus group. Similar findings were observed by
Pramod et al., (2018) where, Zwerg Zebu cattle was placed in the
Bos indicus lineage. It was also found that one sequence from Pulikulam breed of cattle (PCG2) was unambiguously associated with the
Bos taurus haplogroup, but representing an unknown divergent mitochondrial sub-haplogroup.
Multi-dimensional scaling
Multi dimensional scaling clearly segregated 24 sequences into two different clusters as
Bos indicus and
Bos taurus cattle. These results were in accordance with the phylogenetic analysis. The Zwerg Zebu cattle was placed in
Bos indicus cluster whereas one sequence from Pulikulam cattle to be joined to
Bos taurus cluster. The MDS plot of 24 mitochondrial sequences belonging to
Bos indicus and
Bos taurus cattle is displayed in Fig 3.
Analysis of molecular variance (AMOVA)
AMOVA analysis indicated there was no differentiation of the five cattle breeds of Tamil Nadu (between-population component of variation is zero) although all the diversity was gathered at the within-breed level (Table 3).