Rapidly evolving gene families
In 16 species, orthogroups number of genes was 324580 and percentage of genes in orthogroups was 97.8%; number of orthogroups was 19498; number of species-specific orthogroups was 752; number of genes in species-specific orthogroups was 4002; mean orthogroup size was 16.3; median orthogroup size was 16; number of single-copy orthogroups was 8809. In YFP, 501 gene families showed rapidly expansion and 220 gene families showed rapidly contraction, which was consistent with the number of expanding and contracting gene families as shown in Fig 1. For YFP, 860 genes were obtained; 270 genes were lost; 18777 genes were no change; in general, YFP showed genes expansion.
Functional enrichment analysis of 1158 genes (genes of rapidly expansion 501 gene families) was executed. Only clusters associated with reproductive activities enrichment with p<0.05 were listed (Fig 2). The most significant GO terms mainly consisted of biological processes which were involved in germ-line stem cell population maintenance, regulation of androgen receptor signaling pathway and male germ-line stem cell population maintenance. This could be a genetic adaptation for YFP to remain fertile when it was old. According to the conversion of body length and age, the females of YFP in Poyang Lake may still be able to reproduce at the age of 18-19. YFP, meanwhile, had 220 gene families showed rapidly contraction.
Functional enrichment analysis of 726 genes (220 gene families of YFP rapidly contraction and the gene set was the corresponding
Mus musculus genes) was executed. Only clusters associated with reproductive activities enrichment with p<0.05 were listed (Fig 3). The most significant GO terms mainly consisted of biological processes which were involved in positive regulation of estrogen secretion, mating and estrogen metabolic process.
Ireland et al., 1984 believed that estrogen was mainly secreted by dominant follicles on the ovary
(Ireland et al., 1984) and the conception rate of female animals might be affected by the concentration of estrogen secreted by dominant follicles
(Kiewisz et al., 2011). This might be related to YFP’s polygyny mating system
(Hao et al., 2006), the fact that no YFP gave birth to more than 1 baby per birth (
Li, 2017) and the pregnancy rate of YFP in Poyang Lake was as high as 70% (
Mei, 2013).
Positive selection genes and protein functional differentiation
According to the branch site model detection, GDF9 and BMP15 genes exhibited purifying selection. For FSHR gene, 11 positive selection sites with a posterior probability greater than 0.5 with experience were found. For FSHβ gene, YFP,
Phocoena sinus and
Bos taurus all had two homologous genes. Therefore, FSHβ genes could form 8 sequence combinations. No positive selection sites were found in one combination and positive selection sites were found in the other seven combinations. Only FSHβ gene needs to be further expanded for protein functional differentiation test, seeing Materials and Methods for detailed description. For B FSHβ gene set, no functional differentiation was found between the clusters. For A FSHβ gene set, there were 1-4 functional differentiation sites in cetaceans, as shown in Table 2. Further analysis revealed that the L at position 40 is a specific site for the YFP, Phocoena sinus and baiji, as shown in Fig 4-5. The FSHβ protein sequence of YFP and
Phocoena sinus had fragment insertions, as shown in Fig 6. YFP,
Phocoena sinus and
Bos taurus all had two homologous genes for FSHβ, but only FSHβ protein sequence of YFP and
Phocoena sinus had fragment insertions. Further analysis revealed that the L at position 40 was a specific site for the YFP,
Phocoena sinus and baiji.
The relationship between genes of GDF9, BMP15, FSHβ, FSHR and ovulation number is the main object of research on multiple fertility traits in cattle and sheep. Similar to previous studies examining positive selection, the majority of genes investigated in this study were under purifying selection, which is not surprising given the evolutionary constraint on protein coding genes. However, signs of significant positive selection were detected in 2 genes (FSHR and FSHβ). This suggests that FSHR and FSHβ evolved fast along all YFP lineages examined and their positive selection might be associated with fecundity traits. The mutation rate of FSHR gene exon 10 was significantly different between twin and single bovine cattle
(Lei et al., 2004). It was inferred that FSHR gene exon 10 was related with the major gene that controls the high prolificacy of goat and the individuals with FSHR gene mutation showed high lambing number, indicating that the high expression of FSHβ in the ovary promoted the high ovulation number, which was the result of the mutual regulation between FSHR and FSHβ gene (
Ji, 2007).