Overview of transcriptome sequencing data
Total transcriptome sequencing data was 41.28 Gb. After filtering low-quality reads, 39.63 Gb of data was obtained, with an average of 6.605 Gb per sample. The mean Q20 and Q30 for all samples was 97.415% and 92.99%. GC content of high quality reads in all samples was higher than 45.89%. All sequencing information is shown in Table 2. We mapped clean reads to the reference genome, the total mapped reads of samples ranged from 88.23%~91.55%, the unique mapped reads was 84.75%~87.61% and the multiple mapped reads were less than 4.37%. These results suggest that transcriptome sequencing data can be used for further analysis.
Identification of mRNA in ussuri raccoon dog ovary tissue
We identified 31,149 mRNA transcripts in 6 ovarian samples, including 29,143 protein-coding mRNAs and 1,986 new mRNAs. Only 408 genes (1.40%) were highly expressed in all samples, the FPKM value exceeds 60. More than 44% of the genes were not expressed in all six samples and the FPKM value was less than 1. There were 13,886 genes (FPKM>1) co-expressed in two groups (Fig 1A). The distribution of gene number in different FPKM intervals of each sample is shown in (Fig 1B).
Analysis of DEGs in the ussuri raccoon dog ovary
In the comparison of HF and LF group, totally 612 DEGs were identified, including 277 upregulated and 335 downregulated DEGs, respectively (Fig 2A). We performed a clustered heatmap analysis and the results. In addition, the DEGs were divided into several clusters according to the log
2 (fpkm+1) level and the genes in the same cluster showed similar expression pattern under different treatment conditions. Among the significantly up-regulated genes in the HF group, BMPR2, SMAD9, SOX5, BMP5 and RIF1 were found to be related to reproduction.
Functional analysis of the DEGs
GO and KEGG functional enrichment analysis of DEG sets were conducted and the enriched GO terms were classified into three categories. A total of 536 GO terms were enriched, of which 267, 64 and 205 terms were enriched in BP, CC and MF terms, respectively. The top 10 GO terms of each category are displayed in Fig 2B. KEGG analysis results revealed that 20 pathways were found to be enriched (p-value <0.05), including Wnt signaling pathway and osteoclast differentiation (Fig 2C). In addition, KEGG enrichment analysis also found that the Circadian entrainment pathway associated with reproduction was enriched.
Novel gene identification
The unannotated transcription regions were analyzed and the mapped reads were assembled to identify the novel isoforms of knowned or novel genes. A total of 1,986 novel genes were discovered in this study. We found that all the novel genes have more than one transcript. Studies have shown that protein-coding genes are usually multi-exon (
Abdel-Ghany et al., 2016;
Pan et al., 2008). The novel genes were annotated with public databases sequentially and the results showed that 208 (10.47%), 40 (2.01%) and 75 (3.78%) of them could be found in the Swiss-Prot, GO and KEGG databases, respectively. A total of 1,751 (88.17%) isoforms were not found in any of the protein databases. The results suggested that the proportion of protein-coding transcripts in the 1,986 novel isoforms was relatively low.
Alternative splicing events
A total of 19,200 AS events were obtained by alternative splicing analysis and two of the five basic types were covered (Fig 3A). Among the basic AS types, 17,457 SE events and 1,743 MXE events were identified respectively. In the comparison of the two groups, 247 AS events were recognized as DAS events (FDR £0.05 ), including 143 MXE and 94 SE events which in turn corresponding to 135 and 94 genes, respectively. Notably, there are 9 genes that have both identified DAS types simultaneously (Table 3).
Integrated analysis of DAS events and DEGs
To further evaluate the effect of DAS genes in HF and LF groups, we found an intersection between the DEG list and the DAS gene list. Among the DEGs, 8 are DAS genes, of which 5 are upregulated and 3 are downregulated in HF compared to LF (Fig 3B, C).
Of all the up-regulated DAS-DE genes in HF group, we found the ABCA6 gene, which is an important lipid transporter, ABCA6 expression was detected in human placenta and increased with the progression of pregnancy
(Imperio et al., 2019). In our results, ABCA6 was significantly overexpressed in HF groups (P<0.05) and the MXE events were identified. Considering its critical role in the regulation of immunological responses, steroidogenesis and placental barrier function and integrity, the difference in expression and transcription may be the fundamental for a more productive pregnancy. In addition, we also discovered another gene that regulates lipid metabolism, GPAM, which is an enzyme that catalyzes the synthesis of triglycerides
(Igal et al., 2001). Lipid metabolism plays an important role during early pregnancy. Lipid metabolism in the ovary and uterus affects the microenvironment in which oocytes and embryos develop and is critically important for blastocyst development potential (
Lain and Catalano, 2007;
Ye et al., 2021). Maternal lipid metabolism before and during early pregnancy has profound impacts on oocyte quality and embryo survival, as well as the ovarian and uterine micro-environments, which determines the subsequent growth and development trajectories of offspring. In our analysis, GPAM was significantly up-regulated in HF groups (P<0.01) and the SE events occurred. The findings may suggest that GPAM plays an important role in the high reproduction pregnancy of raccoon dogs.
In this study, RNA-seq and transcriptome analysis with reference genome were performed systematically on ovulatory ovarian samples from raccoon dogs with high and low fecundity. The results of the base quality and composition analysis indicating the high-quality of libraries and sequencing.
The DEGs were enriched in the Circadian entrainment pathway which related to the reproduction process. The circadian entrainment pathway has been reported to be closely related to the secretion of reproductive hormones
(Alvord et al., 2022). A study also found that fertility in middle-aged mice could be improved or reduced according to differences in the light-dark cycle (
Takasu et al., 2015). Six genes involved in this pathway were significantly differentially expressed in our results. NOS1, PLCB1, FOS and GNAO1 genes were up-regulated, meanwhile, PRKCB and GNG7 genes were down-regulated in HF group (Fig 4). GNRH (Gonadotropin-releasing hormone) is the master regulator of fertility and reproduction. The release pattern of GnRH by the hypothalamus includes both pulses and surges. Virginia Delli
et al review present the idea of a Kisspeptin-nNOS-GnRH network that is responsible for generating the “GnRH pulse” and “GNRH surge”. Nitric oxide synthase I (NOS1) gene mediates the “OFF” signal on GnRH secretion while the kisspeptin a product of KISS1 gene provides the “ON” signal, promoting GnRH release
(Delli et al., 2021). Furthermore, PLCB1 was also a critical downstream gene of the KISS1/GPR54 signaling pathway
(Zhu et al., 2022). The differential expression of these key genes in the Circadian entrainment pathway may be the important factor for improving the reproductive performance of ussuri raccoon dog.
Among the genes that were significantly up-regulated in the HF group, we also found several other genes related to reproduction. BMPR2 is a member of the bone morphogenetic protein family. Studies have found that BMPR2 plays biological function through BMP/SMAD signaling pathway (
Canty-Laird et al., 2010;
Tan et al., 2017). This pathway is widely believed to be closely related to mammalian ovarian granulosa cell proliferation, reproductive hormone synthesis and secretion, oocyte maturation and ovulation and other physiological activities (
Lain and Catalano, 2007). BMPR2 mainly regulates the ovulation activity of animals by affecting the synthesis and secretion of reproductive hormones such as estradiol (E2), follicle-stimulating hormone (FSH) and luteinizing hormone (LH)
(Regan et al., 2017; Sirard, 2016). We also found SMAD9 and BMP5 genes belonging to the BMP/SMAD pathway among the elevated genes in HF group. The high expression of these genes may play an important regulatory role in follicular growth and multiple pregnancy of raccoon dogs. SOX
5 plays an important role in germ cell development and gender differentiation decisions,
Manfred Schartl et al., (2018) found SOX5 is an evolutionarily conserved regulator of germ-cell number in medaka fish.
In the integrated analysis of DAS events and DEGs, we found key genes related to lipid metabolism and transport, suggesting that these genes could be used as crucial markers to identify high-fertility traits in raccoon dogs. ABCA6 and GPAM genes, involved in lipid transport and metabolic regulation, are significantly highly expressed in the ovarian tissues of high-fecundity raccoon dogs. Their increased expression and the occurrence of variable AS events may be the fundamental for more fetal pregnancies in raccoon dogs. Furthermore, the GPAM gene shows a missense variant (c.234T>A, p.Phe78Leu) of amino acids in all three samples of the HF group according to SNP calling. Further molecular biological experiments are needed to explore the possible effect on the function of GPAM with the protein structural changes.