Phenotypes and active ingredients analysis during harvesting
Understanding the formation of secondary metabolites is critical to the quality and marketability of
Astragalus, especially the production of astragaloside IV and calycosin 7-O-β-D-glucopyranoside, two important metabolites that are major components in the formation of herb quality (Fig 1B,C,D). During the growth of
Astragalus, secondary metabolite synthesis-related genes and metabolites showed up-regulated or down-regulated conditions with the extension of harvesting time. It was suggested that the accumulation of astragaloside IV and calycosin 7-O-β-D-glucopyranoside during ripening was likely driven by ACAT, HMGCS, HMGCR, MVK, MVAK2, MVD and other genes
(Wang et al., 2023; Li, 2020;
Zhang, 2020). The production of isofavone, especially calycosin 7-O-β-D-glucopyranoside, appeared to be strongly dependent on formonetin and calycosin metabolites
(Zhang et al., 2022). In this study, the content of secondary metabolite astragaloside IV was significantly higher in C and D periods and calycosin 7-O-β-D-glucopyranoside content was significantly higher in D period.
Growth and development indicators and two important secondary metabolites of
Astragalus roots at different harvesting periods were analysed according to the group’s previous research. Indicators related to herb yield and calycosin 7-O-β-D-glucopyranoside content were highest in period D (Fig 1A,B). Astragaloside IV stayed high level in Period C root and identified as the major contributors to
Astragalus efficacy
(Wu et al., 2020; Li et al., 2024). Besides, higher calycosin 7-O-β-D-glucopyranoside content in herbs after harvesting in periods B and D. Analyses of the contents of the two important active ingredients and the accumulation pattern of the herbs showed that the highest yield, astragaloside IV content and calycosin 7-O-β-D-glucopyranoside content of the herbs after harvesting in period D was the best harvesting period. If batch extraction of astragaloside IV or calycosin 7-O-β-D-glucopyranoside is required for subsequent studies, the herbs could be harvested in stages according to the harvesting strategy (Fig 1C,D).
WGCNA of metabolomic data
The WGCNA was performed to investigate the coexpression networks, in which all co-expressed metabolites were connected to each other with varying correlation strengths. Metabolites were partitioned into 9 co-expression modules (Fig 2A,B). Period D were positively correlated with metabolites expression in the ‘yellow’ module and negatively correlated with metabolites expression in the ‘turquoise’ and ‘turquoise’ modules, with a coefficient of 0.91 and -0.82, respectively (Fig 2A). A total of 637 metabolites were identified based on the correlation (r > 0.8) of metabolites with period D in the tow modules.
Functional analysis of metabolites in correlated modules
KEGG compounds were classified for the two modules screened in relation to the
Astragalus roots harvesting period D. Classification of KEGG analysis revealed that metabolites could be summarized in three main functional categories, including lipids, peptides and carbohydrates process (Figure 3A/B). We performed pathway enrichment analyses on the relevant modules and selected significantly enriched metabolic pathways. KEGG enrichment revealed that the modules related to growth and development were mainly enriched in Phenylpropanoid biosynthesis, Flavonoid biosynthesis, Nucleotide metabolism, Stilbenoid, diarylheptanoid and gingerol biosynthesis, Glycerophospholipid metabolism, alpha-Linolenic acid metabolism, Plant hormone signal transduction, Linoleic acid metabolism and Arachidonic acid metabolism (Figure 3C/D). We speculated that these pathways may play an important role in regulating the root growth and development and secondary metabolic processes of
Astragalus.
Regulation of growth and development and two major active components by key metabolites
Metabolites networks were visualised and metabolite connectivity analysed for metabolites in modules related to period D in
Astragalus. To determine the Hub metabolites related to period D, we classified the ten metabolites with the highest kME values in each module as hub metabolites and utilized the hub metabolites and their interacting indicators of phenotype and active ingredient content to map the metabolites co-expression network (Fig 4). 19 highly connected metabolites screened by WGCNA analysis using period D tissue-specific modules (yellow turquoise) for multifactorial correlation were highly correlated with indicators of growth development and active ingredient content of
Astragalus (Fig 4A,B).
WGCNA of RNA-seq data
The WGCNA was performed to investigate the coexpression networks, in which all co-expressed genes were connected to each other with varying correlation strengths. Genes were partitioned into 36 co-expression modules (Figure 5A/B). Growth and development indicators were positively correlated with gene expression in the ‘blue’ module and negatively correlated with gene expression in the ‘brown’ and ‘turquoise’ modules. The content of CI was negatively correlated with gene expression in the ‘grey60’ module, with a coefficient of -0.76. The content of CC was positively correlated with gene expression in the ‘pink’ module and negatively correlated in the ‘brown’ and ‘yellow’ modules, with a coefficient of 0.71, -0.86 and -0.78, respectively (Fig 5A). A total of 14701 genes were identified based on the correlation (r > 0.7) of genes with growth and development and active ingredient content in the six modules. A total of 14,701 genes were identified based on the correlation between genes and growth and development in the 3 modules. A total of 5096 genes were identified based on the correlation between genes and active ingredient content in the 4 modules.
Functional analysis of genes in correlated modules
GO annotation of three types of modules related to root growth and development, astragaloside IV content and calycosin 7-O-β-D-glucopyranoside content of screened
Astragalus roots, respectively. GO annotation analysis revealed that genes could be summarized in three main functional categories, including cellular component, molecular function and biological process. Five groups, including macromolecule metabolic process, integral component of membrane, intracellular organelle, cellular macromolecule metabolic process and organonitrogen compound metabolic process, were the main classifications for more than 50% of the genes in the GO annotations related to growth and development (Fig 6A). Six groups, including macromolecule metabolic process, integral component of membrane, intracellular organelle, intracellular membrane-bounded organelle, organonitrogen compound metabolic process and cellular macromolecule metabolic process, were the main classifications for more than 50% of the genes in the GO annotations related to calycosin 7-O-β-D-glucopyranoside content (Fig 6B). Molecular function is the main classifications for more than 50% of the genes in the GO annotations related to Astragaloside IV content (Fig 6C).
We performed pathway enrichment analysis on growth and development and active ingredient-related modules and selected the top 20 metabolic pathways from each comparison. KEGG enrichment revealed that the modules related to growth and development were mainly enriched in plant hormone signal transduction, Glycosyl phosphatidylinositol (GPI)-anchor biosynthesis, Homologous recombination, Folate biosynthesis, Terpenoid backbone biosynthesis and Aminoacyl-tRNA biosynthesis (Fig 6D). The modules related to the content of calycosin 7-O-β-D-glucopyranoside were mainly enriched in Plant hormone signal transduction, Terpenoid backbone biosynthesis, Non-homologous end-joining, Phenylalanine, tyrosine and tryptophan biosynthesis and Plant-pathogen interaction (Fig 6E). The modules related to astragaloside IV content were mainly enriched in RNA degradation, Linoleic acid metabolism and N-Glycan biosynthesis (Fig 6F). Thus, we speculated that these pathways may play an important role in regulating the growth and development process of
Astragalus.
Analysis of hub genes interaction network in the module
Mining and identification of key genes regulating root growth and development and analysing the regulatory pathways of plant growth and development are important elements of biological breeding of
Astragalus. Members of the major plant transcription factor families, including the bHLH and AP2/ERF families, have been validated for their roles in the regulation of plant growth and development (
Yue, 2022). However, genes and metabolites related to the regulation of root growth and development in
Astragalus less well studied. Metabolomic and transcriptomic WGCNA analyses in this study indicated that 19 metabolites (Fig 4) and 8 annotated key genes (
CNX2,
talA,
ABCF2,
HIBCH,
crtZ,
AXY4,
TTC5 and
UNKL) are essential for the regulation of growth development of
Astragalus roots.
In this study, gene networks were visualised and gene connectivity analysed for genes in modules related to growth and development, calycosin 7-O-β-D-glucopyranoside content and astragaloside IV content in
Astragalus. To determine the Hub genes related to growth development and secondary metabolism, we classified the ten genes with the highest kME values in each module as hub genes and utilized the hub genes and their interacting genes to map the gene co-expression network (Fig 7). 22 highly connectivity genes were screened by WGCNA analysis using 8 annotated genes (
CNX2,
talA,
ABCF2,
HIBCH,
crtZ,
AXY4,
TTC5,
UNKL) in the tissue-specific modules (blue brown turquoise) related to growth and development (Fig 7A). 6 annotated genes (
ETR,
IPT,
AP3B,
talB,
AXY4,
CYP707A) in the tissue-specific modules (brown yellow pink) related to calycosin 7-O-β-D-glucopyranoside content were used to screen 24 candidate genes with high connectivity (Fig 7B). 8 high connectivity candidates were screened using 2 annotated genes (
copA,
JAR) in the tissue-specific module related to astragaloside IV content (Fig 7C). 16 annotated high connectivity candidates and 54 related high connectivity candidates were screened in the seven modules.
Validation genes by qRT-PCR analysis
To evaluate the reliability of the gene expression profile of
Astragalus at different harvesting periods, qRT-PCR was used to verify the gene expression level. We selected six genes: TRINITY_DN34490_c0_g3, TRINITY_DN184_c0_g2, TRINITY_DN6890_c0_g1, TRINITY_DN31161_c0_g1, TRINITY_DN3009_c1_g1 and TRINITY_DN1975_c0_g1 to be upregulated or down regulated under different harvesting periods. We found that the expression levels of these 6 genes in
Astragalus herbs roots at different times were consistent with the transcriptome sequencing results, which indicated that our RNA-seq data were reliable (Fig 8).