Isolation of DREB2C gene
The putative
DREB2C gene from cultivar L14 was isolated by PCR amplification. The analysis revealed that this sequence was 2.0 kb long and contained two exons (nt 1-23 and 554-2009) and an intron (nt 24-553) encoding a polypeptide chain of 492 amino acids with a molecular weight (MW) of 55.31 kDa (GenBank: OQ685297.1). Our putative DREB2C sequence was longer and 84.8% similar to that of DREB2C from the Shitouqi peanut cultivar (GenBank: QHO54253.1, unpublished). Based on the BLASTN results in the Peanutbase database (
https://peanutbase.org), the putative
DREB2C gene has a high degree of similarity to its corresponding gene in some species of the
Arachis genus, including
A. hypogaea (99.3%),
Arachis ipaensis (99.3%),
Arachis duranensis (98.4%) and
Arachis cardenasii (98.6%). Besides, the BLAST results in the Peanutbase database indicated
DREB2C was present on chromosomes 2 (SE8WTS.1) and 12 (KD4831.1) of
A. hypogaea. Whereas this gene only occurred on chromosome 2 of
A. ipaensis (Araip.3JJ8N.1),
A. duranensis (Aradu.GB4U4.1) and
A. cardenasii (araca.K10017.gnm1.chr02).
Characterization of DREB2C protein
The results of the structural and functional domain analysis of the protein revealed that the isolated sequence was identified as
DREB2 gene with a conserved domain (AP2/ERF) from amino acids 203 to 580 and a DNA binding site with the following sequence: YRGVRQRIWGKWVAEIREP ISNHESKKNAKRLWLGTF. These findings suggested that
DREB2 gene encodes a nuclear protein containing one conserved AP2/ERF DBD and that DREB2 protein belongs to the DREB TF subfamily. According to InterProScan analysis, putative DREB2 protein is a member of the dehydration-responsive element-binding protein 2C family (PANTHER ID: PTHR31241:SF62). Plant Transcription Factor Database examination of the amino acid sequence of DREB2 also confirmed that it is the DREB2C protein (TF ID: AT2G40340.1). From the above bioinformatics analysis results, the putative
DREB2 gene was named
AhL14_ DREB2C.
Data from the DeepTMHMM server showed that the AhL14_DREB2C protein lacks transmembrane domains and signal peptides. The presence of a nuclear localization signal at the N-terminus of the polypeptide chain in the
AhL14_DREB2C gene with a probability of over 0.9516 suggests that this gene is involved in the regulation of transcription. Biological functional analysis of AhL14_ DREB2C based on annotation application through Gene Ontology (GO) revealed that AhL14_DREB2C is involved in biological processes (the regulation of DNA template transcription: GO:0006355) and molecular function (the activity of DNA-binding transcription factor: GO:0003700; and DNA binding: GO:0003677).
Also, the findings showed that the AhL14_DREB2C protein contains high-hydrophilic domains such as amino acids 45-46, 48-50, 109-118, 154-160, 239-241, 246-248, 270-272, 335-340, 376-378, 384-385, 455-458 and 483-488. The protein-protein interaction prediction revealed that AhL14_DREB2C interacted with some drought stress inducers such as ABF2 (abscisic acid responsive elements-binding protein 2), HSFA3 (heat shock transcription factor a3), ABF3 (abscisic acid responsive elements-binding factor 3), ABF4 (abscisic acid responsive elements-binding factor 4) and HSF1 (heat stress transcription factor A-1a) (Fig 1).
3D modeling of AhL4_DREB2C protein structure
The 3D protein structure of AhL14_DREB2C was predicted by PHYRE2, SWISS-MODEL, I-TASSER and AlphaFold-2. As analyzed by PHYRE2, AhL14_DREB2C was a DNA-binding protein and this program only modeled 17% of the AhL14_DREB2C sequence with 100% confidence using the single highest scoring template (PDB no. 5WX9). Whereas, I-TASSER modeled the protein structure with 26% coverage based on the best template of the ethylene-responsive transcription factor ERF096 of
Arabidopsis thaliana (PDB no. 5WX9), which has the highest alignment score (Z-score: 4.43). The SWISS-MODEL was only modelled at DNA binding domains (amino acids 74-140) based on the template of ethylene-responsive element binding factor 1 (PDB no. 1gcc). The AlphaFold, which assesses the per-residue (amino acid) confidence score (pLDDT, from 0-100), found that 26% of the residues had a confidence of more than 70, whereas low-confidence regions (50-70), found at the N and T-termini, account for 19.3% of the residues.
The DNA binding domain of
AhL14_DREB2C with the GCC-box DNA motif was generated based on the ERF096 template of
A. thaliana (PDB No.: 5WX9) (Fig 2). Additionally, we found an SNP marker (AX-147214668, underlined) in the DNA sequence of
AhL14_DREB2C (TGCTCAAATCAC ATGATGATGTTAAACTTTTGGTCA/GATTCTTCGATTGG TACTTCTATATTCGCTACA).
Phylogenetics
Phylogenetic analysis showed that AhL14_DREB2C belongs to the same group as the corresponding genes of
A. hypogaea and
A. duranensis species (Fig 3). In particular, its deduced amino acid sequence shared a high degree of similarity to AhDREB2C of
A. hypogaea (98.2% for Peanutbase ID: Arahy.KD4831; 97% for Peanutbase ID: Arahy.SE8WTS.1; and 84.8% for GenBank: QHO54253.1) and of
A. duranensis (97% for Peanutbase ID: Aradu. GB4U4.1). These results suggest that the function of DREB2Cs in the aforementioned group may be relatively comparable.
Expression of AhL14_DREB2C gene
The transcriptional expression of the
AhL14_DREB2C gene was found in different tissues and different growth and development periods (Fig 4). Observations revealed that roots and leaves at the flowering stage had higher transcription levels than other tissues at the same stage (Fig 4A) and under similar growing conditions, the
AhL14_DREB2C gene was more significantly expressed during the flowering stage than the 4-leaf stage (Fig 4B). Among the tested peanut cultivars, L14, L20, SeGL and LDH09 exhibited higher expression levels of the
AhL14_DREB2C gene in roots compared to others (Fig 4C). Moreover, RT-PCR amplification indicated that hybrid cultivars (Table 1) had higher transcription levels than native cultivars.
Polymorphism in SSR-linked drought tolerance markers
Eleven cultivars of peanuts commonly cultivated in Vietnam were employed in the present study to assess genetic diversity based SSR markers linked to drought tolerance-related traits (Table 5). The major allele frequency is the proportion of the most common allele in the population. The mean major allele frequency for all markers was 0.79, indicating that the majority of individuals in the population may carry the same allele for each marker. Besides, the mean allele number and gene diversity for all markers were 3 and 0.33, respectively, suggesting that each marker was reasonably informative and had a moderate degree of polymorphism. Further, the mean heterozygosity across all markers was 0.41, which is also of a moderate degree and suggests that the population is not highly inbred. The PIC value is a measure of the informativeness of a marker for genetic analysis. The mean PIC value for all markers was 0.30, indicating that the markers are moderately informative. Among studied SSR markers, IPAHM689, pPGSseq19H3 and PGSseq9G5 had the highest polymorphic values (Fig 5). The SSR markers’ polymorphic information content value ranged from 0.15 to 0.54. The result indicated genetic diversity analysis based on the SSR-linked drought tolerance markers in peanut varieties in Vietnam showed moderate polymorphism. The phylogenetic tree shows a similarity range of 73.6% to 94.7% among different peanut varieties in each group (Fig 6). The investigated peanut varieties were divided into two groups at various similar levels. Varieties L20 and LDH01 belong to group I. Group II is divided into 2 sub-clusters including CucHT, LyTN, L27, SenNA, SeGL, SeDR and seTN varieties. The other sub-cluster contains the varieties L14 and LDH09.
Peanuts (an allotetraploid species: 2n = 4x = 40) are an essential food source that provides nutritious oils and proteins and is farmed in tropical and subtropical semi-arid locations across the world
(Zhao et al., 2018). Peanuts, like many other plants, are subjected to a variety of abiotic stressors, including drought, heat, cold and salinization, which significantly affect their growth, development and yield
(Agarwal et al., 2017).
Plant stress tolerance is dependent on regulatory genes that promote the overexpression of stress-tolerance genes, which improves the ability of plants to adapt to different environmental stresses
(Zhang et al., 2018; Mizoi et al., 2012). The DREB2C protein, a member of the AP2/ERF superfamily of plant transcription factors, has been demonstrated to play a significant role in the regulation of abiotic stress responses as well as plant growth and development. Several studies have been conducted on the identification of the DREB2C gene
(Lee et al., 2009) and its expression in plants subjected to various abiotic stress conditions, such as drought
(Han et al., 2022; Ariharasutharsan et al., 2022), heat
(Lee et al., 2010; Chen et al., 2010; Lim et al., 2007), salt stress
(Song et al., 2014) and cold stress
(Lim et al., 2007; Lee et al., 2010). Here, we isolated the AhL14_DREB2C gene, which contains an intron and has a CDS length of 1479 bp and an AP2/ERF domain, which implies that AhL14_DREB2C maybe has an important role in the ABA signaling system and the response of plants to drought as other DREBs.
Besides, the expression of DREB2C in the flowering stage was differently induced in various organs, such as the root, stem and leaf. where the gene expression was higher in root tissue compared to other tissues. The expression of DREB2C in root tissues also showed differences between flowering and earlier stages, suggesting that perhaps the presence of DREB2C would support plant growth at the water-demanding stage. Moreover, the expression of DREB2C in root tissues differs among peanut cultivars cultivated in Vietnam. The peanut varieties with good drought tolerance in previous studies, such as L14, L20 and LDH09, showed higher DREB2C expression than the local varieties, except for the SeGL variety. This suggests that the DREB2C gene can be used as a molecular marker to aid in the screening of drought-tolerant peanut varieties.
In addition to identifying genes associated with drought tolerance, the use of gene-linked microsatellite markers is an important research tool to assist molecular breeding in order to generate better drought-resistant cultivars. In this study, SSRs associated with drought tolerance traits such as transpiration, transpiration efficiency, specific leaf area and SPAD chlorophyll meter reading were used to analyze genetic diversity among peanut varieties commonly grown in Vietnam. The results of genetic diversity analysis using SSR markers showed that 11 types of peanuts were separated into many sub-branches with various degrees of similarity. The findings of the SSR analysis revealed that the peanut varieties used in the study had a medium level of diversity; this conclusion is similar to other previous studies
(Cuc et al., 2013; Ren et al., 2014). The phylogenetic tree based on SSR-related drought tolerance marker results may suggest that cross-varieties between different groups can increase the variability among the cultivated varieties for selecting the dominant varieties with improved drought tolerance.