The PEBP gene family plays an important role in floral transformation, floral morphogenesis and species development in plants. There is a significant difference in the number of members of this gene family between monocotyledonous and dicotyledonous plants, with a relatively higher number of PEBP genes in monocotyledonous plants
(Dong et al., 2023). This study searched for 9 members of the PEBP gene family in alfalfa through the local database Blast comparison, which resulted in more members of dicotyledonous plants such as
Vitis vinifera L.,
Cucumis sativus and
Populus trichocarpa and much less of monocotyledonous plants such as
Musa acuminata,
Zea mays L. and
Sorghum bicolor (Liu et al., 2016).
Identification of PEBP gene family in alfalfa
To identify all PEBP gene family members in alfalfa, we conducted a profile search with the PBP domain (PF01161) against the alfalfa proteome by using HMMER. A total of 12 protein candidate sequences were obtained with an E-value threshold of 1E-3. In the meantime, we retrieved all six PEBP protein sequences, FT, TSF, TFL1, ATC, BFT and MFT, from Arabidopsis. A BLASTP search using Arabidopsis PEBP sequences as a query was performed against alfalfa proteome with a minimum identity of 75% and E-value of 1E-5. BLASTP search resulted in 17 protein candidates. Finally, we obtained a consensus list of ten proteins from the HMMER and BLASTP search. Redundant sequences were removed using CD-HIT which resulted in a total of nine PEBP candidates in alfalfa. All nine candidate proteins harboured PBP domains and were retained for further analyses (Table 1). Since three out of the nine alfalfa PEBP proteins were annotated as MsFTa1 (MsG07800401 29.01.T01), MsFTa2 (MsG078004013 0.01.T01) and MsFTc (MsG07800 40133.01.T01)
(Shen et al., 2020), here we kept the same annotation for these three PEBP members.
We then analyzed the physical and chemical properties including length, molecular weight, isoelectric point, instability index and grand average of hydropathicity (GRAVY) value of alfalfa PEBP proteins along with Arabidopsis PEBP members using ProtParam
(Gasteiger et al., 2005). Alfalfa PEBP proteins had a length range of 106 to 210 amino acids (aa) compared to around 175 aa in Arabidopsis PEBP proteins while molecular weight ranged from 12.15 to 24.15 kD compared to 19-20 kD in arabidopsis (Table 1). The larger molecular weight range in alfalfa might be due to the length of two PEBP proteins, MsG0780035 995.01.T01 (106 aa) and MsG018000336 1.01.T01 (210 aa). Isoelectric point ranged from 7.69 to 9.64 with one exception of MsG0680031714.01. T01 (4.76). Instability analysis indicated that three out of nine alfalfa PEBP proteins are stable (instability index less than 40) while there is only one stable protein in Arabidopsis (AtATC). GRAVY values showed that all PEBP proteins are hydrophilic (with negative values) except MsG0480024000.01.T01 which is hydrophobic (with positive value). Furthermore, protein subcellular localization showed that all PEBP members are located in the cytoplasm.
Phylogenetic and chromosomal location analyses
PEBP family members are classified into three groups, namely FT-like, MFT-like and TFL1-like
(Carmona et al., 2007; Hedman et al., 2009; Wang et al., 2015). All nine alfalfa PEBP proteins along with Arabidopsis PEBP homologues were aligned and used to construct a phylogenetic tree. Phylogenetic analysis showed that alfalfa PEBP proteins were classified into TFL1-like, MFT-like and FT-like groups (Fig 1). The largest group is FT-like which contains five alfalfa PEBP proteins, followed by three proteins in the TFL1-like group, while the MFT-like group contains only one alfalfa PEBP protein (Fig 1).
To determine the locations of alfalfa PEBP proteins on the alfalfa chromosomes, gene location coordinates were obtained and used to pin the PEBP members onto alfalfa chromosomes. It showed that nine PEBP members are distributed on five of the eight alfalfa chromosomes (Fig 2). Chromosome 7 has the largest number of PEBP members (four members), followed by Chromosome 6 which has two members, while Chromosomes 1, 2 and 4 have one member each (Fig 2). Gene labels with different colors represent different PEBP groups. Chromosome 7 has a cluster of three FT homologues as reported by Shen
et al. (2020). Moreover, this region also echoes the syntenic region in its closest relative
Medicago truncatula which also harbors three FT homologues
(Shen et al., 2020).
Gene structure analysis and conserved motifs of alfalfa and arabidopsis PEBP genes
Genomic and coding sequences of alfalfa and Arabidopsis were obtained and used to illustrate the exon-intron gene structures with the Gene Structure Display Server (GSDS) 2.0
(Hu et al., 2015). Exons, introns and, upstream/downstream regions were highlighted (Fig 3).
A total of 15 alfalfa and Arabidopsis PEBP proteins were submitted to the MEME suite for identification of conserved motifs. A maximum number of 10 motifs were allowed to be discovered (Fig 4). All other parameters were in default. Then we obtained a consensus list of ten proteins from HMMER and BLASTP search. Redundant sequences were removed using CD-HIT resulted in nine PEBP candidates in alfalfa.
Multiple sequence alignment of alfalfa and arabidopsis PEBP protein sequences expression
The same residues were shaded in red while similar residues were shaded in Dodger blue with lowercases. Residues identical to consensus were shaded in Chatham blue. Exon boundaries are indicated using arrows (Fig 5). Two critical residues
(Ahn et al., 2014) distinguish FT and TFL1 are indicated with purple dots. Four segments within the fourth exon are underlined and labelled as A, B, C and D.
Cis-regulatory elements in alfalfa PEBP promoters
Cis-regulatory elements were classified into five categories, namely Light (light responsiveness), Circadian, Hormone, Stress and defense and Meristem expression (Table 2). Among the five categories, the category Light has the largest number of
cis-regulatory elements (122 elements), followed by Stress and defense (66 elements) and Hormone (38 elements) categories. The circadian (involved in the circadian clock) and CAT-box (involved in meristem expression) are found in one and three PEBP members respectively (Table 2).
Gene expression profiles of alfalfa PEBP gene family from RNA-seq dataset
To check the expression patterns in various tissues, we used a public RNA-seq dataset (
O’Rourke et al., 2015) which includes tissues of the elongating stem, post-elongating stem, flower, leaf, nodule and root. It shows that for most alfalfa
PEBP genes, gene expression is detected in all tissues studied (Fig 6).
MsFTa1 has the highest level in post elongating stem, followed by the leaf and elongating stem. Among all the alfalfa
PEBP genes,
MsG0780035995.01 has the highest expression in the leaf, followed by the flower (Fig 6). Furthermore, gene expression retrieved from a recent study
(Ma et al., 2021) of two genotypes, early flowering genotype 80 and late flowering genotype 195, showed that
MsFTa1 and
MsFTa2 are two strong candidates for florigen in alfalfa, while
MsG0480024000.01 is a candidate which functions as a floral repressor (Fig 7). In addition,
MsG0780035995.01 shows similar expression profiles in both the early and late flowering genotypes and higher expression is detected in the mature leaf than in the new leaf (Fig 7).
Another RNA-seq dataset from our previous study
(Zhang et al., 2015) which used leaf tissues of one fall dormant cultivar (Maverick, fall dormancy class of 1) and one non-fall dormant cultivar (CUF101, fall dormancy class of 9) under two-time points (May and September in China: 34° N, 113° E), was also retrieved to explore the expression of alfalfa
PEBP genes. In our previous study, both cultivars showed similar phenotypes in May, while a fall dormancy-specific phenotype was observed in the fall dormant Maverick in September rather than in non-fall dormant CUF101. Gene expression heatmap shows that
MsFTa2 has higher expression in the non-fall dormant cultivar CUF101, compared to the fall dormant type Maverick (Fig 8). Unsurprisingly,
MsG0780035995.01 has the highest expression among all
PEBP genes.
MsG0780035995.01 has higher expression in May compared to September. In September when fall dormancy-specific phenotypes were observed, lower expression of
MsG0780035995.01 is shown in the fall dormant cultivar Maverick which might suggest that
MsG0780035995.01 is playing a role in fall dormancy (Fig 8).