Legume Research

  • Chief EditorJ. S. Sandhu

  • Print ISSN 0250-5371

  • Online ISSN 0976-0571

  • NAAS Rating 6.80

  • SJR 0.391

  • Impact Factor 0.8 (2024)

Frequency :
Monthly (January, February, March, April, May, June, July, August, September, October, November and December)
Indexing Services :
BIOSIS Preview, ISI Citation Index, Biological Abstracts, Elsevier (Scopus and Embase), AGRICOLA, Google Scholar, CrossRef, CAB Abstracting Journals, Chemical Abstracts, Indian Science Abstracts, EBSCO Indexing Services, Index Copernicus

Transcriptome-based Medicago varia Codon Preference Analysis

Liu Bai1, Fengling Shi1,*, Yingtong Mu1, Yuanyuan Cui1
1College of Grassland Science, Key Laboratory of Grassland Resources of the Ministry of Education, Key Laboratory of Forage Cultivation, Processing and Higher Efficient Utilization of the Ministry of Agriculture and Rural Affairs Inner Mongolia Agricultural University, Hohhot 010019, Inner Mongolian, P.R. China.
  • Submitted06-08-2024|

  • Accepted07-02-2025|

  • First Online 22-03-2025|

  • doi 10.18805/LRF-827

Background: The study explored codon usage bias in medicago varia transcriptome coding sequences, aiming to provide data support for understanding gene expression and enhancing Molecular breeding methods in medicago varia.

Methods: In this study, Medicago varia was used as the research material and 11722 complete open reading frame sequences were screened from the transcriptome sequencing results. Codon usage patterns and preferences were analyzed using software such as CodonW, R and Excel.

Result: According to the 11722 complete open reading frame sequences screened from the transcriptome sequencing results, Codon was used to calculate the Codon preference index. A total of 41354958 Unigenes were obtained. A total of 80757 Unigenes (39.66%) were annotated, with the majority being annotated in the NR database. GO annotations include 20 Biological processes, 16 Cellular components and 11 molecular function GO terms, respectively. 25713 genes were annotated into KOG, including 25 subclasses. The content of average GC was 42.87%, respectively. The study revealed that the effective number of codons (ENC) ranged from 23.9 to 61.0. Mutation and selection affected the codon preference. Medicago varia preferred 30 codons and preferred the third base of the codon as A/U, so it can be inferred that Medicago prefers the third codon with A/U.

Transcriptome sequencing is a technique used to transcribe all RNA molecules from a particular cell, tissue, or organism at a given time comprehensively and rapidly. This method has gained popularity in the fields of genetic breeding, pathological analysis and physiological mechanism research (Feng et al., 2022). All functional states of a cell produce specific RNA, including mRNA and non-coding RNA, which are transcribed from that cell (Wang et al., 2021). Genetic codons serve as the connection between the nucleic acid and an organism’s protein to transmit information. The arrangement of triple codons results in a total of 61 codons in the organism, with 3 stop codons removed. These codons encode 20 amino acids, resulting in multiple codons corresponding to a single amino acid. These codons encode the same amino acid and are synonymous codons (Xi et al., 2021). Intriguingly, synonymous codons are not randomly or equally used during encoding (Ma et al., 2015). Over time, due to gene mutations and selection pressure, certain codons are used more frequently than others, resulting in what’s known as codon usage bias (CUB). This bias is characterized by the common occurrence of using synonymous codons of different frequencies (Qin et al., 2022). CUB is widespread across species and differs from species as a code within the genetic code or the second genetic code (Xu et al., 2021; Liu et al., 2020). The analysis of CUB will aid in understanding molecular evolution, environmental adaptation, genomic features and gene functional requirements of different species. It will also help elucidate the superior agronomic performance of cultivated species (Tang et al., 2000; Zhang et al., 2018).
       
At present, codon usage bias analysis based on transcriptome data has been studied in the use of codons ending in A or U in the protein expression process of Ginkgo biloba (He et al., 2016) and Medicago ruthenica (Peng et al., 2024), Helianthus annuus WRKY Transcription factors (Gao et al., 2022), Paeonia lactiflora (Wu et al., 2015) and Morus cathayana (Kong et al., 2017) prefer to use codons ending in A or T. Species differences in the preferred use of codons are demonstrated. The production of CUB is related to many factors such as gene expression level, translational initiation effect, base composition of the gene, GC content, gene length and RNA structure (Brenner et al., 2002). The most frequently used codon is called the optimal codon. Through the determination of the optimal codon, the expression vector of genetic engineering can be designed in a targeted manner and the expression amount of the target gene can be increased (Liang et al., 2019; Wang et al ., 2024). Analyzing the codon preference and optimal codon of plants can provide valuable information about gene sequence characteristics such as base content and codon bias. This information can help us gain a better understanding of the development of biomolecules, the utilization of resources and the optimization of gene expression (Alfalahi et al., 2022). It lays a strong theoretical foundation for exploring these topics. In breeding research, the optimal expression of genes can be achieved through the re-optimization and synthesis of genes and codon optimization is the key step to improve gene expression, including replacing rare codons, optimizing RNA secondary structure, adjusting GC content, avoiding restriction enzyme cleavage sites and deleting complex structures (such as hairpins and repetitive structures) to improve gene expression efficiency, so as to speed up the breeding process. The Escherichia coli codon GBS surface protein Lrrc was optimized and its expression in E. coli was effectively improved. After codon optimization in plants, the codon SpCas9 could be stably expressed in plants (Chen et al., 2018). The optimization of Cas9 codon based on gene editing technology significantly increased the expression of Cas9 protein in bananas and the gene editing efficiency was increased by 4 times after protoplast transformation (Shi et al., 2018).
       
Medicago varia
Martin. cv. Caoyuan No.1 is an excellent variety of alfalfa artificially cultivated, which is widely distributed in semi-arid regions such as Northeast China, North China and Northwest China. Due to its well-developed root system and strong stress resistance, it not only plays the role of maintaining water and soil, windbreak and sand fixation but also is an excellent variety for the establishment of mixed artificial grassland grazing utilization (Zhang et al., 2024). Based on the important ecological use and feed value of alfalfa, to effectively utilize the resources of alfalfa and comprehensively develop its potential value, basic research on its genome and transcriptome has become an urgent problem to be solved.
       
In this study, 11722 coding sequences (CDS) obtained by transcriptome sequencing were used as the research object and the codon composition and usage parameters were statistically analyzed by CodonW and Excel software. The codon use preference characteristics of the transcriptome of Caoyuan No. 1 alfalfa were revealed. It provides data support for future genetic variation, genetic mapping, gene mapping, molecular breeding and resource development.
Medicago varia Martin. cv. Caoyuan No.1 used in the experiment was collected from the forage experimental field of Inner Mongolia Agricultural University, at 111o71¢ E and 40o81¢N. Full, uniform and consistent tissues were selected and tested in 2020-2021. Before sequencing, the material was placed in a cryovial and treated with liquid nitrogen for more than 20 minutes (30 minutes is appropriate), which is not easy to degrade. Samples are stored at -80oC for total RNA extraction and cDNA synthesis. Total RNA was using an RNA extraction kit, reverse transcription synthesizes the first cDNA reaction strand were sequenced using the IlluminaHiSeq™ 4000 and 6188391094 high-quality sequences were obtained after splicing, removing linkers and low-quality sequences. A total of 80,757 Unigenes were assembled by De novo for screening and 11,722 high-quality transcribed gene sequences were finally obtained for codon analysis.
       
To obtain comprehensive gene function information, Unigenes were compared with NR, KOG, GO, Swissprot, KEGG, TrEMBL and other databases by NCBI Blast+v2.60 (Altschul et al., 1997). The GO functional annotation information was obtained based on the protein annotation results of Swissprot and TrEMBL (Gao et al., 2023). The transcripts were aligned to the database and TransDecoder v3.0.1 was used to predict the coding sequence (CDS) of Medicago varia Martin. cv. Caoyuan No.1.
       
The codon adaptation index (CAI), the effective number of codons (ENC) and relative synonymous codon usage (RSCU) of coding sequences of Medicago varia Martin. cv. Caoyuan No.1 was calculated by Codon W software. The total GC content of the statistical sequence and the GC content of the nucleotide at codon position 3 (GC3), the GC content of codon positions 1 and 2 are expressed by GC1, GC2 and the average value of GC1 and GC2 is recorded as GC12.
       
Relative synonymous codon usage indicates the relative use probability of encoding amino acids between specific synonymous codons. If the codon has no preference, the RSCU value is 1 and if the RSCU value of a synonymous codon is greater than 1 or less than 1, it indicates that the codon is used relatively more or less (Liu et al., 2009), according to the results of RSCU, the high-frequency codon of Medicago varia Martin. cv. Caoyuan No.1 transcriptome was identified.
       
The effective codon number ENC is an important reference index to measure the codon use preference, its value is between 20~61, the closer to 61, the weaker the codon use preference, when the ENC value is 61, the gene sample is completely random and the closer to 20, the stronger the codon use preference. ENC plotting was related to gene use bias and ENC plotting used ENC values and GC3s as ordinate and abscissa as two-dimensional scatter plots and to detect the effect of base composition on codon bias (Wright et al., 1990). The formula for calculating the standard curve is as follows:
 
 
  
The GC12 of the transcriptome codon of Medicago varia Martin. cv. Caoyuan No.1 was used as the ordinate and GC3 was used as the abscissa to plot the scatter plot and the linear fitting regression analysis was performed. To assess the correlation between GC3 and GC12. It is used to measure the degree to which natural selection pressure and mutation affect codon use preference.
       
The bias analysis (PR2-plot) was based on A3/(A3+T3) as the abscissa and G3/(G3+C3) as the ordinate to reveal the base composition of each gene, which was used to evaluate the relationship between purines and pyrimidines in the codons of each gene (Quax et al., 2015).
       
Comparison of codon use preference between Sativa and other organisms in Medicago varia Martin. cv. Caoyuan No.1. The occurrence frequency of each codon in the Medicago varia was obtained from the CodonW software and compared with that of Arabidopsis thaliana, Glycine max, Nicotiana tabacum, Saccharomyces, Escherichia coli (Duret et al., 1999).
Transcriptome sequencing with assembly
 
A total of 41843596 Raw Reads were obtained by sequencing the cDNA library of Medicago varia Martin. cv. Caoyuan No.1. Removing low-quality reads (Q value less than 20 and length less than 35 nt) and connectors yielded a total of 41354958 clean reads. High-quality sequences (sequences with more than 20 base masses) account for 98.84%. GC content accounted for 42.74% of the total base. The results showed that the transcriptome data of Medicago varia from Caoyuan No.1 met the quality standards of transcriptome data analysis. A total of 80757 Unigenes were obtained by assembly and redundancy of de novo, with an average length of 655 bp and an N50 value of 1067 bp and the length of Unigenes (Table 1; Table 2), the number of Unigenes between 200~300 nt was the largest, with 31256 entries, accounting for 38.70%, 45354 entries, accounting for 56.16 % of 300~2 000 nt and 4147 entries, accounting for 5.14%, which were larger than 2 000 nt (Table 3).

Table 1: Data filtering and statistical results.



Table 2: Transcriptome data splicing results of Medicago varia martin. cv. caoyuan No.1.



Table 3: The proportion of Unigene in each length interval of medicago varia martin. cv. caoyuan No.1.


 
Functional annotation of Medicago varia Martin. cv. Caoyuan No.1 unigene
 
The Unigene of Medicago varia Martin.cv.Caoyuan No.1 was annotated into 4 databases (NR, SwissProt, KOG and KEGG library). From the annotation results, it can be concluded that 52753 (65.32%) of Unigene were successfully annotated. NR and SwissProt had the largest number of successful annotations, with 49353 (93.55%) and 35292 (66.90%) annotations, 25713 (48.74%) annotations in KOG and 18589 (35.24%) annotations in KEGG (Table 4).

Table 4: Statistics of Unigene annotation rate of medicago varia martin. cv. caoyuan No.1.



GO and KOG functional annotation classification

In the GO functional annotation classification of Medicago varia Martin. cv. Caoyuan No. 1 (Fig 1), biological processes were annotated to 20 GO terms, cell composition was annotated to 16 GO terms and molecular functions were annotated to 12 GO terms. According to the KOG note (Fig 2), 25 metabolic pathways were injected into 25713 Unigene, with the highest number being universal functional prediction, followed by post-translational modifications. The least number of Unigene annotations are extracellular and nuclear structures.

Fig 1: Go terms of Medicago varia Martin. cv. Caoyuan No.1.



Fig 2: KOG function classification of medicago varia martin. cv. caoyuan No.1.


 
GC content analysis and neutral plot analysis
 
Codons play a crucial role in translating gene sequences into proteins and analyzing their usage patterns is essential for studying protein translation efficiency and functions. (Liu et al., 2020). Due to the presence of codons, they typically do not impact the coding of amino acids, thus altering the structure and function of proteins. However, over a long period of evolution, some codons have been used more frequently than other codons in genes, i.e., secrets Preference for the use of yards. Codon preference use is an adaptive selection formed by species in the long-term process of natural selection and evolution, which is mainly influenced by genetic mutation pressure and natural selection pressure (Sueoka et al., 1988).
       
GC content is an important indicator of an organism’s genome base composition. The type of amino acids may be affected by GC1 and GC2, but not by the 3rd base mutation. When there is no selective pressure, mutations do not typically lead to differences in the base content at the three positions of the codon. However, the codons themselves do influence this content, so there is a strong correlation between GC content and codon usage bias. (Sharp and Li  et al., 1986).
       
Codon W was used for codon use preference analysis for the 11722 selected Unigenes. The results showed that the average values of T3s, A3s, G3s and C3s in the transcriptome of Medicago varia Martin. cv. Caoyuan No. 1 were 45.21%, 34.55%, 23.81% and 22.27%, respectively and the average total GC content was 42.87%, with a fluctuation range of 27.10%~69.60%. T(T3s) and A(A3s) were the most common codons of A, T, G and C, followed by G3s and C3s, which were 45.21% and 34.55%, respectively, followed by G3s and C3s (23.81%) and 22.27%, respectively. The average codon adaptation index (CAI) was 0.215 and the fluctuation range was 0.097~0.883. The average codon preference index (CBI) was -0.038 and the fluctuation range was -0.379~0.868. The average optimal codon usage frequency (Fop) was 0.398 and the fluctuation range was 0.192~0.926. The aromatic amino acid (Aromo) ratio was 8.29%, the average protein hydrophobic level (GRAVY) was -0.331, the average amino acid number (L-aa) was 345.916 and the average synonymous amino acid (L-sym) was 333.625 (Table 5)-the average ENC of the transcriptome of Medicago varia Martin. cv. Caoyuan No.1 was 49.659 and the fluctuation range was 24.77~61. The above analysis showed the codon preference of codon 3 in Medicago varia Martin. cv. Caoyuan No.1 was not high, but the GC3 content of different genes varied more than the total GC content.

Table 5: Codon base composition and parameters of medicago varia martin. cv. caoyuan No.1 transcriptome.


       
The analysis of the neutral plot results showed that the gene samples were concentrated on both sides of the regression line of the neutral map and the changing trend of GC3s and GC12 was consistent, so there was no difference in the base composition of the three codon positions, indicating that the main reason for the influence of the codon preference of the codon of Medicago varia Martin. cv. Caoyuan No.1 was mutation pressure (Fig 3).

Fig 3: Analysis of transcriptome neutral mapping of Medicago varia Martin. cv. Caoyuan No.1.


       
In this study, we analyzed the codon bias of Medicago varia Martin. cv. Caoyuan No.1 found that 27 preferred to use codons ending in base A/U and only 3 preferred to use codons ending in base G/C, indicating that Medicago varia Martin. cv. Caoyuan No.1 preferred to use codons ending in base A/U (Sueoka et al., 1999).

Analysis of the effective codon number of genes in Medicago varia Martin. cv. Caoyuan No.1
 
The ENC plot was plotted with GC3s as the x-axis and ENC as the y-axis (Fig 4). Codon GC3s are distributed between 0.094~0.914. The ENC value is between 23.9~61 and the closer to 61 the value is, the weaker the bias is. The mean effective codon number (ENC) of the transcriptome of Medicago varia Martin. cv. Caoyuan No.1 was 47.98, with a maximum of 61 and a minimum of 23.9. This indicates that only a few sequences are codon-biased. The above analysis showed that the codon preference of Medicago varia Martin. cv. Caoyuan No.1 was not high overall, but the codon preference of different genes was inconsistent. From the ENC-GC3s plot, it can be seen that most of the genes of Medicago varia Martin. cv. Caoyuan No.1 was walked around the standard curve, while a few genes were scattered far away from the standard curve, indicating that mutational pressure, natural selection and some other factors led to the preference for the use of codons in Medicago varia Martin. cv.Caoyuan No.1.

Fig 4: ENC-GC3s mapping analysis of Medicago varia Martin. cv. Caoyuan No.1.


       
The ENC value of the codon of Medicago varia Martin. cv. Caoyuan No.1 was significantly positively correlated with GC3s, indicating that the base composition affected the formation of codon bias of alfalfa gene to a certain extent and the ENC-plot results showed that the ENC value of most genes was close to the expected value.
 
PR2~plot bias analysis
 
The relationship between purines (A and G) and pyrimidines (T and C) at the third base of the codon of Medicago varia Martin. cv. Caoyuan No. 1 was analyzed by PR2~plot bias. The two straight lines in the PR2-plot plot divide the graph into four regions and the codons distributed in the upper half of the line indicate that the frequency of codon A is higher than that of T and vice versa, the frequency of use of T is higher than that of A. Codons in the left half of the line indicate that C is used more frequently than G and the right half of the line indicates that G is used more frequently than C. The frequency of use of T in Medicago varia Martin. cv. Caoyuan No.1 was higher than that of A and the frequency of G and C was relatively unequal (Fig 5) and the application frequency of the third base G of the codon of Medicago varia Martin. cv. Caoyuan No.1 was greater than that of C, indicating that the codon preference of Medicago varia Martin. cv. Caoyuan No.1 was affected by mutant pressure and natural selection (Oliveira et al., 2021). Differences in species, ploidy and monodicots lead to differences in codon preference (Li et al., 2024).

Fig 5: Analysis of PR2 bias plot.


 
Comparison of codon preference between alfalfa and representative species
 
If the ratio is 0.5~2, it indicates that the preference for the use of this codon is similar between the two species and vice versa. It was found that the codon preference of Medicago varia Martin. cv. Caoyuan No.1 was similar to that of the model species and there were 1, 4, 1, 5 and 24 codons with ratios outside the range of 0.5~2 (Table 6). The results indicated that there were different levels of differences between the codons of alfalfa and these model organisms, which were less different from Arabidopsis thaliana and tobacco and the most different from Escherichia coli.

Table 6: Relative usage statistics of the synonymous codon of Medicago varia Martin. cv. Caoyuan No.1.


       
Comparing the codon preference between Medicago varia and Tobacco and Escherichia coli, it was found that Medicago varia and Tobacco were slightly different from each other, while Escherichia coli was quite different, indicating that Arabidopsis thaliana and Tobacco were suitable as recipients for genetic transformation in the verification of gene function of Medicago varia Martin. cv. Caoyuan No.1. Codon optimization can improve plant gene expression efficiency (Zhou et al., 2016). Therefore, when the gene of Medicago varia Martin. cv. Caoyuan No.1 is exogenously expressed, it can be efficiently expressed in Arabidopsis thaliana and tobacco through codon optimization. Compared with Escherichia coli, there was little difference in codon preference between Saccharomyces cerevisiae and Saccharomyces cerevisiae codon, indicating that Saccharomyces cerevisiae was preferred as the protein expression system in Saccharomyces cerevisiae Medicago varia Martin. cv. Caoyuan No.1.
 
Optimal codon analysis of synonymous codons
 
This study compared the high and low codon gene expression sample libraries of Medicago varia Martin. cv. Caoyuan No.1 (Fig 6) and screened out 30 optimal codons, namely UUU, UUA, UUG, CUU, CUA, AUU, AUA, GUU, GUA, UCU, UCA, CCU, CCA, ACU, ACA, GCU, GCA, UAU, UAA, CAU, CAA, AAU, AAG, GAU, UGU, AGU, AGA, AGG, GGU, GGA. Among these 30 optimal codons, except for UUG and AAG, Outside of AGG, the third base of the remaining codon is all A/U, indicating that the Medicago varia Martin. cv. Caoyuan No.1 prefers the third base of the A/U codon, which is consistent with the above results.

Fig 6: Optimal codon of the genome of medicago varia martin. cv. caoyuan No.1. *: RSCU value is greater than 1.

Mutational pressure, natural selection and several other factors contribute to the preference for Medicago varia codon use. Medicago varia preferred 30 codons and preferred the third base of the codon as A/U, so it can be inferred that Medicago prefers the third codon with A/U.
The present study was supported by a Major demonstration project of “the open competition” for seed Industry science and technology innovation in Inner Mongolia (2022JBGS0016) and the Science and Technology Innovation 2030-Major Project (2022ZD04012). Fengling Shi designed the experiments, Liu Bai did the experiments and wrote the manuscript and others contributed equally to the manuscript, Yingtong Mu and Yuanyuan Cui carried out data statistics and drawing.
 
Disclaimers
 
The views and conclusions expressed in this article are solely those of the authors and do not necessarily represent the views of their affiliated institutions. The authors are responsible for the accuracy and completeness of the information provided, but do not accept any liability for any direct or indirect losses resulting from the use of this content.
 
Informed consent
 
All animal procedures for experiments were approved by the Committee of Experimental Animal care and handling techniques were approved by the University of Animal Care Committee.
 
The authors declare that there are no conflicts of interest regarding the publication of this article. No funding or sponsorship influenced the design of the study, data collection, analysis, decision to publish, or preparation of the manuscript.

  1. Alfalahi, A.O., Hussein, Z.T., Khalofah, A., Sadder, M.T., Qasem, J.R.,  Al-Khayri, J.M., Jain, S.M., Almehemdi, A.F. (2022). Epigenetic variation as a new plant breeding tool: A review. Journal of King Saud University Science. 34: 102302. 

  2. Altschul, S., Madden T.L., Schäffer A.A., Zhang J., Zhang Z., Miller, W., Lipman D.J. (1997). Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Research. 25(17): 3389-3402. 

  3. Brenner, S., (H), J.M., Broughton, W.J. (2002). Encyclopedia of Genetics. Codon Usage Bias. pp: 402-406.

  4. Chen K. (2018). Optimization of CRISPR Technology and Modification of PAMRecognition Sites of Cas9 Enzyme. (Master), Chinese Academy of Agricultural Sciences.

  5. Duret, L., Mouchiroud, D. (1999). Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila and Arabidopsis. Proceedings of the National Academy of Sciences of the United States of America. USA. 96(8): 4482-4487. 

  6. Feng, C., Shan, M., Xia, Y., Zheng, Z., He, K., Wei, Y., Song, K., Meng, T., Liu, H., Hao, Y., Liang, Z., Wang, Y., Huang, Y. (2022). Single- cell RNA sequencing reveals distinct immunology profiles in human keloid. Frontiers in Immunology.13. 

  7. Gao, Y., Lu, Y., Song, Y., Jing, L. (2022). Analysis of codon usage bias of WRKY transcription factors in Helianthus annuus. BMC Genomics Data. 23(46). 

  8. Gao, Y., Zhang, D., Li, H. (2023). The Professional Go Annotation Dataset. IEEE Transactions on Games. 15(4): 517-526. 

  9. He, B., Dong, H., Jiang, C., Cao, F., Tao, S., Xu, L. (2016). Analysis of codon usage patterns in Ginkgo biloba reveals codon usage tendency from A/U-ending to G/C-ending. Scientific Reports. 6: 35927. 

  10. Kong, W.Q., Yang, J.H. (2017). The complete chloroplast genome sequence of  Morus cathayana and Morus multicaulis and comparative analysis within genus Morus L. PeerJ. 5: e3037. 

  11. Li L, Xu M, Liu Y. (2024). Codon Bias Analysis of Medicago Genome. Acta Agrestia Sinica. 23(09): 2695-2706.

  12. Liang E., Qi M.J., Ding Y.Q., Zhang L. (2019). Analysis of codon bias in Panax japonicus transcriptome. Jiangsu Nongye Xuebao (Jiangsu Agricultural Sciences). 47(2): 67-71.

  13. Liu, H., He, R., Zhang, H., Huang, Y., Tian, M., Zhang, J. (2009). Analysis of synonymous codon usage in Zea mays. Molecular Biology Reports. 37(2): 677-684. 

  14. Liu, Y. (2020). A code within the genetic code: Codon usage regulates co-translational protein folding. Cell Communication and Signaling. 18: 145. 

  15. Ma, Q., Li, C., Wang, J., Wang, Y., Ding, Z. (2015). Analysis of synonymous codon usage in FAD7 genes from different plant species. Genetics and Molecular Research. 14(1): 1414- 1420.

  16. Oliveira, J.L., Morales, A.C., Hurst, L.D., Urrutia, A.O., Thompson, C.R.L., Wolf, J.B. (2021). Inferring adaptive codon preference to understand sources of selection shaping codon usage bias. Molecular Biology and Evolution. 38(8): 3247-3266. 

  17. Peng, X., Mu, Y., Wu, F., Fu, N., Shi, F., Zhang, Y. (2024). Analysis of Codon Preferences in Medicago ruthenica based on Transcriptome Data. Legumes Research. 

  18. Qin, L., Ding, S., Wang, Z., Jiang, R., He, Z. (2022). Host Plants Shape the Codon Usage Pattern of Turnip Mosaic Virus.Viruses. 14(10): 2267.  

  19. Quax, T.E.F., Claassens, N.J., Söll, D., van der Oost, J. (2015). Codon bias as a means to fine-tune gene expression. Molecular Cell. 59(2): 149-161. 

  20. Sharp, P.M., Li, W. (1986). An evolutionary perspective on synonymous codon usage in unicellular organisms. Journal of Molecular Evolution. 24(1/2): 28-38. 

  21. Shi H, Lin Y, Lai Z. (2018). Research progress on CRISPR/Cas9- mediated genome editing technique in plants. Chinese Journal of Applied and Environmental Biology. 24(3):  640-650.

  22. Sueoka, N. (1988). Directional mutation pressure and neutral molecular evolution. Proceedings of the National Academy of Sciences of the United States of America. 85(8): 2653-2657. 

  23. Sueoka, N. (1999). Two aspects of DNA base composition: G+C content and Translation-Coupled Deviation from Intra-Strand Rule of A=T and G=C. Journal of Molecular Evolution. 49(1): 49-62. 

  24. Tang, L., Shah, S., Chung, L., Carney, J., Katz, L., Khosla, C., Julien, B. (2000). Cloning and heterologous expression of the epothilone gene cluster. Science. 287(5453): 640-642. 

  25. Wang, J., Li, X., Lu, Y., Huang, Q., Sun, Y., Cheng, M., Li, F., Shi, C., Zeng, Y., Wang, C., Cao, X. (2021). Analysis of lncRNAs and mRNA Expression in the ZBTB1 Knockout Monoclonal EL4 Cell Line and Combined Analysis With miRNAs and circRNAs. Frontiers in Cellular and Infection Microbiology. 11. 

  26. Wright, F. (1990). The ‘effective number of codons’ used in a gene.  Gene. 87(1): 23-29. 

  27. Wang S, Shi F, Shi R, Zhang Y (2024). Seed dormancy and germination in alfalfa (Medicago falcata L.). Legume Research. 47(2): 234-241. doi: 10.18805/LRF-758.

  28. Wu, Y., Zhao, D., Tao, J. (2015). Analysis of codon usage patterns in herbaceous peony (Paeonia lactiflora Pall.) based on transcriptome data. Gene. 6(4): 1125-1139. 

  29. Xi, Z., Davis, L., Baxter, K., Tynan, A., Goutou, A., Greiss, S. (2021). Using a quadruplet codon to expand the genetic code of an animal. Nucleic Acids Research. 50(9): 4801-4812. 

  30. Xu, B., Wu, R.N., Gao, C.P., Shi, F.L. (2021). Establishment of tissue culture regeneration system for Medicago ruthenica L. cv. ‘Zhilixing’. Legume Research. 45(2): 162-167. doi: 10. 18805/LRF-659.

  31. Zhang, R., Zhang, L., Wang, W., Zhang, Z., Du, H., Qu, Z., Li, X., Xiang, H. (2018). Differences in codon usage bias between photosynthesis-related genes and genetic system-related genes of chloroplast genomes in cultivated and wild solanum species. International Journal of Molecular Sciences. 19(10): 3142. 

  32. Zhang, Y., Wu, X., Wang, X., Dai, M., Peng, Y. (2024). Crop root system architecture in drought response. Journal of Genetics and Genomics. 52(1): 4-13.

  33. Zhou, Z., Dang, Y., Zhou, M., Li, L., Yu, C., Fu, J., Chen, S., Liu, Y. (2016). Codon usage is an important determinant of gene expression levels largely through its effects on transcription. Proceedings of the National Academy of Sciences of the United States sof America. 113.

Editorial Board

View all (0)