Bioinformatics analysis of leptin in E. miletus
Signal peptide assay
It showed that there was a highly reliable Sec/SPI type signal peptide in the leptin protein of
E. miletus (Fig 1). The probability distribution of different regions of signal peptide is high and the probability of non-signal peptide region is low, which is consistent with the judgment of signal peptide. Cleavage site: It is predicted to be between the 21
st and 22
nd amino acids, with a probability of up to 0.978790. It has high reliability and the cleavage site is clear, which supports the functional localization of its secreted protein. Signal peptide type: it belongs to Sec/SPI signal peptide and the probability of signal peptide is 0.9998.
Protein secondary and tertiary structure prediction
protein secondary structure prediction results
After removal of the signal peptide sequence, the mature leptin protein of
E. miletus consists of 146 amino acid residues. Secondary structure prediction using SOPMA (Fig 2) revealed a predominance of α-helices (h), comprising 94 residues and accounting for 64.38% of the structure (Fig 3). Random coils (c) were the secondary major structural element, with 50 residues, while the proportion of extended strands (e) was minimal at only 1.37%.
Prediction results of protein tertiary structure
SWISS-MODEL was used to construct the tertiary structure model of leptin protein in
E. miletus (Fig 4). The core template: leptin LEPR trimer structure (8 × 80.1. C leptin) was selected to establish the model. The sequence consistency was as high as 82.04%, the global model quality score (GMQE) was 0.76 and qmeandico global was 0.76±0.07, indicating that the quality of the model was good and the construction was reasonable. The spiral structure is prominent in the modeling and the model presents a spatial folding shape dominated by α - helix, with less random curl.
Determination and analysis of physical and chemical properties
The analysis results of physical and chemical properties of rat leptin protein showed that the number of amino acids was 127, including 20 essential amino acids, of which leucine accounted for the highest proportion and serine accounted for the second highest proportion (Table 2). Its molecular formula is C
824H
1353N
221O
252S
8, with a total atomic number of 2658 and a molecular weight of 18644.62. The theoretical isoelectric point of protein is 5.86, which is acidic. The theoretical half-life of
E. miletus leptin protein
in vitro (mammalian reticulocytes) was 30 h, in yeast > 20 h and in
Escherichia coli > 10 h. The aliphatic index of the protein was 119.04; Average hydrophilicity (GRAVY): 0.113, instability index: 50.89, which is an unstable hydrophilic protein.
Lipophilicity/hydrophobicity of leptin protein in E. miletus
The leptin proteins of
O. princeps inhabiting alpine tundra,
T. belangeri from tropical regions,
H. sapiens widely distributed across temperate zones and
E. miletus were analyzed and compared (Fig 5). The four species share similar hydrophobic/hydrophilic fluctuation patterns in their leptin protein sequences. The leptin proteins of the North American pika and the large-eared vole exhibit stronger hydrophobicity, while those of humans and the
T. chinensis display weaker hydrophobicity.
Comparison and analysis of leptin gene sequences in 56 mammalian species
phylogenetic analysis of leptin gene
The phylogenetic tree based on leptin gene sequences of 56 mammalian species (Fig 6) shows: Some Microtus species and
E. miletus cluster into a small branch, with close genetic relationship and distant relationship with Dermoptera. Rodents of the same family cluster closely; squirrels, beavers and other rodents have longer branches and earlier differentiation than
E. miletus. Ochotona and
Lepus europaeus have short branches, adjacent to but independent of Rodentia.
T. belangeri is close to Primates. Dermoptera species cluster into one branch, adjacent to Scandentia and Primates branches.
Distribution of 56 species in different temperature zones
The 56 species are clustered by their main distribution across temperature zones (Fig 7). Most primates,
T. belangeri and Dermaptera cluster together, mainly in tropical areas. Two Ochotonidae pikas cluster, mainly in cold and sub-cold regions, differing from the two Leporidae species, which are mainly in temperate or temperate-subtropical zones. Most rodents are mainly in temperate zones; a few span tropical, subtropical and temperate zones, with some widely distributed from tropical to temperate.
Codon preference and GC content analysis
Analysis of ENC, GC3 and GC contents of leptin genes in 56 species (Table 1) showed: leptin gene GC content ranged 48.90-61.51 (average 54.76); GC3 was 64.07-81.55 (average 71.42), indicating a tendency to use G/C-ending codons (since the third codon rarely changes amino acid type); ENC values were 38.70-58.39 (average ~46.57). Lagomorpha had the highest GC and GC3 contents and the smallest ENC value, while
E. miletus showed medium levels in all three.
Analysis of conservative areas and functional annotation areas
Multi sequence conservative motif analysis
The conservative motif prediction results of 56 leptin gene sequences were visualized with tbtools- aII (Fig 8). Motif 1-5 is stable in most species and its position is relatively concentrated. In rodents, motif 1-3 has more prominent repetition frequency and length; Some motifs are widely distributed in rodents and some primates and the distribution span is from 5' to 3'. Some motifs only appear in specific groups and species, such as motif 7-11 unique to some Primates Macaca and are more densely distributed at the 3' end.
Functional annotation analysis of multiple sequence conservative motifs
Functional annotation was performed on motif 1-11, some conservative motifs had no functional annotation (Fig 9). Leptin was concentrated near the 20-25 amino acids. The region of membrane bound protein predicted to be outside the membrane, in the extracellular region, is expected to be located outside the membrane, occupying the posterior segment of the sequence (about 25-50 positions). Motif 1-3 showed the functional region of the conserved motif of the protein sequence encoded by the
E. miletus leptin gene sequence.
Analysis of selection pressure and positive selection effect
The single ratio model selection pressure analysis (Table 3) shows that the overall ω=0.32993<1 and the leptin gene dN (1.5749) is significantly lower than dS (4.7733), indicating the presence of significant purification (negative) selection.
E. miletus and Microtus gather together. In some branches, synonymous substitutions of rodents decrease, while in others, synonymous substitutions increase. In addition, the average content of GC3 between sequences is 0.7142.
Free ratio model-based branch selection pressure analysis showed a better overall likelihood value (LNL = -5630.47) than the single ratio model (LNL = -5721.88). Likelihood ratio test (LRT) confirmed significant heterogeneity in selection pressure among branches. The overall kappa (ts/tv = 4.13) indicated a high transition/transversion ratio, consistent with mammalian gene evolution characteristics. With total nonsynonymous substitution rate (dN = 1.62) lower than synonymous substitution rate (dS = 4.55; dN < dS), the overall selection remained dominated by purifying selection. Under the free ratio model, Microtus and
E. miletus (ω = 0.5661) clustered into one branch. Most species underwent strong purifying selection, while some branches showed significant positive selection.
Sequence characteristics of leptin gene and protein structure of E. Miletus adapted to their living needs
This study undertook an analysis of the leptin gene in
E. miletus, predicting that its leptin protein contains a Sec/SPI-type signal peptide with a notably high support probability. Leptin functions by binding to its receptor (LepR), with α-helices playing a pivotal role in receptor recognition. The leptin of
E. miletus exhibits a high α-helix ratio, which is conducive to a stable signal transduction domain. Research on Ochotona curzoniae has demonstrated that elevated α-helix ratios enhance receptor affinity and thermogenesis, facilitating adaptation to extreme cold conditions
(Yang et al., 2011). Compared to humans and mice,
E. miletus leptin displays a higher α-helix ratio, which is essential for energy regulation in adapting to high-altitude, cold environments. Its low random coil ratio indicates a compact structure, thereby reducing energy consumption. These findings, coupled with high sequence consistency with the leptin-LepR trimer template, confirm structural compatibility and enhance receptor binding efficiency in cold environments.
E. miletus leptin (127 amino acids) has the highest leucine proportion, potentially enhancing energy reserve and utilization in alpine environments
(Zhang et al., 2009). Its amino acid composition is more ''energy-efficient'' than human leptin (≈11% leucine), aligning with survival needs in high-altitude, food-deficient habitats. As an unstable hydrophilic protein requiring continuous
in vivo synthesis, its theoretical half-life indicates eukaryotic stability, ensuring sustained energy regulation signals.
The hydrophobic region interacts with the phospholipid bilayer in transmembrane structures and maintains the conformational stability of membrane proteins at low temperatures
(Tomczak et al., 2002). E. miletus exhibits high hydrophobicity, which facilitates protein structural maintenance and receptor signal transduction efficiency in cold environments
(Chen et al., 2022). Plateau cold-zone species (
O. princeps and
E. miletus) showed strong hydrophobicity, ensuring stability and response speed of energy regulation signals. In contrast, temperate
H. sapiens and tropical
T. belangeri balance hydrophilicity and hydrophobicity, likely meeting the energy metabolism and environmental stability requirements.
Multidimensional characteristics of E. miletus adapted to living environment
Further evolutionary analysis showed that
E. miletus and Microtus clustered, indicating leptin gene conservation in the same family and genus. Rodentia families have longer branches, with niche differentiation possibly driving leptin gene differentiation. Primates and hominids cluster closely, reflecting high leptin gene conservation.
In temperature distribution, Primates,
T. belangeri and Dermoptera are mostly tropical. Ochotonidae and Leporidae differ in differentiation: some species range from tropical to temperate zones, with their leptin genes scattered in temperate clusters, showing environment-adaptive polymorphism to meet energy needs across temperature zones, reflecting molecular-level "ecological range expansion"
(Londraville et al., 2014; Gong et al., 2013). E. miletus, mainly temperate, uses leptin gene regulation to adapt to the Hengduan Mountains’ extreme, climate-variable, large temperature-difference environment.
Molecular-level analyses of codon preference and conserved motifs further support these conclusions. The leptin gene shows a preference for G/C-ending codons; high GC3 content enhances DNA stability, aiding adaptation to UV-intense tropical environments and cold zones with large temperature fluctuations (
Sambrook and Russell, 2001). A lower ENC value indicates stronger codon preference and faster translation (
Sharp and Li, 1987); the moderate ENC value of
E. miletus leptin suggests a trade-off between rapid translation and sequence variation, balancing core energy regulation functions with temperature zone specialization demands. Motifs 1-5, as core functional domains, ensure cross-species energy regulatory basics. The distribution of motifs 1-3 reveals
E. miletus leptin retains signal peptide-guided secretion, metabolic regulation and extracellular membrane protein functions, linked to adaptation to the Qinghai-Tibet Plateau’s high-altitude, cold and hypoxic conditions
(Chen et al., 2022; Gong et al., 2022).
Selection pressure analyses showed overall purifying selection on the leptin gene, with highly conserved functions. Some Rodentia branches exhibited higher dS, indicating rapid evolution and sequence divergence, potentially linked to adaptive radiation. Most rodents clustered closely with short branch lengths and low genetic distances. Microtus clusters with
E. miletus, supporting their close phylogenetic relationships. Leptin function is highly conserved among congeneric rodent species, with consistent patterns observed in other taxa.