Sequence characteristics of the barcode
The nine DNA barcodes of the lotus chloroplast genome showed high success rates for PCR amplification and sequencing using specific single primer pair. Sequence analysis results showed that the total length of nine chloroplast DNA regions of the two lotus study populations ranged from 8892-8921 bp (white lotus) and 8861-8916 bp (pink lotus) (Table 2). BLAST results on the National Center for Biotechnology Information (NCBI) showed that the obtained nucleotide sequence is very similar to that of the lotus species
N. nucifera (Accecssion number: KF009944.1).
Sequence characteristics of a total of nine regions are present in Table 2 shows the nine sequences the within mean group distance in each lotus population was 0.180±0.900 × 10
-3 (white Lotus) and 3.390±0.450 × 10
-3 (pink Lotus). G+C content contained in the nine genetic regions is between different lotus samples was low, ranging from 0.334-0.335 (Table 2). The appearing percentage of each nucleotide type showed that Adenine (A) and Timin (Uracin) accounted for the highest proportion and there was no high difference between studied lotus samples and ranged from 32.863 to 33.180% (A) and 33.546 to 33.676% (T), reaching an average of 33.085 and 33.595, respectively. Meanwhile, the lowest proportions were Cysteine (C) and Guanidin (G), ranging from 17.207% to 17.454% (C) and from 15.971% to 16.138% (G), reaching an average of 17.266% (C) and 16.054% (G) of the lotus samples studied.
Besides, analysis of a total of nine genomic regions of two lotus populations using DNAsp 6.0 software showed that there are five separate polymorphic positions (S), create five mutant positions (Eta) for the white lotus population; eighty-four separate polymorphic positions (S) create eighty-five mutant positions (Eta) for the pink lotus population and eighty-five separate polymorphic positions (S) create eighty-six mutant positions (Eta) for the total of both lotus populations. A total of nine genomic regions with these differences were divided into three types of haplotypes (h) for the white lotus population; seven types of haplotypes (h) for the pink lotus population and nine types of haplotypes (h) for the total of both lotus populations. The haplotype diversity coefficient (Hd) accounts for 0.618±0.104 (white lotus); 0.818±0.050 (pink lotus) and 0.869±0.026 (total of both lotus populations), the average number of nucleotide differences (k) was 156.364×10
-2 (white lotus); 2908.225×10
-2 (pink lotus) and 2210.985×10
-2 (total of both lotus population). The nucleotide diversity coefficient (Pi) accounts for 0.180±0.050×10
-3 (white lotus), 3.280±0.830×10
-3 (pink lotus) and 2.550±0.720×10
-3 (total of both lotus population), respectively. The minimum number of recombination events (Rm=1) for the pink lotus population but not expressed in the white lotus population. All the indicators were processed with statistical significance
p<0.05 (Table 2).
The results test neutrality shown in Table 3 with D, Li’s D* and F* values are negative for the white lotus population (D= -0.32197, D*= -0.76384 and F*= -0.73647, Not significant,
p>0.10), shows an excess of rare nucleotide site variants and recently emerging haplotypes compared to what would be expected under a neutral model and either population expansion or background selection has occurred concerning for to the evolution of the lotus population under study (
Fu and Li, 1993). While the D value of the pink lotus population and the combination of the total of both lotus populations yielded a positive value was with not significant
p>0.10, this indicates the evolution of the pink lotus population is studied and both total lotus populations may have been suffered a recent bottleneck or we may have evidence for overdominant selection at this locus. Besides, the values of Fu and Li’s D* and F* of the pink lotus population and the combination of the white and pink lotus populations (Not significant:
p>0.10), indicated that the study population has very few individuals show large differences in when compared to other individuals in the population (Table 3). Also, the results of the Fu’s Fs test, based on the distribution of haplotypes, showed positive values for the lotus populations as evidence for a deficiency of alleles, as would be expected from a recent population bottleneck or we may have evidence for overdominant selection at this population.
Following Fu’s Fs test, the hypothesis of natural evolution was significantly rejected for all regions of the lotus population under study. Strobeck’s S, the probability of obtaining equal or fewer haplotypes based on gene frequency and mutation rate is low in the white lotus population (Strobeck’s S statistic=0.446) and does not occur in the pink lotus population and total of both lotus populations (Table 3). These results are not consistent with deviation from neutrality due to either selection or population expansion.
Researchers often use Fst to assess gene flow, a higher Fst value indicates a lower level of gene flow (Nm) and higher genetic differentiation among populations (
Hedrick, 2005). Fst reflects the level of inbreeding within populations (
Wright, 1984) or the extent to which populations are differentiated (Hartl and Clark, 2007). The presence of genetic structure is an outcome of limited gene flow and a high level of genetic drift within each reproductively isolated group. Fst values below 0.05 indicate negligible genetic differentiation, whereas values greater than 0.25 indicate high genetic differentiation within the analyzed population (
Weir, 1996). Fst values of the lotus populations in Thua Thien Hue, Viet Nam were significant but weak (Fst=0.00233;
p<0.05, Table 4). This may be because primitive and highly conservative of the chloroplast genome in this species. This genetic distance implies that the kinship between unrelated individuals of the same ancestry relative to the lotus study population is equivalent to the kinship between individuals in a randomly mating lotus population. The presence of genetic structure is an outcome of limited gene ow and a high level of genetic drift within each reproductively isolated group. This study indicated that the two population genetic structure of lotus collection from different places in Thua Thien Hue, Vietnam are stable in both spatial and temporal scales. The random dispersal and their year-round spawning behavior might inhibit the population differentiation.
@table4
Phylogenetic analysis
The Phylogenetic tree was built based on Maximum Likelihood method (bootstrap=1000). The result is shown the first cluster groups a population of pink lotus (SH02, SH05, SH06, SH07 and SH09); The second cluster includes both the pink and white lotus samples and they are divided into 3 different sub-clusters. Sub-clusters one includes 7 pink lotus samples (SH04, SH10, SH11, SH12, SH20, SH21 and SH22), Sub-clusters two includes 11 white lotus (ST01-ST11) and 1 pink lotus (SH01) samples and Sub-clusters three includes the rest of the pink lotus samples (SH03, SH08, SH13-19). The tree topology is supported by a good bootstrap value. The differences between the two pink and white lotus populations were not found in the total nine regions of barcodes. Although, the two lotus populations have a different flower color, shared the same haplotype for the nine markers of the barcodes region, which are considered the most variable coding and non-coding regions of the plastid genome
(Chase et al., 2007) (Fig 2).