Isolation of genomic DNA
The genomic DNA isolated from the blood shown a single band on 0.6% agarose gel electrophoresis. The concentration of DNA was found to be 500 µg/ml. The isolated DNA had an A
260/A
280 of 1.8 indicating a highly purified preparation. The size of the amplified PCR product carried out at an annealing temperature of 59°C revealed a 1000 bp product as expected (Fig 1).
Cloning of PCR amplified LHβ gene
The T/A cloning vector pGEMT-Easy and purified 1006 bp product was ligated in the ratio of 1:3 using T4 DNA ligase. The resulting construct was used for transformation of DH5α competent cells. After plating the transformed.
E. coli cells over LB agar plates containing ampicillin, IPTG, X-gal, the plates were incubated at 37°C overnight which resulted in the formation of white and blue colonies. The white colonies with the recombinant plasmids were grown overnight in LB containing penicillin and subsequently the plasmids were isolated.
Characterization of recombinant pGEMT-Easy plasmids
Characterization by PCR
The isolated plasmids analyzed for the insert by PCR using gene specific primers. The product run on 1.5% agarose gel gave a gene amplicon of size 1006 bp (Fig 2; Lane 6 and 7). No amplification was observed in plasmid isolated from blue colonies (Fig 2; Lane 1) as negative control.
Characterization by restriction endonucleases
The plasmids that gave positive results with the PCR was further characterized by restriction endonucleases digestion. Insert of LHβ gene was released form pGEMT-Easy vector using
EcoRI (Fig 2; Lane 4 and 5). Digestion of
NotI produced two distinct bands of 399 and 607 bp (Fig 3; Lane 3 and 4).
Sequencing
The recombinant clone was sequenced and found to be 1006 bp and the sequence was submitted to GenBank (accession number AM258985). The sequence shows start codon ATG at position 19-21 and the stop codon TAA at 974-976, thus it represents the complete coding region (CDS) of the gene. It codes for 141 amino acids and contain two introns and three exons. A BLAST search for this sequence in NCBI confirmed its similarity with other mammalian LHβ sequences. The sequencing results revealed a high similarity with other mammalian species. The nucleotide sequence and the deduced amino acid sequence is shown in (Fig 4). The ORF of the goat LHβ subunit encodes 141 amino acids with a predicted molecular mass of 15.17 kDa. Goat LHβ subunit consists of a signal peptide 20 amino acids long and a mature subunit 121 amino acids long. A total of 12 cysteines and one putative N-glycosylation sites exist in this subunit (Fig 4).
Sequence analysis
Align report
Some of the available nucleotide and amino acid sequence of LHβ gene from various mammalian species were aligned and compared with respect to the goat LHβ sequence. The alignment report of amino acid is depicted in (Fig 5).
Homology
The nucleotide and deduced amino acid sequences were compared with other mammalian sequences available in GenBank. Amino acid and nucleotide sequence alignment was performed using the Clustal method (
Higgins and Sharp, 1988). Clustal W analysis of nucleotide sequences revealed a high level of similarity with the nucleotide sequences of other mammalian species. The goat LHβ sequence had the highest homology with buffalo (99.6%) and sheep (97.3%) at the nucleotide level. At the amino acid sequence level, the goat LHβ sequence had 100% homology with sheep and buffalo
(Basavarajappa et al., 2008). The unique amino acid residues, CAGY (34
th to 37
th amino acid residues), were found conserved across different species. These conserved residues are responsible for binding to the common alpha subunit
(Gharib et al., 1990).
Phylogenetic analysis of the LH-β
hormone family
Based on the multiple sequence alignment of 141 amino acid residues, a phylogenetic tree was constructed (Fig 6).
The present work was undertaken to clone and characterize the gene encoding LHβ of Indian Goat. The action of lutropin (LH), follitropin (FSH) and in some species, chorionic gonadotropin (CG) on gonadal receptors are required for fertility. These gonadotropins and other member of the glycoprotein hormone family, thyrotropin (TSH) are composed of a conserved a-subunit and a hormone specific β-subunit that enables them to distinguish LH, FSH and TSH receptors. Both hormone subunits have similar architectures and are of 15 kDa and 23 kDa respectively. The hormone specificity, species specificity and relative biological activity are determined almost exclusively by the β-subunit
(Ward et al., 1991).
The genomic DNA isolated from goat blood was amplified using specific LH gene primers. At an annealing temperature of 59
oC, PCR gave distinct band of 1006 bp as expected. The cloned LHβ-subunit of goat is 1006 bp fragment long and it extends immediately from 5’ of the translational start codon (ATG) to 3’ of translational stop codon (TAA) and thus spanned the entire coding region. The gene consists of two introns and three exons. The sequence of junction splice sites is identical with consensus sequences at exon intron boundaries. Introns length were 297 and 235 bp. Intron 1 is inserted between the amino acid codons -16/-15 in the apoprotein signal sequence. Intron 2 is inserted between codons +41 and +42. While the lengths of these introns are slightly shorter than those of other mammalian species, their relative location within the transcriptional unit are identical indicating strong conservation of gene structure within the glycoprotein hormone family as well as between mammalian species.
The nucleotide base at position 330 was ‘A’ in the studied goat LHβ sequence while it was ‘G’ in all other mammalian species used to compare this sequence. Since, this positioned at intron-exon junction, the sequence should be ‘G’. The goat LHβ subunit was amplified by PCR using Taq polymerase which is known to introduce error in the amplified fragment. It is therefore every possibility that the ‘A’ at position 330 is an artefact introduced by Taq polymerase. This error can be checked by resequencing the other clones containing LHβ gene.
The LHβ subunit is 97.7% and 92.3% similar with 9 and 15 bp shorter than the ovine (Accession number S64695) and bovine (Accession number M11506) LHβ subunit respectively. The additional bases present in intron doesn’t affect the reading frame. Almost all differed and inserted nucleotides were found in the intron region only irrespective of species which is 100% in case with that of sheep and cattle.
Nucleic acid sequence comparison allows determination of not only the number of base changes which results in amino acid changes (replacement changes) but also the number of base changes which have no effect on amino acid sequence (silient changes). Determination of both silient and replacement substitution thereby provides an index of the rate of neutral genetic drift at silient or intron sites as well as on index of divergence at replacement sites. For a given set of related genes, the rate of sequence divergence at replacement sites is nearly linear when analyzed over recent evolution.
A comparison of the amino acid sequence of polypeptide deduced from coding sequence as presented in (Fig 4) indicate that the LHβ is 141 amino acids long with a leader (signal) sequence of 20 amino acids and a mature subunit of 121 amino acids. The molecular weight calculated is 15.2kDa. The signal sequence is rich in hydrophobic residues and remain same in sheep, cattle and buffalo but the difference is observed with other mammalian species. The difference observed with other mammalian species may not affect the biological activity of the hormone, since this sequence is cleaved from the nascent peptide and it plays an important role in maturation and secretion of hormone (
von Heijne, 1985).
In apoprotein molecule the position of 12 cysteine residues is conserved among all species. These twelve cysteine residues form six disulfide bridges helping in the formation of three large loop, which in turn play a critical role in heterodimer formation and/or secretion
(Furuhashi et al., 1995; Sato et al., 1997). Formation of additional loop, known as seat belt, is largely responsible for influence of β-subunit on receptor binding specificity, probably through an influence on hormone conformation. It is noted that a single potential glycosylation site (Asn-X-Thr) at position 13 to 15 is aligned with that of vertebrate species except human. This conserved N-glycosylation site is required for signal transduction, intercellular stability and subunit folding (
Matzuk and Boime, 1988;
Sairam and Bhargavi, 1985). The region important for biological activity, the “CAGY” region
(Hayashizaki et al., 1985; Ledley et al., 1976; Pierce and Parsons, 1981) is also conserved among all these species and this region is found in the beta subunit of other glycoprotein hormones also.
The common tetrapeptide sequence (Cys-Gly-Pro-Cys), which is found in the active site of thioredoxin and LHβ subunit of various mammals, is also found in the goat LHβ sequence. LH has an intrinsic thioredoxin activity and this activity is important in receptor activation and in post receptor signal transduction (
Boniface and Reichert, 1990). This catalytic tetrapeptide sequence does not exist in any non-mammalian LHβ-subunit so far examined, so this hypothesis is restricted to mammals.
A recombinant single chain beta alpha equine chorionic gonadotropin/luteinizing hormone (eCG/LH) with active biological potential was produced for applications in assisted reproductive technologies or diagnostic assays through expression of the eCG/LH gene construct
(Bhardwaj et al., 2018). Based on the sequence of gonadotropin subunit of goat, recombinant lutropin can be produced in an in-vitro system for long term treatment in assisted reproduction of goat without possible risk of antibody production.