SLC45A2 gene variants from the WGS data
The whole genome sequencing (WGS) analysis of the 40 dromedary samples identified a total of four SNPs. Comparison with the
SLC45A2 mRNA reference sequence from the NCBI (accession number: XM_010977623.3) revealed that one SNP was located at exon 1 (c.95 C>T), another was identified at exon 3 (c.633 G>A) and two were found at exon 4 (c.879 T>C and c.975 C>T).
Verification of SNPs and structural analysis
Twenty-five dromedary camels were analyzed to validate the SNPs identified from the WGS data, with five samples selected from each of the five phenotypic groups. The analysis confirmed the presence of the four previously identified SNPs and revealed three additional variants: one SNP in exon 3 (cDNA 783 G>C, G>A,), another in intron 4 (g.23719539) and one in exon 7 (cDNA 1494 C>G); the G allele was detected exclusively in white dromedaries. No sequence variation was observed in exons 2, 5, or 6. Preliminary data suggested an association between variants in exons 1, 3 and 4 for the dark brown coat phenotype. Additionally, the c.783 G>C variant had three alleles (G, C and A), though no AA genotype was observed. A unique SNP was identified in exon 7 (c.1494 C>G), found exclusively in a single white dromedary presenting with abnormal hind limb growth and yellow eyes (Fig 2a and 2b). This phenotype differed from the typical white dromedary phenotype, which is characterized by darker skin, the absence of limb abnormalities and dark brown eyes (Fig 2c and 2d).
Functional analysis
To examine the impact of the exon 1 (c.95 C>T) and exon 7 (c.1494 C>G) variants on protein level, various tools were utilized to assess whether these mutations affected protein function and stability. The dromedary camel
SLC45A2 gene encodes a protein of 525 amino acids, characterized by 10 transmembrane domains. A comparative sequence analysis revealed a high degree of conservation within camelid species, with 99%, 98% and 97% sequence identity to
Camelus ferus,
Camelus bactrianus and
Vicugna pacos, respectively. However, sequence identity decreased when compared to other species, including
Bos taurus (86%),
Ovis aries (86%),
Capra hircus (86%),
Equus quagga (83%),
Canis lupus familiaris (82%),
Homo sapiens (81%) and
Felis catus (81%).
The c.95 C>T resulted in amino acid substitution (Threonine, Thr, T >T32M > Methionine (Met/M)). Furthermore, c.95 C>T was found to be located outside the transmembrane domain at the beginning of the protein structure. Multiple alignment sequences of the SLC45A2 protein sequencing showed that the T was highly conserved across mammalian species and chickens, but the M was different.
Additionally, the c.633 G>A, c. 783 G>C, G>A, c.879 T>C and c.975 C>T variants were silent mutations in which no amino acid changes occurred. These four variants were found in exons 3 and 4.
Finally, the c.1494 C>G variant led to a serine (Ser/S) to arginine (Arg/R) substitution at position 498 (S498R), occurring within the last transmembrane domain. Moreover, multiple alignment analyses of S498R showed that the S was highly conserved across many species, including chicken and zebrafish. This was followed by poly valine (Val/V); some species differed in the number of Vs (Fig 3). Multiple alignment sequences showed that humans had T instead of S at position 498; however, T and S are both amino acids with polar neutral side chains, whereas R is an amino acid with electrically charged side chains - basic.
Association analysis
A total of 117 dromedary camels were sequenced to assess the association between exon 1 variants and the dark brown coat color phenotype. However, no significant association was observed (p-value > 0.05). Additionally, 61 and 62 dromedaries were analyzed to investigate the potential associations of variants in exon 3, exon 4 and intron 4. Similar to exon 1, no significant associations were detected between these variants and dromedary coat color (p-value > 0.05). To investigate the association of the exon 7 variant (c.1494C>G) with coat color phenotypes, 135 dromedaries were sequenced. The G base was exclusively identified in white dromedaries, while all other coat color groups exhibited the CC genotype. Notably, no individuals with the GG genotype were observed. Statistical analysis revealed a significant association between the G base and white coat phenotype (p-value <0.001) (Table 2). No further sequencing was performed for exon1, exon3 and exon 4 because insignificant associations were determined. The different number of samples corresponding to different individuals was due to a failure of sequencing.
We studied the
SLC45A2 gene, identified several variants in the coding region and associated one variant with a white coat color in the dromedary camel from Saudi Arabia. Our data shows that
SLC45A2 variants are diverse, suggesting that there is gene flow between dromedary populations. Also, our data indicates that the
SLC45A2 region is where crossover events occur and in particular, exon 4 variants.
Exon 1 c.95 C>T resulted in amino acid substitution T32M. Although about 44% of the dark brown camels studied carried the TT genotype, the large cohort analysis did not confirm this association. It is possible that increasing the sample size of dark brow dromedary would increase the significance level. However, as the TT genotype is present in all dromedary populations, it is possible that this variant is involved in regulating other genes that might increase the intensity of darker pigmentation production. Threonine is highly conserved across mammalian species, suggesting that it might lead to a functional effect on protein level. Though, as it is located on the outside of the transmembrane domain, it may cause only insignificant damage to protein function. It has been noted that
SLC45A2 is highly expressed in black-headed goats and black skin biopsies
(Xiong et al., 2020). Dark brown dromedaries have darker wool on their humps, necks and tails, which could be due to higher
SLC45A2 expression in these regions. Further, contribution of other coat color genes such as
MC1R,
ASIP and
TYR genes together with
SLC45A2 will lead to have a more comprehensive answer of how these gene can correlate to produce a final melanin product.
Yang et al., (2019) identified more genetic loci in
MC1R,
ASIP and
TYRP1 genes in wild boar that was not discovered in domesticated pigs; however, these genetic bases were underlying the coat color variations in wild boar and domesticated pigs.
The exon 7 c.1494 C>G missense mutation was found only in the white population; thus, this variant was determined to be significantly associated with the white phenotype (p-value < 0.001). The variant replaces serine with arginine at position 498 (S498R). S498R is located in the transmembrane domain suggestion that it may lead to affect the protein function. Furthermore, multiple alignment sequencing shows that the S is highly conserved across mammalian species, chicken and zebrafish (Fig 3). It has been found that c.1478 G>A G493D is responsible for oculocutaneous albinism in dogs and is located in exon 7, where homozygote deletion is lethal (
Wijesena and Schmutz, 2015). Our findings show that the CG genotype is responsible for the white phenotype in dromedary camels and the GG genotype is lethal. It must be noted that 19 white dromedaries were sampled from one herd, 15 of which were heterozygotes. Most of these animals had abnormal tumor-like development in the legs that was neither identified nor investigated (Fig 2a). Furthermore, the breeder noticed that 50% of the offspring died in the first week after delivery, suggesting that the GG genotype is in fact lethal.
We also identified two silent mutations in exon 3 and another two in exon 4. None of these variants were associated with coat color variations in the dromedary. Also, the intron 4 variant was not linked to coat color variations in the sample. It must be noted that c.879 T>C and c.975 C>T were not linked suggesting that exon 4 could likely be a site of crossover events. This may also be related to extensive selection in coat color phenotypes, as differences in haplotypes indicates recent changes in the genome (
Slatkin, 2008).
It has been reported that
MC1R c901 C>T is associated with white coat color in dromedary camels
(Almathen et al., 2018; Alshanbari et al., 2019). However, not all white dromedaries carry the T allele, which is probably due to the involvement of other gene regulation processes such as transcriptional regulation process of microRNAs. It has been shown that melanogenesis signaling pathway involved the transcriptional regulation process of microRNAs in white phenotype of yak (
Basang Wang Dui et al., 2018). This supports our conclusion that
SLC45A2 exon 7 is responsible for white coat color. Also,
SLC45A2 is associated with diluted phenotypes in horses
(Holl et al., 2019). In addition,
SLC45A2 is associated with albinism in cattle and dogs (
Wijesena and Schmutz, 2015;
Rothammer et al., 2017). Furthermore, it has been reported that
SLC45A2 is associated with white coat color in pigs (
Bâlteanu et al., 2021).
Other phenotypes were observed in the dromedaries with a G base in c.1494 C>G, including yellow eyes and tumor-like growths in the hind limbs (Figure 2). It has been determined that
SLC45A2 genetic variants are involved in iris pigmentation in quail
(Huo et al., 2024). Furthermore,
SLC45A2 haplotypes have been associated with the Brazilian population’s dark/light pigmentation in the eyes and skin (
de Aguiar Fracasso et al., 2017). The other phenotypic observation in the present study was abnormal tumor-like growths in the hind limbs of dromedaries with the G base. Due to restrictions by the camel owner, we were not able to take a biopsy from the tumor-like growth. Also, only older camels had this type of growth (older than 7 years), whereas younger dromedaries did not. It appeared like a fibroma with soft tissue overgrowth. It has been determined that
SLC gene family in general and
SLC45A2 in particular are linked to cancer
(Lavoro et al., 2023). In addition, some coat color genes are associated with cancer growth and cancer development. More particularly, melanoma is associated with
SLC45A2 genetic variants (
Ibarrola-Villava et al., 2012).