Pools of DNA of fifty-five Jordanian Awassi ewe sheep (Fig 2), were sequenced at approximately 60× coverage and then analyzed jointly with publicly available the sheep reference genome covering with an average of 97.78% of the reference genome
(Jiang et al., 2014). 99.22 Gb of genome data were generated in the present study. For quality insurance, the raw data was modified by deleting the adapter pollution in reads and then the reads which contain over 50% low-quality bases (quality value <= 12) were removed and the stringent quality filtering yielded 93.88 Gb of genome data.
Results in Table (1) summarized the raw and cleaned reads of the whole genome sequence. The Q20 base rate of each lane is above 97%, so the data quality is very high
(Fujimoto et al., 2010).
The genome size is 2,584,832,510 bp and the effective size is 2,560,768,911bp. Burrows-Wheeler Aligner (BWA) software used for sequence reads alignment to the reference genome. The mapping rate and the final effective mapping depth were 99.28% and 36.32. Based on the consensus sequence, the polymorphic loci between our breed and the reference filtered and a high-fidelity SNP data set generated (Table 2). 18,994,659 SNPs among Jordanian ewe genomes. The SNP results are listed as CDS, exon and genes. In this study, 152,684 synonymous and 102,531 non-synonymous SNPs were annotated among this sheep genome. Furthermore, the distribution of predicted large-effect SNPs is presented in Fig 3. Premature stop codons were 1016 and 1071 were disrupt splicing donor or acceptor sites.
Short InDel detection and annotation
The Genome Analysis Toolkit (GATK) was used to detect InDels. 3,626,673 InDels generated and distributed as 1,782,830 insertions and 1,843,843 were deletions (Table 3). Furthermore, the frameshift InDel based on the InDel annotation has resulted in 43,614.
Structure and copy number variation detection and annotation
The structural variation detected includes deletion, insertion, duplication, inversion and transposition. 43.14% were deletion type of SV. Over 189326 were in gene sites. The SV results are presented in Table (4).
Copy Number Variation (CNV) is a basic form of structural variation among individuals of the same species. Genome regions deleted or duplicated on some chromosomes between the sequencing individual and the reference have corresponded to CNVs. The CNV results listed in Table (5); Over 72% of the CNV were down-regulated. Recently, the whole-genome resequencing of Jordanian Awassi Ram (
Ovis aries) using Hiseq sequencing technology was reported
(Haddad et al., 2020). The report considered this research as the first step towards sheep genomic selection of Awassi ram. Based on this report, the authors urged similar work to be performed on Awassi ewe’s genome. They justified that whole-genome resequencing of Jordanian Awassi Ram was considered a preliminary stage requiring similar whole sequencing of Awassi ewe’s genome to start Awassi sheep genomic research selection. Genome-wide studies used High-throughput sequencing and screening technologies on sheep are reported. They included single-nucleotide polymorphisms, induction variations, structural variation, copy number variants at the whole-genome level, transcriptome and DNA methylation sequences and comprehensive information on functional genes genetic variants associated with economically important traits. Studies to decipher the genetic basis of animal domestication are dramatically increased and considered an active research area. Genomic signatures related to domestication and improvement were found in response to demographic and selective differences between wild and domestic populations
(Florian J. Alberto et al., 2018). (Li et al., 2020) stated that the availability of whole-genome sequences provides an opportunity to study domestication at the gene mutation level. They re-sequenced the whole genomes of 248 wild, landrace and improved sheep and detected genomic regions: targets of domestication, breeding and selection. Furthermore, they found non-synonymous mutations in candidate genes and significant differences in their allele frequency distributions across breeds, such as PDGFD (platelet-derived growth factor D) gene for fat deposition in the tails of sheep.
(Pan et al., 2018) performed whole-genome sequencing of 99 sheep and identified candidate genes revealed local adaptations of the sheep associated with sensory perception, muscle strength, eating habits, mating process and aggressive behavior.
Furthermore,
(Florian J. Alberto et al., 2018) determined certain genomic regions carrying genes involved in the nervous system, immunity and productivity traits differentiating domestic breeds of sheep and goats from wild populations.
(Chang et al., 2018), reported two novel miRNAs from the
Ovis aries may play an important role in inducing embryonic development, sex regulation, production of sperm and ovulation. Moreover,
(Chang et al., 2020), identified few genes serve as potential biomarkers for monitoring disease progression and abortion risk during pregnancy in sheep.
(Adam
Abied et al., 2020) used high-density SNP Chip data to characterize the auto-zygosity of five local Chinese sheep breeds belonging to different geographical locations identified by the genomic regions with high runs of homozygosity frequencies within individuals of each breed. These regions included candidate genes associated with disease resistance traits, the innate and adaptive immune response, digestion and metabolism, growth, body size and developments.
(Brake et al., 2021) demonstrated using the complete mitochondrial genome of the Awassi-Jo breed (
Ovis aries) data to construct the phylogenetic tree. They showed that Awassi-Jo diverged earlier than related breeds (Turkey, Italy, Germany and Netherland) with a common ancestor in haplogroup HB. The results revealed the importance of mitochondrial data in studying sheep evolution and domestication.