Quality evaluation of microbial sequencing in calf diarrhea feces
Use SOAP denovo and CD-HIT software to assemble and de redundant data, use MetaGeneMark for ORF (Open Reading Frame) prediction and filter out information with a length of less than 100 nt based on the prediction results. After sequencing, effective sequences were obtained through quality control, chimerism removal and splicing. A total of 698,029 original sequences were predicted, with an average of 174,507 sequences per sample. After filtering through relevant software, 531,393 valid sequences were ultimately obtained. The average length is 693.82 bp and the GC content is 46.69%. Among them, there are 268,345 complete genes, accounting for 50.5% of the total number of non redundant genes. In addition, the effective data rate for quality control of calf diarrhea fecal samples is 99.65% (Table 2). The number of C1 characteristic sequences in calf diarrhea fecal samples is the highest and the F1 group is the lowest. The highest overlap is 93,221 feature sequences and the lowest is 15,346 feature sequences. The highest order of the four overlapping parts is 22,097 feature sequences and the lowest is 14,783 feature sequences (Fig 1).
The original data volumes for F1, C1, C2 and C3 are 7,802.78, 7,063.78, 7,356.44 and 6212.20, respectively. The final valid data obtained by filtering F1, C1, C2 and C3 through relevant software are 7, 782.27, 7045.09, 7313.78 and 6195.05, respectively. The Q30 of F1, C1, C2 and C3 filtered data were 92.36, 92.84, 93.19 and 93.00, respectively. The GC contents (%) of F1, C1, C2 and C3 filtered data were 45.67, 45.61, 46.35 and 44.98, respectively. The percentages of effective data for F1, C1, C2, and C3 compared to the original data are 99.737%, 99.735%, 99.420% and 99.724%, respectively, indicating that the sequencing data is reliable and of good quality (Table 3).
Beta diversity analysis
Perform PCoA (Principal Co coordinates Analysis) analysis based on Bray Curtis distance and select the principal coordinate combination with the highest contribution rate for graphical display. Observing the differences between sample groups through PCoA, different colors in the results represent different groups. The closer the sample is, the more similar the microbial composition and structure between the samples are. Conversely, the greater the difference. The distance between the C2 and C3 treatment groups is relatively close, while the distance between the C1 group and the other three groups is relatively far, indicating significant differences in microbial community types (Fig 2).
In order to study the similarity of different samples, cluster analysis can also be conducted on the samples to construct a cluster tree of the samples. Non metric multidimensional scale (NMDS) analysis based on species abundance at the genus level can also fully support the above analysis results. If the species composition of the samples is more similar, the distance between them in the NMDS diagram is closer (Fig 3).
Species composition analysis
The number of ORFs that can be annotated into the NR database is 465,393 (87.58%) based on 531,393 predicted genes after original de redundancy. Based on the annotation results of feature sequences and the characteristic tables of each sample, a species abundance table at the level of kingdom, phylum, class, order, family, genus and species was obtained. The species composition and inter group differences of the gut microbiota of four groups of samples were analyzed for different levels of species abundance tables. Among them, the ORFs that can be annotated into the NR database have a proportion of 86.45% at the boundary level, 83.53% at the phylum level, 78.59% at the class level, 77.95% at the order level, 62.78% at the family level, 57.58% at the genus level and 38.18% at the species level. The dominant phylum includes Firmicutes, Bacteroidetes, and Actinobacteria, among others. This study focuses on changes at the phylum, genus and species levels (Table 4).
Microbial composition at the phylum level
At the phylum level, species composition analysis yielded a total of 10 phyla. Among them, Firmicutes, Actinobacteria, and Bacteroidetes are the dominant phyla in F1, C2 and C3. The dominant phyla in C1 are Firmicutes, Actinobacteria, and Proteobacteria (Fig 4). Whether in summer or winter, Firmicutes and Actinobacteria have always been significant dominant bacteria, which is similar to other studies and consistent with the results of studies on the composition of gut microbiota in herbivorous animals (
Fountain et al., 2020;
Oikonomou et al., 2013). These two phyla are the main dominant phyla of ruminants, with the highest relative abundance. They participate in important processes such as food digestion, nutrient regulation and absorption, energy metabolism, and host gut defense against invasion of foreign pathogens
(Wang et al., 2017). However, their relative abundance may vary in different seasons. Proteobacteria is widely present in nature and is a common opportunistic pathogen that can colonize the body’s skin, respiratory tract, gastrointestinal tract,
etc. This study found that the abundance of Proteobacteria in the intestinal microbiota of winter diarrhea calves is higher than other seasons, indicating that the immune system of calves is more likely to decrease in winter, leading to diseases .
Microbial composition at the genus level
At the genus level, species composition analysis yielded a total of 10 genera. The main genera in F1 are
Blautia (17.269%),
Collins ella (13.88%), and
Bacteroides (3.996%). The main bacterial genera in C1 are
Bacteroides (13.804%),
Prevotella (8.035%) and
Clostridium (2.984%). The main genera of bacteria in C2 are
Bacteroides (15.535%),
Lactobacillus (10.581%) and
Faecalibacterium (8.467%). The main genera of bacteria in C3 are
Lactobacillus (18.939%),
Faecalibacterium (6.679%) and
Bacteroides (4.858%) (Fig 5).
Microbial composition at the species level
At the species level, a total of 10 species were obtained through species composition analysis. The main strains in F1 are
Blautia sp. CAG:257 (11.709%),
Clostridium hiranonis (7.248%) and
Firmicutes bacterium CAG:424 (5.044%). The main strains in C1 are
Firmicutes bacterium CAG:110 (3.647%),
Prevotella sp. CAG:485 (2.787%) and
Firmicutes bacterium CAG:424 (0.214%). The main strains in C2 are
Faecalibacterium prausnitzii (3.723%),
Lactobacillus reuteri (2.224%) and
Firmicutes bacterium CAG:424 (0.126%). The main strains in C3 are
Lactobacillus reuteri (4.901%),
Firmicutes bacterium CAG: 424 (3.481%)and
Faecalibacterium prausnitzii (3.084%) (Fig 6).
PICRUS function prediction analysis
Out of 531,393 predicted genes, 380,152 (71.54%) genes can be compared to the KEGG database. Among them, 206,460 (38.85%) genes can be compared to 4,818 KEGG ortholog groups in the database; 378,220 (71.18%) genes can be compared to the egg NOG database; There are 19,712 (3.71%) genes that can be compared to the CAZy database (Table 5).
KEGG function prediction analysis
Based on the functional annotations and abundance information of all samples in the KEEG database, select the top 35 abundant functions and their abundance information in each sample to draw a bar chart. Use the KO based relative abundance bar chart as an example to display (Fig 7). The KEEG database annotation results statistical table shows that amino acid transport metabolism, energy metabolism and other metabolic functions are abundant, as well as genetic functions such as protein modification, folding, and translation are abundant (Table 6). The results indicate that the microbial metabolism and genetic information processing functions of calf diarrhea feces are very rich.
CARD function prediction analysis
Out of 531,393 predicted genes after de redundancy, 307 genes were able to be compared to the CARD database, containing a total of 228 ARO (the Antibiotic Resistance Ontology). Among them, F1 is resistant to 117 drugs, and the top three resistance genes in abundance are
tetO,
tetW/N/W and
mefA; C1 is resistant to 148 drugs, and the top three resistance genes in abundance are
tetW/N/W,
lnuC and
ErmF; C2 is resistant to 170 drugs, and the top three resistance genes in abundance are
APH3-Ib,
tetW/N/W and
OXA-85. C3 is resistant to 163 drugs and the top three resistance genes in abundance are
tetO,
tetW/N/W and
AAC6-Ie-APH2-Ia (Fig 8).
Seasonal changes are often considered a factor that significantly affects microbial communities and antibiotic resistance genes in different environments (
He et al., 2020). The seasonal changes in ruminant populations have a significant impact on the diffusion of antibiotic resistance genes. In this study, the abundance of tetW/N/W was relatively abundant in all four seasons.