Descriptive statistics for biometric measurements of Norduz sheep were given in Table 2. The mean Age of the sheep was determined as 2.21 years, with a standard deviation of 1.07 years. Overall results indicated that variables’ distributions closely approximating normality. Heatmap of Pearson Correlation Coefficients Among biometric parameters with body weight was included in Fig 2. The results of the correlation analysis indicated statistically significant associations between most of the examined biometric variables and BW.
Fig 3 and 4 present training and testing error metrics for MLP networks. As seen in Fig 3 and Fig 4, the TanSig function yielded lower MSE and MAPE values with algorithms such as BR, LM and CGB. The BR algorithm maintained low error levels across neuron counts, demonstrating high stability and generalization performance.
MLP models with varying neuron numbers and TanSig/LogSig activation functions were evaluated in detail, with optimal numerical outcomes reported in Table 3. Most algorithms showed no significant overfitting or underfitting; however, some failed to balance training and test errors, indicating generalization issues. The BR algorithm, configured with 30 hidden neurons and TanSig function, yielded the lowest test errors and the highest , while also achieving the best AIC and BIC scores. In contrast, GD, GDM and GDX algorithms resulted in relatively high errors, with many configurations displaying underfitting. Graphical summaries of Table 3 are provided in Fig 5.
Fig 5 provides graphical representations of performance metrics for configurations that fit the dataset effectively and exhibit strong generalization capabilities within MLP networks. In Fig 5, graphical representations of performance metrics for MLP configurations with strong generalization capability are presented. The BR algorithm with TanSig and LogSig activation functions yielded the lowest MSE and MAPE values and the highest R
2Adj. Additionally, the LM, SCG and CGB algorithms also demonstrated low error rates and high R
2Adj values.
Table 4 presents the statistical performance metrics of the LS-SVM model using the RBF Kernel across various σ and γ combinations. A notable overfitting instance was observed at σ=1 and γ =10, with a large gap between training and test errors. The best performance was achieved at σ=5 and γ =10, yielding low and balanced errors, indicating effective generalization. Conversely, high σ (≥30) and low γ (≤1) led to increased errors in both sets, suggesting underfitting and insufficient learning capacity.
In Fig 6, graphical representations of performance metrics are included for various σ and γ hyperparameter combinations configured with the RBF Kernel function in the LS-SVM technique. As supported by the data presented in Table 4, at low σ (σ = 1) values, as the γ parameter increases, the error rates in the training set decrease rapidly, while the error rates in the test set tend to increase after a certain point. On the other hand, at medium σ values (especially for σ=5 and σ=10), as g increases, both training and test error values decrease significantly and follow a balanced course. As can be seen in Fig 6, at moderate values (notably σ=5 and σ=10), an increment in γ is linked with a pronounced reduction in error rates for both training and testing. Conversely, in conditions with high σ (σ≥50) and low γ (γ≤2.15), a significant rise in error rates for both training and testing is detected, suggesting the emergence of underfitting and a lack of adequate generalization capacity.
In Table 5, the statistical performance metrics for different polynomial degrees of the LS-SVM model with a Polynomial Kernel Function are presented. It was observed that lower degrees (1 and 2) yielded low training errors, while test errors remained relatively higher but acceptable.
According to Tables 3 and 4, the highest prediction performance was achieved by the MLP model configured with the BR algorithm and TanSig activation function. In Fig 7(a), a close alignment between observed and predicted values is illustrated, indicating that the BR-TanSig model captured dataset fluctuations effectively. The Pearson correlation coefficient was calculated as 0.898. Fig 7(b) presents the normalized importance of independent variables. CG, HW and age were identified as the most influential predictors, while BL contributed the least to model performance.
In the domain of sheep farming, it has been reported that machine learning algorithms employing k-fold repeated cross-validation have yielded successful outcomes for body weight prediction, as demonstrated in several studies (
Shahinfar and Kahn, 2018;
Huma and Iqbal, 2019;
Cakmakçı, 2022;
Chay-Canul et al., 2024;
Hamadani and Ganai, 2023;
Tırınk et al., 2023a;
Tırınk et al., 2023b). Unlike in conducted study by
Tırınk (2022), our research has explored neural network methods in a more detailed manner, assessing various configurations comprehensively. Similarly, in the study by
Akkol et al. (2017), BR algorithm outperformed other network structures significantly. In
Norouzian and Alavijeh (2016)’s study, similar to our results, LM algorithm was employed with a sigmoid activation function, demonstrating substantial effectiveness, according to our results. Unlike our study, a measurement system employing Kinect as a sensor for BW prediction was developed by
Chay-Canul et al. (2024). Similar to our study,
Iqbal et al. (2021) also employed the RBF Kernel function; however, it introduced two heuristic algorithms, simulated annealing and the simplex method, for adjusting the hyperparameters of the LS-SVM model. Additionally, research by
Huma and Iqbal (2019) also included support vector machines, it is seen that the generalization abilities of the models are quite successful. In the study executed by
Cakmakçı (2022), contrary to our study, the Boruta algorithm was employed for feature selection. However, align with our results, the most significant variables were found as Chest width, CD and HW. Since the animals in the study had completed their developmental period, it is considered that the significance level of BL differed from the findings of the present study. Another point of divergence is that age was included as an independent variable in our model to assess the effects of different developmental stages. Additionally, the inclusion of the age variable resulted in differences in the correlations among the variables. Contrary to our study results, it was reported that the CART model exhibited lower RMSE and MAPE values compared to the NN mode with different biometric measurements in
Celik et al. (2017)’s study. Similar to our study; thoracic perimeter was among the important variables in estimating body weight of sheep.
Ali et al. (2015) conducted research with different biometric measurements from our study. Our results are aligned with existing studies in the literature, indicating that chest girth (CG) measurement can be regarded as one of the essential biometric parameters for predicting BW
(Sabbioni et al., 2020; Gurgel et al., 2021; Djaout et al., 2022).
The potential of machine learning algorithms to support livestock management has been demonstrated. Body weight estimation based on traits such as chest girth and height at withers has been shown to facilitate the assessment of growth, the optimization of feeding strategies and the selection of superior animals without requiring direct weighing, particularly in resource-limited systems. Certain limitations should be acknowledged. The sample size (n = 312) and the use of data from a single location may limit generalizability. Additionally, variations in biometric traits caused by environmental factors such as nutrition and seasonal conditions may reduce prediction accuracy. To enhance reliability, broader validation across diverse datasets is recommended.