Input variables selection
The descriptive statistics for input variables of American Yorkshire Duroc crossbreed pigs from 10th September to 10th December in 2021 are shown in Table 3. In this study, the correlation coefficient method was performed to examine the association between input and output variables. From Fig 3, there was a positive correlation between growth-related factors and body weights, specifically the height of pig (HP), age (AG), length (LP) and girth length (GL). Other researchers have reported similar findings, in where FI
(Pierozan et al., 2016), DW
(Arulmozhi et al., 2020), HP
(Yang et al., 2019), AG
(Birteeb et al., 2015), LP
(Banik et al., 2021), GL
(Banik et al., 2021) and RCO
2 (
Basak et al., 2022) are highly associated with pig’s body weight. In addition, a negative correlation existed between PBW and the pig’s body temperature (PBT). The negative correlation was noted may be due to the transition from autumn (September to November) to winter (December to March) in Korea. This observation was in accordance with previous research on a variation of temperature stress on growth parameters in pigs
(Caldara et al., 2014). The present study found that the inside room temperature (RT), relative humidity (RRH) and temperature-humidity index (RTHI) had no significant relationship with PBW. Result of correlation analysis showed that FI, DW, HP, AG, LP, GL, CO
2 and PBT were the most important factors (r³0.5) in determining PBW. Therefore, these factors were used for developing and evaluating the performance of MLR and ANNs models.
FFBP and MLR model performance
The precise model selection is important for estimating PBW. Table 4 and 5 showed the goodness of fit of four ANN models in predicting PBW with different transfer functions and neurons in hidden layers. It was found that the FFBP model with a TS transfer function, CFBP with a TS transfer function, EL with a LS transfer function and LR with LS transfer function had the best outcomes. Among these ANN models, the results of the study also showed that the FFBP model with 16 neurons in its hidden layers gave the best result during training (R
2 = 0.991 and RMSE = 0.926), validation (R
2 = 0.970 and RMSE = 1.076) and testing (R
2 = 0.954 and RMSE = 1.260) for the TS transfer function (Fig 4).
The current study also found that ANN models using a linear transfer function (purelin) performed the worst. This decrease in efficiency was most likely due to the nature of the function
(Bhujel et al., 2022). Adding 16 neurons into the two hidden layers of the FFBP model with linear transfer function (purelin) resulted in lower prediction accuracy. When the linear transfer function was changed to the tan-sigmoid transfer function, an increase in precision of predicted body weight was observed.
Wang et al., (2008) used a number of physical features obtained from pig images and connected those with the pig’s live weight. The experimental result indicated that the best performance was achieved when hidden layers had three nodes
(Wang et al., 2008). It was also noted that a higher level of uncertainty in a model is directly connected to the individual parameter coefficient and the architecture of ANN models
(Arulmozhi et al., 2021; Basak et al., 2022; Jaihuni et al., 2022).
The performance of the MLR model was also measured using the same input variables that were used in ANN models for predicting PBW. Equation (3) was employed to measure PBW:
Equation (3) helps to understand how PBW changes as a function of FI, AG, HP, GL, LP, DW, PBT and CO
2. This model determined PBW as a linear relationship between PBW and independent variables. When compared with ANN models, the MLR model had a lower performance in predicting PBW during training (with R
2 = 0.917, RMSE = 1.863 and MAE = 1.532), validation (with R
2 = 0.902, RMSE = 1.952 and MAE = 1.978) and testing (R
2 = 0.897, RMSE = 2.104, MAE = 1.681). The distribution pattern of actual and predicted PBW values for entire datasets were demonstrated using scatter and box plot (Fig 5).
The MLR models are simple and easy to handle and can be used to predict PBW, similar to the other techniques. However, the performance variations between the two models showed the importance of selecting the best one. The ANN methods were shown better performance than MLR in explaining the highly nonlinear relation between the measured and predicted values. Overall, the ANN models predicted PBW more accurately than the MLR model. When compared to MLR, the recommended ANN model (FFBP) predicted PBW with an 8.07%, 7.54% and 7.25% increase in R
2 and a reduction of 50.29%, 44.88% and 46.39% in RMSE during training, validation and testing periods, respectively. By applying the cumulative distribution function, it was found that the predicted PBW data obtained from ANN models were more accurate than the PBW estimated from the MLR (Fig 6). As shown in Figure 6, 90.22% of the data for the ANN model had a residual value between -2 and 2, whereas 67.12% for the MLR had the same range. Furthermore, the relationships between the input variables and PBW were found to be a nonlinear relationship, which may also affect the MLR models’ performance (Fig 6). The high performance in estimating the outcome of ANN modelling compared to the MLR methods for capturing the highly nonlinear and complex relationship between output and input variables has been reported in different studies
(Jaihuni et al., 2022; Sihalath et al., 2021).
Sensitivity analysis of environmental factors
Sensitivity testing was carried out to determine the effect of the individual independent factors in estimating PBW. When the ANN and MLR models were tested without LP, their ability to predict PBW was slightly reduced (Fig 7). It was noted that the ANN and MLR models without LP had the least R
2 (0.919, 0.889) and the maximum RMSE (1.987, 2.918) and MAE (1.595, 2.577), respectively. The study results showed that LP was the most prominent factor in predicting PBW, followed by AG, GL, HP, FI, DW, CO2 and PBT. The selected trait without LP could predict PBW with a 48.58% and 28.26% increase in RMSE and a reduction of 4.60% and 3.50% in R
2 for ANN and MLR models, respectively as compared to the most influential trait,
i.e. without PBT. The significance of these variables in determining the PBW has led to the introduction of these variables in many modelling studies as a crucial indirect indicator to estimate PBW
(Caldara et al., 2014). Along with LP, the four additional traits (
i.e., AG, GL, HP and FI) also significantly affected in predicting PBW in the two models. According to research findings, the ANN model in FFBP network was suggested for modelling PBW in American Yorkshire Duroc crossbreed pigs.