The correlation matrices provided insights into the relationships between body weight and other morphological features (Body height, body length and heart girth) across different age groups for both male and female cattle.
Table 1 shows that the young male cattle’s heart girth has the strongest predictor of body weight (0.90), body length (0.72) and body height (0.65) are moderately correlated but less influential than heart girth. Similar to male cattle, female cattle, heart girth remains the best predictor (0.85). body length (0.70) has a stronger influence than body height (0.52). Therefore, in short heart girth is the dominant predictor for both male and female young cattle, while female cattle show weaker correlations compared to male cattle, indicating slightly more variability in body structure.
Table 2 presents, for early maturity male cattle, heart girth has the strongest correlation with body weight (0.91). This means that heart girth remains the dominant predictor. body height and body length also become more important (0.84 and 0.82, respectively). For female cattle in this group, heart girth remains the strongest predictor (0.90), while body height (0.73) and body length (0.74) have similar but slightly lower correlations. This age group shows the strongest overall correlations, indicating consistent growth relationships. Male cattle show stronger correlations across all features compared to females.
Table 3 presents, for males at group 547-728 Days Old (Growth Peak), Male Cattle, Heart Girth remains the best predictor but shows a slightly weaker correlation (0.76), while Body Length and Body Height remain moderately correlated (0.69 and 0.67, respectively). Female cattle in this group, Heart Girth shows its strongest correlation in this group (0.92), making it an even better predictor for females than for males, while Body Length (0.61) and Body Height (0.63) have weaker correlations compared to younger groups. Overall, for male cattle, correlations begin to weaken slightly, possibly due to growth variability. For female cattle, Heart Girth becomes the strongest predictor across all age groups. Feature importance shifts slightly, meaning additional predictors may improve model accuracy.
Table 4 shows, for males in the age group Cattle of 729-1040 days old (mature cattle for males), Heart Girth (0.88) remains dominant. Body Length (0.78) and Body Height (0.71) maintain a strong influence. For female cattle, correlations weaken significantly compared to earlier stages. Heart Girth is still the best predictor (0.74), but not as strongly correlated as in younger cattle. Body Length (0.53) and Body Height (0.50) become less reliable predictors. Overall, male cattle’s heart Girth still shows strong correlations in adulthood, meaning models can still predict their weight effectively. For female cattle, weight prediction becomes more challenging due to weaker correlations.
It can be seen from Table 5 that by analyzing model performance across the ages, we found that LR consistently performs well across all age groups, RFR sometimes performs well but is less consistent, while SVR struggles in most cases due to the dataset characteristics and NNR do not significantly outperform simpler models, suggesting that simpler models are more suitable for prediction. Overall, RF consistently outperforms SVR and NNR in all age groups. SVR and NNR struggle in this dataset, especially in the 547-728 days group, where its R² score is significantly lower than that of the Random Forest model. For mature cattle (729+days), both models have high errors, indicating a need for feature engineering, alternative models, or ensemble methods.
Table 6 at age group 240-365 days, LR performs best in this age group, RF and PR perform moderately well and have similar R² around 0.71, indicating a moderate predictive power. PR has slightly better RMSE, but the difference is small. The model performance across the ages shows that LR performs well in younger age groups, particularly in 240-365 days. RFR performs better in 366-546 days, likely capturing non-linear relationships in growing cattle. SVR and NNR generally struggle due to dataset characteristics, suggesting insufficient data for deep learning.
The results indicate that the performance of machine learning models in predicting Bali cattle body weight varies significantly across age groups, reinforcing the need for an age-specific modeling approach. PR consistently outperformed other models in the middle-age group (366-728 days), where cattle experience nonlinear growth spurts, aligning with previous studies that highlight its effectiveness in capturing weight progression trends
(Cavalcante et al., 2020). In contrast, RFR demonstrated superior performance for younger (240-365 days) and older (729+ days) cattle, likely due to its ability to handle complex feature interactions and nonlinear dependencies, which have been similarly reported in studies on Brahman cattle and other breeds
(Li et al., 2022). The lower performance of SVR and NNR suggests that these models struggle with the dataset characteristics, particularly in cases where data volume and feature distributions impact convergence and generalization.
Comparing these findings to existing research, prior studies have shown that ensemble models like Random Forest are highly effective in predicting cattle weight based on morphometric measurements (
Sarini and Dharmawan, 2023;
Tırınk et al., 2023), which aligns with this study’s results for young and mature cattle. However, the limitations observed in older female cattle, where correlation values weakened, suggest that additional features (
e.g., age, nutrition, breed variations) should be incorporated for improved model accuracy. Moreover, while polynomial regression captured growth trends well, its performance declined in mature cattle due to irregular weight fluctuations influenced by external factors such as diet and genetics, reinforcing the findings of previous studies that highlight the limitations of polynomial models for highly variable biological data
(Cano et al., 2016; Widyas et al., 2018). These results emphasize the importance of using different models tailored to specific growth phases, providing valuable insights into precision livestock management. However, future studies should explore deep learning approaches or hybrid models that integrate multiple predictive techniques to enhance overall prediction accuracy and applicability. The accuracy of these models varied, with some achieving Mean Absolute Errors as low as 0.35 kg (
Setiawan and Utami, 2024) and others around 13-18 kg (
Setiawan and Utami, 2024;
Weber et al., 2020). By accurately predicting cattle body weight based on age group and morphological features, farmers can adjust feed portions to optimize growth and prevent overfeeding or underfeeding, reducing costs and improving efficiency. For early age (240-365 days), weight predictions also help in health monitoring, enabling farmers to detect abnormal weight fluctuations that may indicate disease, malnutrition, or poor growth rates, allowing for timely intervention
(Segerkvist et al., 2020).
The lower prediction accuracy for older cattle (729+ days) can be attributed to greater biological variability and external influences
(Hassen et al., 1999). Besides, nonlinear growth fluctuations of the cattle are harder for the machine learning models proposed in this research to capture. XGBoost algorithms have demonstrated good performance in predicting daily weights (R
2 = 0.869)
(Awasthi et al., 2024; Awasthi et al., 2023). Unlike younger cattle, which follow more consistent growth trajectories, mature cattle experience weight variations due to factors like diet, genetics, reproductive status and environmental conditions, which were not explicitly included as model inputs.
The outcomes of this study are in line with earlier research exploring the use of body measurements and machine learning for predicting livestock weight.
Karakus (2025) reported that prediction models performed better when lambs were grouped by age, which supports our finding that model accuracy improves when Bali cattle are segmented by developmental stage. A similar conclusion was drawn by
Haldar et al., (2023), who found that decision tree models, particularly Recursive Partitioning and Regression Trees (RPART), worked well for predicting goat weight based on body dimensions-echoing our observation that Random Forest models handle complex cattle morphometrics effectively, especially in younger and mature groups. Additionally,
Banik et al., (2021) demonstrated that nonlinear regression methods were better suited for estimating piglet weight during early development, which aligns with our result showing the strength of polynomial regression during the 366–728 day growth window. Meanwhile,
Hamadani et al., (2024) observed that support vector machines and deep learning models struggled with generalizing predictions across sheep age groups, a challenge that was also evident in our neural network and SVR results for older Bali cattle. From a broader perspective,
Sultana et al., (2022) and
Djeghar et al., (2025) reaffirm the reliability of body measurements like heart girth and body length in estimating cattle weight, reinforcing our conclusion that these traits remain essential across all age groups for accurate prediction.