After completing 50 epochs of training, the convolutional neural network (CNN) model demonstrated remarkable performance metrics on both the training and validation datasets (Fig 4). The loss of the training set dropped to 0.0444, suggesting that the model's predictions closely matched the actual values, with minimum error. In addition, the model achieved an accuracy of 98.48% on the training set, indicating that it properly identified the majority of the training samples. On the validation set, which functions as an autonomous evaluation of the model's capacity to generalize, the loss was marginally elevated at 0.0592, suggesting a little increased mistake in comparison to the training set while still maintaining a commendable degree of accuracy. The validation accuracy remained consistently high at 98.12%, indicating that the model's performance extended well to data that it had not been trained on. In summary, the results demonstrate that the CNN model efficiently acquired significant characteristics from the input images and accurately categorized them into the correct groups. This was achieved through 50 epochs of training, resulting in high accuracy and minimal loss.
The upcoming experiment aims to predict classes, along with their corresponding confidence scores (Fig 5). This involves utilizing the learned Convolutional Neural Network (CNN) model to provide predictions on new or unobserved data. Subsequently, for every prediction, the class with the greatest probability is determined, indicating the expected class label. The confidence score is determined by the probability assigned to the predicted class, which represents the model's level of certainty in its prediction for that specific sample. Ultimately, the predicted category, along with its related level of certainty, is generated for every individual instance in the dataset. This procedure facilitates a more profound comprehension of the model's efficacy by offering insight into the certainty of its predictions, hence assisting in decision-making in practical situations.
The confusion matrix offered has rows that indicate the accurate labels for several classifications, including healthy, rust, gall and chocolate spot (Fig 6). The columns represent the projected labels. The diagonal elements represent accurate predictions, where the true label corresponds to the expected label. For instance, among the group of healthy individuals, all 216 samples were accurately identified as healthy, resulting in a true positive count of 216. In the rust class, all 219 samples were accurately identified as rust, resulting in a true positive total of 219. Within the gall class, there was a single instance of misclassification when a sample was erroneously identified as rust, leading to a tally of 1 false negative. Furthermore, one sample was erroneously categorized as chocolate spot, leading to a count of 1 false positive. In the chocolate spot class, out of a total of 190 samples, 183 were accurately identified as chocolate spot, while 7 were incorrectly classified as rust. This shows 7 false negatives. Finally, the confusion matrix offers a complete analysis of the ability of the model for several classes of diseased leaves. It enables an assessment of precision and identifies potential eras for enhancement of accuracy.
Table 1 summarizes the performance metrics of multiple classes. These metrics provide information regarding the ability of the model to accurately differentiate between categories. In the classification report, it can be shown that the rust and gall classes show a high degree of precision.
From the table, it can be seen that the healthy class shows a remarkable precision, recall and F1 score that is equal to 1. It supports 216 instances. Rust and gall classes show comparable results. The rust shows higher recall than gall but gal has higher precision value than rust. Both show good F1-score. The rust supports 219 events while gall promotes 207 events.
In accordance with the above discussion, the chocolate spot and rust classes yield significant insights. The rust has a perfect recall (1.000) while the chocolate spot has a lower recall of 0.9632. On the other hand, precision for the chocolate class (0.9946) is higher than rust (0.9648). This means the model has a low detection level for rust. The overall accuracy of the prediction for all classes is equal to 98.92%.
The given information includes ROC curves and their accompanying Area Under the Curve (AUC) values for four categories: healthy, rust, gall and chocolate spot (Fig 7). The AUC values for healthy, rust, gall and chocolate spot are 0.5396, 0.5616, 0.5533 and 0.5799, respectively. The ability of the model to differentiate between instances that are positive and instances that are negative for each class is measured by these scores, which indicate the discriminatory power of the model. In general, the AUC values provide valuable information about the model's ability to accurately identify cases for each class. ROC(AUC) curves provide information for the evaluation capability of the model across different classes of disease leaves of plants.
The results obtained for the faba bean leaf diseases are compared with the existing research papers.
Salau et al., (2023) demonstrated the accuracy of the model for processed and unprocessed data. They found that the accuracy of the unprocessed data (92.1%) is lower than that of processed data (98.14%). YOLOv8 model is used by
Slimani et al., (2023) to classify the rust disease in faba bean fields in eastern Morocco. A precision rate of 95.1% is achieved by the authors. Moreover, the evaluation metrics such as mAP (mean average precision), recall and F1-score have high values. The YOLOv8 performs well in identifying diseases in plants. It is also used for tracking and classifying objects. In view of previous studies, it can be concluded that our work is important to detect diseases in the leaves and other parts of the plants. The presented method for faba bean leaf disease detection has reliable and reproducible results that highlight the significance of the findings.
Limitations and future work
In this study, only four diseases affecting the leaves of Faba beans were examined. However, it's important to note that Faba beans can be affected by many different diseases, not just on their leaves but also on other parts of the plant. So, in the future, more research should be done on diseases that might affect different parts of the Faba bean plant. Also, in this study, a certain type of model with a special classifier was used. To make the model better and to see how well it works, different methods could be used in the future, like support vector machines, random forests and decision trees. By doing this, researchers can find better ways to detect and manage diseases in Faba beans. In the future, the collaborative development of intelligent imaging technologies shows the potential to assist farmers, consumers and the environment by promptly identifying and optimizing crop health to prevent threats.