Background: Shrimp rearing and wild-catch shrimp fisheries provide employment and income for millions worldwide. As the global population rises, shrimp plays a key role in addressing food security challenges. It is a nutritious and accessible food, widely consumed in diverse cuisines. Ecologically, it serves as scavengers and detritivores, maintaining stability in aquatic environments. This ecological role has driven the development of sustainable shrimp farming practices. Disease management, especially prevention and early detection, is a critical concern in shrimp farming. Automated image processing could speed up disease detection, ensuring timely intervention. This study focuses on detecting white spot syndrome virus (WSSV) in shrimp using convolutional neural networks (CNN) with promising accuracy.

Methods: The model uses a sequential convolutional neural network (CNN) to identify WSSV in shrimp. It is trained with images categorized into two groups: Healthy and diseased. Features like lesions and discoloration are used to detect infection. The dataset is preprocessed to reduce resolution variation through bilinear interpolation. The model’s architecture consists of multiple layers that extract features and reduce spatial dimensions. Concatenation layers combine information from different network parts to enhance feature extraction. Performance is evaluated to determine the model’s ability to make reliable predictions.

Result: The CNN model achieved a remarkable accuracy (93.75%) in detecting WSSV. It correctly identified all WSSV instances with minimal misclassification. The overall performance of the model demonstrates its robustness. The ROC curves further confirm its ability to classify both healthy and diseased shrimp effectively, highlighting its potential for early disease detection in shrimp farming systems.

The global shrimp industry has seen significant growth in both quantity and quality over the past 15 years, with farmed species, particularly Pacific white leg shrimp, dominating production. Farmed shrimp accounted for 63% of total production in 2022. The rising demand for shrimp is driven by health benefits and the introduction of shrimp-based products, particularly frozen shrimp, which offers extended shelf life. Despite challenges like oversupply and low prices, the industry has recovered post-COVID-19 and is expected to continue expanding. Future growth will depend on climate- smart aquaculture, sustainability certifications and efficient production practices (Mandal and Singh, 2025; Yang et al., 2024).
       
The global demand for seafood has escalated, prompting countries to adopt better aquaculture practices to improve productivity, efficiency and sustainability. Advances in selective breeding and genetics have led to the development of improved shrimp, fish and other shell breeds with enhanced growth rates and nutritional value. Technologies like marker-assisted selection and genomic selection are accelerating breeding processes in marine species (Zenger et al., 2019). Among various seafood products, shrimp holds a significant position. According to the Food and Agriculture Organization (FAO), shrimp is the most traded seafood commodity globally (FAO, 2022) and its consumption continues to rise, ranking alongside popular seafood items like fish, tuna and mollusks.        

Shrimp farming contributes billions of dollars annually to the global economy (Phong et al., 2021). It is considered a sustainable method for meeting the growing food demand and bolstering social and economic stability worldwide.
       
Shrimp plays a vital role as a high-protein, nutrient-rich food source, contributing to global food security amidst a rapidly growing population. Its versatility in various cuisines has made it a favorite among consumers and chefs. Furthermore, shrimp helps maintain aquatic ecosystems by recycling organic matter and ensuring water quality (Depestele et al., 2019; Hai and Duong, 2024; Maltare et al., 2023; Bagga et al., 2024). As scavengers and detritivores, shrimp populations support balanced food webs in aquatic environments. These roles have led to the innovation of shrimp farming models that prioritize environmental sustainability and commercial success. Additionally, shrimp farming holds cultural significance in coastal communities, where traditional practices are passed down through generations, reinforcing local cultural identity.
       
However, the rapid expansion of the shrimp industry has highlighted environmental challenges such as habitat degradation, water pollution, overfishing and disease outbreaks. These issues have prompted the establishment of eco-certification programs and conservation initiatives aimed at minimizing environmental harm and improving resource management. Diseases such as White Spot Syndrome Virus (WSSV), Taura Syndrome Virus (TSV) and Early Mortality Syndrome (EMS) have significant economic and ecological impacts on shrimp farming. Molecular tools, such as polymerase chain reaction (PCR) assays and loop-mediated isothermal amplification (LAMP), have been developed for rapid and accurate disease detection in shrimp farming. However, these methods face limitations in terms of cost, complexity and time sensitivity (Islam et al., 2023). Moreover, traditional disease detection methods rely heavily on farmer expertise, making them time-consuming, labor-intensive and expensive (Estevez et al., 2023; Wang et al., 2024).
       
To address these challenges, there is a growing need for affordable, user-friendly and field-deployable disease surveillance tools. Machine learning (ML) algorithms and artificial intelligence (AI) techniques offer promising solutions by enabling large datasets to be analyzed for early disease detection, health monitoring and environmental monitoring. Non-invasive, low-cost and user-friendly technologies, such as computer vision, are particularly promising for shrimp disease detection (Aung et al., 2024). Using image acquisition equipment, data can be collected and used to train deep learning models that offer high accuracy and quick intervention opportunities (Abade et al., 2021; Duong-Trung et al., 2020; Min et al., 2024; Zhang and Gui, 2023).

Convolutional Neural Networks (CNNs) are emerging as a key technology for analyzing shrimp health. Deep CNN models can classify shrimp images based on disease symptoms, such as lesions, abnormalities and other visual signs. Object detection algorithms, including Faster R-CNN and YOLO (You Only Look Once), are designed to identify regions of interest in shrimp images, providing valuable inputs for disease diagnosis and monitoring. These CNN-based systems offer prompt disease detection with minimal human involvement, making them well-suited for large-scale aquaculture settings. An automated image-capture system, combined with CNN-powered disease recognition models, can significantly enhance disease surveillance, improving shrimp farm productivity and sustainability.
       
Several studies have addressed the need for efficient disease monitoring and classification in shrimp farming. Phong et al., (2021) discuss the market dynamics that drive farmers to optimize shrimp yields and productivity while maintaining environmental sustainability. Close supervision of shrimp farms is essential for adopting pro-environmental practices, such as avoiding excessive antibiotic use to prevent diseases. The shift from traditional manual disease detection to machine learning algorithms is gaining momentum in aquaculture, with various ML models being applied for classification tasks.
       
The detection of shrimp diseases using CNN-based models is an evolving field with significant potential for improving aquaculture practices. The integration of machine learning techniques, such as CNNs, into disease monitoring systems enables more timely, accurate and scalable interventions. While many existing models have shown promising results, further research is needed to optimize these systems for large-scale deployment.
       
The current work proposes a CNN-based approach for classifying shrimp health states and emphasizes the importance of accuracy, efficiency and low-cost technologies in supporting sustainable shrimp farming operations. By enhancing disease detection and early intervention capabilities, this model contributes to the overall productivity and sustainability of shrimp aquaculture.
The following parts outline the procedure employed to construct a sequential Convolutional Neural Network (CNN) model for the identification of shrimp infected with WSSV disease. The task is separated into multiple distinct parts, with the initial one including the collection of images for categorization using deep neural networks.
 
Dataset
 
The dataset is composed of images from two different groups: Healthy and WSSV (White Spot Syndrome Virus). WSSV is a serious viral infection affecting global shrimp aquaculture. Images of shrimp with normal physiological characteristics that is, without any obvious signs of disease or infection are included in the healthy class. On the other hand, images of shrimp in the WSSV class show signs of a WSSV infection, like discoloration, abnormal behavior, or visible lesions on the body surface. The dataset consists of 298 images, divided into three subsets. The training set has 238 images (80%). The validation and testing datasets each contain 30 images (10%). This 80-10-10 split helps ensure a balanced dataset for effective model training and evaluation. Fig 1 illustrates representative images of healthy shrimp and shrimp infected with White spot syndrome virus (WSSV).

Fig 1: Healthy and WSSV-infected shrimp.


 
Image preprocessing
 
Before training, preprocessing techniques are applied to every image to ensure uniformity and improve the quality of the dataset. Resizing the images was done to ensure that the dimensions were consistent across the collection. Bilinear interpolation was used to reduce images of varying resolutions to 128×128 pixels. The images were then transformed into grayscale. A significant number of training data is required at this pre-processing stage to explicitly learn the features of the training data. In the next step, the images of shrimp were categorized based on their type and then annotated with the corresponding acronym representing the specific disease. In this case, the test collection and training dataset exhibited two distinct classes.
 
Data augmentation
 
An array of augmentation techniques is used in this study to add diversity to the original dataset. These methods were rotation, flipping, shifting, zooming and shearing. Rotation added random rotations to the images to simulate different views and orientations, which helped the model identify objects from a range of perspectives. Flipping created mirror images in the dataset by flipping images horizontally, which improved the model's ability to recognize objects in any orientation.
 
Training dataset
 
The technique includes the utilization of a Convolutional Neural Network (CNN) to create a model for assessing performance. The model was trained to utilize image data as input. The flow chart illustrating the classification process of healthy and diseased shrimp using Convolutional Neural Network (CNN) is shown in Fig 2.

Fig 2: Flow Chart of the classification using CNN of healthy and diseased shrimp.


 
Architecture of convolutional neural network (CNN) model
 
The architecture of a neural network model includes various types of layers such as convolutional layers, max-pooling layers, flatten layers and dense layers (Fig 3). Table 1 represents the parameters used in the CNN algorithm. Starting with the input layer, labeled as "input_1," it takes input data with dimensions of 128x128x3, where 3 represents the number of color channels (typically red, green and blue in an image).

Fig 3: Architecture of Sequential CNN model.



Table 1: Parameters of CNN model.


       
The subsequent layers include convolutional layers ("conv2d"), followed by max-pooling layers ("max_pooling2d"). These layers are responsible for extracting features from the input data while reducing spatial dimensions, thereby aiding in capturing important patterns efficiently. As the data progresses through the network, the convolutional and max-pooling layers are stacked to deepen the network's configuration, gradually reducing the spatial dimensions of the data and increasing the number of channels. The architecture includes dense layers, also known as fully connected layers, which are responsible for learning complex patterns and making final predictions. These layers possess a large number of parameters because they are fully connected, which greatly enhances the model's ability to capture complex patterns in the data. The model employs concatenation layers ("concatenate") to combine features from different parts of the network, enhancing the model's ability to capture diverse and complementary information. Finally, the output layer, labeled as "dense_8," consists of two neurons, representing the classes that the model aims to classify. In this case, it's likely a binary classification task, as there are two output neurons. Overall, the presented architecture illustrates a deep neural network designed for processing image data, with convolutional layers for feature extraction, dense layers for pattern recognition and concatenation layers for integrating diverse information, culminating in a model capable of making accurate predictions based on the input data.
 
Performance evaluation parameters
 
The confusion matrix, which consists of the four essential components True Positive (TP), True Negative (TN), False Positive (FP) and False Negative (FN), is used to assess the model's performance.
       
Accuracy is a frequently used metric for assessing performance. It is a measure of the correctness of predictions, taking into account both true positives and true negatives, relative to the total number of predictions made. Usually, it provides insight into the accuracy of a trained model and offers an evaluation of its overall performance.

 
Precision is a metric that quantifies the frequency of accurate predictions generated by a model. The process involves dividing the total number of positive labels by the number of accurate positive predictions.

 
Recall measures the classifier's ability to correctly identify all relevant instances. This percentage is calculated by dividing the total number of positive reviews in the dataset by the number of correctly predicted positive observations. The objective of computing these metrics is to identify the most positive labels. The formula for calculating recall is as follows


Achieving an F1-score of 1 signifies complete performance, whereas a score of 0 signifies total failure. The F-measure is calculated using the following formula

Intelligent aquaculture is essential for identifying shrimp diseases. Precise categorization of both healthy and WSSV is crucial for enhancing productivity and profitability. The main objective of this study was to achieve satisfactory outcomes in the area of disease detection. Fig 4 displays the training and validation loss along with the accuracy metrics.

Fig 4: Loss and Accuracy measurements of training and validation datasets.


       
A training loss of 0.0507 signifies an average difference of 0.0507 units between the model's predicted and actual target labels. Comparably, a validation loss of 0.0376 indicates that an average error of 0.0376 units is produced by the model's performance on unseen data (validation data). After 50 epochs, the model correctly classified every instance in the training and validation sets with an accuracy of 1.0000.
       
Fig 5 displays a collection of images that depict both the actual and the predicted labels. This presentation allows for an assessment of the efficacy and level of certainty of the trained model. The exhibited predicted outcomes are accurate and exhibit a high level of certainty.

Fig 5: Actual and prediction of healthy and diseased shrimp.


       
The classification performance of the proposed model is summarized using a confusion matrix, as shown in Fig 6. The classification performance of the model for both classes is displayed in the confusion matrix. Two instances were mistakenly classified as WSSV (false negatives) out of the 17 cases in the Healthy class that were accurately identified as Healthy (true positives). On the other hand, none of the 13 WSSV class instances were incorrectly classified as Healthy (false positives), while all of the cases were accurately identified as WSSV (true positives). This matrix provides a clear visual representation of the model's ability to distinguish between the two classes, highlighting its potential to decrease misclassifications. This is particularly evident in the model's capacity to identify WSSV instances with accuracy. The performance of the classification model is summarized using key evaluation metrics, as presented in Table 2.

Fig 6: Confusion matrix of instances of healthy and WSSV disease.



Table 2: Classification matrix parameters.


       
The precision, recall, F1-score, support and overall performance measures are shown in the table for both classes. The prediction of all Healthy classes was correct, as evidenced by the precision score of 1.0000. Conversely, the recall score of 0.8947 means that 89.47% of actual occurrences of healthy were properly identified by the model. The F1-score of 0.9444, which strikes a balance between precision and recall, indicates an equitable distribution of true positives and false negatives. This demonstrates a slight limitation in recall but a high degree of accuracy in classifying events as Healthy. However, the WSSV class displays a distinct set of performance indicators. The high recall score of 1.0000 signifies that the model accurately detected every instance of WSSV in the dataset, despite the precision of 0.8667 suggesting a comparatively small percentage of misclassifications within this category. An overall high performance for this class is indicated by the F1-score of 0.9286, which is slightly lower in accuracy than that of Healthy but perfect in recall, indicating robustness in capturing all occurrences of WSSV.
       
The provided ROC (Receiver Operating Characteristic) curves show the rate at which the model performed in distinguishing between the Healthy and WSSV classes (Fig 7). For different threshold levels, each curve illustrates the trade-off between the true positive rate (sensitivity) and the false positive rate (1-specificity). The ROC curves exhibiting identical AUC values of 0.9717 demonstrate robust classification performance for the Healthy and WSSV classes, confirming the approach's efficient ability to differentiate between the two groups.

Fig 7: ROC Curve for both classes (healthy and WSSV).


       
The model was trained for 50 epochs based on initial experiments, where we observed satisfactory convergence and performance within this timeframe. This number of epochs allowed the model to learn essential features from the dataset while reducing the risk of overfitting, as longer training durations can sometimes lead to memorization of the training data. Although increasing the number of epochs might improve accuracy, we considered the trade-off between training time and performance. Future experiments will involve training with more epochs to evaluate if further training leads to meaningful improvements or overfitting.
       
The proposed model is compared with the previously reported work based on a similar use of deep learning architecture. Satoto et al., (2023) employed the Inception ResNetV2 deep learning model for coastal shrimp species classification, achieving an average accuracy of 99.4%. Despite promising results, larger datasets are needed for further validation. Hu et al., (2020) introduced ShrimpNet, a CNN-based system with two fully connected layers for shrimp identification. This model achieved an impressive 95.48% accuracy rate in distinguishing shrimp from other marine species. Lai et al., (2022) used the YOLOv4-tiny CNN model to estimate shrimp body length and classify shrimp into measurable and visible categories. The model showed a precision of 93.24%, though its infrastructure requirements and reliance on cloud computing make it less viable for some shrimp farmers.
       
Zhang et al., (2022) developed a lightweight YOLOv4-based model for automatic shrimp counting, achieving a precision of 92% and recall of 94%. While the model performed well in counting shrimp, its application for disease detection remained limited. Prema et al., (2022) proposed a hybrid CNN-SVM model for shrimp freshness detection, focusing on quality control after harvest. However, their work did not address proactive disease prediction in live shrimp, which would be more beneficial for preventing health issues before harvest. Zhou et al., (2023) introduced an unsupervised learning approach for shrimp segmentation, providing a publicly available dataset for further research. This approach addresses the issue of limited resources for computer vision applications in aquaculture. The proposed model achieved a remarkable accuracy (93.75%).
       
These models heighten the need for more accurate models for disease detection to raise the yields in aquaculture farming. The model is an initiative with further upgrades for larger datasets. The overall performance of the proposed model is adequate and lays the foundation for more evolved CNN-based models with more layering.
In the domain of aquaculture, precise identification of shrimp diseases is essential for enhancing productivity and profitability. This study focuses on achieving satisfactory outcomes in disease detection. The provided table displays comprehensive performance measures, indicating the model's effectiveness in accurately classifying both healthy and diseased shrimp instances. The precision, recall and F1-score metrics highlight the model's ability to distinguish between healthy and WSSV instances, with notable robustness in WSSV detection. The provided ROC curves confirm the model's efficient classification performance for both classes, demonstrating its capability to differentiate between healthy and WSSV instances effectively. One limitation of this study is the small size of the dataset, particularly in terms of the number of images with two classifications. Future research should incorporate a larger number of images for better model performance. This study mainly focuses on the severe infection caused by the White Spot Syndrome Virus (WSSV) in shrimp. However, shrimp are also affected by other diseases, which should be considered in future studies. Additionally, this study uses a fully connected layer classifier for classification. Future research could compare machine learning techniques like support vector machines, random forests and decision trees for improved classification accuracy.
Availability of data and materials
 
Not applicable
 
Use of artificial intelligence
 
Not applicable
 
Declarations
 
Author declares that all works are original and this manuscript has not been published in any other journal.
Author declares that they have no conflict of interest.

  1. Abade, A., Ferreira, P.A. and Vidal, F. de B. (2021). Plant Diseases Recognition on images using convolutional neural net works: A Systematic Review. Computers and Electronics in Agriculture. 185: 106125. https://doi.org/10.1016/j. compag.2021.106125.

  2. Aung, T., Razak, R.A. and Nor, A.R.B.M. (2024). Artificial intelli gence methods used in various aquaculture applications: A systematic literature review. Journal of the World Aquaculture Society. https://doi.org/10.1111/jwas.13107.

  3. Bagga, T., Ansari, A.H., Akhter, S., Mittal, A. and Mittal, A. (2024). Under- standing Indianconsumers' propensity to purchase electric vehicles: An analysis of determining factors in environmentally sustainable transportation. International Journal of Environmental Sciences. 10(1): 1-13

  4. Cho, O.H. (2024). An Evaluation of Various Machine Learning Approaches for Detecting Leaf Diseases in Agriculture. Legume Research. 47(4): 619-627. doi: 10.18805/LRF-787.

  5. Depestele, J., Feekings, J., Reid, D.G., Cook, R., Gascuel, D., Girardin, R., Health, M., Hernvann, P-Y., Morato, T., Soszynski, A. and Savina-Rolland, M. (2019). The impact of fisheries discards on scavengers in the Sea.The European Landing Obligation. Springer, Cham. https://doi.org/10.1007/978-3-030-033 08-8_7. 

  6. Duong-Trung, N., Quach, L., Nguyen, C. (2020). Towards Classification of shrimp diseases using transferred convolutional neural networks. Advances in Science, Technology and Engineering Systems. 5(4): 724-732. http://dx.doi.org/10. 25046/aj050486.

  7. Estevez, J.R., Manco, J.A., Garcia-Arboleda, W., Echeverry, S., Pino, I., Acevedo, A. and Rendon, M.A. (2023). Microen capsulated probiotics in feed for beef cattle are a better alternative to monensin sodium. International Journal of Probiotics and Prebiotics. 18(1): 30-37. https://doi.org/10. 37290/ijpp2641-7197.18:30-37. 

  8. FAO. 2022. World Food and Agriculture - Statistical Yearbook 2022. Rome.https://doi.org/10.4060/cc2211en.

  9. Hai, N.T. and Duong, N.T. (2024). An Improved Environmental Man agement Model for Assuring Energy and Economic Pros perity. Acta Innovations. 52: 9-18. https://doi.org/10. 62441/ActaInnovations.52.2. 

  10. Hu, W.C., Wu, H., Zhang, Y., Zhang, S. and Lo, C. (2020). Shrimp recognition using ShrimpNet based on convolutional neural network. Journal of Ambient Intelligence and Humanized Computing. https://doi.org/10.1007/s12652- 020-01727-3.

  11. Islam, S. I., Mou, M. J., Sanjida, S. and Mahfuj, S. (2023). A review on molecular detection techniques of white spot syndrome virus: Perspectives of problems and solutions in shrimp farming. Veterinary Medicine and Science. 9(2): 778- 801. https://doi.org/10.1002/vms3.979.

  12. Lai, P., Lin, H., Lin, J., Hsu, H., Chu, Y., Liou, C. and Kuo, Y. (2022). Automatic measuring of shrimp body length using CNN and an un derwater imaging system. Biosystems Engineering. 221: 224e235.  https://doi.org/10.1016/j.biosystemseng. 2022. 07.006.

  13. Maltare, N.N., Sharma, D. and Patel, S. (2023). An exploration and prediction of rainfall and groundwater level for the District of Banaskantha, Gujrat, India. International Journal of Environmental Sciences. 9(1): 1-17

  14. Mandal, A. and Singh, P. (2025). Global Scenario of Shrimp Industry: Present Status and Future Prospects. In P. Singh, A. Singh, A. Tyagi and S. Benjakul (Eds.), Shrimp Culture Technology (pp. 1-20). Springer. https://doi.org/10.1007/978-981-97- 8549-0_1

  15. Min, P.K., Mito, K. and Kim, T.H. (2024). The evolving landscape of artificial intelligence applications in animal health. Indian Journal of Animal Research. 58(10): 1793-1798. doi: 10.18805/IJAR.BF-1742.

  16. Phong, T.N., Thang, V.T. and Hoai, N.T. (2021). What motivates farmers to accept good aquaculture practices in devel opment policy? Results from a choice experiment. sur veys with small-scale shrimp farmers in Vietnam. Economic Analysis and Policy. 72: 454-469. https://doi.org/ 10.1016/j.eap.2021.09.015.

  17. Prema, K. and Visumathi, J. (2022). Hybrid approach of CNN and SVM for shrimp freshness diagnosis in aquaculture monitoring system using IoT-based Learning Support System. Journal of Internet Technology. 23(4): 801-810.  https://doi.org/ 10.53106/160792642022072304015.

  18. Satoto, B.D., Khotimah, B.K., Yusuf, M., Hapsari, R.K. and Irmawati, B. (2023). Classification of coastal shrimp species using deep learning InceptionResNetV2 with data augmentation. Technium: Romanian Journal of Applied Sciences and Technology. 16: 250-258, ISSN: 2668-778X.

  19. Wang, D., Zhong, J., Gao, Y., He, J., Hou, J. and Du, K. (2024). Green tea extract improves postoperative outcomes after urological surgery. Current Topics in Nutraceutical Research. 22(3): 974-979. https://doi.org/10.37290/ctnr2641-45 2X.22:974-979.  

  20. Yang, J., Zhang, L., Bao, L. and Wang, M. (2024). Comparison of nucleic acid amplification based white spot syndrome virus (WSSV) detection methods in Penaeus vannamei. Aquaculture. 742052. https://doi.org/10.1016/j.aquaculture.2024. 742052.

  21. Zenger, K.R., Khatkar, M.S., Jones, D.B., Khalilisamani, N., Jerry, D.R. and Raadsma, H.W. (2019). Genomic selection in aquaculture: Application, limitations and opportunities with special reference to marine shrimp and pearl oysters. Frontiers in Genetics. 9: 693.  https://doi.org/10.3389/fgene. 2018. 00693.

  22. Zhang, H. and Gui, F. (2023). The application and research of new digital technology in marine aquaculture. Journal of Marine Science and  Engineering. 11: 401.  https://doi.org/10. 3390/jmse11020401.

  23. Zhang, L., Zhou, X., Li, B., Zhang, H. and Duan, Q. (2022).  Automatic shrimp counting method using local images and light weight YOLOv4. Biosystems Engineering. 220: 39e54. https://doi.org/10.1016/j.biosystemseng.2022.05.011.

  24. Zhou, H., Kim, S.H., Kim, S.C., Kim, C.W., Kang, S.W. and Kim, H. (2023). Instance segmentation of shrimp based on contrastive learning. Applied Sciences. 13: 6979. https://doi.org/10. 3390/app13126979.

Background: Shrimp rearing and wild-catch shrimp fisheries provide employment and income for millions worldwide. As the global population rises, shrimp plays a key role in addressing food security challenges. It is a nutritious and accessible food, widely consumed in diverse cuisines. Ecologically, it serves as scavengers and detritivores, maintaining stability in aquatic environments. This ecological role has driven the development of sustainable shrimp farming practices. Disease management, especially prevention and early detection, is a critical concern in shrimp farming. Automated image processing could speed up disease detection, ensuring timely intervention. This study focuses on detecting white spot syndrome virus (WSSV) in shrimp using convolutional neural networks (CNN) with promising accuracy.

Methods: The model uses a sequential convolutional neural network (CNN) to identify WSSV in shrimp. It is trained with images categorized into two groups: Healthy and diseased. Features like lesions and discoloration are used to detect infection. The dataset is preprocessed to reduce resolution variation through bilinear interpolation. The model’s architecture consists of multiple layers that extract features and reduce spatial dimensions. Concatenation layers combine information from different network parts to enhance feature extraction. Performance is evaluated to determine the model’s ability to make reliable predictions.

Result: The CNN model achieved a remarkable accuracy (93.75%) in detecting WSSV. It correctly identified all WSSV instances with minimal misclassification. The overall performance of the model demonstrates its robustness. The ROC curves further confirm its ability to classify both healthy and diseased shrimp effectively, highlighting its potential for early disease detection in shrimp farming systems.

The global shrimp industry has seen significant growth in both quantity and quality over the past 15 years, with farmed species, particularly Pacific white leg shrimp, dominating production. Farmed shrimp accounted for 63% of total production in 2022. The rising demand for shrimp is driven by health benefits and the introduction of shrimp-based products, particularly frozen shrimp, which offers extended shelf life. Despite challenges like oversupply and low prices, the industry has recovered post-COVID-19 and is expected to continue expanding. Future growth will depend on climate- smart aquaculture, sustainability certifications and efficient production practices (Mandal and Singh, 2025; Yang et al., 2024).
       
The global demand for seafood has escalated, prompting countries to adopt better aquaculture practices to improve productivity, efficiency and sustainability. Advances in selective breeding and genetics have led to the development of improved shrimp, fish and other shell breeds with enhanced growth rates and nutritional value. Technologies like marker-assisted selection and genomic selection are accelerating breeding processes in marine species (Zenger et al., 2019). Among various seafood products, shrimp holds a significant position. According to the Food and Agriculture Organization (FAO), shrimp is the most traded seafood commodity globally (FAO, 2022) and its consumption continues to rise, ranking alongside popular seafood items like fish, tuna and mollusks.        

Shrimp farming contributes billions of dollars annually to the global economy (Phong et al., 2021). It is considered a sustainable method for meeting the growing food demand and bolstering social and economic stability worldwide.
       
Shrimp plays a vital role as a high-protein, nutrient-rich food source, contributing to global food security amidst a rapidly growing population. Its versatility in various cuisines has made it a favorite among consumers and chefs. Furthermore, shrimp helps maintain aquatic ecosystems by recycling organic matter and ensuring water quality (Depestele et al., 2019; Hai and Duong, 2024; Maltare et al., 2023; Bagga et al., 2024). As scavengers and detritivores, shrimp populations support balanced food webs in aquatic environments. These roles have led to the innovation of shrimp farming models that prioritize environmental sustainability and commercial success. Additionally, shrimp farming holds cultural significance in coastal communities, where traditional practices are passed down through generations, reinforcing local cultural identity.
       
However, the rapid expansion of the shrimp industry has highlighted environmental challenges such as habitat degradation, water pollution, overfishing and disease outbreaks. These issues have prompted the establishment of eco-certification programs and conservation initiatives aimed at minimizing environmental harm and improving resource management. Diseases such as White Spot Syndrome Virus (WSSV), Taura Syndrome Virus (TSV) and Early Mortality Syndrome (EMS) have significant economic and ecological impacts on shrimp farming. Molecular tools, such as polymerase chain reaction (PCR) assays and loop-mediated isothermal amplification (LAMP), have been developed for rapid and accurate disease detection in shrimp farming. However, these methods face limitations in terms of cost, complexity and time sensitivity (Islam et al., 2023). Moreover, traditional disease detection methods rely heavily on farmer expertise, making them time-consuming, labor-intensive and expensive (Estevez et al., 2023; Wang et al., 2024).
       
To address these challenges, there is a growing need for affordable, user-friendly and field-deployable disease surveillance tools. Machine learning (ML) algorithms and artificial intelligence (AI) techniques offer promising solutions by enabling large datasets to be analyzed for early disease detection, health monitoring and environmental monitoring. Non-invasive, low-cost and user-friendly technologies, such as computer vision, are particularly promising for shrimp disease detection (Aung et al., 2024). Using image acquisition equipment, data can be collected and used to train deep learning models that offer high accuracy and quick intervention opportunities (Abade et al., 2021; Duong-Trung et al., 2020; Min et al., 2024; Zhang and Gui, 2023).

Convolutional Neural Networks (CNNs) are emerging as a key technology for analyzing shrimp health. Deep CNN models can classify shrimp images based on disease symptoms, such as lesions, abnormalities and other visual signs. Object detection algorithms, including Faster R-CNN and YOLO (You Only Look Once), are designed to identify regions of interest in shrimp images, providing valuable inputs for disease diagnosis and monitoring. These CNN-based systems offer prompt disease detection with minimal human involvement, making them well-suited for large-scale aquaculture settings. An automated image-capture system, combined with CNN-powered disease recognition models, can significantly enhance disease surveillance, improving shrimp farm productivity and sustainability.
       
Several studies have addressed the need for efficient disease monitoring and classification in shrimp farming. Phong et al., (2021) discuss the market dynamics that drive farmers to optimize shrimp yields and productivity while maintaining environmental sustainability. Close supervision of shrimp farms is essential for adopting pro-environmental practices, such as avoiding excessive antibiotic use to prevent diseases. The shift from traditional manual disease detection to machine learning algorithms is gaining momentum in aquaculture, with various ML models being applied for classification tasks.
       
The detection of shrimp diseases using CNN-based models is an evolving field with significant potential for improving aquaculture practices. The integration of machine learning techniques, such as CNNs, into disease monitoring systems enables more timely, accurate and scalable interventions. While many existing models have shown promising results, further research is needed to optimize these systems for large-scale deployment.
       
The current work proposes a CNN-based approach for classifying shrimp health states and emphasizes the importance of accuracy, efficiency and low-cost technologies in supporting sustainable shrimp farming operations. By enhancing disease detection and early intervention capabilities, this model contributes to the overall productivity and sustainability of shrimp aquaculture.
The following parts outline the procedure employed to construct a sequential Convolutional Neural Network (CNN) model for the identification of shrimp infected with WSSV disease. The task is separated into multiple distinct parts, with the initial one including the collection of images for categorization using deep neural networks.
 
Dataset
 
The dataset is composed of images from two different groups: Healthy and WSSV (White Spot Syndrome Virus). WSSV is a serious viral infection affecting global shrimp aquaculture. Images of shrimp with normal physiological characteristics that is, without any obvious signs of disease or infection are included in the healthy class. On the other hand, images of shrimp in the WSSV class show signs of a WSSV infection, like discoloration, abnormal behavior, or visible lesions on the body surface. The dataset consists of 298 images, divided into three subsets. The training set has 238 images (80%). The validation and testing datasets each contain 30 images (10%). This 80-10-10 split helps ensure a balanced dataset for effective model training and evaluation. Fig 1 illustrates representative images of healthy shrimp and shrimp infected with White spot syndrome virus (WSSV).

Fig 1: Healthy and WSSV-infected shrimp.


 
Image preprocessing
 
Before training, preprocessing techniques are applied to every image to ensure uniformity and improve the quality of the dataset. Resizing the images was done to ensure that the dimensions were consistent across the collection. Bilinear interpolation was used to reduce images of varying resolutions to 128×128 pixels. The images were then transformed into grayscale. A significant number of training data is required at this pre-processing stage to explicitly learn the features of the training data. In the next step, the images of shrimp were categorized based on their type and then annotated with the corresponding acronym representing the specific disease. In this case, the test collection and training dataset exhibited two distinct classes.
 
Data augmentation
 
An array of augmentation techniques is used in this study to add diversity to the original dataset. These methods were rotation, flipping, shifting, zooming and shearing. Rotation added random rotations to the images to simulate different views and orientations, which helped the model identify objects from a range of perspectives. Flipping created mirror images in the dataset by flipping images horizontally, which improved the model's ability to recognize objects in any orientation.
 
Training dataset
 
The technique includes the utilization of a Convolutional Neural Network (CNN) to create a model for assessing performance. The model was trained to utilize image data as input. The flow chart illustrating the classification process of healthy and diseased shrimp using Convolutional Neural Network (CNN) is shown in Fig 2.

Fig 2: Flow Chart of the classification using CNN of healthy and diseased shrimp.


 
Architecture of convolutional neural network (CNN) model
 
The architecture of a neural network model includes various types of layers such as convolutional layers, max-pooling layers, flatten layers and dense layers (Fig 3). Table 1 represents the parameters used in the CNN algorithm. Starting with the input layer, labeled as "input_1," it takes input data with dimensions of 128x128x3, where 3 represents the number of color channels (typically red, green and blue in an image).

Fig 3: Architecture of Sequential CNN model.



Table 1: Parameters of CNN model.


       
The subsequent layers include convolutional layers ("conv2d"), followed by max-pooling layers ("max_pooling2d"). These layers are responsible for extracting features from the input data while reducing spatial dimensions, thereby aiding in capturing important patterns efficiently. As the data progresses through the network, the convolutional and max-pooling layers are stacked to deepen the network's configuration, gradually reducing the spatial dimensions of the data and increasing the number of channels. The architecture includes dense layers, also known as fully connected layers, which are responsible for learning complex patterns and making final predictions. These layers possess a large number of parameters because they are fully connected, which greatly enhances the model's ability to capture complex patterns in the data. The model employs concatenation layers ("concatenate") to combine features from different parts of the network, enhancing the model's ability to capture diverse and complementary information. Finally, the output layer, labeled as "dense_8," consists of two neurons, representing the classes that the model aims to classify. In this case, it's likely a binary classification task, as there are two output neurons. Overall, the presented architecture illustrates a deep neural network designed for processing image data, with convolutional layers for feature extraction, dense layers for pattern recognition and concatenation layers for integrating diverse information, culminating in a model capable of making accurate predictions based on the input data.
 
Performance evaluation parameters
 
The confusion matrix, which consists of the four essential components True Positive (TP), True Negative (TN), False Positive (FP) and False Negative (FN), is used to assess the model's performance.
       
Accuracy is a frequently used metric for assessing performance. It is a measure of the correctness of predictions, taking into account both true positives and true negatives, relative to the total number of predictions made. Usually, it provides insight into the accuracy of a trained model and offers an evaluation of its overall performance.

 
Precision is a metric that quantifies the frequency of accurate predictions generated by a model. The process involves dividing the total number of positive labels by the number of accurate positive predictions.

 
Recall measures the classifier's ability to correctly identify all relevant instances. This percentage is calculated by dividing the total number of positive reviews in the dataset by the number of correctly predicted positive observations. The objective of computing these metrics is to identify the most positive labels. The formula for calculating recall is as follows


Achieving an F1-score of 1 signifies complete performance, whereas a score of 0 signifies total failure. The F-measure is calculated using the following formula

Intelligent aquaculture is essential for identifying shrimp diseases. Precise categorization of both healthy and WSSV is crucial for enhancing productivity and profitability. The main objective of this study was to achieve satisfactory outcomes in the area of disease detection. Fig 4 displays the training and validation loss along with the accuracy metrics.

Fig 4: Loss and Accuracy measurements of training and validation datasets.


       
A training loss of 0.0507 signifies an average difference of 0.0507 units between the model's predicted and actual target labels. Comparably, a validation loss of 0.0376 indicates that an average error of 0.0376 units is produced by the model's performance on unseen data (validation data). After 50 epochs, the model correctly classified every instance in the training and validation sets with an accuracy of 1.0000.
       
Fig 5 displays a collection of images that depict both the actual and the predicted labels. This presentation allows for an assessment of the efficacy and level of certainty of the trained model. The exhibited predicted outcomes are accurate and exhibit a high level of certainty.

Fig 5: Actual and prediction of healthy and diseased shrimp.


       
The classification performance of the proposed model is summarized using a confusion matrix, as shown in Fig 6. The classification performance of the model for both classes is displayed in the confusion matrix. Two instances were mistakenly classified as WSSV (false negatives) out of the 17 cases in the Healthy class that were accurately identified as Healthy (true positives). On the other hand, none of the 13 WSSV class instances were incorrectly classified as Healthy (false positives), while all of the cases were accurately identified as WSSV (true positives). This matrix provides a clear visual representation of the model's ability to distinguish between the two classes, highlighting its potential to decrease misclassifications. This is particularly evident in the model's capacity to identify WSSV instances with accuracy. The performance of the classification model is summarized using key evaluation metrics, as presented in Table 2.

Fig 6: Confusion matrix of instances of healthy and WSSV disease.



Table 2: Classification matrix parameters.


       
The precision, recall, F1-score, support and overall performance measures are shown in the table for both classes. The prediction of all Healthy classes was correct, as evidenced by the precision score of 1.0000. Conversely, the recall score of 0.8947 means that 89.47% of actual occurrences of healthy were properly identified by the model. The F1-score of 0.9444, which strikes a balance between precision and recall, indicates an equitable distribution of true positives and false negatives. This demonstrates a slight limitation in recall but a high degree of accuracy in classifying events as Healthy. However, the WSSV class displays a distinct set of performance indicators. The high recall score of 1.0000 signifies that the model accurately detected every instance of WSSV in the dataset, despite the precision of 0.8667 suggesting a comparatively small percentage of misclassifications within this category. An overall high performance for this class is indicated by the F1-score of 0.9286, which is slightly lower in accuracy than that of Healthy but perfect in recall, indicating robustness in capturing all occurrences of WSSV.
       
The provided ROC (Receiver Operating Characteristic) curves show the rate at which the model performed in distinguishing between the Healthy and WSSV classes (Fig 7). For different threshold levels, each curve illustrates the trade-off between the true positive rate (sensitivity) and the false positive rate (1-specificity). The ROC curves exhibiting identical AUC values of 0.9717 demonstrate robust classification performance for the Healthy and WSSV classes, confirming the approach's efficient ability to differentiate between the two groups.

Fig 7: ROC Curve for both classes (healthy and WSSV).


       
The model was trained for 50 epochs based on initial experiments, where we observed satisfactory convergence and performance within this timeframe. This number of epochs allowed the model to learn essential features from the dataset while reducing the risk of overfitting, as longer training durations can sometimes lead to memorization of the training data. Although increasing the number of epochs might improve accuracy, we considered the trade-off between training time and performance. Future experiments will involve training with more epochs to evaluate if further training leads to meaningful improvements or overfitting.
       
The proposed model is compared with the previously reported work based on a similar use of deep learning architecture. Satoto et al., (2023) employed the Inception ResNetV2 deep learning model for coastal shrimp species classification, achieving an average accuracy of 99.4%. Despite promising results, larger datasets are needed for further validation. Hu et al., (2020) introduced ShrimpNet, a CNN-based system with two fully connected layers for shrimp identification. This model achieved an impressive 95.48% accuracy rate in distinguishing shrimp from other marine species. Lai et al., (2022) used the YOLOv4-tiny CNN model to estimate shrimp body length and classify shrimp into measurable and visible categories. The model showed a precision of 93.24%, though its infrastructure requirements and reliance on cloud computing make it less viable for some shrimp farmers.
       
Zhang et al., (2022) developed a lightweight YOLOv4-based model for automatic shrimp counting, achieving a precision of 92% and recall of 94%. While the model performed well in counting shrimp, its application for disease detection remained limited. Prema et al., (2022) proposed a hybrid CNN-SVM model for shrimp freshness detection, focusing on quality control after harvest. However, their work did not address proactive disease prediction in live shrimp, which would be more beneficial for preventing health issues before harvest. Zhou et al., (2023) introduced an unsupervised learning approach for shrimp segmentation, providing a publicly available dataset for further research. This approach addresses the issue of limited resources for computer vision applications in aquaculture. The proposed model achieved a remarkable accuracy (93.75%).
       
These models heighten the need for more accurate models for disease detection to raise the yields in aquaculture farming. The model is an initiative with further upgrades for larger datasets. The overall performance of the proposed model is adequate and lays the foundation for more evolved CNN-based models with more layering.
In the domain of aquaculture, precise identification of shrimp diseases is essential for enhancing productivity and profitability. This study focuses on achieving satisfactory outcomes in disease detection. The provided table displays comprehensive performance measures, indicating the model's effectiveness in accurately classifying both healthy and diseased shrimp instances. The precision, recall and F1-score metrics highlight the model's ability to distinguish between healthy and WSSV instances, with notable robustness in WSSV detection. The provided ROC curves confirm the model's efficient classification performance for both classes, demonstrating its capability to differentiate between healthy and WSSV instances effectively. One limitation of this study is the small size of the dataset, particularly in terms of the number of images with two classifications. Future research should incorporate a larger number of images for better model performance. This study mainly focuses on the severe infection caused by the White Spot Syndrome Virus (WSSV) in shrimp. However, shrimp are also affected by other diseases, which should be considered in future studies. Additionally, this study uses a fully connected layer classifier for classification. Future research could compare machine learning techniques like support vector machines, random forests and decision trees for improved classification accuracy.
Availability of data and materials
 
Not applicable
 
Use of artificial intelligence
 
Not applicable
 
Declarations
 
Author declares that all works are original and this manuscript has not been published in any other journal.
Author declares that they have no conflict of interest.

  1. Abade, A., Ferreira, P.A. and Vidal, F. de B. (2021). Plant Diseases Recognition on images using convolutional neural net works: A Systematic Review. Computers and Electronics in Agriculture. 185: 106125. https://doi.org/10.1016/j. compag.2021.106125.

  2. Aung, T., Razak, R.A. and Nor, A.R.B.M. (2024). Artificial intelli gence methods used in various aquaculture applications: A systematic literature review. Journal of the World Aquaculture Society. https://doi.org/10.1111/jwas.13107.

  3. Bagga, T., Ansari, A.H., Akhter, S., Mittal, A. and Mittal, A. (2024). Under- standing Indianconsumers' propensity to purchase electric vehicles: An analysis of determining factors in environmentally sustainable transportation. International Journal of Environmental Sciences. 10(1): 1-13

  4. Cho, O.H. (2024). An Evaluation of Various Machine Learning Approaches for Detecting Leaf Diseases in Agriculture. Legume Research. 47(4): 619-627. doi: 10.18805/LRF-787.

  5. Depestele, J., Feekings, J., Reid, D.G., Cook, R., Gascuel, D., Girardin, R., Health, M., Hernvann, P-Y., Morato, T., Soszynski, A. and Savina-Rolland, M. (2019). The impact of fisheries discards on scavengers in the Sea.The European Landing Obligation. Springer, Cham. https://doi.org/10.1007/978-3-030-033 08-8_7. 

  6. Duong-Trung, N., Quach, L., Nguyen, C. (2020). Towards Classification of shrimp diseases using transferred convolutional neural networks. Advances in Science, Technology and Engineering Systems. 5(4): 724-732. http://dx.doi.org/10. 25046/aj050486.

  7. Estevez, J.R., Manco, J.A., Garcia-Arboleda, W., Echeverry, S., Pino, I., Acevedo, A. and Rendon, M.A. (2023). Microen capsulated probiotics in feed for beef cattle are a better alternative to monensin sodium. International Journal of Probiotics and Prebiotics. 18(1): 30-37. https://doi.org/10. 37290/ijpp2641-7197.18:30-37. 

  8. FAO. 2022. World Food and Agriculture - Statistical Yearbook 2022. Rome.https://doi.org/10.4060/cc2211en.

  9. Hai, N.T. and Duong, N.T. (2024). An Improved Environmental Man agement Model for Assuring Energy and Economic Pros perity. Acta Innovations. 52: 9-18. https://doi.org/10. 62441/ActaInnovations.52.2. 

  10. Hu, W.C., Wu, H., Zhang, Y., Zhang, S. and Lo, C. (2020). Shrimp recognition using ShrimpNet based on convolutional neural network. Journal of Ambient Intelligence and Humanized Computing. https://doi.org/10.1007/s12652- 020-01727-3.

  11. Islam, S. I., Mou, M. J., Sanjida, S. and Mahfuj, S. (2023). A review on molecular detection techniques of white spot syndrome virus: Perspectives of problems and solutions in shrimp farming. Veterinary Medicine and Science. 9(2): 778- 801. https://doi.org/10.1002/vms3.979.

  12. Lai, P., Lin, H., Lin, J., Hsu, H., Chu, Y., Liou, C. and Kuo, Y. (2022). Automatic measuring of shrimp body length using CNN and an un derwater imaging system. Biosystems Engineering. 221: 224e235.  https://doi.org/10.1016/j.biosystemseng. 2022. 07.006.

  13. Maltare, N.N., Sharma, D. and Patel, S. (2023). An exploration and prediction of rainfall and groundwater level for the District of Banaskantha, Gujrat, India. International Journal of Environmental Sciences. 9(1): 1-17

  14. Mandal, A. and Singh, P. (2025). Global Scenario of Shrimp Industry: Present Status and Future Prospects. In P. Singh, A. Singh, A. Tyagi and S. Benjakul (Eds.), Shrimp Culture Technology (pp. 1-20). Springer. https://doi.org/10.1007/978-981-97- 8549-0_1

  15. Min, P.K., Mito, K. and Kim, T.H. (2024). The evolving landscape of artificial intelligence applications in animal health. Indian Journal of Animal Research. 58(10): 1793-1798. doi: 10.18805/IJAR.BF-1742.

  16. Phong, T.N., Thang, V.T. and Hoai, N.T. (2021). What motivates farmers to accept good aquaculture practices in devel opment policy? Results from a choice experiment. sur veys with small-scale shrimp farmers in Vietnam. Economic Analysis and Policy. 72: 454-469. https://doi.org/ 10.1016/j.eap.2021.09.015.

  17. Prema, K. and Visumathi, J. (2022). Hybrid approach of CNN and SVM for shrimp freshness diagnosis in aquaculture monitoring system using IoT-based Learning Support System. Journal of Internet Technology. 23(4): 801-810.  https://doi.org/ 10.53106/160792642022072304015.

  18. Satoto, B.D., Khotimah, B.K., Yusuf, M., Hapsari, R.K. and Irmawati, B. (2023). Classification of coastal shrimp species using deep learning InceptionResNetV2 with data augmentation. Technium: Romanian Journal of Applied Sciences and Technology. 16: 250-258, ISSN: 2668-778X.

  19. Wang, D., Zhong, J., Gao, Y., He, J., Hou, J. and Du, K. (2024). Green tea extract improves postoperative outcomes after urological surgery. Current Topics in Nutraceutical Research. 22(3): 974-979. https://doi.org/10.37290/ctnr2641-45 2X.22:974-979.  

  20. Yang, J., Zhang, L., Bao, L. and Wang, M. (2024). Comparison of nucleic acid amplification based white spot syndrome virus (WSSV) detection methods in Penaeus vannamei. Aquaculture. 742052. https://doi.org/10.1016/j.aquaculture.2024. 742052.

  21. Zenger, K.R., Khatkar, M.S., Jones, D.B., Khalilisamani, N., Jerry, D.R. and Raadsma, H.W. (2019). Genomic selection in aquaculture: Application, limitations and opportunities with special reference to marine shrimp and pearl oysters. Frontiers in Genetics. 9: 693.  https://doi.org/10.3389/fgene. 2018. 00693.

  22. Zhang, H. and Gui, F. (2023). The application and research of new digital technology in marine aquaculture. Journal of Marine Science and  Engineering. 11: 401.  https://doi.org/10. 3390/jmse11020401.

  23. Zhang, L., Zhou, X., Li, B., Zhang, H. and Duan, Q. (2022).  Automatic shrimp counting method using local images and light weight YOLOv4. Biosystems Engineering. 220: 39e54. https://doi.org/10.1016/j.biosystemseng.2022.05.011.

  24. Zhou, H., Kim, S.H., Kim, S.C., Kim, C.W., Kang, S.W. and Kim, H. (2023). Instance segmentation of shrimp based on contrastive learning. Applied Sciences. 13: 6979. https://doi.org/10. 3390/app13126979.
In this Article
Published In
Indian Journal of Animal Research

Editorial Board

View all (0)