Bhartiya Krishi Anusandhan Patrika, volume 38 issue 1 (march 2023) : 06-08

CNV Deep Learning based Methodology for Recognition

Sarika Jaiswal1,*, Nitesh Kumar Sharma1, Uma1, Mir Asif Iquebal1, Anil Rai1, Dinesh Kumar1
1Agricultural Bioinformatics Centre, ICAR-Indian Agricultural Statistics Research Institute, New Delhi-110 012, India.
  • Submitted25-08-2022|

  • Accepted05-01-2023|

  • First Online 19-04-2023|

  • doi 10.18805/BKAP582

Cite article:- Jaiswal Sarika, Sharma Kumar Nitesh, Uma, Iquebal Asif Mir, Rai Anil, Kumar Dinesh (2023). CNV Deep Learning based Methodology for Recognition . Bhartiya Krishi Anusandhan Patrika. 38(1): 06-08. doi: 10.18805/BKAP582.
Background: Copy number variants (CNVs) account for a significant amount of genetic variation. Since many CNVs include genes that result in differential levels of gene expression, substantial normal phenotypic variation can be explained. Current efforts are directed toward a more comprehensive characterization of CNVs that will provide the basis for determining how genomic diversity impacts biological function, evolution and common diseases in human as well as plants. 

Methods: The analytical variability in next generation sequencing (NGS) and artifacts in coverage data along with lack of robust bioinformatics tools for CNV detection have limited the utility of targeted NGS data to identify CNVs. Literature has the evidence of development of deep learning-based pipeline that incorporates a machine learning component to identify CNVs from targeted NGS data. 

Result: It is believed that combining this with clinical “gold standard” (e.g. FISH) information, the CNV detection could be more accurate. This would lead to a new research direction, supplementing the existing NGS methods.

  1. Coe, B.P., Girirajan, S. and Eichler, E.E. (2012). The genetic variability and commonality of neurodevelopmental disease. In American Journal of Medical Genetics Part C: Seminars in Medical Genetics. 160(2): 118-129.

  2. Freeman, J.L., Perry, G.H., Feuk, L., Redon, R., McCarroll, S.A., Altshuler, D.M. and Carter, N.P. (2006). Copy number variation: New insights in genome diversity. Genome Research. 16(8): 949-961.

  3. LeCun, Y., Bottou, L., Bengio, Y. and Haffner, P. (1998). Gradient- based learning applied to document recognition. Proceedings of the IEEE. 86(11): 2278-2324.

  4. Liu, P., Carvalho, C.M., Hastings, P.J. and Lupski, J.R. (2012). Mechanisms for recurrent and complex human genomic rearrangements. Current Opinion in Genetics and Development. 22(3): 211-220.

  5. Poplin, R., Newburger, D., Dijamco, J., Nguyen, N., Loy, D., Gross, S.S. and DePristo, M.A. (2016). Creating a universal SNP and small indel variant caller with deep neural networks. BioRxiv. 092890.

  6. Teo, S.M., Pawitan, Y., Ku, C.S., Chia, K.S. and Salim, A. (2012). Statistical challenges associated with detecting copy number variations with next-generation sequencing. Bioinformatics. 28(21): 2711-2718.

  7. Zhang, Y.X., Jin, L.C., Wang, B., Hu, D., Wang, L.Q., Li, P. and Yang, J. (2019). DL-CNV: A deep learning method for identifying copy number variations based on next generation target sequencing. Mathematical Biosciences and Engineering. 17(1): 202-215.

Editorial Board

View all (0)