The basis of all improvement in (re)production performance of animals and plants lies in the genetic variation. The underlying genetic variation can be further explored through investigations using molecular markers including single nucleotide polymorphism (SNP) and microsatellite, and more recently structural variants like copy number variations (CNVs). Unlike SNPs, CNVs affect a larger proportion of the genome, making them more impactful vis-à-vis variation at the phenotype level. They significantly contribute to genetic variation and provide raw material for natural and artificial selection for improved performance. CNVs are characterized as unbalanced structural variations that arise from four major mechanisms viz., non-homologous end joining (NHEJ), non-allelic homologous recombination (NAHR), fork stalling and template switching (FoSTeS), and retrotransposition. Various detection methods have been developed to identify CNVs, including molecular techniques and massively parallel sequencing. Next-generation sequencing (NGS)/high-throughput sequencing offers higher resolution and sensitivity, but challenges remain in delineating CNVs in regions with repetitive sequences or high GC content. High-throughput sequencing technologies utilize different methods based on read-pair, split-read, read depth, and assembly approaches (or their combination) to detect CNVs. Read-pair based methods work by mapping discordant reads, while the read-depth approach works on detecting the correlation between read depth and copy number of genetic segments or a gene. Split-read methods involve mapping segments of reads to different locations on the genome, while assembly methods involve comparing contigs to a reference or de novo sequencing. Similar to other marker-trait association studies, CNV-association studies are not uncommon in humans and farm animals. Soon, extensive studies will be needed to deduce the unique evolutionary trajectories and underlying molecular mechanisms for targeted genetic improvements in different farm animal species. The present review delineates the importance of CNVs in genetic studies, their generation along with programs and principles to efficiently identify them, and finally throw light on the existing literature on studies in farm animal species vis-à-vis CNVs.
Keywords: CNVR; GC content; High-throughput sequencing; InDel; Read; SNP; Variation.
Copyright © 2024 Elsevier B.V. All rights reserved.