Next generation sequencing technologies enabled high-density genotyping for large numbers of samples. Nowadays SNP calling pipelines produce up to millions of such markers, but which need to be filtered in various ways according to the type of analyses. One of the main challenges still lies in the management of an increasing volume of genotyping files that are difficult to handle for many applications. Here, we provide a practical guide for efficiently managing large genomic variation data using Gigwa, a user-friendly, scalable and versatile application that may be deployed either remotely on web servers or on a local machine.
Keywords: INDELs; Interoperability; NoSQL database; SNP markers; VCF; Web tool.
© 2022. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.