A unified framework to analyze transposable element insertion polymorphisms using graph genomes

Nat Commun. 2024 Oct 16;15(1):8915. doi: 10.1038/s41467-024-53294-2.

Abstract

Transposable elements are ubiquitous mobile DNA sequences generating insertion polymorphisms, contributing to genomic diversity. We present GraffiTE, a flexible pipeline to analyze polymorphic mobile elements insertions. By integrating state-of-the-art structural variant detection algorithms and graph genomes, GraffiTE identifies polymorphic mobile elements from genomic assemblies or long-read sequencing data, and genotypes these variants using short or long read sets. Benchmarking on simulated and real datasets reports high precision and recall rates. GraffiTE is designed to allow non-expert users to perform comprehensive analyses, including in models with limited transposable element knowledge and is compatible with various sequencing technologies. Here, we demonstrate the versatility of GraffiTE by analyzing human, Drosophila melanogaster, maize, and Cannabis sativa pangenome data. These analyses reveal the landscapes of polymorphic mobile elements and their frequency variations across individuals, strains, and cultivars.

MeSH terms

  • Algorithms
  • Animals
  • DNA Transposable Elements* / genetics
  • Drosophila melanogaster* / genetics
  • Genome, Plant / genetics
  • Genomics / methods
  • Humans
  • Polymorphism, Genetic*
  • Sequence Analysis, DNA / methods
  • Software
  • Zea mays* / genetics

Substances

  • DNA Transposable Elements