The complexity of genome rearrangement combinatorics under the infinite sites model

J Theor Biol. 2020 Sep 21:501:110335. doi: 10.1016/j.jtbi.2020.110335. Epub 2020 May 22.

Abstract

Rearrangements are discrete processes whereby discrete segments of DNA are deleted, replicated and inserted into novel positions. A sequence of such configurations, termed a rearrangement evolution, results in jumbled DNA arrangements, frequently observed in cancer genomes. We introduce a method that allows us to precisely count these different evolutions for a range of processes including breakage-fusion-bridge-cycles, tandem-duplications, inverted-duplications, reversals, transpositions and deletions, showing that the space of rearrangement evolution is super-exponential in size. These counts assume the infinite sites model of unique breakpoint usage.

Keywords: Cancer; Combinatorics; Complexity; Rearrangements.

MeSH terms

  • DNA*
  • Gene Rearrangement / genetics
  • Genome* / genetics

Substances

  • DNA