DNA recombinant processes can involve gene segments that overlap or interleave with gene segments of another gene. Such gene segment appearances relative to each other are called here gene segment organization. We use graphs to represent the gene segment organization in a chromosome locus. Vertices of the graph represent contigs resulting after the recombination and the edges represent the gene segment organization prior to rearrangement. To each graph we associate a vector whose entries correspond to graph properties, and consider this vector as a point in a higher dimensional Euclidean space such that cluster formations and analysis can be performed with a hierarchical clustering method. The analysis is applied to a recently sequenced model organism Oxytricha trifallax, a species of ciliate with highly scrambled genome that undergoes massive rearrangement process after conjugation. The analysis shows some emerging star-like graph structures indicating that segments of a single gene can interleave, or even contain all of the segments from fifteen or more other genes in between its segments. We also observe that as many as six genes can have their segments mutually interleaving or overlapping.
Keywords: Gene segment organizations; Hierarchical cluster analysis (HCA); Oxytricha trifallax; Point cloud from graph properties; Scrambled genome.
Copyright © 2020. Published by Elsevier Ltd.