A Bayesian approach to discriminate between alternative DNA sequence segmentations

Bioinformatics. 2002 Feb;18(2):226-34. doi: 10.1093/bioinformatics/18.2.226.

Abstract

Motivation: As a result of recombination or rate variation, a DNA sequence alignment may have a mosaic structure, where different segments correspond to different evolutionary histories. While several methods have been developed to predict DNA mosaic structures, they do not properly address the question of whether the predicted segmentation itself is statistically significant, or whether it is significantly better than an alternative mosaic structure predicted with another method. The objective of the present article is to devise an approximate Bayesian hypothesis test to discriminate between alternative candidate mosaic structures.

Results: We have applied the proposed discrimination scheme to various synthetic and real-world DNA sequence alignments. On the synthetic data, the algorithm identified the true mosaic structure in nine out of ten cases. On the real-world sequence alignments, it selected the same mosaic structures as predicted in the literature.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bayes Theorem*
  • Computational Biology
  • DNA / genetics*
  • Databases, Nucleic Acid
  • Dengue Virus / genetics
  • Hepatitis B virus / genetics
  • Models, Genetic
  • Mosaicism
  • Neisseria / genetics
  • Sequence Alignment / statistics & numerical data*
  • Software

Substances

  • DNA