A method for identifying splice sites and translational start sites in eukaryotic mRNA

Comput Appl Biosci. 1997 Aug;13(4):365-76. doi: 10.1093/bioinformatics/13.4.365.

Abstract

This paper describes a new method for determining the consensus sequences that signal the start of translation and the boundaries between exons and introns (donor and acceptor sites) in eukaryotic mRNA. The method takes into account the dependencies between adjacent bases, in contrast to the usual technique of considering each position independently. When coupled with a dynamic program to compute the most likely sequence, new consensus sequences emerge. The consensus sequence information is summarized in conditional probability matrices which, when used to locate signals in uncharacterized genomic DNA, have greater sensitivity and specificity than conventional matrices. Species-specific versions of these matrices are especially effective at distinguishing true and false sites.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Base Sequence
  • Binding Sites
  • Biometry
  • Consensus Sequence
  • Eukaryotic Cells
  • Exons
  • Humans
  • Introns
  • Probability
  • Protein Biosynthesis*
  • RNA Splicing*
  • RNA, Messenger / genetics*
  • RNA, Messenger / metabolism*
  • Software

Substances

  • RNA, Messenger