Gradient-based optimization of kernel-target alignment for sequence kernels applied to bacterial gene start detection

IEEE/ACM Trans Comput Biol Bioinform. 2007 Apr-Jun;4(2):216-26. doi: 10.1109/TCBB.2007.070208.

Abstract

Biological data mining using kernel methods can be improved by a task-specific choice of the kernel function. Oligo kernels for genomic sequence analysis have proven to have a high discriminative power and to provide interpretable results. Oligo kernels that consider subsequences of different lengths can be combined and parameterized to increase their flexibility. For adapting these parameters efficiently, gradient-based optimization of the kernel-target alignment is proposed. The power of this new, general model selection procedure and the benefits of fitting kernels to problem classes are demonstrated by adapting oligo kernels for bacterial gene start detection.

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Base Sequence
  • Codon, Initiator / genetics*
  • DNA, Bacterial / genetics*
  • Molecular Sequence Data
  • Pattern Recognition, Automated / methods*
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Transcription Initiation Site

Substances

  • Codon, Initiator
  • DNA, Bacterial