Unrecognizable genes are an unsettling problem in genomics. Here, we survey the various types of cryptic genes and the corresponding deciphering strategies employed by cells. Encryption that renders genes substantially different from homologs in other species includes sequence substitution, insertion, deletion, fragmentation plus scrambling, and invasion by mobile genetic elements. Cells decode cryptic genes at the DNA, RNA or protein level. We will focus on a recently discovered case of unparalleled encryption involving massive gene fragmentation and nucleotide deletions and substitutions, occurring in the mitochondrial genome of a poorly understood protist group, the diplonemids. This example illustrates that comprehensive gene detection requires not only auxiliary sequence information - transcriptome and proteome data - but also knowledge about a cell's deciphering arsenal.
Keywords: RNA editing; cryptic genes; diplonemids; gene fragmentation; multipartite mitochondrial DNA (mtDNA); trans-splicing.
Copyright © 2016 Elsevier Ltd. All rights reserved.