Transposable elements (TEs) have recently been shown to have many regulatory roles within the genome. In this chapter, we will examine two in silico methods for analyzing TEs and identifying families that may have acquired such functions. The first method will look at how the overrepresentation of a repeat family in a set of genomic features can be discovered. The example situation of OCT4 binding sites originating from LTR7 TE sequences will be used to show how this method could be applied. The second method will describe how to determine if a TE family exhibits a cell type-specific expression pattern. As an example, we will look at the expression of HERV-H, an endogenous retrovirus known to act as an lncRNA in embryonic stem cells. We will use this example to demonstrate how RNA-seq data can be used to compare cell type expression of repeats.
Keywords: Bioinformatics; Endogenous retrovirus; RNA-seq; Repeats; Transcriptional regulation; Transposable elements; lncRNAs.