Estimating enrichment of repetitive elements from high-throughput sequence data

Genome Biol. 2010;11(6):R69. doi: 10.1186/gb-2010-11-6-r69. Epub 2010 Jun 28.

Abstract

We describe computational methods for analysis of repetitive elements from short-read sequencing data, and apply them to study histone modifications associated with the repetitive elements in human and mouse cells. Our results demonstrate that while accurate enrichment estimates can be obtained for individual repeat types and small sets of repeat instances, there are distinct combinatorial patterns of chromatin marks associated with major annotated repeat families, including H3K27me3/H3K9me3 differences among the endogenous retroviral element classes.

MeSH terms

  • Animals
  • Base Sequence
  • CD4-Positive T-Lymphocytes / metabolism
  • Cell Line
  • Databases, Nucleic Acid*
  • Embryo, Mammalian / cytology
  • Fibroblasts / metabolism
  • High-Throughput Screening Assays / methods*
  • Histones / metabolism
  • Humans
  • Mice
  • Phylogeny
  • Protein Processing, Post-Translational
  • Repetitive Sequences, Nucleic Acid / genetics*
  • Sequence Analysis, DNA / methods*

Substances

  • Histones