Comparing multiple ChIP-sequencing experiments

Hatice Gulcin Ozer; Yi-Wen Huang; Jiejun Wu; Jeffrey D Parvin; Tim Hui-Ming Huang; Kun Huang

doi:10.1142/s0219720011005483

Comparing multiple ChIP-sequencing experiments

J Bioinform Comput Biol. 2011 Apr;9(2):269-82. doi: 10.1142/s0219720011005483.

Authors

Hatice Gulcin Ozer¹, Yi-Wen Huang, Jiejun Wu, Jeffrey D Parvin, Tim Hui-Ming Huang, Kun Huang

Affiliation

¹ Department of Biomedical Informatics, The Ohio State University , Columbus, OH, 43210, USA. gulcin.ozer@osumc.edu

Abstract

New high-throughput sequencing technologies can generate millions of short sequences in a single experiment. As the size of the data increases, comparison of multiple experiments on different cell lines under different experimental conditions becomes a big challenge. In this paper, we investigate ways to compare multiple ChIP-sequencing experiments. We specifically studied epigenetic regulation of breast cancer and the effect of estrogen using 50 ChIP-sequencing data from Illumina Genome Analyzer II. First, we evaluate the correlation among different experiments focusing on the total number of reads in transcribed and promoter regions of the genome. Then, we adopt the method that is used to identify the most stable genes in RT-PCR experiments to understand background signal across all of the experiments and to identify the most variable transcribed and promoter regions of the genome. We observed that the most variable genes for transcribed regions and promoter regions are very distinct. Gene ontology and function enrichment analysis on these most variable genes demonstrate the biological relevance of the results. In this study, we present a method that can effectively select differential regions of the genome based on protein-binding profiles over multiple experiments using real data points without any normalization among the samples.

Publication types

Comparative Study
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Breast Neoplasms / genetics
Breast Neoplasms / metabolism
Cell Line
Cell Line, Tumor
Chromatin Immunoprecipitation / statistics & numerical data*
Computational Biology
Epigenesis, Genetic
Female
Genome, Human
Humans
Protein Binding

Abstract

Publication types

MeSH terms

Grants and funding