The ENCODE Imputation Challenge: a critical assessment of methods for cross-cell type imputation of epigenomic profiles

Jacob Schreiber; Carles Boix; Jin Wook Lee; Hongyang Li; Yuanfang Guan; Chun-Chieh Chang; Jen-Chien Chang; Alex Hawkins-Hooker; Bernhard Schölkopf; Gabriele Schweikert; Mateo Rojas Carulla; Arif Canakoglu; Francesco Guzzo; Luca Nanni; Marco Masseroli; Mark James Carman; Pietro Pinoli; Chenyang Hong; Kevin Y Yip; Jeffrey P Spence; Sanjit Singh Batra; Yun S Song; Shaun Mahony; Zheng Zhang; Wuwei Tan; Yang Shen; Yuanfei Sun; Minyi Shi; Jessika Adrian; Richard Sandstrom; Nina Farrell; Jessica Halow; Kristen Lee; Lixia Jiang; Xinqiong Yang; Charles Epstein; J Seth Strattan; Bradley Bernstein; Michael Snyder; Manolis Kellis; William Stafford; Anshul Kundaje; ENCODE Imputation Challenge Participants

doi:10.1186/s13059-023-02915-y

The ENCODE Imputation Challenge: a critical assessment of methods for cross-cell type imputation of epigenomic profiles

Genome Biol. 2023 Apr 18;24(1):79. doi: 10.1186/s13059-023-02915-y.

Authors

Jacob Schreiber^#¹, Carles Boix^#², Jin Wook Lee², Hongyang Li², Yuanfang Guan², Chun-Chieh Chang², Jen-Chien Chang², Alex Hawkins-Hooker², Bernhard Schölkopf², Gabriele Schweikert², Mateo Rojas Carulla², Arif Canakoglu², Francesco Guzzo², Luca Nanni², Marco Masseroli², Mark James Carman², Pietro Pinoli², Chenyang Hong², Kevin Y Yip², Jeffrey P Spence², Sanjit Singh Batra², Yun S Song², Shaun Mahony², Zheng Zhang², Wuwei Tan², Yang Shen², Yuanfei Sun², Minyi Shi², Jessika Adrian², Richard Sandstrom², Nina Farrell², Jessica Halow², Kristen Lee², Lixia Jiang², Xinqiong Yang², Charles Epstein², J Seth Strattan², Bradley Bernstein², Michael Snyder², Manolis Kellis², William Stafford², Anshul Kundaje²; ENCODE Imputation Challenge Participants

Affiliations

¹ Stanford University School of Medicine, Stanford, CA, USA. jmschreiber91@gmail.com.
² Stanford University School of Medicine, Stanford, CA, USA.

^# Contributed equally.

Abstract

A promising alternative to comprehensively performing genomics experiments is to, instead, perform a subset of experiments and use computational methods to impute the remainder. However, identifying the best imputation methods and what measures meaningfully evaluate performance are open questions. We address these questions by comprehensively analyzing 23 methods from the ENCODE Imputation Challenge. We find that imputation evaluations are challenging and confounded by distributional shifts from differences in data collection and processing over time, the amount of available data, and redundancy among performance measures. Our analyses suggest simple steps for overcoming these issues and promising directions for more robust research.

The ENCODE Imputation Challenge: a critical assessment of methods for cross-cell type imputation of epigenomic profiles

Authors

Affiliations

Abstract

Publication types

MeSH terms

Grants and funding