Proteomic profiling dataset of chemical perturbations in multiple biological backgrounds

Sci Data. 2021 Aug 25;8(1):226. doi: 10.1038/s41597-021-01008-4.

Abstract

While gene expression profiling has traditionally been the method of choice for large-scale perturbational profiling studies, proteomics has emerged as an effective tool in this context for directly monitoring cellular responses to perturbations. We previously reported a pilot library containing 3400 profiles of multiple perturbations across diverse cellular backgrounds in the reduced-representation phosphoproteome (P100) and chromatin space (Global Chromatin Profiling, GCP). Here, we expand our original dataset to include profiles from a new set of cardiotoxic compounds and from astrocytes, an additional neural cell model, totaling 5300 proteomic signatures. We describe filtering criteria and quality control metrics used to assess and validate the technical quality and reproducibility of our data. To demonstrate the power of the library, we present two case studies where data is queried using the concept of "connectivity" to obtain biological insight. All data presented in this study have been deposited to the ProteomeXchange Consortium with identifiers PXD017458 (P100) and PXD017459 (GCP) and can be queried at https://clue.io/proteomics .

Publication types

  • Dataset
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Antineoplastic Agents / toxicity*
  • Astrocytes / drug effects*
  • Astrocytes / metabolism*
  • Cardiotoxins / toxicity*
  • Cell Line, Tumor
  • Humans
  • Phosphorylation / drug effects
  • Protein Kinase Inhibitors / toxicity*
  • Protein Processing, Post-Translational / drug effects
  • Proteome
  • Proteomics*

Substances

  • Antineoplastic Agents
  • Cardiotoxins
  • Protein Kinase Inhibitors
  • Proteome