Combining Precursor and Fragment Information for Improved Detection of Differential Abundance in Data Independent Acquisition

Mol Cell Proteomics. 2020 Feb;19(2):421-430. doi: 10.1074/mcp.RA119.001705. Epub 2019 Dec 30.

Abstract

In bottom-up, label-free discovery proteomics, biological samples are acquired in a data-dependent (DDA) or data-independent (DIA) manner, with peptide signals recorded in an intact (MS1) and fragmented (MS2) form. While DDA has only the MS1 space for quantification, DIA contains both MS1 and MS2 at high quantitative quality. DIA profiles of complex biological matrices such as tissues or cells can contain quantitative interferences, and the interferences at the MS1 and the MS2 signals are often independent. When comparing biological conditions, the interferences can compromise the detection of differential peptide or protein abundance and lead to false positive or false negative conclusions.We hypothesized that the combined use of MS1 and MS2 quantitative signals could improve our ability to detect differentially abundant proteins. Therefore, we developed a statistical procedure incorporating both MS1 and MS2 quantitative information of DIA. We benchmarked the performance of the MS1-MS2-combined method to the individual use of MS1 or MS2 in DIA using four previously published controlled mixtures, as well as in two previously unpublished controlled mixtures. In the majority of the comparisons, the combined method outperformed the individual use of MS1 or MS2. This was particularly true for comparisons with low fold changes, few replicates, and situations where MS1 and MS2 were of similar quality. When applied to a previously unpublished investigation of lung cancer, the MS1-MS2-combined method increased the coverage of known activated pathways.Since recent technological developments continue to increase the quality of MS1 signals (e.g. using the BoxCar scan mode for Orbitrap instruments), the combination of the MS1 and MS2 information has a high potential for future statistical analysis of DIA data.

Keywords: Cancer Biomarker(s); Label-Free Quantification; Lung Cancer; Mass Spectrometry; Quantification; SWATH-MS.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Caenorhabditis elegans
  • Cerebellum / metabolism
  • Data Interpretation, Statistical
  • HeLa Cells
  • Humans
  • Lung / metabolism
  • Lung Neoplasms / metabolism
  • Mass Spectrometry
  • Mice
  • Proteomics / methods*
  • Saccharomyces cerevisiae