An in-depth examination of requirements for disclosure risk assessment

Proc Natl Acad Sci U S A. 2023 Oct 24;120(43):e2220558120. doi: 10.1073/pnas.2220558120. Epub 2023 Oct 13.

Abstract

The use of formal privacy to protect the confidentiality of responses in the 2020 Decennial Census of Population and Housing has triggered renewed interest and debate over how to measure the disclosure risks and societal benefits of the published data products. We argue that any proposal for quantifying disclosure risk should be based on prespecified, objective criteria. We illustrate this approach to evaluate the absolute disclosure risk framework, the counterfactual framework underlying differential privacy, and prior-to-posterior comparisons. We conclude that satisfying all the desiderata is impossible, but counterfactual comparisons satisfy the most while absolute disclosure risk satisfies the fewest. Furthermore, we explain that many of the criticisms levied against differential privacy would be levied against any technology that is not equivalent to direct, unrestricted access to confidential data. More research is needed, but in the near term, the counterfactual approach appears best-suited for privacy versus utility analysis.

Keywords: data access; data disclosure risk; federal statistical system.

MeSH terms

  • Censuses
  • Confidentiality*
  • Disclosure*
  • Privacy
  • Risk Assessment